Dear Stata Users,
I am wondering if anyone has a strategy to speed up the estimation of the cmxtmixlogit command?
I am estimating a set of choice models using a large panel dataset with approximately 60,000 cases and 200,000 observations. My preferred specification includes both fixed and random parameters, as well as a set of case-specific covariates.
Although I have also considered user-written commands such as mixlogit, I am using cmxtmixlogit due to its post-estimation capabilities with margins and its ease of specifying case-specific variables. I am aware that the cmxtmixlogit is not parallelized and hence using a multi-core computer with Stata MP (with e.g. 8 cores) or even a computing cluster would not improve the estimation speed. Due to the large sample size, it is also not feasible to set a small number of intpoints, as the models will not converge. The simulation procedure takes especially long when including additional (case-specific) covariates, such as week fixed effects (24 weeks x 5 alternatives) which amounts to a large number of covariates. So far, I have not been able to successfully estimate a model using all data and all covariates (and I have let the command run for up to a week).
I am hoping that there is something that I can tweak to be able to use the cmxtmixlogit command with my data. Any suggestions would be greatly appreciated.
Thanks,
Paul
Related Posts with Speeding up random parameter mixed-logit estimation (cmxtmixlogit) with large dataset
Simulation of repeated measures data in a stepped-wedge contextHello, I have written the following code for a three-level repeated measures model in a stepped-wed…
Generating a dummy variable with value from other variable to a new row (cell)Hi, my job here is to generate a new variable (dummy for months unemployed) for December. In my anal…
Color schemes and color palettes - setting default colors for graphsI have a list of colors from my workplace that we should use for all the graphs I create. Ideally I …
I am confused about the confidence interval of survival analysis between two methodsIf I analyze a set of data use syntax ”ci means variate” and I get a result like this ci means y V…
Cubic splines to model relationships between continuous variables and outcomes: a guide for cliniciansFor those interested, I've just come across this interesting article, published on Bone Marrow Trans…
Subscribe to:
Post Comments (Atom)
0 Response to Speeding up random parameter mixed-logit estimation (cmxtmixlogit) with large dataset
Post a Comment