Hi there,
I am running a mixed logit model using the user written command mixlogit on Stata 16/MP8 with only one random coefficients and around 100 additional (constant) parameters to estimate. I have a very large number of observations (NXJ is more than 200 millions). Convergence is not an issue. The main problem is the time it takes to compute each iteration of the Sim Lik. I have tried with a few random sub-samples and have some questions on how to improve on speed for the overall sample:
1. I noticed that speed is quite sensitive to the number of halton draws -- nrep(#) option. How small is too small in order to increase speed at the cost of inducing noise and bias in the estimated coefficients?
2. Does the number of halton draws affect only the estimates of the random coefficients or also the other estimates of the model? I did not find any monte-carlo result on this, any suggestions?
3. Any other trick to improve speed in this case? What about the number of initial sequence elements to drop when creating Halton sequence? Can I reduce this to a very small number through the burn(#) option since I have only random coefficient?
Thanks a lot in advance for your attention and time. Best,
Matteo
Related Posts with Speeding up Mixlogit with large datasets
Problem with instruments in the spivregressDear All, I am estimating a spatial IV model using the files that you can download from here. They …
How to make a graph for U-shaped relationshipsHi, I often check estimated relationships in a graph by using the Stata marginsplot or marginscontp…
Data transformationDear Statatist, I have try reshape and stack command but still can't solve the problem related to …
"Expanding" a count variable within a panel datasetHello. My PI has asked me to create new rows from two existing variables (see num and den below). Us…
Divide panel data into subgroupsDear Statalist, I am working on a panel dataset (xtset firm year). I want to divide the firms into…
Subscribe to:
Post Comments (Atom)
0 Response to Speeding up Mixlogit with large datasets
Post a Comment