I want to estimate a probit regression on a large dataset.
In particular, a 1 refers to a positive trade flow from country i to country j for a particular product. Any directed country-pair can export many products, i.e. a link = 1 if i exports to j this particular good. No 0's are recorded. For any given year in the data, this results in around 5 million observations.
I want to estimate the probability that a link is present given covariates: probit link x1 x2..., vce(cluster ...)
Creating all possible i-j-product combinations results in 228 million observations. Hence, I would like to estimate a probit on all observed 1's and create a random subsample of all 0's, estimate the probit, and then reweigh the coefficients to correct for the true number of 0's in the data. I can create and store the large dataset of 228 mln observations, but the server chokes on the probit. What would be the correct way to proceed?
Related Posts with Estimating probit on random subsample of zeroes
pooled mean group estimationHello. I get errors when I run below command and could not figure out what is the cause. It would be…
Creating new variables on the basis of relationship to head, age and wages?Hi, I want to do two things in STATA but I do not know the commands to it. The data is of two rounds…
Putpdf with R2Hi all, I'm trying to save some regression results, and I'm using putpdf: Code: putpdf begin //…
Using for loop to plot multiple graphs.Hey Everyone, I am trying to plot multiple graphs using a for loop. At the time, I am trying to mak…
drop in all rangeHi there, I have some difficulties dropping observations. If the ID did not receive subsidy in all…
Subscribe to:
Post Comments (Atom)
0 Response to Estimating probit on random subsample of zeroes
Post a Comment