BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

Estimating probit on random subsample of zeroes
Estimating probit on random subsample of zeroes

I want to estimate a probit regression on a large dataset.
In particular, a 1 refers to a positive trade flow from country i to country j for a particular product. Any directed country-pair can export many products, i.e. a link = 1 if i exports to j this particular good. No 0's are recorded. For any given year in the data, this results in around 5 million observations.
I want to estimate the probability that a link is present given covariates: probit link x1 x2..., vce(cluster ...)
Creating all possible i-j-product combinations results in 228 million observations. Hence, I would like to estimate a probit on all observed 1's and create a random subsample of all 0's, estimate the probit, and then reweigh the coefficients to correct for the true number of 0's in the data. I can create and store the large dataset of 228 mln observations, but the server chokes on the probit. What would be the correct way to proceed?

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Estimating probit on random subsample of zeroes
Estimating probit on random subsample of zeroes

0 Response to Estimating probit on random subsample of zeroes

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Estimating probit on random subsample of zeroes Estimating probit on random subsample of zeroes

Related Posts with Estimating probit on random subsample of zeroes

0 Response to Estimating probit on random subsample of zeroes