Hi, I'm working with a big database (15 million of observations, 2 gb) and I would like to make it as lighter as possible. The only ways that I come up to do that is code string variables (i.e. gender: "male" vs. "female" as 0 and 1). However it's unclear to me which role values' labels plays. Does a label 0 = "Male", makes it heaver? I'm asking this because I have for example a variable "country" with more than 200 possible values. Furthermore, is there any other trick or thing to consider in order to make a database lighter?
Cheers
Related Posts with Coding variables to make the database lighter
mixlpred command after mixlogitDear Stata Users, I run a mixed logit model (-mixlogit-) to identify preferences in a choice experi…
Coding LoopsHi all, I've recently moved to STATA from more point-and-click interfaces (i.e. SPSS), and am havin…
conformability error r(503) when looping matrixHello statalists, I'm trying to loop some basic descriptive stats of age according to sex. Right no…
HDFE-PPML. Interacting dyadic fixed effects with the time trendHello, I am currently estimating a standard gravity model to estimate the effect of trade agreements…
Merge panel and time series datasetIn my panel data set i have mdate, decile (identity), and return variable. In my time series dataset…
Subscribe to:
Post Comments (Atom)
0 Response to Coding variables to make the database lighter
Post a Comment