Hi, I'm working with a big database (15 million of observations, 2 gb) and I would like to make it as lighter as possible. The only ways that I come up to do that is code string variables (i.e. gender: "male" vs. "female" as 0 and 1). However it's unclear to me which role values' labels plays. Does a label 0 = "Male", makes it heaver? I'm asking this because I have for example a variable "country" with more than 200 possible values. Furthermore, is there any other trick or thing to consider in order to make a database lighter?
Cheers
Related Posts with Coding variables to make the database lighter
Mixed Results LM-Test and Hausman-TestHello everyone, I am running some regression on my dataset and I do get mixed results which is very…
SortI think this is a simple question. I have a categorical variable (85 departments) in a long data set…
Counting the min and max in a panel level dataHi All, I'm trying to count the min and max number of plants constructed each year by the vendors. …
Syntax error in estoutI am producing tables using estout but even though the code creates a row for them, it won't produce…
IV with dummy variableDear all, I am trying to run this model: Array Where i is individual i, in strata s, and time t. …
Subscribe to:
Post Comments (Atom)
0 Response to Coding variables to make the database lighter
Post a Comment