Hi, I'm working with a big database (15 million of observations, 2 gb) and I would like to make it as lighter as possible. The only ways that I come up to do that is code string variables (i.e. gender: "male" vs. "female" as 0 and 1). However it's unclear to me which role values' labels plays. Does a label 0 = "Male", makes it heaver? I'm asking this because I have for example a variable "country" with more than 200 possible values. Furthermore, is there any other trick or thing to consider in order to make a database lighter?
Cheers
Related Posts with Coding variables to make the database lighter
average at sector level within state (excluding current sector)Hi all, I have a company level data as below and I want to create a new variable that is the averag…
Conditional logit (clogit) for cross-sectional dataDear all, I'm working on a project in which we analyze firms' location decision, i.e. the decision …
Why should I use difference GMM instead of system GMM (Arellano-Bond)What are the reasons to use Arellano-Bond's difference GMM instead of system GMM? Reasons to apply …
-xtsfkk- and fixed effectDear Dr Karakaplan, thank you very much for providing the community with this new methodology. I'm…
Error code in Google Places API--Observation numbers out of rangeI've been trying to run googleplaces to return latitude/longitude for some facilities I am analyzing…
Subscribe to:
Post Comments (Atom)
0 Response to Coding variables to make the database lighter
Post a Comment