[Note: PhD student new to Stata and still somewhat of a beginner with stats analysis]
In my dataset, the variable 'Inputs' reflects monetary values for which some observations are 0. I have logged all values of 'Inputs' for running regressions, but of course Stata drops +/- 25 observations for which 'Inputs' =0. I would prefer not to lose those observations because my sample is only n=147.
On the advice of my supervisor, I have replaced 'Inputs'=0 with 'Inputs'=1 for the latter observations so as not to drop them from the sample, then I logged the values again. Now instead of dropping those observations, they remain in the sample with 'Log_Inputs'=0. However, this weakens the R-squared value and therefore the model.
Which is the better choice: Drop the observations that cannot be logged, or weaken the model but maintain the sample size?
Related Posts with Logged variables w/value of 0 : drop observations or weaken the model?
Dissimilarity Index for Neighborhood Level DataHello, I need to calculate a racial dissimilarity index score for each of 52 neighborhoods in my st…
Use Log(Stock Prices) or (Stock Returns)I am running a regression model to find the impact of Federal Funds Rate(FFR) on Stock market. I hav…
How to use a .prn file in stataHey there, I am currently working on my bachelor thesis about economic growth and I downloaded a da…
Fixed Effects & Ordered Probit (RE) for same panel?Hey Folks! I hope I am not getting too annoying with my questions. I have encountered the following…
Panel threshold regressionDear all, I am trying to do panel threshold regression developed by Hansen(1999). Our dependent vari…
Subscribe to:
Post Comments (Atom)
0 Response to Logged variables w/value of 0 : drop observations or weaken the model?
Post a Comment