Hello,
first of all I would like to thank you all for all the help I have been getting through this medium the last few weeks. I am in the middle of my data analysis, and I am struggling with some stata related questions, and some statistical questions.
1. Before performing OLS, I winsorize all my independent variables and my dependent variable. Do all variables have to be winsorized at the same % level? for example, I have very big outliers in my dependent variable, but these outliers are a lot smaller in my independent variables. Also, I have taken the natural log of my firm size variable, does it make sense to winsorize this variable, given that it is already expressed as a log?
2. I have performed the hettest for heteroskedasticity. If this test indicates that there is heteroskedasticity, is it sufficient to regress making use of ,robust? and what is this robust standard error exactly? is it White's?
3. I have expressed most of my variables as a ratio (decimal number), and I am strugglign on how to interpret these results. For example, my regression shows a coefficient for independent variable leverage of -0,165119. My dependent variable is the relative change in carbon intensity between 2013 and 2018. Is it correct if I interpret the coefficient as follows: if the debt-to-equity ratio of a firm (= my measure for leverage) is 10 percentage point higher, the carbon intensity will be reduced with 16,51 percentage point more. Or should I multiply these coefficients with 100?
4. if some of my control variables are statistically significant, what does this tell me exactly? Or do they just serve for a more accurate estimation of the coefficients of the independent variables and do not need to be interpret? Same with a constant term that is statiscally significant?
Kind regards,
Timea De Wispelaere
Related Posts with various small questions
Omitted Variables in Hybrid ModelI am running a difference-in-difference model using the SSC command -xthybrid. I am interested in bo…
Generating scores using combinations of weighted variables and dividing by the sum of weightsHi all, I would like to create weighted genetic risk scores using a combination of 27 SNPs (variable…
-npregress- average derivatives change each time I rerun the model, why?I am running the following nonparametric regression: Code: set seed 12345 npregress kernel spending…
Creating new variable keeping just the first letter of each word in a stringHello all, I have a list of 250+ schools and want to create a variable that takes the first letter …
Variables significant when changing reference category?Hi all, I'm doing a research about risk tolerance and demographic factors using a logistic regressi…
Subscribe to:
Post Comments (Atom)
0 Response to various small questions
Post a Comment