I have a few similar variables and I'm not sure if I should consider one of each or all in my logistic regression.
For instance, i have :
1. An altitude variable and a hill/terrain variable and also Latitude and longitude as variables
2. A Number of children variable and a Household size variable
3. A smoking variable and tobacco use variable
My concern is that the fact that these variables represent very similar information and having them in a logistic regression table may be redundant and may affect the quality of my data. Would an adjusted odds ratio account for any overlap in this case or should I decide on which variables to use, and if so, how do I go about doing that?
I have many other variables, sex, education, exercise etc the above arent the only ones going into the adjusted odds ratio table
Thanks
Related Posts with Determining which variables to use in logistic regression
FMM with GLM yields strange results on simulated dataHello Statalist Community, I am trying to test the capabilities of STATA 15's FMM procedure to esti…
Advice for FE or FD model for panel dataHello, I am looking for advice on the correct form of my model. I am planning to run a regression to…
FMMwith GLM yields strange results on simulated dataHello Statalist Community, I am trying to test the capabilities of STATA 15's FMM procedure to esti…
Simultaneous equationsHello, I am doing accounting research. Recently I read a research paper using the generalized metho…
Simultaneity bias in a logit model?OLS yields inconsistent estimates when an explanatory variable is endogenous, because the explanator…
Subscribe to:
Post Comments (Atom)
0 Response to Determining which variables to use in logistic regression
Post a Comment