subsetByVIF selects subsets of the covariates such that each covariate in a given subset has a VIF that is less than or equal to a value specified by the user. This program has been posted on the Statistical Software Components (SSC) archive.
We are frequently faced with analyzing data sets in which the ratio of covariates to patients is high. There are several approaches to analyzing such data including penalized regression methods, k-fold cross-validation techniques, and bagging. A problem with any of these approaches is that, even after the elimination of variables causing multicollinearity, the variance-covariance matrix of the remaining covariates is often highly ill-conditioned. The subsetByVIF program reduces the number of covariates to the largest subsample such that the maximum VIF for each variable in the subsample is less than some value specified by the user. These variables are selected without regard to the dependent variable of interest, which should mitigate problems due to overfitting. The use of this program should improve the convergence properties of many methods of exploratory data analysis.
Please see the subsetByVIF help file in the SSC archive for further details.
Related Posts with subsetByVIF -- An ado file that selects a subset of covariates constrained by variance inflation factors (VIFs)
Descriptive table that summarizes all my dataI have 2 groups in my dataset referring to Credit and LC and would like to know, if its possible to …
Do files which are written in Windows are not well aligned when being used under LinuxHi, I am using Stata MP 14 do-file editor, both on a Windows and Ubuntu Linux machine. The transiti…
questions about regress a tableHere are the questions: (a) Run the regression with all years (Table 3 Column 1). (c) Test if the co…
How to create internal choiceDear all, My data is as follows. Respondents were asked 5 times. I am new to Stata and I am in troub…
Using decimal numlist in foreach loop Dear Statalisters, I want to build a table that records prevalence of a variable at a varying hypo…
Subscribe to:
Post Comments (Atom)
0 Response to subsetByVIF -- An ado file that selects a subset of covariates constrained by variance inflation factors (VIFs)
Post a Comment