subsetByVIF selects subsets of the covariates such that each covariate in a given subset has a VIF that is less than or equal to a value specified by the user. This program has been posted on the Statistical Software Components (SSC) archive.
We are frequently faced with analyzing data sets in which the ratio of covariates to patients is high. There are several approaches to analyzing such data including penalized regression methods, k-fold cross-validation techniques, and bagging. A problem with any of these approaches is that, even after the elimination of variables causing multicollinearity, the variance-covariance matrix of the remaining covariates is often highly ill-conditioned. The subsetByVIF program reduces the number of covariates to the largest subsample such that the maximum VIF for each variable in the subsample is less than some value specified by the user. These variables are selected without regard to the dependent variable of interest, which should mitigate problems due to overfitting. The use of this program should improve the convergence properties of many methods of exploratory data analysis.
Please see the subsetByVIF help file in the SSC archive for further details.
Related Posts with subsetByVIF -- An ado file that selects a subset of covariates constrained by variance inflation factors (VIFs)
Extracting labels and renaming variables using loopsDear Statalist members, I am working with a dataset in .csv format that is separated in different f…
How to check Study Size (count of participants included in meta-analysis) when conducting meta-analysis?Hello I am conducting meta-analysis on Stata 16 using meta set and meta summarize commands. The out…
Setting the level() for lincomest, decimals don't workHi, I am trying to set a specific level for lincomest, but it only accepts numbers without decimals…
Add prefix on every Nth column in row X in panel data. Then, make row X the new variable names.I have a dataset containing stock data, looking like this: Code: * Example generated by -dataex-.…
Replacing missing values with previous non-missing values [Panel data]Hi everyone, I have 2-year panel data for households and it's members. I have a variable ramadan09 …
Subscribe to:
Post Comments (Atom)
0 Response to subsetByVIF -- An ado file that selects a subset of covariates constrained by variance inflation factors (VIFs)
Post a Comment