I am working with data for Indian schools. I intend to create an Index by combining several variables(availability of library, no. of computers in the school, no. of toilets in the school) that can reflect the net infrastructure available in the school. I wish to use this index later in my regression specification.
I performed a pca to construct this index. However, my first principal component only explained around 23% of the variation in the data. The second component also explained around 20% variation. I read that 23% is quite low and the first component cannot directly be used as the index in such cases.
Can somebody recommend what shall I do in such a case. Is there any other way to construct the index. Is it sensible to combine the first two components. If yes, how can I do it?
Related Posts with Principal Component Analysis
How best to control for / identify changes in variables over timeHi Statalist. I want to capture the change in variables in my panel dataset, such as marital status…
Creating panel structure - any precise way?Hello, I am trying to create a panel structure with three dimensions: exporting country, importing c…
Extracing dates from both long and file name in stringDear Statalists, I am trying to do a subtraction between dates to come up with the time interval be…
Graph line to show the mean of two variables over a periodHello everyone. I need to know how to display a graph line that shows the mean of my Y variables by…
mi impute chained - STATA Program did not show the final resultsHello, I had some issues with running multiple imputation. Using the following codes, I ran multip…
Subscribe to:
Post Comments (Atom)
0 Response to Principal Component Analysis
Post a Comment