Question re: 'gsample' procedure.
I am currently analyzing the Healthcare Utilization Project (HCUP) National Inpatient Sample (NIS) 2016 dataset which is a stratified systematic sample of discharges--estimated 7 million observations with patient ICD-10 diagnostic and procedure codes. The NIS data documentation recommend utilizing the 'svyset' command to account for the complex survey design and for weights
svyset [pweight= discwt], strata(nis_stratum) psu(hosp_nis)
I'd like to work with a smaller sample of the data given limited computer resources to run such a large 10 gigabyte dataset, but removing observations would significantly alter the nationally estimated SE's because it is weighted data. This is where I'd like to consider 'gsample'--but as per my understanding of the documentation the 'gsample' procedure is utilized for frequency weights, and NOT probability weights, which is utilized in my data. However, I've been informed that I should still be able to use the 'gsample' procedure--I'm still not convinced. I wanted to know if anyone who has utilized a similar dataset or the 'gsample' procedure in a similar case and what suggestions they might have. I thought I had found a similar post here, but I'm not sure if this is answerign my exact question regarding frequency vs. probability weights for the gsample procedure.
Cheers!
Related Posts with gsample procedure for frequency weights vs. probability weights?
Picking highest real output per worker in sector i in year tDear all, In my dataset, I have variables of real output per worker across 34 different sectors in …
Multiple imputation estimate excluding some imputationsHi everyone, I would like to run the following code on my imputed dataset: Code: foreach var in $l…
Writing loop to identify individuals with consecutive positive values (n, n+1, n+2, ...)Hi, First time Statalist poster, long time follower. Thanks for this great resource. I am working …
Help correlating between parameters across tasks and participantsHello everyone, This question may be rudimentary, however I haven't touched statistics for a very lo…
What should we conclude when the Adjusted R square decrease when adding a dummy variable into the regression?In my regression using OLS, when I add a dummy variable into my regression, the Adjusted R-squared r…
Subscribe to:
Post Comments (Atom)
0 Response to gsample procedure for frequency weights vs. probability weights?
Post a Comment