Hi all,
I have a big data set and am trying to QC some variables such as blood pressure and cholesterol measurement. I am trying to see if there is any drift pattern/consistency of the measurements taken by the devices we have overtime. Basically we have multiple daily (1 year worth of data) measurements for each variable and devices. Eg:
device date sbp dbp cholesterol
1 22/8/18 108
1. 22/8/18 110
2 22/8/18 115
2 23/8/18 118
2 23/8/18 115
My questions:
1. What is the best way to analyse these dataset? Do you think CUMSUM plot is applicable? Any examples?
2. I tried taking the weekly average reading and then do a basic scatter plot ? But I have problem adding the mean line, and +/- SD line into the plot to identify outlier measurements. I keep getting error message when using the common yline? Eg of my command:
global mean1= r(mean)
global lowerCL1= r(mean) - 1*r(sd)
global upperCL1= r(mean) + 1*r(sd)
twoway (scatter ave_height weekly_date, ///
legend(off) mcolor(black) ///
yline($mean1, lpattern(shortdash) lcolor(gray))///
yline($lowerCL1, lpattern(dash) lcolor(gray)) ///
yline($upperCL1, lpattern(dash) lcolor(gray)) ///
graphregion(color(white)) ylabel(, grid glcolor(gs14)) ///
ylabel(150(1)154) xlabel(1(2)60) ///
aspectratio(1.09) //
Please help!
Many thanks!
Related Posts with Data quality control check with CUMSUM
Counting Distinct Observations: number of new investments of specific VCs in an industry 5 years prior to the focal observationDear All, I am using Stata 15.0 I need to calculate how often a VC firm X (variable name: FirmName…
Comparing regression coefficients across groups in presence of fixed effectsHey I have some company data and i run a panel regression with firm fixed effects. I have a list of …
Storing one regression coefficient out of many others in "reghdfe"I have a fixed effects regression with many controls. Code: reghdfe pm p_diff l1.pm dd ddXpay rnd…
Group categorical variables into a groupHello, for my Master thesis I am at the step of preparing the data to run a multinomial logit model…
sort using twoway line vs. twoway scatter, connected(L)Hi everyone, This may have a very simple solution that I'm missing, but I've gotten stuck. Basical…
Subscribe to:
Post Comments (Atom)
0 Response to Data quality control check with CUMSUM
Post a Comment