I ran an experiment to see if a particular gamification design (2 treatment groups) would perform better in terms of quality than the control. The design was run on a reporting application where citizens can report issues in their environment to the government, and the government can act upon it.
Quality in my data is measured by the number of successful reports of an individual/total reports made of the same individual. (Successful reports are those acted on by the government).
The data would look something like this, where each row represents 1 individual:
The Competition and Inter-Team represent the IV which is categorical, the baseline is no design. My question is which of the following formula/commands in STATA should I use?:
Approach 1
glm Succesful_Report Competition Inter-Team, family(binomial Total_Report) link(logit) vce(robust) nolog
Here the family (binomial Total_Report) takes into account the fact that each individual has a different denominator (total number of reports). The fact that the individuals in the control report less would then be accounted for. Since I run the formula with varying denominators for the DV, Stata does not allow me to run a mfx command.
Approach 2
glm Proportion Competition Inter-Team, family(binomial) link(logit) vce(robust) nolog
In this case the proportion is directly taken as the DV. However, the varying number of reports made by an individual are not accounted for.
I am unsure which approach to take to test whether a the treatment group (Competition and Inter-Team) outperform the control in regard to the quality. What would be advised?
Kind regards, Michiel
0 Response to GLM with proportion DV and categorical IV. Which STATA formula should I use?
Post a Comment