Dear Statalist,

I am struggling to reassign records with a missing value on the categorical variable that assigns each record to a province. I would like to semi-randomly reassign those with a missing value to existing provinces based on the existing frequency within each province. Below I created an example where I had 20 records already assigned to each province, I am also reporting overall frequency for each province. I would like to reassign semi-randomly the three missing values considering the existing frequency for each province. I am using Stata 14 MP.

Thanks



clear
input float(province sex freq)
1 1 .15
1 1 .15
1 2 .15
2 1 .1
2 2 .1
3 2 .1
3 2 .1
4 2 .25
4 2 .25
4 1 .25
4 1 .25
4 1 .25
5 2 .1
5 1 .1
6 2 .1
6 1 .1
7 1 .05
8 1 .05
9 2 .1
9 2 .1
. 2 .
. 1 .
. 1 .
end