I have been trying to merge 2 datasets. I have household sample and individual sample. Their sample sizes are different. I would like to merge variales in the hh sample with individual sample. HH sample includes 2 key variables: HV009 and HV010. I want HV009 HV010 variables to appear in my individual sample. To do this, I first opened my individual sample and run the following code:
merge 1:1 _n using "C:\Users\cansu\Desktop\Merging HH and Women\DHS-2018\2018 HH KEPT.dta"
drop if _merge==1
drop if _merge==2
Both samples has the same HHID number. I tried to match it based on HHID that appears both, but when I run this code, I see that the code fails. It does not correctlt matched. The only thing that I want is the exact match based on HHID number.
How should I solve this problem? Thank you very much in adcance.
This is my household sample as having an HHID number:
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input str12 HHID double(HV009 HV010) " 01010004" 3 1 " 01010007" 4 1 " 01010008" 1 0 " 01010009" 4 1 " 01010011" 3 1 " 01010012" 4 0 " 01010013" 4 1 " 01010014" 5 1 " 01010016" 6 1 " 01010019" 5 1 " 01010021" 3 1 " 01020001" 1 0 " 01020002" 1 0 " 01020005" 2 0 " 01020006" 2 1 " 01020007" 2 0 " 01020010" 2 1 " 01020011" 1 0 " 01020013" 4 1 " 01020015" 2 1 " 01020017" 2 0 " 01030004" 2 1 " 01030007" 1 0 " 01030012" 3 2 " 01030014" 2 0 " 01030016" 4 2 " 01030018" 2 0 " 01030020" 2 0 " 01040002" 3 0 " 01040003" 4 1 " 01040004" 3 1 " 01040005" 2 0 " 01040006" 3 0 " 01040007" 1 0 " 01040008" 4 2 " 01040009" 4 2 " 01040010" 2 0 " 01040011" 4 1 " 01040013" 2 1 " 01040014" 5 1 " 01040015" 5 1 " 01040016" 2 0 " 01040018" 3 0 " 01040019" 1 0 " 01040020" 4 1 " 01050001" 3 2 " 01050003" 4 1 " 01050005" 3 1 " 01050010" 2 0 " 01050011" 3 1 " 01050014" 5 1 " 01050016" 4 1 " 01050019" 4 1 " 01050021" 4 0 " 01060001" 6 3 " 01060002" 6 2 " 01060003" 4 2 " 01060004" 4 0 " 01060005" 3 1 " 01060006" 9 2 " 01060007" 10 2 " 01060008" 4 0 " 01060009" 4 1 " 01060011" 1 0 " 01060012" 1 0 " 01060013" 3 1 " 01060014" 4 0 " 01060015" 5 2 " 01060017" 5 2 " 01060018" 5 2 " 01060019" 3 1 " 01060020" 3 1 " 01070002" 4 1 " 01070003" 4 1 " 01070005" 6 1 " 01070007" 3 0 " 01070008" 8 1 " 01070009" 4 1 " 01070010" 1 0 " 01070013" 4 1 " 01070015" 4 1 " 01070016" 2 1 " 01070017" 3 1 " 01070018" 4 0 " 01070019" 3 0 " 01070020" 4 1 " 01070021" 2 0 " 01080003" 2 0 " 01080005" 4 1 " 01080008" 3 1 " 01080009" 2 0 " 01080010" 4 0 " 01080011" 2 0 " 01080012" 1 1 " 01080013" 4 1 " 01080017" 4 0 " 01080018" 2 0 " 01080020" 1 0 " 01090001" 2 1 " 01090004" 2 0 end
This is my individual sample where HHID is also available:
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input str15 CASEID str8 HHID str3 V000 " 01010004 02" "01010004" "TR7" " 01010007 02" "01010007" "TR7" " 01010011 02" "01010011" "TR7" " 01010013 03" "01010013" "TR7" " 01010014 02" "01010014" "TR7" " 01010016 02" "01010016" "TR7" " 01010021 02" "01010021" "TR7" " 01020006 01" "01020006" "TR7" " 01030004 01" "01030004" "TR7" " 01030012 02" "01030012" "TR7" " 01040003 02" "01040003" "TR7" " 01040004 02" "01040004" "TR7" " 01040008 02" "01040008" "TR7" " 01040008 04" "01040008" "TR7" " 01040009 02" "01040009" "TR7" " 01040009 03" "01040009" "TR7" " 01040011 02" "01040011" "TR7" " 01040013 02" "01040013" "TR7" " 01040014 02" "01040014" "TR7" " 01040015 05" "01040015" "TR7" " 01040020 02" "01040020" "TR7" " 01050001 01" "01050001" "TR7" " 01050003 02" "01050003" "TR7" " 01050011 03" "01050011" "TR7" " 01050014 02" "01050014" "TR7" " 01050016 02" "01050016" "TR7" " 01060002 02" "01060002" "TR7" " 01060002 04" "01060002" "TR7" " 01060003 02" "01060003" "TR7" " 01060005 02" "01060005" "TR7" " 01060006 03" "01060006" "TR7" " 01060006 07" "01060006" "TR7" " 01060007 05" "01060007" "TR7" " 01060007 06" "01060007" "TR7" " 01060009 01" "01060009" "TR7" " 01060013 02" "01060013" "TR7" " 01060015 02" "01060015" "TR7" " 01060015 03" "01060015" "TR7" " 01060017 02" "01060017" "TR7" " 01060017 04" "01060017" "TR7" " 01060018 01" "01060018" "TR7" " 01060020 02" "01060020" "TR7" " 01070002 02" "01070002" "TR7" " 01070003 02" "01070003" "TR7" " 01070005 02" "01070005" "TR7" " 01070008 02" "01070008" "TR7" " 01070009 02" "01070009" "TR7" " 01070016 02" "01070016" "TR7" " 01070017 03" "01070017" "TR7" " 01070020 02" "01070020" "TR7" " 01080005 02" "01080005" "TR7" " 01080008 02" "01080008" "TR7" " 01080012 01" "01080012" "TR7" " 01080013 03" "01080013" "TR7" " 01090001 02" "01090001" "TR7" " 01090009 02" "01090009" "TR7" " 01090014 01" "01090014" "TR7" " 01090015 02" "01090015" "TR7" " 01090020 03" "01090020" "TR7" " 01100003 02" "01100003" "TR7" " 01100004 04" "01100004" "TR7" " 01100008 02" "01100008" "TR7" " 01100012 03" "01100012" "TR7" " 01100017 02" "01100017" "TR7" " 01100018 02" "01100018" "TR7" " 01110001 02" "01110001" "TR7" " 01110002 01" "01110002" "TR7" " 01110002 02" "01110002" "TR7" " 01110002 03" "01110002" "TR7" " 01110003 03" "01110003" "TR7" " 01110004 02" "01110004" "TR7" " 01110009 02" "01110009" "TR7" " 01110010 01" "01110010" "TR7" " 01110013 02" "01110013" "TR7" " 01110016 01" "01110016" "TR7" " 01110016 02" "01110016" "TR7" " 01110019 02" "01110019" "TR7" " 01110021 02" "01110021" "TR7" " 01120001 03" "01120001" "TR7" " 01120001 05" "01120001" "TR7" " 01120002 02" "01120002" "TR7" " 01120002 03" "01120002" "TR7" " 01120003 02" "01120003" "TR7" " 01120004 02" "01120004" "TR7" " 01120006 02" "01120006" "TR7" " 01120006 03" "01120006" "TR7" " 01120007 02" "01120007" "TR7" " 01120007 03" "01120007" "TR7" " 01120007 05" "01120007" "TR7" " 01120009 01" "01120009" "TR7" " 01120009 03" "01120009" "TR7" " 01120011 02" "01120011" "TR7" " 01120012 02" "01120012" "TR7" " 01120014 02" "01120014" "TR7" " 01120015 02" "01120015" "TR7" " 01120015 03" "01120015" "TR7" " 01120015 04" "01120015" "TR7" " 01120015 05" "01120015" "TR7" " 01120016 02" "01120016" "TR7" " 01120018 02" "01120018" "TR7" end
0 Response to Merging 2 datasets with the same ID number
Post a Comment