BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

Dropping duplicate observations conditioned on another variable
Dropping duplicate observations conditioned on another variable

Hi,

I am using Stata 16.1, and have the following (general) issue. I want to drop duplicate observations of one variable (educ) from my dataset but conditioned on another variable (year).
I've found an online dataset, such that it may be easier to talk about. First up I apologize that I am new Stata.

My problem: I want to drop all duplicates of educ for each corresponding year (72 & 74). This means I do not want educ's value to enter for year 72 if it's value has already been observed.
Here is what I've tried do to:

use http://fmwww.bc.edu/ec-p/data/wooldridge/fertil1
keep if year == 72 | year == 74
drop if educ == educ[_n-1] & (year == 72 | year == 74)

The problem:
I am not dropping all duplicates of educ

Any suggestions would be appreciated

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Dropping duplicate observations conditioned on another variable
Dropping duplicate observations conditioned on another variable

0 Response to Dropping duplicate observations conditioned on another variable

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Dropping duplicate observations conditioned on another variable Dropping duplicate observations conditioned on another variable

Related Posts with Dropping duplicate observations conditioned on another variable

0 Response to Dropping duplicate observations conditioned on another variable