BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

Dropping duplicates based on another variable
Dropping duplicates based on another variable

I have a large dataset with patient encounters, of which some patients may have had multiple encounters, and I only want to include their FIRST encounter. I'm trying to figure out how to drop duplicate variables such that for each duplicate, the one with the lowest age is the only one that's included. For example, if I have the following dataset

ID	age
1	4
3	3
2	1
1	5
2	2
3	4
4	2
3	5
4	4

I want to convert it to:

ID	age
1	4
2	1
3	3
4	2

Is there a way to do this using a the duplicates command? Or a different command?

BJ Data Tech Solution

Home / Data Cleaning / Data management / Data Processing / Dropping duplicates based on another variable
Dropping duplicates based on another variable

0 Response to Dropping duplicates based on another variable

Post a Comment

Home / Data Cleaning / Data management / Data Processing / Dropping duplicates based on another variable Dropping duplicates based on another variable

Related Posts with Dropping duplicates based on another variable

0 Response to Dropping duplicates based on another variable