Hello,

I am working on a dataset with almost 450,000 cases with several columns of diagnostic codes(DX), labeled DX1-DX30. Each row of Dx per case represents clinical diagnostic codes per patient. A case may have up to 30 obs/visits with different or similar diagnostic codes written in ICD-9 format (International Classification of Disease codes); these codes are string variables in the form "415.11". A patient could have 30 rows of different codes representing different or the same diagnoses.

I want to be able to count the number of specific diagnoses (for instance number of code 415.11 or 415.13) per patient for all the Dx and create a new variable containing these numbers.

I am not using the right commands as the results are not similar to what I am getting on SPSS (I am very familiar with the process, using SPSS). Please I need guidance in the right direction.

Thanks.

Olowu