Hello everyone,

I am writing a thesis in which I will analyze the application behaviour of top performing low-income students.

In my main dataset, I have the postal codes of the applicants, but no information regarding family income. I have found a dataset that links FSA's (Forward Sortation Area) information to the average family income. I have also found a dataset that details what the FSA is for every postal code.

So now what I need to do recode each postal code to its assigned FSA, and then use the average income data to recode each FSA, to certain family wealth.

I feel I could manually do this if there were limited switches that needed to be done, through a series of recodings, however, since there is close to a million observations, I am trying to understand how this would be possible.

Any help would be greatly appreciated.