Dear STATA Community,

I need some advice on geocoding:

My sample has more than a million observations. I have pairs of US street addresses with information on street address, zip code, and state.

I want to compute two quantities - first, the distance between each pair, and, second, the tract location of each address.

I have been following various available commands. I understand that I first need to gather information on lat/long coordinates for each address and then I can make use of geodist command to compute the distance between lat/long pair.

Geocoding (converting street address into geo coordinates) is usually provided by third party vendors, such as Google, Mapquest, HERE, and Open Cage.

I am not sure if there are other cost-effective alternatives. Any guidance will be highly appreciated.

Thank you.

Best wishes,
Manish