Hello

Am a novice Stata user/ statistician. I have created a two way scatter of aggregated travel distances (y axis) and passenger counts (x axis) for suburbs of Cape Town. This I created from aggregated data bins from the original dataset using a collapse function (I think this was the correct method).I have a regression line. The data has non-normal distribution. I am trying to identify the 'worst outliers' for travel distance per capita. Is there a way to represent this in a numerical way? I can 'see' them on the graph, but for accuracy I would like to isolate the top 5 locations and describe them with more detail.

Many thanks. Mark
Array