Hello,

I am working on my thesis and was my professor explained that for my dataset a probit analysis is best.
The dependent variable is the presence of logos on wine labels, and the independent variable is price of the product (available in either ordinal or numeric)

I have 4 control variables (region, country, white/red, store) and transformed them into dummies (60 variables).
Some dummy variables have a very low occurance, sometimes only one row.
E.g. a unknown wine region will only occur once and has only one corresponding wine.

When I command:
Probit logo price (+60 dummy control variables)
I get many omitted/perfect predicting findings in the analyses, I suspect partially due to the low N of the dummies.

The goal of these control variables is to exclude other explanatory influences.

Is this the right test/setting to analyse this?

Looking very forward to your replies!