I used vce(cluster) to account for clustering within 9 groups in a set of nested logistic regression models. Stata doesn't want to give me Wald chi2 stats because I have too many variables in the model in relation to # of clusters, and used up my df. Stata also said both Wald and lrtest would be misleading. So, what *wouldn't* be misleading to report to describe fit and compare fit among nested models? Are pseudo-R square, AIC, BIC, and log likelihood #s still meaningful to interpret? Or are there other stats I don't know about?

Thanks in advance!