Hi everyone,

I have individual level data for 20 regions for 18 years. The question is to examine the impact of x on y (binary variable). However, there is no individual id. Basically the data looks like this:
region year x y age gender
1 1993 20 1 20 1
1 1993 26 0 25 1
1 1993 12 1 40 1
1 1994 13 1 21 0
1 1994 20 1 30 1
2 1993 25 0 25 1
It is not the same individuals that are tracked every year. My questions are:
  • Is this still a panel despite me not knowing anything about individuals?
  • I have run a regression using the following command: logit y x gender age i.region i.year, vce (cluster region) Is this the correct way to include region and year fixed effects?
  • Should I define it to be a panel data using xtset and then run the xtreg command. I read in a different post that when you have multiple observations under a particular region and year that might not be the right way.
  • What if I want to run a non-parametric regression on this? Will having dependent variable and some of the controls are binary impact anything?