Hi,
I have a data set on mortgage lending for single family homes and I have a total of 10,000 observations. My variable of interest/dependent variable is default which takes a binary value of 0 and 1. 1 means if the borrower's payment was 90+ days late and 0 otherwise and I have a list of x variables such as adjustable rate of mortgage, refinance etc.
My question is, how can I split my data of observations into two different groups e.g. training and testing with 6000 observations randomly assigned to my training data set because I need to tabulate my dependent variable default for both training and testing data sets.
Related Posts with Splitting data
Merging two variables + timeDear Statalist users I have some activity data which asks respondents how much time they spent in p…
Looking for a way to do local regressions on data with many zeroesI want to run a nonparametric or semiparametric regression on data which I suspect to be non-linear.…
Need some helpIm working with a "Encuesta Permanente de Hogares" (EPH) can someone help me with something? …
while loop error, testing overdispersionHello everyone, i have an issue with a loop. I got this from Hilbe's book "modeling count data". I'm…
Point transparency in spmapHi all, I wonder if its possible control the Point transparency using spmap: Code: spmap relig1 …
Subscribe to:
Post Comments (Atom)
0 Response to Splitting data
Post a Comment