Hi all,
Currently I am working with a mortgage portfolio on which I would like to perform a logistic regression.
The portfolio consists of 14 datasets (per quarter) and each dataset contains all the loans at that given time with the related characteristics/variables. Since the data in the dataset is captured per loan type and not per loan, loans with multiple loan types (f.e. 3or 5) are overrepresented in the data in relation to loans that only consist of 1 loan type. (a loan with 3 loan types has 3 observations in the dataset and a loan 1 with 1 loan type has 1 observation). Secondly, there is an overrepresentation in the date due to the age of a specific loan. Older loans could be observed in all 14 datasets, but newer loans occur in less datasets.
I merged all the datasets into 1 and reshaped the dataset from long to wide to overcome this form of over/under representation. Ideally, I would like to have 1 observation per loan but currently each observation(row) contains the quarterly information of a loan and has all the loan types in that observation (wide).
As reshaping of the data led to many variables, my dependent variables (behind on payment) is divided into many variables and therefore I am not able to perform the logit regression.
Anny suggestion on how I could tackle these issues and correct the dataset in order to perform a logistic regression?
Thank you in advance,
Django
Related Posts with Logistic Regression Model on mortgage portfolio
Obtain coefficients for cross-sectional regression by id monthlyDear Statalists, I like to run this cross-sectional regression for my panel data and obtain only th…
Tobit Model Decomposition (Probability of adoption for non-adopter group and Intensity of Participation for adopter group)I am working on adoption study with non-adopter category and non-adopter category. How can I decompo…
regexmHi STATALIS I want to select patients with "IAM" in the studydescription_all variable . The "studyd…
How to do two part model when there is one independent variable is binary and endogenous?I want to use the two-part model to estimate food expenditure (Y), in the Xs, one variable is endoge…
Regarding GMMHi, There are multiple commands (xtabond , xtabond2, xtdpdsys) for GMM in Stata. What is less clear…
Subscribe to:
Post Comments (Atom)
0 Response to Logistic Regression Model on mortgage portfolio
Post a Comment