I am currently working on a research project and I have a dataset of bankrupt and non-bankrupt European companies.
First of all, before I can conduct this research I have to think about my methodology how I will evaluate the hypothesizes.
My knowledge in stata is still limited, any help will be highly appreciated.
I need to construct a default predication model using a number of variables.
I use R&D as independent variable and some control variables (age, size, leverage, liquidity, Z-score, industry, and acquisitions).
The model I use is the logit model.
My first basic-hypothesis is: R&D spending has a positive impact on the prediction of bankruptcy because it improves the financial performance of a firm.
The regression specification that I use is: 𝑃r (failure = 1) = 𝛽0 + 𝛽1X1 + 𝛽2𝑋2 + 𝛽3𝑋3 + 𝛽4𝑋4 + 𝛽5X5 + 𝛽6𝑋6 + 𝛽7𝑋7 + errorterm)
- X1 = R&D
- X2 = Leverage
- X3 = Size
- X4 = Liquidity
- X5 = Z-score
- X6 = age
- X7 = industry
-
The regression specification that I use is: 𝑃r (failure = 1) = 𝛽0 + 𝛽1X1 + 𝛽2𝑋2 + 𝛽3𝑋3 + 𝛽4𝑋4 + 𝛽5X5 + 𝛽6𝑋6 + 𝛽7𝑋7 + errorterm)
- X1 = R&D
- X2 = Size
- X3 = Age
- X4 = Liquidity
- X5 = Z-score
- X6 = leverage
- X7 = industry
By not controlling size and age, I can deduce whether the prediction model improves by looking at the significance of the variables and the adjusted R squared of the model. Here I'm not sure yet, can someone confirm please?
The final hypothesis: There is a non-linear U-shaped relationship between R&D spending and the probability of bankruptcy.
Here I include a quadratic term and check whether the quadratic term of R&D is significant. Is this a correct method? In this hypothesis age and size will be control variables as original.
The regression specification that I use is: 𝑃r (failure = 1) = 𝛽0 + 𝛽1X1 + 𝛽2𝑋2 + 𝛽3𝑋3 + 𝛽4𝑋4 + 𝛽5X5 + 𝛽6𝑋6 + 𝛽7𝑋7 + 𝛽8𝑋8 + errorterm)
- X1 = R&D
- X2 = R&D2
- X3 = Leverage
- X4 = Size
- X5 = Liquidity
- X6 = Z-score
- X7 = age
- X8 = industry
Kind regards,
Chun H
0 Response to How to evaluate the following hypothesizes?
Post a Comment