Dear all,
first and foremost a happy new year to all of you!
In my current research project, I am analyzing the use of language in Twitter and its effects on an financing event.
My data is structured as follows:
1. For all observations, I have twitter language values for a certain period.
2.This period starts with the first financing event and either ends with the second financing event or, in case that no second event occurred (i.e., the data is right-censored) the period ends with the last recorded tweet.
3. What do I want to find out? Does the language used on twitter affect the probability/reduce the time that a second financing event occurs?
4. I set up my data using the following code:
stset TimeToSecond, failure(SecondFunding)
TimeToSecond are the days counted between first and second event (or last tweet in case no event happened). SecondFunding is my financing event and coded 0/1 (1=happened, 0=did not happen in the considered period).
5. Now estimating the effects with the cox-model:
stcox languageVariables* controlVariables*
What's my problem?
The results I get are meaningful. Nevertheless, I think I have the problem that the proportionality assumption is not true for my data. To test the proportional hazards assumption, I re-estimated my models. I interacted the independent variables with my Time-Variable (as suggested in a teaching book).
stcox languageVariables* controlVariables*, tvc(languageVariables* controlVariables*) texp(TimeToSecond)
The result of this estimation is, that some of the interactions of my control variables are significant, which is a sign for disproportionality (according to the book).
My question would now be:
Is it actually a problem, if only some of the control variable interactions are significant, but the explaining variables (languageVariables*) used are not significant? What alternatives are there to make it correctly (i.e. something like "disproportional hazards"?).
Best regards and stay healthy
Related Posts with Cox regressions for Twitter data (proportional hazards)
Tobit regressionHi i am running a tobit regresison for data across 2 years 2007-2008 ( they were 2 individual datase…
Parallel Coordinates and playing around with graphIn using the parcoord command, I am finding it difficult to move from the default colors provided by…
Pairwise Comparisons of Average Marginal EffectsHi, I am aiming to look at the different effect on earnings of increasing the number of dependent ch…
Value Labels and Appending DataI have a question about the appending data from multiple rounds of surveys across 10 countries from …
Multiple loops of varlists and outreg2 for quantile regressionsDear Statalists, I hope anyone can help me with my regressions with loops because I am quite new to…
Subscribe to:
Post Comments (Atom)
0 Response to Cox regressions for Twitter data (proportional hazards)
Post a Comment