Stata has the best data management of a single data set (a single data frame) that I have come across. Is anyone willing to give a brief description of the data management capabilities in Python (propably meaning pandas and numpy)? For instance, Stata has flexible and convenient variations of the -egen- command for grouped data. Another example is Stata's interpretation of if/else, it adapts to the object we use it on (a simple scalar vs a vector, with a vector if/else automatically becomes vectorised and applied to each observation separately).
Do we have similar data management capabilities within Python when we send a small program from Stata to Python?
Related Posts with Stata vs Python for data management
Seemingly unrelated regression with panel data fixed effectsI have a panel data set with id=52 and time variable year, 2000 to 2018 (988 observations). I would …
'type mismatch' error when using 'fp' (fractional polynomial tool) with 'nbreg'Hello, I am getting a 'type mismatch' error when running an nbreg regression with the 'fp' command,…
qregplot A module for plotting quantile regression CoefficientsDear all, Thanks to Prof. Baum, a new command "qregplot" is available from SSC. This new command is…
Summarizing first row data within subgroups conditioned on IDHello! I want to accumulate values from the first occurrences or the first row given a specific rou…
Difference in Difference Analysis Using Panel and Time Series DataI am trying to conduct difference in difference analysis. I want to look at whether bank holding com…
Subscribe to:
Post Comments (Atom)
0 Response to Stata vs Python for data management
Post a Comment