BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Create Risk Prediction Variable from 2 Different Heckman Correction Models

Create Risk Prediction Variable from 2 Different Heckman Correction Models

Monday, October 31, 2022 Data Cleaning Data management Data Processing
Hi everyone, I'm wondering if there's a way to create a risk prediction variable from the results of two different Heckman Correct...
Using mca or pca to generate a single variable

Using mca or pca to generate a single variable

10:23 PM Data Cleaning Data management Data Processing
Dear colleagues I know I can use principle component analysis (PCA) on categorical variables by first coding them as dummies. Multiple corre...
ipdmetan - estimation_command

ipdmetan - estimation_command

7:23 PM Data Cleaning Data management Data Processing
I would like to use ipdmetan to do an IPD meta-analysis for studies with treatment response yes/no as the outcome. The sample code provided ...
Testing dominance between Lorenz curves

Testing dominance between Lorenz curves

4:23 PM Data Cleaning Data management Data Processing
Hello Stata Experts, I will request information on the Stata code to test for dominance between two or more Lorenz curves. I am using Stat...
Conducting multinomial regression on multiply imputed variables for subset data

Conducting multinomial regression on multiply imputed variables for subset data

2:23 PM Data Cleaning Data management Data Processing
Hi Is there a way to conduct multinomial regression for a weighted subsample datasets? For e.g. I used mi estimate: mlogit DV IV1 IV2 IV2[i...
Date values are different

Date values are different

2:23 AM Data Cleaning Data management Data Processing
Hello, I have values for the dates given in LB21B. I used the code egen LB21B_12=convertCMC(LB21B) when I apply %td on both values, none o...
A conceptual question about when should I add fixed effect and cluster the fixed effect?

A conceptual question about when should I add fixed effect and cluster the fixed effect?

2:23 AM Data Cleaning Data management Data Processing
Suppose I add a time-fixed effect to a panel data regression that I want to estimate using OLS. My question is conceptually when should I al...
How can I get several variables' mean values at the smae

How can I get several variables' mean values at the smae

2:23 AM Data Cleaning Data management Data Processing
I want to create a vector containing several variables' mean values. But the summarize command only return the last variables' mean...
Interpreting coefficients (percentage points vs percentage)

Interpreting coefficients (percentage points vs percentage)

12:23 AM Data Cleaning Data management Data Processing
I was reading a paper that ran a difference-in-differences regression and the coefficient value was -0.036. The dependent variable is vote s...
Interpreting coefficients (percentage vs percentage points)

Interpreting coefficients (percentage vs percentage points)

12:23 AM Data Cleaning Data management Data Processing
I was reading a paper that ran a difference-in-differences regression and the coefficient value was -0.036. The dependent variable is vote s...
Old bug

Old bug

Sunday, October 30, 2022 Data Cleaning Data management Data Processing
Dear All, I hope someone following Stata for many years can help me with remembering: there used to be a problem (and I hope I can recolle...
What is a GROUP variable in difference in differences analysis?

What is a GROUP variable in difference in differences analysis?

2:23 PM Data Cleaning Data management Data Processing
Query: What is a GROUP variable in difference in differences analysis? Study : Newspaper articles from 1985-2017 were categorized as eith...
Testing for model specification error in fractional regression

Testing for model specification error in fractional regression

7:23 AM Data Cleaning Data management Data Processing
Hello, I employed fractional regression model proposed by Wooldridge and Papke to estimate the dependent variable (a proportional variable...
VAR and Impulse response function

VAR and Impulse response function

3:23 AM Data Cleaning Data management Data Processing
Dear Statalists, I am working on a project where I need to 1. estimate the reduced form VAR to predict the residuals ( the shock/ the inn...
Separating a string variable into separate variables

Separating a string variable into separate variables

12:23 AM Data Cleaning Data management Data Processing
I have a string variable CODEX which has the underlying cause of death coded first, and any secondary causes of death following. The cause o...
Generating Variable That Depicts Increase

Generating Variable That Depicts Increase

Saturday, October 29, 2022 Data Cleaning Data management Data Processing
My dataset comes from a survey with 5 waves that has a general score (variable GScore). I am trying to do the following: 1) I am trying to...
SEM modeling ( path model)

SEM modeling ( path model)

5:23 PM Data Cleaning Data management Data Processing
Dear ll, I hope you are doing well. I'm working on my paper and i would like to study the mediating role of M on the relationship be...
sdid in case of unbalanced panel

sdid in case of unbalanced panel

3:23 PM Data Cleaning Data management Data Processing
I need to run synthetic difference is difference regression. Therefore, I need balanced panel data. But, as you can see my sample year is fr...
Calculate age from other occurrences and attribute it to specific observation

Calculate age from other occurrences and attribute it to specific observation

10:23 AM Data Cleaning Data management Data Processing
Hi, I'm working with consecutive censuses. I can follow the same individuals through several decades. However, age is not always repor...
Creating Graphs with Regression Output

Creating Graphs with Regression Output

10:23 AM Data Cleaning Data management Data Processing
I am attempting to determine whether hours worked and wages have changed overtime differently for different generations. So I am running the...
Mediation Analysis - Stata sem and medsem commands with fixed effects models

Mediation Analysis - Stata sem and medsem commands with fixed effects models

7:23 AM Data Cleaning Data management Data Processing
Hello everyone, I am trying to perform a mediation analysis, in which the variables of concern are as follows: Dependent Variable (Y) -...
Portfolio construction

Portfolio construction

1:23 AM Data Cleaning Data management Data Processing
Hello, I have data of individual investors trading behavior. Below you see an abstract of my data. My data is sorted by investor and date....
Reference period in new DID estimators

Reference period in new DID estimators

Friday, October 28, 2022 Data Cleaning Data management Data Processing
Hello all! I am running a few checks using the new heterogeneity-robust diff-in-diff estimators, specifically: csdid, did_imputation, and di...
Matching data over time?

Matching data over time?

5:23 PM Data Cleaning Data management Data Processing
Hello, I have a sample from the Current Population Survey (CPS). My data is monthly from 2015-present for the variables empsame , empstat ...
Creating new variable (subtracting) same individual

Creating new variable (subtracting) same individual

3:23 AM Data Cleaning Data management Data Processing
Hi, I am using individual-level dataset across 4 years to try and calculate the mobility out of low wage between 2020 and 2017. I created ...
vlookup generates same id

vlookup generates same id

1:23 AM Data Cleaning Data management Data Processing
I read the thread on Complicated Vlookup-type problem in a large dataset. I am facing a similar problem with my research. I was able to g...
Cox model with an unbalanced panel

Cox model with an unbalanced panel

Thursday, October 27, 2022 Data Cleaning Data management Data Processing
Hello everyone, I'm running a cox model with an unbalanced panel from 2015-2021 to measure the factors that influence the survival of ...
When including interacted fixed effects, do I also have to include each fixed effects separately in the model?

When including interacted fixed effects, do I also have to include each fixed effects separately in the model?

10:23 PM Data Cleaning Data management Data Processing
When including interacted fixed effects, do I also have to include each fixed effects separately in the model? For example, if I want to in...
Difference in Difference analysis - how to create a group variable

Difference in Difference analysis - how to create a group variable

6:23 PM Data Cleaning Data management Data Processing
I think my question is pretty straightforward. I'd like to know what my "Group" is for a difference in differences analysis. ...
My Lorenz curves are coming out as straight lines.

My Lorenz curves are coming out as straight lines.

5:23 PM Data Cleaning Data management Data Processing
Hello Stata Experts, I am trying to create lorenz curves for a variable with four categories. I have done this before but this time my cur...
Adding and simplifying observations; from daily to weekly

Adding and simplifying observations; from daily to weekly

1:23 AM Data Cleaning Data management Data Processing
Hello to all. I´m kind of new in using STATA, and I´m having trouble figuring out how to merge observations. I have binary values for sever...
Interpretation of "/logs" in Stata17 output using mestreg

Interpretation of "/logs" in Stata17 output using mestreg

12:23 AM Data Cleaning Data management Data Processing
Dear Stata users, Can someone please kindly explain in simple terms what the "/logs" output refers to in the following code? I...
Plotting a bar graph with two y-axis

Plotting a bar graph with two y-axis

Wednesday, October 26, 2022 Data Cleaning Data management Data Processing
dear colleagues I am trying two create a bar graph with two y-axis from my data and code below and am getting error "invalid span'....
Marketing Communication Mix

Marketing Communication Mix

6:33 PM Research Methodology
Defined as a “dialogue between business unit and its present and potential customers that takes place during pre-selling, selling and post-s...
eivreg in rolling window setting (estimates not stored properly)

eivreg in rolling window setting (estimates not stored properly)

5:23 PM Data Cleaning Data management Data Processing
Dear Statalist, I might be shooting my shot with this post but I want to implement the eivreg command in a rolling-windows setting using ...
Combining surveys with distinct analytical weights

Combining surveys with distinct analytical weights

4:51 PM Data Cleaning Data management Data Processing
Hi. I have a dataset which combine 14 household surveys in 14 countries. Each survey was conducted in different years and each survey has ...
regress with panel data

regress with panel data

12:23 AM Data Cleaning Data management Data Processing
Code: * Example generated by -dataex-. To install: ssc install dataex clear input str7 ym byte hs_c float imports str3(importer exporter) ...
asroprobit may estimate the variables that are not defined in the code?

asroprobit may estimate the variables that are not defined in the code?

12:23 AM Data Cleaning Data management Data Processing
Hello, When I followed the tutorial to learn the asroprobit command in the Stata software, I typed the following code: use http://www.st...
Tabulating estimates of endogenous variables in dynamic forecast models

Tabulating estimates of endogenous variables in dynamic forecast models

Tuesday, October 25, 2022 Data Cleaning Data management Data Processing
Hi I am running some dynamic forecast models, each of which contains 7 endogenous variables. Since there are missing values for some of th...
[LASSO] Collinear covariates: Suggested addition to the documentation

[LASSO] Collinear covariates: Suggested addition to the documentation

6:23 PM Data Cleaning Data management Data Processing
I would like to suggest an addition to the documentation for collinear covariates in LASSO models. The Summary section currently reads as f...
How to weight data using CPS weights in STATA?

How to weight data using CPS weights in STATA?

4:44 PM Data Cleaning Data management Data Processing
Hello, I have CPS monthly data 2015-present. I want to calculate unemployment rate. I am using empstat and labforce variables from the C...
Stata dataset to latex table

Stata dataset to latex table

2:50 PM Data Cleaning Data management Data Processing
I have a dataset in stata that looks like this: quintile mean t_stat 1 0.0100 4.100 2 0.0200 4.200 3 0.0300 4.300 4 0.0400 4.400 5 0.0500 ...
Help with reshaping the data

Help with reshaping the data

2:50 PM Data Cleaning Data management Data Processing
Dear community, I am trying to bring some data into a different format. The data is World Uncertainty Index, which is publicly available f...
Show the working directory and save it in a macro

Show the working directory and save it in a macro

3:23 AM Data Cleaning Data management Data Processing
Hi~ I can use command cd or pwd to show the working directory. But how can I save the returned results in a macro. If I can do this, it wi...
Adding % to stacked bar chart

Adding % to stacked bar chart

1:34 AM Data Cleaning Data management Data Processing
Dear statalisters, is there a way to add %-signs (by code) to my stacked bar chart? here is the code for data preperation and graph: ...
Cleaning string variable

Cleaning string variable

Monday, October 24, 2022 Data Cleaning Data management Data Processing
Dear all, I have a variable called place_birth in my dataset. Some of the locations weren't recorded properly. place_birth Feucher...
Error merging master data with using data keeping the merge key as city and statefip

Error merging master data with using data keeping the merge key as city and statefip

8:23 PM Data Cleaning Data management Data Processing
I need to convert my master data to county level. For that in my master data I have statefip and city name. In my using data I have city , s...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ▼  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ▼  October (198)
      • Create Risk Prediction Variable from 2 Different H...
      • Using mca or pca to generate a single variable
      • ipdmetan - estimation_command
      • Testing dominance between Lorenz curves
      • Conducting multinomial regression on multiply impu...
      • Date values are different
      • A conceptual question about when should I add fixe...
      • How can I get several variables' mean values at th...
      • Interpreting coefficients (percentage points vs pe...
      • Interpreting coefficients (percentage vs percentag...
      • Old bug
      • What is a GROUP variable in difference in differen...
      • Testing for model specification error in fractiona...
      • VAR and Impulse response function
      • Separating a string variable into separate variables
      • Generating Variable That Depicts Increase
      • SEM modeling ( path model)
      • sdid in case of unbalanced panel
      • Calculate age from other occurrences and attribute...
      • Creating Graphs with Regression Output
      • Mediation Analysis - Stata sem and medsem commands...
      • Portfolio construction
      • Reference period in new DID estimators
      • Matching data over time?
      • Creating new variable (subtracting) same individual
      • vlookup generates same id
      • Cox model with an unbalanced panel
      • When including interacted fixed effects, do I also...
      • Difference in Difference analysis - how to create ...
      • My Lorenz curves are coming out as straight lines.
      • Adding and simplifying observations; from daily to...
      • Interpretation of "/logs" in Stata17 output using ...
      • Plotting a bar graph with two y-axis
      • Marketing Communication Mix
      • eivreg in rolling window setting (estimates not st...
      • Combining surveys with distinct analytical weights
      • regress with panel data
      • asroprobit may estimate the variables that are not...
      • Tabulating estimates of endogenous variables in dy...
      • [LASSO] Collinear covariates: Suggested addition t...
      • How to weight data using CPS weights in STATA?
      • Stata dataset to latex table
      • Help with reshaping the data
      • Show the working directory and save it in a macro
      • Adding % to stacked bar chart
      • Cleaning string variable
      • Error merging master data with using data keeping ...
      • spacing between columns in collect table
      • This should be quick :)
      • converting to census countyfip from the existing v...
      • multinomial logistic - interpreting the economic e...
      • Stata's mixed command*- adjusting for confounding ...
      • Snapshot of key sections of graph (blowing up a po...
      • Splitting String
      • tnbreg does not converge
      • Exporting nbreg regression (negative binomial regr...
      • Invalid name Error when using post
      • Runby corr, error: store_data(): 3900 unable to al...
      • Help with time-trend analysis
      • did_multiplegt cannot estimate treatment effect; p...
      • gsem with interaction terms
      • Exporting IRR table
      • Doubt of the Cox model
      • stcrreg competing risk model with random intercept
      • Parsing a string variable to extract a particular ...
      • Q about generating survey weight in combined NHANE...
      • about xtqreg command
      • Fixed effect issue (deleting variables)
      • summing over a unique identifier over a panel data...
      • creating a graph with volume of procedures and ave...
      • Illustrative table of percentages to read in Word
      • Observations and data points
      • creating a count variable for multiple entries per...
      • Can I create a new variable based on data labels
      • Calculating Liquidity
      • Missing Values for Age Variable in Panel Data
      • Panel: setting one year as the reference year.
      • Undesired behavior (or bug): confirm new variable
      • how to calculate the loop for variables?
      • Is DiD possible for examining effect of a continuo...
      • Asymmetric predictors in dynamic panel models?
      • Problem graphing by several variables
      • Problem graphing by many variables
      • Export bootest resutls for a number of regressions.
      • Calculate the standard deviation of multiple varia...
      • balancetable testing differences across multiple s...
      • convert panel data from annual to monthly data
      • List values of one variable in blocks
      • Reshape wide to long
      • Adding error bars to bar graph
      • Create two-way line graph with shaded confidence i...
      • Removing unnecessary observations
      • Changing three-language value labels of a variable...
      • Help with text on a graph aligning when there are ...
      • Seek Help to Count the Number of the Persons Who H...
      • How to chanche variable date from "2022-6-7 00:00:...
      • Using loop for generate and replace
      • Q - A problem with qregpd: a Quantile Regression i...
      • Re-elaboration of string variable
      • Reshape dataset with format "variable:value"
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer