BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Logistic regression, propensity score matching, or IPTW?

Logistic regression, propensity score matching, or IPTW?

Friday, July 31, 2020 Data Cleaning Data management Data Processing
Hello, I would like to compare the "adjusted" mortality after surgery between two cohorts with very different sample sizes. The ...
Why can't we cluster on anything we like?

Why can't we cluster on anything we like?

8:25 PM Data Cleaning Data management Data Processing
I often get error messages indicating that Stata doesn't like the variable that I have chosen to cluster on. Consider the following mode...
Writing a Loop

Writing a Loop

6:25 PM Data Cleaning Data management Data Processing
Dear Statalist Users, I have multiple txt files and I want to convert them into stata files so that I can append the data. This is the m...
Problem with reaching 1:n ratio with psmatch2 !

Problem with reaching 1:n ratio with psmatch2 !

6:25 PM Data Cleaning Data management Data Processing
Hi everyone, I am trying to perform a 1:4 propensity score matching using the command psmatch2. After I run the code, the ratio of individ...
stata ttest significance

stata ttest significance

6:25 PM Data Cleaning Data management Data Processing
Hi everyone, I have the following dataset Code: * Example generated by -dataex-. To install: ssc install dataex clear input int NEWID...
Best command for cross-classified models

Best command for cross-classified models

6:25 PM Data Cleaning Data management Data Processing
I'd like to fit a model with student and teacher random effects. The ID variables for students and teachers are stdntid and tchid. There...
Asdoc "conformability error"

Asdoc "conformability error"

5:25 PM Data Cleaning Data management Data Processing
Dear Statalist, I am getting an error message when running the asdoc command. The first code works fine, but the second code returns: Cod...
mixed vs. xtreg, re

mixed vs. xtreg, re

5:25 PM Data Cleaning Data management Data Processing
Here are four ways to estimate the same model (I think): Code: xtreg absent ib(2).classtype ib("k").gradenum i.schid, re xtreg...
Writing a Loop

Writing a Loop

9:25 AM Data Cleaning Data management Data Processing
Dear Statalist Users, I have multiple txt files and I want to convert them into stata files so that I can append the data. This is the m...
Operations over lags

Operations over lags

8:25 AM Data Cleaning Data management Data Processing
Hello, I need to do some operations over all the lags up to the current value, but the loop I wrote takes forever, could you please tell m...
Force Merge

Force Merge

8:25 AM Data Cleaning Data management Data Processing
Hi, I am trying to merge household-level data for two different rounds but I keep on getting an error. I have attached the STATA code and th...
Interpreting interaction

Interpreting interaction

7:25 AM Data Cleaning Data management Data Processing
Dear Statalist, I am running the following regression with an interaction between a categorical education variable and continuous variable...
Points of mass in running variable - RD

Points of mass in running variable - RD

7:25 AM Data Cleaning Data management Data Processing
Dear all, I am struggling with my RD specification. I am using the rdrobust package from SSC and when I try to compute the optimal bandwid...
ppmlhdfe with disaggregated data: exporter,importer,sector fiexed effects

ppmlhdfe with disaggregated data: exporter,importer,sector fiexed effects

7:25 AM Data Cleaning Data management Data Processing
Hello, I am running a gravity model. My data looks something like this Code: use "http://fmwww.bc.edu/RePEc/bocode/e/EXAMPLE_TRADE...
Fine and Gray with Censoring

Fine and Gray with Censoring

6:25 AM Data Cleaning Data management Data Processing
When I produce cumulative incidence curves (CIC) for the Fine and Gray model using stcrreg and stcurve , I noticed that the estimated plate...
Interactions between sex and country of birth : do I also have to include interactions between control variables and sex?

Interactions between sex and country of birth : do I also have to include interactions between control variables and sex?

3:25 AM Data Cleaning Data management Data Processing
Dear STATA community, This is my first post and I hope that you can help me with my problem. In a nutshell, I run a regression to estimat...
Complete Time Series

Complete Time Series

3:25 AM Data Cleaning Data management Data Processing
Hello everyone, I'm wondering how to 'insert' empty values in my panel data. My data looks as follows: Array For each b_id...
Creating a new variable differentiated by gender

Creating a new variable differentiated by gender

3:25 AM Data Cleaning Data management Data Processing
Hi Statalist. I want to be able to test if there is a difference in the effect of level of education by gender. Here's my draft code. ...
Using an interaction between a categorical and a continuous interaction where the base category is highly informative

Using an interaction between a categorical and a continuous interaction where the base category is highly informative

3:25 AM Data Cleaning Data management Data Processing
Dear Statalist, Sorry if my question is too general for this forum. I hope it is okay. I am looking at the impact of trade on the return...
ivprobit and cmp ivprobit

ivprobit and cmp ivprobit

2:25 AM Data Cleaning Data management Data Processing
Hi. I am using an individual-based survey and I am trying to estimate the impact of migration and remittances on child education in Egypt. T...
Merging datasets code issue

Merging datasets code issue

1:25 AM Data Cleaning Data management Data Processing
Hi all! I can't figure out a certain line of code. I'd like to merge two data sets. The master dataset looks as follows: Array ...
Replacing "NA" with missing

Replacing "NA" with missing

12:25 AM Data Cleaning Data management Data Processing
I'm super new to using Stata and could really use some help! I have imported a csv file from R into Stata and am not sure if there is a ...
eintreg

eintreg

Thursday, July 30, 2020 Data Cleaning Data management Data Processing
Hi, I am using eintreg for interval regression with sample selection I would like to know the number of points used to do the numerical in...
p values

p values

10:25 PM Data Cleaning Data management Data Processing
Hi, We are comparing the clinical characteristics of infants who received Octreotide for chylothorax versus those who did not. I used Wilco...
how to backup files better

how to backup files better

9:25 PM Data Cleaning Data management Data Processing
Hello! W hen I use stata to clean data, I always want to keep track of every step I take so that I can use backup file to get every dta whi...
Centering on Mean - Interaction of 2 continous variables using the first difference estimator

Centering on Mean - Interaction of 2 continous variables using the first difference estimator

9:25 PM Data Cleaning Data management Data Processing
Hi, My research project is looking at the impact of financial development (bank development - proxied by private credit& stock market ...
Dropping observations based on multiple conditions

Dropping observations based on multiple conditions

8:25 PM Data Cleaning Data management Data Processing
Hi Everyone, Thanks for taking the time to read my query. I currently am cleaning a very big dataset (52 variables, 82284 observations) f...
define after a certain range

define after a certain range

8:25 PM Data Cleaning Data management Data Processing
Hello, I want to define a variable conditioned after a certain range of another variable. For instance, each id has a binary variable ...
Error in mi impute chained (logit)

Error in mi impute chained (logit)

7:25 PM Data Cleaning Data management Data Processing
Hello, I am trying to run an mi impute chained using (logit) and I get the following error message when I try to impute medications which ...
Obtaining the Spatially Weighted Regressors Using spregress

Obtaining the Spatially Weighted Regressors Using spregress

6:25 PM Data Cleaning Data management Data Processing
Hi Everyone: I'm looking for a simple way to obtain the regressors W*X when using spregress, where W is the chosen spatial weighting m...
Simple sum within the same variable

Simple sum within the same variable

5:25 PM Data Cleaning Data management Data Processing
hello, this may be a very simple question but is it possible to create a variable that simply sums consecutive numbers within the same varia...
Using GEE for repeated cross sectional, nested data

Using GEE for repeated cross sectional, nested data

12:25 PM Data Cleaning Data management Data Processing
I'm conducting an impact evaluation for a school level program and interested in evaluating if there is a dose response relationship bet...
Change degrees of freedom after estimating an OLS regression with sem

Change degrees of freedom after estimating an OLS regression with sem

12:25 PM Data Cleaning Data management Data Processing
Dear Stata experts, I recently learned how to change the degrees of freedom in the regress command using the dof() option (Thanks Trent Mi...
Help using loop to replace values in several variables with conditions

Help using loop to replace values in several variables with conditions

11:25 AM Data Cleaning Data management Data Processing
Hello, Can anyone suggest a way to use a loop to replace values in several variables with conditions? Here is the situation. I created sev...
Data in Stata format for Card & Krueger

Data in Stata format for Card & Krueger

11:25 AM Data Cleaning Data management Data Processing
Does anyone know where i can get the data of the famous Card& Krueger, 1994. AER paper on minimum wages. " Minimum Wages and Empl...
Coefplot with note() option

Coefplot with note() option

8:25 AM Data Cleaning Data management Data Processing
Dear Ladies and Gentlemen, I would like to have a longer note comment under my graphs stretching over two lines. I run the following comm...
Create Dataset in Stata using a Loop

Create Dataset in Stata using a Loop

8:25 AM Data Cleaning Data management Data Processing
I'm having trouble finding guidance on this. I want to create a dataset that has a variable called "location" and contains the...
meta regression missing data

meta regression missing data

7:25 AM Data Cleaning Data management Data Processing
Hi everybody, I have to do a meta-analysis. The aim of this meta-analysis (37 studies) is THE prevalence of bipolar disorder in an ASD (AU...
Transforming survey results to numerical codes appropiately (Encoding help)

Transforming survey results to numerical codes appropiately (Encoding help)

6:25 AM Data Cleaning Data management Data Processing
Stata and Stata Forum Beginner here. Situation: Using Limesurvey data for a health-related QOL study. This has questions where the respons...
ologit in favour of parsimoniousness despite violated parallel lines?

ologit in favour of parsimoniousness despite violated parallel lines?

5:25 AM Data Cleaning Data management Data Processing
Dear Statalists, I could use your input on the following 😊 My case: I am testing the influence of a factor variable (4 different count...
Unexpected coefficients in GMM sys

Unexpected coefficients in GMM sys

5:25 AM Data Cleaning Data management Data Processing
Hello everyone, I'm new with GMM, I'm using this estimator because I found it's the most suitable for my case. I have a panel d...
Need some help on a loop

Need some help on a loop

4:25 AM Data Cleaning Data management Data Processing
Hello dears all. I merged two datasets. The first dataset contained the listing of households' members. In this dataset, i had the ag...
Unable to read .dta file through do file - works fine otherwise

Unable to read .dta file through do file - works fine otherwise

2:25 AM Data Cleaning Data management Data Processing
Hello! I am working on a do file alongside a colleague but its commands will not load successfully and it generates an r(601) error. The...
Add a vertical line?

Add a vertical line?

2:25 AM Data Cleaning Data management Data Processing
Dear all, I generate a data set and estimate an interaction model as follows. Code: clear set seed 123 set obs 1000 gen x = runiform() ge...
Import mulitple sas7bdat files into stata with a loop?

Import mulitple sas7bdat files into stata with a loop?

Wednesday, July 29, 2020 Data Cleaning Data management Data Processing
Hello I have 524 sas files I would like to import as dta files, ex. bef201012.sas7bdat bef201112.sas7bdat bef201212.sas7bdat faik2010...
test for significant difference in number of children between 2 groups

test for significant difference in number of children between 2 groups

11:25 PM Data Cleaning Data management Data Processing
Hi there, I've been searching for a few hours on the internet but right now I can't see the wood for the trees. I want to test i...
95% Confidence Interval for relative concentration index

95% Confidence Interval for relative concentration index

10:25 PM Data Cleaning Data management Data Processing
Hello I would like to calculate 95% CI of relative concentration index. However, I could not find a stata command for this. Could anyone k...
Loop of different files using capture

Loop of different files using capture

7:25 PM Data Cleaning Data management Data Processing
Hello Statalisters I am running a loop over several files, like: Code: local originals "/Users/onedrive/stata/files" // Defi...
Leave out 90th percentile

Leave out 90th percentile

7:25 PM Data Cleaning Data management Data Processing
I am using a census dataset which has these following variables: county industry income I know how to calculate the normal one: col...
Variable label in generate?

Variable label in generate?

6:25 PM Data Cleaning Data management Data Processing
Dear All, I'd like to confirm whether there is any syntax that would allow me to prescribe variable labels in the generate 's synt...
Can I do a OLS Regression if the distribution of my DV is like this?

Can I do a OLS Regression if the distribution of my DV is like this?

6:25 PM Data Cleaning Data management Data Processing
The dependent variable is the proportion of the ** time / *** time. Because this is a proportion, the values range from 0 to 1. As the his...
xtwest command

xtwest command

6:25 PM Data Cleaning Data management Data Processing
i am trying to apply the xtwest command i get an error i have one dependent variable and 5 independent variables t=28 n=5 xtwest net fdi ...
How to save a Kaplan-Meier survival point estimate to a local macro?

How to save a Kaplan-Meier survival point estimate to a local macro?

5:25 PM Data Cleaning Data management Data Processing
Hello STATA Community, I am currently using version 15.1 of STATA. I have some straightforward survival time data, (time in years, censo...
Importing oddly formatted txt data into stata

Importing oddly formatted txt data into stata

5:25 PM Data Cleaning Data management Data Processing
Hi statalist, I am dealing with precinct by precinct voting results from counties in Texas. My goal is to record elections results (how many...
Iterations in logistic regression

Iterations in logistic regression

1:25 PM Data Cleaning Data management Data Processing
Is there any work on what affects the number of iterations required to achieve convergence in logistic regression -- number of X variables, ...
Finding Most Common String Values Across Variables

Finding Most Common String Values Across Variables

1:25 PM Data Cleaning Data management Data Processing
Hi all: I am trying to find the most common string values across variables. While I work in criminal justice data, I can't share that ...
Interpretation of sdtest

Interpretation of sdtest

12:25 PM Data Cleaning Data management Data Processing
Hi everyone, I am using the sdtest command to evaluate variances in two groupes. But in Stata's output, I'm have problems with th...
Help producing line graph

Help producing line graph

11:25 AM Data Cleaning Data management Data Processing
Hi all, I have somewhat of embarrassing question. I am trying to make a line graph that displays what percentage of the observations (or i...
The Hausman test for endogeneity

The Hausman test for endogeneity

11:25 AM Data Cleaning Data management Data Processing
I have read in a paper that we can use Hausman test for endogeneity. and the authors mentioned that we can use error term as follows: HTML...
Using analytical weight in STATA's mixed effects model

Using analytical weight in STATA's mixed effects model

10:25 AM Data Cleaning Data management Data Processing
I have a learning assessment dataset of over 60 countries and 2-5 years/waves. While countries are the same in all years, individuals (stude...
Fixed effects and cluster error in gravity model

Fixed effects and cluster error in gravity model

8:25 AM Data Cleaning Data management Data Processing
Dear all. I need your help. I am working in a gravity model that pretends to estimate Colombian exports to 136 partners from 2005 to 2018 ...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ▼  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ▼  July (682)
      • Logistic regression, propensity score matching, or...
      • Why can't we cluster on anything we like?
      • Writing a Loop
      • Problem with reaching 1:n ratio with psmatch2 !
      • stata ttest significance
      • Best command for cross-classified models
      • Asdoc "conformability error"
      • mixed vs. xtreg, re
      • Writing a Loop
      • Operations over lags
      • Force Merge
      • Interpreting interaction
      • Points of mass in running variable - RD
      • ppmlhdfe with disaggregated data: exporter,importe...
      • Fine and Gray with Censoring
      • Interactions between sex and country of birth : do...
      • Complete Time Series
      • Creating a new variable differentiated by gender
      • Using an interaction between a categorical and a c...
      • ivprobit and cmp ivprobit
      • Merging datasets code issue
      • Replacing "NA" with missing
      • eintreg
      • p values
      • how to backup files better
      • Centering on Mean - Interaction of 2 continous var...
      • Dropping observations based on multiple conditions
      • define after a certain range
      • Error in mi impute chained (logit)
      • Obtaining the Spatially Weighted Regressors Using ...
      • Simple sum within the same variable
      • Using GEE for repeated cross sectional, nested data
      • Change degrees of freedom after estimating an OLS ...
      • Help using loop to replace values in several varia...
      • Data in Stata format for Card & Krueger
      • Coefplot with note() option
      • Create Dataset in Stata using a Loop
      • meta regression missing data
      • Transforming survey results to numerical codes app...
      • ologit in favour of parsimoniousness despite viola...
      • Unexpected coefficients in GMM sys
      • Need some help on a loop
      • Unable to read .dta file through do file - works f...
      • Add a vertical line?
      • Import mulitple sas7bdat files into stata with a l...
      • test for significant difference in number of child...
      • 95% Confidence Interval for relative concentration...
      • Loop of different files using capture
      • Leave out 90th percentile
      • Variable label in generate?
      • Can I do a OLS Regression if the distribution of m...
      • xtwest command
      • How to save a Kaplan-Meier survival point estimate...
      • Importing oddly formatted txt data into stata
      • Iterations in logistic regression
      • Finding Most Common String Values Across Variables
      • Interpretation of sdtest
      • Help producing line graph
      • The Hausman test for endogeneity
      • Using analytical weight in STATA's mixed effects m...
      • Fixed effects and cluster error in gravity model
      • Use of various Xtpcse options
      • Dummy
      • Diff in Diff graphic with repeated cross-sectional...
      • Stacked bar graph using percentage option
      • T&E
      • extracting several string values from several diff...
      • How to interpret predicted probabilities with cont...
      • Calculating time between specific events in panel ...
      • Can I match on a baseline dataset from 2018 (treat...
      • How to identify all cases with missing observation
      • Sample size for 3 arm, 2 comparison sub-study
      • Collapse by several variables
      • Putexcel modify not working
      • Help with Stochastic Frontier Analysis/Translog Co...
      • Missing Numeric Value with Comma When Importing tx...
      • INVERSE MILLS RATIOS of an ORDERED PROBIT
      • mibeta interpretation
      • Is it posible to make a math formula on a twoway tab?
      • Re-coding 2 race/ethnicity variables
      • Correcting the condition for the creation of a new...
      • Factor variable notation: How to omit the base per...
      • Adding sum of coefficients to output table
      • Panel Data Analysis
      • Percentage change variable for daily data
      • Unadjusted confidential interval versus adjusted CI
      • how to test residual in "rangestat (reg)"
      • Nonparametric Linear regression
      • how to create tempfile named according to value of...
      • Sum distanct values of a variable by group
      • linear hurdle models
      • considering inter eyes correlation in spearman ana...
      • Negative values for categorical variable.
      • How to avoid duplication in observations for varia...
      • How to find the value of a variable using conditio...
      • Lag independent variable in fixed effects regressi...
      • xtaond2
      • Two - step dynamic panel GMM estimator weight Matr...
      • query about stata code
      • Heat maps according to dates
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer