BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Standard errors and 95% Confidence Intervals for Proportions - Differences between different Stata versions

Standard errors and 95% Confidence Intervals for Proportions - Differences between different Stata versions

Wednesday, March 31, 2021 Data Cleaning Data management Data Processing
Dear community: I am having doubts about how Stata calculates standard errors and 95% confidence intervals for proportions, especially bec...
Identifying common observations between two groups

Identifying common observations between two groups

9:25 PM Data Cleaning Data management Data Processing
Dear Statalist, I have a dataset with firms and owners. This is a minimum working example: Code: * Example generated by -dataex-. To in...
Commands for sleep and retry if a file being accessed at the moment

Commands for sleep and retry if a file being accessed at the moment

9:25 PM Data Cleaning Data management Data Processing
I have multiple instances of Stata accessing the same excel file to record some information. Every once in a while they try to access the ex...
sqom subcost matrix problem?

sqom subcost matrix problem?

8:25 PM Data Cleaning Data management Data Processing
Hi, I am trying to run the latest version of sqom on Stata16 using subcost(meanprobdistance) and am getting an error. My command is: sq...
Histogram with lpattern(non-solid line) adds a weird slant line

Histogram with lpattern(non-solid line) adds a weird slant line

6:25 PM Data Cleaning Data management Data Processing
Hi all, I have just found that, when I specify 'lpattern(dash)' or 'dot' or, anything other than the solid line for 'two...
Restructuring data for survival analysis

Restructuring data for survival analysis

5:25 PM Data Cleaning Data management Data Processing
Hello everyone, Please I need help here on re-structuring my data (Nigeria Demographic and Health Survey) for survival analysis. I'm...
Happy birthday to statalist.org

Happy birthday to statalist.org

7:25 AM Data Cleaning Data management Data Processing
Statalist re-invented itself as a forum on March 31, 2014. That doesn't sound that long ago, but in the last 7 years, there have been mo...
Combinations of binary variables

Combinations of binary variables

7:25 AM Data Cleaning Data management Data Processing
Hello, I have a dataset with 10 variables and 5 binary variables (A, B, C,D,E). I'm trying to get all possible combinations of 2,3, 4 ...
convert hexadecimal to binary

convert hexadecimal to binary

6:25 AM Data Cleaning Data management Data Processing
I have variables whose values come as hexadecimal strings 32 characters long as a rusult of an md5 hash function to an original string. I ne...
HGLM - Model 2 level variables not siginificant but postestimation plot shows variation

HGLM - Model 2 level variables not siginificant but postestimation plot shows variation

4:25 AM Data Cleaning Data management Data Processing
Good day, I am using a two level HGLM model model 1 - Level 1 variables have significant effect on outcome Model 2 - Level 2 variables d...
Robustness checks

Robustness checks

4:25 AM Data Cleaning Data management Data Processing
Dear statalist, I am running a bootstrap on an OLS model with the dependent variable being log(maximal grip strength) on a set of independ...
Comparing coefficients while keeping difference between other coefficients constant in mixed model

Comparing coefficients while keeping difference between other coefficients constant in mixed model

4:25 AM Data Cleaning Data management Data Processing
Dear Statalist users, I would like to compare whether the slope is statistically different between two groups while pretending that the in...
How to calculate year in month with conditions

How to calculate year in month with conditions

3:25 AM Data Cleaning Data management Data Processing
Dear all, Year of birth and month of birth of each individual in a given dataset, now I want to generate two variables as follows: 1) Yea...
Interpreting Sargan-Hansen Tests

Interpreting Sargan-Hansen Tests

3:25 AM Data Cleaning Data management Data Processing
Hi all, I am running diff GMM using Sebastian's xtdpdgmm command. However, I am not sure how to interpret the 2 different Sargan-Hanse...
How to generate a variable which takes for every combination another value?

How to generate a variable which takes for every combination another value?

3:25 AM Data Cleaning Data management Data Processing
Dear all I have a question. I have a dataset with lots of different variables and want to generate a new variable which takes for every pos...
sort row with string variables?

sort row with string variables?

3:25 AM Data Cleaning Data management Data Processing
Dear All, Is it possible to sort "string" variable? Suppose that the data set is Code: * Example generated by -dataex-. To insta...
outreg2 keep option does not work

outreg2 keep option does not work

3:25 AM Data Cleaning Data management Data Processing
Hello, I am in desperate need for some help for my thesis regarding Stata. I would like to make use of the outreg2 command. However, whe...
Difference in Consecutive Values by Group Labels

Difference in Consecutive Values by Group Labels

3:25 AM Data Cleaning Data management Data Processing
Dear Statalisters, I need to generate another variable called 'difference' which contains the difference between 'consecutive...
IV ordered probit using CMP (syntax help)

IV ordered probit using CMP (syntax help)

3:25 AM Data Cleaning Data management Data Processing
Hi There, I am estimating the causal effect of maternal education on child health. My dependent variable ==> Breastfeeding duration. It...
Reshape data in order to calculate gini coefficient panel data using income groups

Reshape data in order to calculate gini coefficient panel data using income groups

3:25 AM Data Cleaning Data management Data Processing
Dear Statalist, I'm trying to calculate the Gini of counties using the the population of 20+ income groups and the median income of sa...
Series 0 not found using grc1leg2

Series 0 not found using grc1leg2

3:25 AM Data Cleaning Data management Data Processing
Dear all, I am using Stata 14 on Windows 10. The following code is supposed to create a combined graph of several scatter plots for two vari...
Drop if Strmatch with multiple conditions

Drop if Strmatch with multiple conditions

2:25 AM Data Cleaning Data management Data Processing
Hello everyone, I'm new to STATA and i have the following problem: I want to drop all observations, that don't contain the words ...
Merger Simulation Nested Logit Model (Björnerstedt & Verboven, 2013)

Merger Simulation Nested Logit Model (Björnerstedt & Verboven, 2013)

1:25 AM Data Cleaning Data management Data Processing
Dear forum, I follow the paper written by Björnerstedt and Verboven (2013) and Berry (1994) to simulate a merger: In Berry (1994) they u...
ITSA Error on Monthly Time Series Data with a Gap Due to Zero Count

ITSA Error on Monthly Time Series Data with a Gap Due to Zero Count

12:25 AM Data Cleaning Data management Data Processing
Hi, I am currently looking at datasets from several hospitals to assess the impact of COVID-19 pandemic on the surgical volume changes throu...
Combined plots

Combined plots

Tuesday, March 30, 2021 Data Cleaning Data management Data Processing
Hi I recently came across a graph where instead of making bar graphs the authors/researchers had combined three plots (namely scatter, viol...
Graphing variables with extreme values

Graphing variables with extreme values

8:25 PM Data Cleaning Data management Data Processing
Hi Statalist. I want to graph a number of financial variables, such as total household assets, and compare the values between a few dichot...
Extract substring between nth and (n+1)th commas in a variable

Extract substring between nth and (n+1)th commas in a variable

6:25 PM Data Cleaning Data management Data Processing
How can I extract a substring between the nth and (n+1)th commas in a variable? For example, consider ID = 3 and beta = "eight,nine,t...
How to adjust (select or handpick) correlation matrix column(s) output to be esttab after estpost correlation table?

How to adjust (select or handpick) correlation matrix column(s) output to be esttab after estpost correlation table?

6:25 PM Data Cleaning Data management Data Processing
Dear Statalisters, I would like to find out whether it is possible to select the specific column(s) to be produced in the correlation matrix...
MATCHIT- Stata for data consolidation and cleaning using fuzzy string comparisons

MATCHIT- Stata for data consolidation and cleaning using fuzzy string comparisons

6:25 PM Data Cleaning Data management Data Processing
Hello, I came across your matchit command in Stata for data consolidation and cleaning using fuzzy string comparisons. I would like to us...
need help: panel data analysis

need help: panel data analysis

11:27 AM Data Cleaning Data management Data Processing
Hello, I have a panel data sample consisting of obervations of several years. For each year I have some duplicates which I won`t drop sinc...
probleme to perform ado function

probleme to perform ado function

10:25 AM Data Cleaning Data management Data Processing
Dears, i am a new one on this forum and on stata, thank you for your help my question is : I want to run an ado program according to the...
Can i pick the Model with lower AIC but higher BIC?

Can i pick the Model with lower AIC but higher BIC?

9:25 AM Data Cleaning Data management Data Processing
Good day Everyone, I was comparing two models Model2 has a lower AIC but a higher BIC. can i pick model 2 as the better model since its ...
probleme to perform ado function

probleme to perform ado function

9:25 AM Data Cleaning Data management Data Processing
Dears, i am a new one on this forum and on stata, thank you for your help my question is : I want to run an ado program according to the...
Controlling for year when appending two datasets

Controlling for year when appending two datasets

9:25 AM Data Cleaning Data management Data Processing
Hello, One newbie question but I'm stuck on this basic task. I'm pooling two datasets from two different years and have successful...
Quantile regression (QR) for panel data

Quantile regression (QR) for panel data

8:25 AM Data Cleaning Data management Data Processing
Dear all, i hope you are doing good i'm trying to work on a non linear relationship. does quantile regression an appropriate model...
Reverse Causality and Panel Data

Reverse Causality and Panel Data

8:25 AM Data Cleaning Data management Data Processing
Hi all, I hope all is well. Please, I am studying if some of the variables affect firm instrument issuance. Assume that I have these varia...
Table1_mc stata 16 issue

Table1_mc stata 16 issue

7:25 AM Data Cleaning Data management Data Processing
Hello, I am trying to create a table 1 in stata 16. I am using the following code: table1_mc, by(race_ethnicity) vars(age contn %4.0f \...
Analysis of Demographic Factors Across Syllabi

Analysis of Demographic Factors Across Syllabi

7:25 AM Data Cleaning Data management Data Processing
Hi all! I'm very new to Stata so apologies in advance if my question isn't very well formed. I am performing an analysis of gender...
why I need to run the command keep twice to delete some observations?

why I need to run the command keep twice to delete some observations?

7:25 AM Data Cleaning Data management Data Processing
Code: ------------------------------------------------------------------------------------------------------------------------------------...
Panel regression model / Multiple regression model?

Panel regression model / Multiple regression model?

6:25 AM Data Cleaning Data management Data Processing
Hi everyone, I have a dataset available which links the the Covid-19 situation to the change in port activity (expressed in difference of ...
Regressing a variable recorded in 2018 on a variable of 2011

Regressing a variable recorded in 2018 on a variable of 2011

4:25 AM Data Cleaning Data management Data Processing
Hello everybody, I'm a beginner of Stata and I'm already facing the first issues. I have a panel data where my dependent variable (...
Omitted interaction terms in the fixed effect due to collinearity

Omitted interaction terms in the fixed effect due to collinearity

4:25 AM Data Cleaning Data management Data Processing
Hi all, I have a question when I am running a fixed effect regression which is used to investigate how the dependent variable will be when...
Replace odd commas by spaces in a variable

Replace odd commas by spaces in a variable

4:25 AM Data Cleaning Data management Data Processing
Hi, Thanks in advance for your help. Maybe, what I am going to ask is an easy question for many of you but I have been unable to find a sol...
Importing an Imputed File

Importing an Imputed File

4:25 AM Data Cleaning Data management Data Processing
Hi everyone, I created an Imputed version of my data on a PC machine using Stata/SE. I've now opened the data set using Stata/IC on a ...
Mark latest observation with unbalanced paneldata

Mark latest observation with unbalanced paneldata

3:25 AM Data Cleaning Data management Data Processing
I want to mark the latest observation in an unbalanced dataset with panel data Timevaribel: t (varies som 0 to 17) person id: id The ...
Need help with sampling process

Need help with sampling process

2:25 AM Data Cleaning Data management Data Processing
Hello, I am working with a dataset with merged data from 2 survey rounds (2005 & 20011). The first column/variable 'id' is t...
Cartesian product of Columns

Cartesian product of Columns

2:25 AM Data Cleaning Data management Data Processing
Dear Statalisters, I have 2 sets Set_1= {HH, HT, TT, TH} and Set_2 = {1, 2, 3, 4}. These are the initial column values and names. I want ...
GMM for time series

GMM for time series

12:25 AM Data Cleaning Data management Data Processing
Hello Statalist, 1.I am using time series data. my main equation is pb t = a 0 +a 1 pb t-1 +a 2 pd t-1 +og t +e but there is a problem o...
Stationarity, Markov switching

Stationarity, Markov switching

Monday, March 29, 2021 Data Cleaning Data management Data Processing
Hi everyone, I was trying to model times series with markov switching regimes. Preliminarily, do I need to have my time series to be station...
Out of sample estimation following xtdpdgmm

Out of sample estimation following xtdpdgmm

9:25 PM Data Cleaning Data management Data Processing
Stata throws a 301 error (last estimates not found) whenever I try to use estimates from a dynamic model (using xtdpdgmm) to predict out of ...
Identifying missing Data & Interpolation of Panel Data

Identifying missing Data & Interpolation of Panel Data

6:25 PM Data Cleaning Data management Data Processing
Hello, I've created a dataset of 30 countries over 30 years for 29 different variables. For some of these countries, the 30-year time ...
Shifting observations across rows

Shifting observations across rows

5:25 PM Data Cleaning Data management Data Processing
Hello, I am fairly new to Stata and am having trouble even starting on this problem. I am working with the National Longitudinal Survey of ...
Reading image properties such as GPS and date taken

Reading image properties such as GPS and date taken

5:26 AM Data Cleaning Data management Data Processing
Hi everyone, I was wondering if there is a way to read photo/ image properties such as date taken, data modified and GPS coordinates. I have...
Alternatives to Propensity Score Matching (PSM)

Alternatives to Propensity Score Matching (PSM)

4:26 AM Data Cleaning Data management Data Processing
Hello everyone, I have a large dataset of forest pixels with different control variables such as altitude, slope, distance to nearest rive...
FE constant vs RE constant

FE constant vs RE constant

4:26 AM Data Cleaning Data management Data Processing
Hi, I am estimating a model by FE and the constant I get is not close to the mean of my only explanatory variable, but when I run the same...
Categorical x Categorical interaction

Categorical x Categorical interaction

4:26 AM Data Cleaning Data management Data Processing
Hi all, I am hoping someone can clear something up for me. I want to run a poisson regression model - with a continuous dependent variab...
xttab

xttab

3:25 AM Data Cleaning Data management Data Processing
Hi, I am busy with creating a word doc with my panel data descriptive statistics. I am using the following code, which works. asdoc xttab...
Shaping graphs using grc1leg

Shaping graphs using grc1leg

3:25 AM Data Cleaning Data management Data Processing
Hi all, I'm making a set of different scatters using the following code: Code: foreach x in physical_strength conceptualisation s...
create dataset based on all possible pairs of identifiers within each group in Stata

create dataset based on all possible pairs of identifiers within each group in Stata

3:25 AM Data Cleaning Data management Data Processing
Hi, I have a dataset that looks like this: Code: * Example generated by -dataex-. To install: ssc install dataex clear input str7 coun...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ▼  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ▼  March (697)
      • Standard errors and 95% Confidence Intervals for P...
      • Identifying common observations between two groups
      • Commands for sleep and retry if a file being acces...
      • sqom subcost matrix problem?
      • Histogram with lpattern(non-solid line) adds a wei...
      • Restructuring data for survival analysis
      • Happy birthday to statalist.org
      • Combinations of binary variables
      • convert hexadecimal to binary
      • HGLM - Model 2 level variables not siginificant bu...
      • Robustness checks
      • Comparing coefficients while keeping difference be...
      • How to calculate year in month with conditions
      • Interpreting Sargan-Hansen Tests
      • How to generate a variable which takes for every c...
      • sort row with string variables?
      • outreg2 keep option does not work
      • Difference in Consecutive Values by Group Labels
      • IV ordered probit using CMP (syntax help)
      • Reshape data in order to calculate gini coefficien...
      • Series 0 not found using grc1leg2
      • Drop if Strmatch with multiple conditions
      • Merger Simulation Nested Logit Model (Björnerstedt...
      • ITSA Error on Monthly Time Series Data with a Gap ...
      • Combined plots
      • Graphing variables with extreme values
      • Extract substring between nth and (n+1)th commas i...
      • How to adjust (select or handpick) correlation mat...
      • MATCHIT- Stata for data consolidation and cleaning...
      • need help: panel data analysis
      • probleme to perform ado function
      • Can i pick the Model with lower AIC but higher BIC?
      • probleme to perform ado function
      • Controlling for year when appending two datasets
      • Quantile regression (QR) for panel data
      • Reverse Causality and Panel Data
      • Table1_mc stata 16 issue
      • Analysis of Demographic Factors Across Syllabi
      • why I need to run the command keep twice to delete...
      • Panel regression model / Multiple regression model?
      • Regressing a variable recorded in 2018 on a variab...
      • Omitted interaction terms in the fixed effect due ...
      • Replace odd commas by spaces in a variable
      • Importing an Imputed File
      • Mark latest observation with unbalanced paneldata
      • Need help with sampling process
      • Cartesian product of Columns
      • GMM for time series
      • Stationarity, Markov switching
      • Out of sample estimation following xtdpdgmm
      • Identifying missing Data & Interpolation of Panel ...
      • Shifting observations across rows
      • Reading image properties such as GPS and date taken
      • Alternatives to Propensity Score Matching (PSM)
      • FE constant vs RE constant
      • Categorical x Categorical interaction
      • xttab
      • Shaping graphs using grc1leg
      • create dataset based on all possible pairs of iden...
      • GMM sensitivity to model specification and adequac...
      • IV-PROBIT with binary endogenous variable and bina...
      • Calculate gini coefficient panel data using popula...
      • Opening upspecified file extension
      • How to combine two variables to create a new one?
      • Combine 2 Kaplan-Meier plots, not as 2 panels but ...
      • Relative change score as dependent*variable in Dif...
      • Falsification test for DID
      • Regression Analysis with year dummy
      • Any quick way to handle incorrect statistics for v...
      • Marginal effect
      • ivreg2 not running
      • Generate a table of R-squared values using a loop ...
      • xttest3 overall significance
      • Change my observations values in discrete returns ...
      • Searching for variable string content into another...
      • Creating a New Variable
      • Generate new variable pasting numbers or text
      • cmxtmixlogit - expected probabilities and sign of ...
      • Pre and post event
      • Choosing specific variables in a micro dataset and...
      • Converting ordinal variable
      • Question: combining mi estimate and 'synth' comman...
      • Overidentification tests ivreg2
      • Computing Between-Group Gini Coefficient
      • Calculate gini coefficient panel data using income...
      • Issue with cross-sectional regression using rangestat
      • Specifying state-specific linear time trends and p...
      • graph bars that stack positive and negative values
      • Price trend
      • String variable related commands
      • Confused why my R^2 is either 0 or 1
      • Addstats at the bottom of the table
      • Reporting coefficients, standard errors and p valu...
      • Panel VAR with the pvar package: Subtracting a con...
      • Establishing my data as a panel
      • egen (sum) is not summing up absolute figures but ...
      • two stages least squares vs fixed effects within e...
      • Help with choosing which type of logistic regressi...
      • Local and Square Bracket Notation
      • Using repeated measures ANOVA vs. linear mixed eff...
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer