BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Exporting Logistic Regression output table - command using 'svyset'

Exporting Logistic Regression output table - command using 'svyset'

Thursday, April 30, 2020 Data Cleaning Data management Data Processing
Dear experts, I am running logistic regression model using 'svyset' command in Stata 15. I am unable to export the logistic regress...
Correct for Selection on Independent Variables

Correct for Selection on Independent Variables

10:26 PM Data Cleaning Data management Data Processing
Dear Statalists, I am confused about how to correct for selection on one independent variable. I want to estimate Y_ft=beta*Certified_ft+...
Replacing missing rows of a variable

Replacing missing rows of a variable

9:26 PM Data Cleaning Data management Data Processing
Dear All I have a file with more than 1k observations and two variables. One of the variables Y is complete but the variable Country is no...
How to sort data for distinct IDs with multiple visits and multiple values

How to sort data for distinct IDs with multiple visits and multiple values

9:26 PM Data Cleaning Data management Data Processing
Dear all, I have a dataset that has multiple visits for each person. However each person had a differing number of visits The dataset is ...
Panel VECM in STATA

Panel VECM in STATA

9:26 PM Data Cleaning Data management Data Processing
I am giving a general description of the model in a panel setting: Let y, a, b, and z are four variables. There exist a vector of co-integr...
Replace missing values from a different row

Replace missing values from a different row

9:26 PM Data Cleaning Data management Data Processing
I have data that are in long format where one ID has four rows of data. Only one row has information and I want to fill in the other three r...
Bootstrapped SE with Two-Sample IV: "insufficient observations to compute bootstrap standard errors"

Bootstrapped SE with Two-Sample IV: "insufficient observations to compute bootstrap standard errors"

8:26 PM Data Cleaning Data management Data Processing
I am not sure if this is kosher but I am trying to calculate bootstrapped standard errors for a two-sample IV. (As an aside, for some reason...
Code for Marginsplot Interaction Terms - Error/questions

Code for Marginsplot Interaction Terms - Error/questions

8:26 PM Data Cleaning Data management Data Processing
Hi all, I am trying to determine whether the effect of income (continuous var) on cash usage depends on credit card ownership (categorical...
Esttab - Compress labels

Esttab - Compress labels

6:26 PM Data Cleaning Data management Data Processing
Hi everyone, Is there a way to compress labels in esttab command? If two variables from two regressions have same variable names, we can s...
Wilcoxon rank sum test for more than 3 groups?

Wilcoxon rank sum test for more than 3 groups?

5:26 PM Data Cleaning Data management Data Processing
So Stata is not letting me do a rank sum test for more than 3 groups with the command: ranksum score if outcome~=1, by(outcome). Does anyone...
Filling in missing values in long data

Filling in missing values in long data

5:26 PM Data Cleaning Data management Data Processing
I have some sample data pasted below. Code: * Example generated by -dataex-. To install: ssc install dataex clear input double(studentid ...
Hausman Test - "V_b-V_B is not positive definite" appears

Hausman Test - "V_b-V_B is not positive definite" appears

5:26 PM Data Cleaning Data management Data Processing
Background of question I am an economics student, currently writing my bachelor thesis, and quite inexperienced with Stata. I would be gra...
Histogram by groups

Histogram by groups

5:26 PM Data Cleaning Data management Data Processing
Hello, I would like to know the code to create a bar graph with the values of the receipts of each group by year. Therefore, in each year ...
Renaming variables

Renaming variables

5:56 AM Data Cleaning Data management Data Processing
Hi, I am facing a problem trying to rename variables. I am using Stata 13 MP. This started with me having to reshape a long data into a w...
Code for including confidence intervals on both curves

Code for including confidence intervals on both curves

5:26 AM Data Cleaning Data management Data Processing
Hi all, I am trying to compare a linear vs quadratic predictive margins in one graph. However, I am trying to show the confidence interval...
Graphing problem

Graphing problem

4:26 AM Data Cleaning Data management Data Processing
I collapsed my data for it to include count, mean and a dummy variable for the category they belong to. My aim was to create a bar graph s...
Difference-in-Difference analysis after PSM

Difference-in-Difference analysis after PSM

3:56 AM Data Cleaning Data management Data Processing
Dear statist, I need your help. I am doing work that involves assessing the impact of issuing a specific obligation on governance variable...
Combining duplicate names into one

Combining duplicate names into one

3:56 AM Data Cleaning Data management Data Processing
Hello everyone, I have a dataset containing data about Board Members and it is formatted like this: Name Start Date End Date Role N...
Error calculating margins after melogit (could not calculate numerical derivatives -- discontinuous region with missing values encountered)

Error calculating margins after melogit (could not calculate numerical derivatives -- discontinuous region with missing values encountered)

3:26 AM Data Cleaning Data management Data Processing
Hello all, I am having trouble calculating margins after running a melogit with 3 levels. The command for the melogit is formulated as: ...
Counting number of people by gender

Counting number of people by gender

3:26 AM Data Cleaning Data management Data Processing
Hi all, I am currently using Stata Version 14. In the sample data attached below, I have information on the gender, rank, area of residenc...
putexcel

putexcel

2:26 AM Data Cleaning Data management Data Processing
Good day to everybody, just a couple of quick questions concerning the putexcel command in Stata 16: 1. I'd like to see the results ...
sem - visualizing interaction between latent variables

sem - visualizing interaction between latent variables

1:26 AM Data Cleaning Data management Data Processing
Dear all, I would like to visualize a latent variable interaction after running sem in Stata 14. To interpret the interaction effect, I w...
merging data files to create a large panel data by VDS_Id and SUR_MON_YR

merging data files to create a large panel data by VDS_Id and SUR_MON_YR

1:26 AM Data Cleaning Data management Data Processing
here is data below Code: * Example generated by -dataex-. To install: ssc install dataex clear input str10 VDS_ID str5 SUR_MON_YR str1 GI...
ANOVA REPEATE MEASURES - HOW TO INTERPRET CONTRADICTORY RESULTS BETWEEN ANOVA AND MULTIPLE PAIRWISE COMPARISONS (post hoc)?

ANOVA REPEATE MEASURES - HOW TO INTERPRET CONTRADICTORY RESULTS BETWEEN ANOVA AND MULTIPLE PAIRWISE COMPARISONS (post hoc)?

12:26 AM Data Cleaning Data management Data Processing
Good mornig to everybody. HOW TO INTERPRET CONTRADICTORY RESULTS BETWEEN ANOVA AND MULTIPLE PAIRWISE COMPARISONS (post hoc)?How to write t...
xtabond2 vs xtreg / xtregar

xtabond2 vs xtreg / xtregar

12:26 AM Data Cleaning Data management Data Processing
Dear Stata users, I have always assumed that in the presence of serial autocorrelation and assumptions of endogeneity, a correctly specifi...
Time Series Graph Percentage Deviations from Trend

Time Series Graph Percentage Deviations from Trend

12:26 AM Data Cleaning Data management Data Processing
Hello, I am currently working on time series analysis, trying to identify the effect of an increase in income on consumption. Therefore I...
ANOVA REPEATE MEASURES - HOW TO INTERPRET CONTRADICTORY RESULTS BETWEEN ANOVA AND MULTIPLE PAIRWISE COMPARISONS (post hoc)?

ANOVA REPEATE MEASURES - HOW TO INTERPRET CONTRADICTORY RESULTS BETWEEN ANOVA AND MULTIPLE PAIRWISE COMPARISONS (post hoc)?

12:26 AM Data Cleaning Data management Data Processing
Good mornig to everybody. 1)HOW TO INTERPRET CONTRADICTORY RESULTS BETWEEN ANOVA AND MULTIPLE PAIRWISE COMPARISONS (post hoc)? I ran a mod...
Xtreg with Demeaned variables

Xtreg with Demeaned variables

12:26 AM Data Cleaning Data management Data Processing
I have a panel dateset and my dependent variable and independent variables are demeaned and standardized. Assuming that I do not need to inc...
Heteroskedasticity or other problem with regression?

Heteroskedasticity or other problem with regression?

Wednesday, April 29, 2020 Data Cleaning Data management Data Processing
Hello everyone, so I run a pooled OLS regression on panel data of log real monthly wage on education, age, sex, marital status, job sector, ...
Sum of values of a conditional variable

Sum of values of a conditional variable

7:26 PM Data Cleaning Data management Data Processing
I am new to using Stata, I have always made corrections using already clean bases. I have the variables "sales" and "economi...
Performing chi squared with one variable against a group of variables individually

Performing chi squared with one variable against a group of variables individually

7:26 PM Data Cleaning Data management Data Processing
I've done a ton of looking around, and I've likely just not got the right key words, but my goal is to perform a simple chi squared ...
re case control matching

re case control matching

7:26 PM Data Cleaning Data management Data Processing
Hi everyone, Can someone please suggest a method/any packages by which I can derive a case control dataset (from a larger unmatched datase...
Foreach loop using macro in numlist

Foreach loop using macro in numlist

7:26 PM Data Cleaning Data management Data Processing
Probably a simple syntax issue, but I can't figure this out after reading the documentation on foreach, macros, and numlist. I run the f...
Quintile

Quintile

6:26 PM Data Cleaning Data management Data Processing
hi I am new to the stata forum. I am currently working on my thesis topic " poverty and consumption inequality". I need help in ma...
Transforming panel data with different time points

Transforming panel data with different time points

5:26 PM Data Cleaning Data management Data Processing
Hello, I am using data collected from 36 different hospitals over the course of 7 quarters (2015q2 - 2016q4). The data consist of assessment...
Problem with Hausman test in xsmle

Problem with Hausman test in xsmle

5:26 PM Data Cleaning Data management Data Processing
I was estimating spatial panel data models in STATA, when I tried to perform Hausman test, the following message appears: ... estimating f...
Xtabond2 Newbie question

Xtabond2 Newbie question

5:56 AM Data Cleaning Data management Data Processing
Hi, I am a quite newbie to dynamic panels. For my project, I want to run a simple model on government approval rates on a dataset consistin...
Averaging values from different "sum, detail outputs"

Averaging values from different "sum, detail outputs"

5:56 AM Data Cleaning Data management Data Processing
Hi everyone, I was wondering if there is a chance to average "sum, detail outputs" over various months. First, I sorted my data...
Latent class analysis using gsem - Cross validation

Latent class analysis using gsem - Cross validation

4:56 AM Data Cleaning Data management Data Processing
Hi, I'm trying to learn LCA/LPA using gsem command in Stata by walking myself through Masyn (2013) - cited in SEM example 52 - and tr...
Three-year volatility

Three-year volatility

4:26 AM Data Cleaning Data management Data Processing
I have been looking on the forum for a topic about three-year volatility but I didn't find what I wanted. I have panel data which look...
IV using panel data and fixed effects

IV using panel data and fixed effects

4:26 AM Data Cleaning Data management Data Processing
Hi, I am using panel data about women's wellbeing and influencing factors, therefore, I have been using a fixed effects panel regressi...
Computing the consistency ratio and consistency index for analytic hierarchy process with mata

Computing the consistency ratio and consistency index for analytic hierarchy process with mata

3:56 AM Data Cleaning Data management Data Processing
hey everyone, i'm completely new to stata and mata as well. im trying to figure out if there is a possibilty of getting the consistenc...
Linear regression using a time variable

Linear regression using a time variable

3:56 AM Data Cleaning Data management Data Processing
Hi all, I'm having some difficulties in doing a linear regression for my research. My dataset consists of 361 respondents, with each o...
Forecasting a variable

Forecasting a variable

2:56 AM Data Cleaning Data management Data Processing
Hi, I am using a panel dataset and for one of my variables I only have data 2014 to 2017. For my dataset to balanced I need to forecast w...
Estimating asymmetrical confidence intervals for ICC using -nlcom-

Estimating asymmetrical confidence intervals for ICC using -nlcom-

2:56 AM Data Cleaning Data management Data Processing
Dear all Out of curiosity I want to reproduce the calculations from: Code: cls use https://www.stata-press.com/data/r16/judges, clear ic...
LaTeX font on eps figures: cannot get writepsfrag package to work

LaTeX font on eps figures: cannot get writepsfrag package to work

2:26 AM Data Cleaning Data management Data Processing
Hello, I cannot get writepsfrag to work. I am trying to have the same fonts on my Stata-produced figures and the rest of my LaTeX document...
Dropping multiple missing observations

Dropping multiple missing observations

2:26 AM Data Cleaning Data management Data Processing
Currently I'm working on a project in which I use Item response theory. I have 8 variables from a lot of cases, which I would like to fi...
Which F stats should I look at with ivlasso?

Which F stats should I look at with ivlasso?

2:26 AM Data Cleaning Data management Data Processing
Hi, I'm running IV regression with ivlasso in Stata. It reports several different first stage F statistics, does any one know which on...
Difference and Difference Design Model Specification

Difference and Difference Design Model Specification

2:26 AM Data Cleaning Data management Data Processing
Hey everyone, I have a question concerning a model specification for a Difference and Difference Design. I have cross sectional data over ...
Expanding a time series dataset by one month

Expanding a time series dataset by one month

2:26 AM Data Cleaning Data management Data Processing
Hi, I have panel data and I would like to expand the dataset by one month. So I would like to expand each "stock" time series to...
How to check for significant heterogeneity for a categorical variable with 2 interactions

How to check for significant heterogeneity for a categorical variable with 2 interactions

1:26 AM Data Cleaning Data management Data Processing
Dear Statlist, I am estimating whether elderly age in a healthier way because of the Long-term care system they are in. I do this by group...
Regression on Panel Data

Regression on Panel Data

12:26 AM Data Cleaning Data management Data Processing
Hello, I am having a bit of trouble running the xtreg command on my dataset. I have panel data on 24 countries for a 12 year time period (...
How to do Reality check and SPA test by Stata?

How to do Reality check and SPA test by Stata?

12:26 AM Data Cleaning Data management Data Processing
Dear Statalist, I want to know whether there are any user written commands by stata for performing Reality Check (White 2000) and Superior...
Parametric survival analysis

Parametric survival analysis

12:26 AM Data Cleaning Data management Data Processing
Good morning. I am doing parametric survival analysis through streg. After streg, I got the following graph through the stcurve option. Wha...
Question about diff-in-diff with multiple control groups and one treatment group

Question about diff-in-diff with multiple control groups and one treatment group

12:26 AM Data Cleaning Data management Data Processing
Hello! I am running a Difference in Difference (DD) regression to see whether the introduction of a policy affected school enrolment for a...
How to model "Sparial Variability" or "Choice of Location"

How to model "Sparial Variability" or "Choice of Location"

Tuesday, April 28, 2020 Data Cleaning Data management Data Processing
I have a survey data on 10000 delivery person. I have number of delivery they made in 49 neighborhoods (that is 49 columns plus 01 as "...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ▼  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ▼  April (692)
      • Exporting Logistic Regression output table - comma...
      • Correct for Selection on Independent Variables
      • Replacing missing rows of a variable
      • How to sort data for distinct IDs with multiple vi...
      • Panel VECM in STATA
      • Replace missing values from a different row
      • Bootstrapped SE with Two-Sample IV: "insufficient ...
      • Code for Marginsplot Interaction Terms - Error/que...
      • Esttab - Compress labels
      • Wilcoxon rank sum test for more than 3 groups?
      • Filling in missing values in long data
      • Hausman Test - "V_b-V_B is not positive definite" ...
      • Histogram by groups
      • Renaming variables
      • Code for including confidence intervals on both cu...
      • Graphing problem
      • Difference-in-Difference analysis after PSM
      • Combining duplicate names into one
      • Error calculating margins after melogit (could not...
      • Counting number of people by gender
      • putexcel
      • sem - visualizing interaction between latent varia...
      • merging data files to create a large panel data by...
      • ANOVA REPEATE MEASURES - HOW TO INTERPRET CONTRADI...
      • xtabond2 vs xtreg / xtregar
      • Time Series Graph Percentage Deviations from Trend
      • ANOVA REPEATE MEASURES - HOW TO INTERPRET CONTRADI...
      • Xtreg with Demeaned variables
      • Heteroskedasticity or other problem with regression?
      • Sum of values of a conditional variable
      • Performing chi squared with one variable against a...
      • re case control matching
      • Foreach loop using macro in numlist
      • Quintile
      • Transforming panel data with different time points
      • Problem with Hausman test in xsmle
      • Xtabond2 Newbie question
      • Averaging values from different "sum, detail outputs"
      • Latent class analysis using gsem - Cross validation
      • Three-year volatility
      • IV using panel data and fixed effects
      • Computing the consistency ratio and consistency in...
      • Linear regression using a time variable
      • Forecasting a variable
      • Estimating asymmetrical confidence intervals for I...
      • LaTeX font on eps figures: cannot get writepsfrag ...
      • Dropping multiple missing observations
      • Which F stats should I look at with ivlasso?
      • Difference and Difference Design Model Specification
      • Expanding a time series dataset by one month
      • How to check for significant heterogeneity for a c...
      • Regression on Panel Data
      • How to do Reality check and SPA test by Stata?
      • Parametric survival analysis
      • Question about diff-in-diff with multiple control ...
      • How to model "Sparial Variability" or "Choice of L...
      • How to Handle Heterogeneity in Panel Data using Sy...
      • Venn diagram
      • Subanalysis
      • Create graph with different colors when y<0
      • Independent variables based on same variable
      • Randomization test and descriptive statistics
      • brant test STATA 16
      • Paneldata Fixed Effects with and without robust le...
      • hetprobit and robust
      • Showing 95%-CI intervals in bar graph
      • Marginal effects-interpretation
      • Delete observations under conditions
      • fillin (a question on Twitter)
      • How to get the start and end round of each user?
      • Pooled OLS, fixed & random effects: Panel Data
      • Demeaning and standardizing variables in panel reg...
      • Looping regression to determine one set of control...
      • Topic impact access to microfinancial institution ...
      • No constant in fixed-effect regression
      • How to do Reality check and SPA test by Stata?
      • How to create interaction variable in stata
      • Blinder–Oaxaca decomposition
      • How to recode missing values within a range in Stata
      • Comparing different cohorts amongst different years
      • Graphing: Scatter Plot
      • Why stata still reports log likelihood results for...
      • Robust correlation matrix/ covariance matrix
      • Multinomial Logistic Regression Taking hour+
      • Bug saving funnel plots
      • combining "by" and loops
      • npss: a STATA module to estimate nonparametric het...
      • Code for counting ID within a variable
      • Truncating long variable names
      • Multiple interaction terms in panel data model
      • Interpreting values on the Y-axis in hazard functi...
      • Dot plot for two categorical variables
      • Difference-in-Difference analysis after PSM
      • Frequency Matching in a retrospective cohort: outp...
      • Exporting odds ratios, confidence intervals, and p...
      • How to compute the right eigenvectors from a matrix A
      • Interpreting coefficients in a dummy * log transfo...
      • Identify variables with identical prefixes
      • Stochastic frontier analysis-flexible Fourier(augm...
      • How to replace missing values for certain rows in ...
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer