BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
gmatch and psmatch2

gmatch and psmatch2

Sunday, January 31, 2021 Data Cleaning Data management Data Processing
I am trying to conduct a logistic regression after propensity score matching. I tried both gmatch psmatch2 and am wondering if I should expe...
Add Optional t-test to Stata Program

Add Optional t-test to Stata Program

8:25 PM Data Cleaning Data management Data Processing
I have a stata program that displays a twoway bar graph and I want to add a t-test to the program, but the t-test has to be optional. I am a...
esttab

esttab

8:25 PM Data Cleaning Data management Data Processing
Why is it when I enter the following command: "forvalues i = 1/5 { regress edge percentFRL ipr_se_variation popdensity if schoolsize...
xtivreg2 instrumental variables with endogenous variable involves in interaction terms

xtivreg2 instrumental variables with endogenous variable involves in interaction terms

7:25 PM Data Cleaning Data management Data Processing
Hi all, I'm conducting analysis in the form below: Code: y = x1 + x2 + x3 + x1*x2 In which x1 is the variable of interest and it ...
reshape data long to wide with several IDs

reshape data long to wide with several IDs

4:25 PM Data Cleaning Data management Data Processing
I could not reshape my data from my previous posting. Please help me solve this problem. This is a part of my data. code: input yea...
Count return clients

Count return clients

3:25 PM Data Cleaning Data management Data Processing
Dear All, I would like to count the number of return clients in 2020 (the first occurrence/visit either was in 2018 or 2019 for example an...
Reshape data long to wide but not normal wide

Reshape data long to wide but not normal wide

2:25 PM Data Cleaning Data management Data Processing
Hello, I want to reshape my data long to wide. So, I googled it and found this web page: https://stats.idre.ucla.edu/stata/mo...-long-to...
Test to show a regressor is non stochastic?

Test to show a regressor is non stochastic?

1:25 PM Data Cleaning Data management Data Processing
Other than producing and eyeballing a scatter diagram, is there a standard way to prove/discover that an independent variable is non-stochas...
Graphing one product verses an average of all other products

Graphing one product verses an average of all other products

1:25 PM Data Cleaning Data management Data Processing
Hi All, I am having trouble thinking how I can graph an average of two products ("Coca-Cola" and "tonic water") as ...
How to make a Difference in Differences Graph

How to make a Difference in Differences Graph

12:25 PM Data Cleaning Data management Data Processing
I want to make a graph showing parallel slopes for the following data. The variable REGION consists of Northeast regions coded as 1 and sout...
How to change the look of a subset of bars using specific criteria - bar graphs

How to change the look of a subset of bars using specific criteria - bar graphs

12:25 PM Data Cleaning Data management Data Processing
Dear all, I am working with a dataset that contains the number of candidates for each call of a study abroad program and I prepared the fo...
need help with import excel

need help with import excel

11:25 AM Data Cleaning Data management Data Processing
Sorry I have tried this using import excel, copying the data directly, and using Stat Transfer, but I everything I do gives a value for the ...
Confusing behavior by -more-

Confusing behavior by -more-

11:25 AM Data Cleaning Data management Data Processing
I'm trying to use -more- to force a pause between display commands in a loop. Here is the code: Code: set more on forvalues x = 1/5{...
linear Regression with treatment and control variables

linear Regression with treatment and control variables

8:25 AM Data Cleaning Data management Data Processing
Dear all, I am interested to study the impact of monetary policy changes on the stock market return over the last 20 years. the list of ...
Calculating a gender wage gap

Calculating a gender wage gap

7:25 AM Data Cleaning Data management Data Processing
I have a data set with University degree subjects, salary bands and then the number of men and women in each salary band. How would I go abo...
calculating how percentiles performed in a panel dataset

calculating how percentiles performed in a panel dataset

6:25 AM Data Cleaning Data management Data Processing
Dear Statalist, My panel dataset is as follows year firm size sales 2000 firm1 size1 sales1 2000 firm2 size2 sales2 ... 2001 ...
"0 (empty)" during logistic regression

"0 (empty)" during logistic regression

Saturday, January 30, 2021 Data Cleaning Data management Data Processing
I was measuring odds ratio for systolic hypertension using bmi as a variable, but when I do so the 4th category gives me "0 (empty)...
Graph of Parallel Trend Lines

Graph of Parallel Trend Lines

8:25 PM Data Cleaning Data management Data Processing
Hello, I have the data set below. This is a data set of total number of visits to the emergency room by region in the US. I want to implemen...
Filling missing observations with successor and predecessors ID's values

Filling missing observations with successor and predecessors ID's values

8:25 PM Data Cleaning Data management Data Processing
Dear Stata Users, Can you please help me with the following issue: I have a company (“gvkey”) with a person ID (pers_ID) who is associated ...
Regarding use Matchit to fuzzy merge two different data set

Regarding use Matchit to fuzzy merge two different data set

7:25 PM Data Cleaning Data management Data Processing
Hello everyone, I am trying to merge two data set by their unique string variables. However, these two datasets do not share the same form...
Identifying observations of var1 that have different values for var2

Identifying observations of var1 that have different values for var2

4:25 PM Data Cleaning Data management Data Processing
I have a dataset in which the data are identified by state, county, and ZIP code - each of these is a different variable. My problem is th...
Reshaping data with three dimensions

Reshaping data with three dimensions

4:25 PM Data Cleaning Data management Data Processing
Hello, I am having trouble reshaping my data as it will require something beyond the usual long to wide format. My dataset is a relationship...
Creating a matrix with Euclidean distances between variables.

Creating a matrix with Euclidean distances between variables.

1:25 PM Data Cleaning Data management Data Processing
Hello everybody, I use Stata 13.1 and I am working with a dataset that contains 25 numerical variables (var1-var25) and around 400 observa...
Reshaping wide error values of variable date not unique within id

Reshaping wide error values of variable date not unique within id

1:25 PM Data Cleaning Data management Data Processing
Hi, I`m fairly new to Stata and I know this question is previously answered however I could not make it work for me. This is the type of ...
Developing person weights from other variables

Developing person weights from other variables

1:25 PM Data Cleaning Data management Data Processing
I have to "develop person weights from the age and sex variables using an iterative proportional fitting process with the variables of ...
How to utilise sample weights in Stata? - QUICK QUESTIONS

How to utilise sample weights in Stata? - QUICK QUESTIONS

11:25 AM Data Cleaning Data management Data Processing
Hi Everybody, I am running a regression analysis using HLFS on Stata. I have been told that I need to use sample weights to have reliable...
frecuency in the axes X&Y in Histograms and Scatter plots

frecuency in the axes X&Y in Histograms and Scatter plots

8:25 AM Data Cleaning Data management Data Processing
Hi all, Sorry to bother you, I have another question and is regarding the histograms/scatter plots. I would like to be able to select the f...
frecuency distributions with intervals

frecuency distributions with intervals

8:25 AM Data Cleaning Data management Data Processing
Hi all, I have the data of a studdy with two variables. and I would like to generate a table with frecuency distributions with intervals o...
How to convert a variable in the format "day, month, year" to another variable in the format "month, year"?

How to convert a variable in the format "day, month, year" to another variable in the format "month, year"?

7:25 AM Data Cleaning Data management Data Processing
Hello, I would like to convert a "date" variable in is in the format "day, month, year" to another variable that gives...
matcell option in a table with weights

matcell option in a table with weights

5:25 AM Data Cleaning Data management Data Processing
While running table son father[pw=pn], center format(%9.0f) matcell(PQ) I am getting an error message "option matcell() not allowed...
New package: xtbalance2 - Create a balanced subsample from unbalanced panel data.

New package: xtbalance2 - Create a balanced subsample from unbalanced panel data.

4:25 AM Data Cleaning Data management Data Processing
Thanks to Kit Baum, a new package called xtbalance2 is available on SSC. xtbalance2 creates an indicator variable to identify a balanced s...
Difference in Difference with propensity score matching

Difference in Difference with propensity score matching

4:25 AM Data Cleaning Data management Data Processing
Hello Statalist team, may I ask you the following qustion? I would like to use the DID with propensity score matching in the following set...
How to give different color for all the scatter plots in a graph?

How to give different color for all the scatter plots in a graph?

3:25 AM Data Cleaning Data management Data Processing
Hello everyone, I am using Stata 16.0, The data is in wide format with 29 observations. It is a National data with state variables. I woul...
Tobit regression with "outreg2" output?

Tobit regression with "outreg2" output?

2:25 AM Data Cleaning Data management Data Processing
Dear All, I run this Tobit model Code: sysuse auto, clear gen wgt=weight/1000 // Censored from below tobit mpg wgt foreign, ll(17) outreg2...
Lagged dependent variable

Lagged dependent variable

2:25 AM Data Cleaning Data management Data Processing
Hi dear, I have a question. In my dataset, the dependent variable is related to 2020 but the independent ones, like GDP per capita, unemplo...
How to convert to date when data is saved as long format (%tdD_m_Y ); for example-15 Oct 13

How to convert to date when data is saved as long format (%tdD_m_Y ); for example-15 Oct 13

1:25 AM Data Cleaning Data management Data Processing
How to convert to date when data is saved as long format (%tdD_m_Y ); for example-15 Oct 13
How to report the differences in a nice table/graph?

How to report the differences in a nice table/graph?

12:25 AM Data Cleaning Data management Data Processing
Hi all, I have self-reported and actually household income from national register in 2010. I have compared them (see below), but I am not ...
xtsur with restriction

xtsur with restriction

Friday, January 29, 2021 Data Cleaning Data management Data Processing
Dear All, I use xtsur for unbalanced panel data. is there way to restrict the coefficients of two equations to one. it is allowed in sureg c...
lag regression: Not sorted error

lag regression: Not sorted error

11:25 PM Data Cleaning Data management Data Processing
Hi, I have tried to run lagged regressions but the error keeps coming up as not sorted. I have sorted it several times such as sort country...
Repeated time values within panel help

Repeated time values within panel help

10:25 PM Data Cleaning Data management Data Processing
Hi All, I am having some trouble with understanding the xtset command even after reading through some past posts and the help section on i...
Collapsing multiple variables using the count option

Collapsing multiple variables using the count option

9:25 PM Data Cleaning Data management Data Processing
Hi all, I have a dataset where every row represents a study. Each row contains a column describing the region and country where the study ...
Retaining the entire ID if there are certain variables in the panel data

Retaining the entire ID if there are certain variables in the panel data

7:25 PM Data Cleaning Data management Data Processing
Good Morning world English is not my language so google help me. Please understand me. I have stata 15. I`m using panel data with 191 ...
Replacing missing values with draws from a subset of nonmissing values

Replacing missing values with draws from a subset of nonmissing values

5:25 PM Data Cleaning Data management Data Processing
I am replicating an imputation procedure for a household survey, and I'm struggling to find a elegant coding solution for bracketed resp...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ▼  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ▼  January (665)
      • gmatch and psmatch2
      • Add Optional t-test to Stata Program
      • esttab
      • xtivreg2 instrumental variables with endogenous va...
      • reshape data long to wide with several IDs
      • Count return clients
      • Reshape data long to wide but not normal wide
      • Test to show a regressor is non stochastic?
      • Graphing one product verses an average of all othe...
      • How to make a Difference in Differences Graph
      • How to change the look of a subset of bars using s...
      • need help with import excel
      • Confusing behavior by -more-
      • linear Regression with treatment and control varia...
      • Calculating a gender wage gap
      • calculating how percentiles performed in a panel d...
      • "0 (empty)" during logistic regression
      • Graph of Parallel Trend Lines
      • Filling missing observations with successor and pr...
      • Regarding use Matchit to fuzzy merge two different...
      • Identifying observations of var1 that have differe...
      • Reshaping data with three dimensions
      • Creating a matrix with Euclidean distances between...
      • Reshaping wide error values of variable date not u...
      • Developing person weights from other variables
      • How to utilise sample weights in Stata? - QUICK QU...
      • frecuency in the axes X&Y in Histograms and Scatte...
      • frecuency distributions with intervals
      • How to convert a variable in the format "day, mont...
      • matcell option in a table with weights
      • New package: xtbalance2 - Create a balanced subsam...
      • Difference in Difference with propensity score mat...
      • How to give different color for all the scatter pl...
      • Tobit regression with "outreg2" output?
      • Lagged dependent variable
      • How to convert to date when data is saved as long ...
      • How to report the differences in a nice table/graph?
      • xtsur with restriction
      • lag regression: Not sorted error
      • Repeated time values within panel help
      • Collapsing multiple variables using the count option
      • Retaining the entire ID if there are certain varia...
      • Replacing missing values with draws from a subset ...
      • How to organize a dataset in order to have years i...
      • Random effects: logit with ibn. vs melogit
      • How can I generate a column of numbers with an int...
      • Reconstruct data format for county adjacency
      • Random utility model with mixlogit - obtaining mar...
      • Reshaping after matching firms with -rangejoin-
      • Weighted Geometric Mean-Cross-Sectional Dataset
      • Problem with exporter-importer dataset (from Cepii...
      • twostepweakiv with panel data
      • Drop all observations of a company if it does not ...
      • query about nested loops (how to match on GP)?
      • Identification of Common Support using PSM
      • replace missing data with the mean of a subscale
      • asdocx : Stata – Export Codebook to Word, Excel or...
      • Sorting Years in a panel data set
      • Arrangement of Years in Same Order
      • How to plot the probability density function of mu...
      • How to plot the probability density function of mu...
      • Error in cmp: "discontinuous region encountered ca...
      • Logit, Probit and Penalised Maximum Likelihood (PMLE)
      • Generalized difference in differences with panel data
      • Trimming certain part of variable labels
      • XTIVREG, FE: needs to be FE in stage 1 and 2?
      • 7-point ordinal dependent variable: Is it okay to ...
      • Mean size of household
      • Test how rating would have changed given new param...
      • Can xtologit work with three-level model?
      • Revealed Comparative Advantage (RCA) computation
      • Baujat plot - TO DETECT OUTLIER IN META-ANALYSES
      • What resources do you recommend to master STATA?
      • Ado embedded python code not working
      • Exporting data in excel sequentially, in different...
      • Sample size for one proportion (prevalence in cros...
      • The Number of Iterations for Convergence in ml
      • mvprobit or cmp?
      • Predicted probabilities out of range after Heckman
      • Reshape command
      • Question about AR test of xtabond
      • Reshape a db in matrix form
      • Panel Data Survival Analysis
      • Help with 2SLS modelling
      • Adding up variables with the same value
      • ttest or xtmixed
      • Estimate monthly data with annual time series data
      • Computation of mean with nondetects / data with le...
      • How to extract the regression coefficients in a mo...
      • Systematic Variation
      • Need help renaming variables in a loop
      • How does Stata identify and generate a categorical...
      • Error "svy not recognised"
      • specification test for panel data
      • not losing data when collapsing on spellid
      • String Handling
      • "type mismatch" in collapse
      • Blinder-Oaxaca Decomposition
      • rdplot (part of rdrobust) problem?
      • Estpost with stata 16
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer