BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
System GMM and AB test for AR(2)

System GMM and AB test for AR(2)

Thursday, January 31, 2019 Data Cleaning Data management Data Processing
Hi All: I am using system GMM method to estimate my panel data (T=6). I am new to this method. I know that the assumption of system GMM is...
Documentation for non existent function

Documentation for non existent function

10:50 PM Data Cleaning Data management Data Processing
I was learning string functions in Stata. After obtaining help using help string functions command, one thing that particularly caught my a...
How to declare weekly data as time series data in Stata 15

How to declare weekly data as time series data in Stata 15

10:50 PM Data Cleaning Data management Data Processing
Dear All I want to use time series data on a weekly basis. When I try, I get the error "repeated time values in sample" I have ...
Creating and storing residuals in a loop

Creating and storing residuals in a loop

10:11 PM Data Cleaning Data management Data Processing
Hi all I am trying to run the following code. The purpose is to capture industry and year wise residuals from the stated model. It works ...
Creating and storing residuals in a loop

Creating and storing residuals in a loop

10:11 PM Data Cleaning Data management Data Processing
Code: forvalues y = 2015/2017 { // Define a for/next loop spanning years in sample forvalues i = 1/45 { // Define a for...
Question about reghdfe

Question about reghdfe

8:35 PM Data Cleaning Data management Data Processing
Could you please answer my questions about reghdfe command? 1. When I conducted estimation using reghdfe, the following error messages ...
Can I use tssmooth for a fixed number of periods like a rolling forecast, safe the forecast t+5, then start in t+1, save forecast t+6...

Can I use tssmooth for a fixed number of periods like a rolling forecast, safe the forecast t+5, then start in t+1, save forecast t+6...

7:20 PM Data Cleaning Data management Data Processing
Hello everybody, I'm trying to forecast values for the strategy of companies. I have a panel dataset for companies from 2000 - 2015, b...
Calibration of logistic regression on large dataset.

Calibration of logistic regression on large dataset.

4:10 PM Data Cleaning Data management Data Processing
Evaluating goodness-of-fit for a logistic regression model using the Hosmer-Lemeshow test is not reliable in large datasets. Which method w...
Weak IV test postestimation test when using ivreghdfe command?

Weak IV test postestimation test when using ivreghdfe command?

2:01 PM Data Cleaning Data management Data Processing
Dear STATA community, I am hoping you can help me find a command I am looking for. My coauthor and I ran the following regression (I dropp...
Generate a moving window average

Generate a moving window average

1:40 PM Data Cleaning Data management Data Processing
Is there an easy way to generate a moving window average? For instance, for every 5 minutes of Ta_NOAA, I want to generate the average Ta_NO...
Guarantee 3 consecutive observations before and after the event

Guarantee 3 consecutive observations before and after the event

1:25 PM Data Cleaning Data management Data Processing
Dear Stata Users, I need to keep just those firms (gvkey) that have 3 year consecutive observations before and after the event. The event ...
Simple help with global macros

Simple help with global macros

1:10 PM Data Cleaning Data management Data Processing
Hello, I believe I incorrectly posted this to the Statalist earlier today. I recognize this question is simple but I need help. I try to d...
LPOLY: How can I change the scale of the axis on the lpoly graph?

LPOLY: How can I change the scale of the axis on the lpoly graph?

12:15 PM Data Cleaning Data management Data Processing
Dear Stata users: When I do the lpoly graph, it seems no option allows me to restrict my scale of the axis to a certain range. What should...
How can I make a line graph for data from a certain date range?

How can I make a line graph for data from a certain date range?

10:50 AM Data Cleaning Data management Data Processing
Hello! I'm working with daily time series data with the date originally in the format "01/29/19" (string) which I changed to 2...
Missing R-squared from IV regression

Missing R-squared from IV regression

8:51 AM Data Cleaning Data management Data Processing
Question : How can I display/find the missing "within R-squared" from an IV regression? Example of the problem: Suppose I use ...
Longitudinal data - generating variables dependent on observations within each subject

Longitudinal data - generating variables dependent on observations within each subject

8:26 AM Data Cleaning Data management Data Processing
Hi everyone, I have longitudinal data (see dataex below). I need to censor each id according to a few conditions. Condition 1: if within ...
Residuals in a panel data model

Residuals in a panel data model

8:26 AM Data Cleaning Data management Data Processing
Hi I am running a regression (using panel data) looking at the effect of income on food consumption and controlling for age. I am trying to...
Adding an interaction term into a model or stratifying data , which method is more preferable to analyse interaction terms?

Adding an interaction term into a model or stratifying data , which method is more preferable to analyse interaction terms?

8:10 AM Data Cleaning Data management Data Processing
Hi Statlists, Hope this post finds you well. May I know why stratification seems to be less preferable than adding an interaction term int...
Marginal effects Tobit (mfx vs margins)

Marginal effects Tobit (mfx vs margins)

8:00 AM Data Cleaning Data management Data Processing
Hello I'm trying to calculate the marginal effects of a Tobit model using the margins command instead of mfx, because margins is faster...
Help with using expand>2 while replacing values in duplicates generated

Help with using expand>2 while replacing values in duplicates generated

7:51 AM Data Cleaning Data management Data Processing
Hi, I am trying to use the expand command to create duplicates and replacing one of the variables in the row. For example, expand 2 if ...
Individual Caliper for Variables Nearest-Neighbour Matching (psmatch2)

Individual Caliper for Variables Nearest-Neighbour Matching (psmatch2)

6:39 AM Data Cleaning Data management Data Processing
Dear Community, I aim to apply nearest neighbour matching using the mahalanobis option in the the psmatch2 package and Stata15. Given the ...
A question on macro expression.

A question on macro expression.

5:54 AM Data Cleaning Data management Data Processing
If we want to write i = i + 1, we could use ++i. I wonder if there is also a short expression for i = i + 2 (or any number > 2)?
Launching a Free Online Course: Introduction to GIS in R

Launching a Free Online Course: Introduction to GIS in R

5:29 AM Data Analysis
We’re so excited to announce the launch of our second online course about geospatial data in R. Sign up here. When you hear “geospatial da...
Panel data - dropping cross section based on missing values

Panel data - dropping cross section based on missing values

3:59 AM Data Cleaning Data management Data Processing
In a panel dataset, there are companies with revenues for multiple years. I would like to drop all companies if their revenue information ...
Problems when running optimal k-means cluster solution program

Problems when running optimal k-means cluster solution program

3:19 AM Data Cleaning Data management Data Processing
Dear Team, After reading the excellent "Stata tip 110: How to get the optimal k-means cluster solution, Stata Journal (2012) 12, Numb...
Studentized deleted residuals and DFfits after logistic regression in Stata. How to calculate?

Studentized deleted residuals and DFfits after logistic regression in Stata. How to calculate?

2:34 AM Data Cleaning Data management Data Processing
How can I calcilate studentized deleted (externally, jackknifed) residuals and dffits after performing logistic regression in Stata? The rst...
Results Interpretation

Results Interpretation

2:14 AM Data Cleaning Data management Data Processing
Hi everyone, Can anyone help me interpret these results, specifically the F-test below. What does this mean? Thank you in advance!
Why do i have large z test statistics when i ran translog model

Why do i have large z test statistics when i ran translog model

1:44 AM Data Cleaning Data management Data Processing
First i ran frontier model and then i did translog method here i have obtained large wald chisquare and large z test statistics my lnv2sig2v...
Replacing missing variable with other observations that satisfy certain conditions

Replacing missing variable with other observations that satisfy certain conditions

1:44 AM Data Cleaning Data management Data Processing
Hi all, I am working on a cross-country dataset. The dataset is created when I merge the bilateral trade data with the country characterist...
Collapse different columns differently

Collapse different columns differently

1:34 AM Data Cleaning Data management Data Processing
In a panel database of companies (bisnode), the revenue information is not always for the full year...sometimes it's quarterly, for exam...
Machine Learning setup

Machine Learning setup

12:54 AM Data Cleaning Data management Data Processing
How can I break down my loaded dataset into training set and test set, and develop random forest on the training set, calculating fit for bo...
Wrangling panel data - calculating growth rates and cagr

Wrangling panel data - calculating growth rates and cagr

12:14 AM Data Cleaning Data management Data Processing
I am working with bisnode panel data https://www.bisnodegroup.com/solutio.../company-data/ which has ID, year, and company revenue info. ...
Comparing predictions and regression fitted values between two regression models with an additional explanatory varaiable

Comparing predictions and regression fitted values between two regression models with an additional explanatory varaiable

Wednesday, January 30, 2019 Data Cleaning Data management Data Processing
Dear Statalisters, I am struggling with a task in which I want to investigate how an additional variable (i.e., human rights) affects my r...
Importing previously imputed data using mi import

Importing previously imputed data using mi import

7:59 PM Data Cleaning Data management Data Processing
Hello Stata Users, I have been trying to import previously imputed data (m=0, 1,2,…..20) using mi import command. The data set includes the...
Microsoft Organizational Structure: Divisional Structure with Focus on Innovation

Microsoft Organizational Structure: Divisional Structure with Focus on Innovation

5:26 PM Research Methodology
Microsoft organizational structure can be classified as divisional.  In June 2015, the senior management announced a change in Microsoft org...
Microsoft Leadership: A New Era for Multinational Technology Company

Microsoft Leadership: A New Era for Multinational Technology Company

5:26 PM Research Methodology
Co-founder of the company, Bill Gates was at the helm of Microsoft leadership since its inception in 1972 until 2000, when Steve Ballmer suc...
Scoring measures using STATA

Scoring measures using STATA

4:59 PM Data Cleaning Data management Data Processing
I have dataset that includes items measuring diagnostic criteria for personality disorders. For each criterion there are multiple items. For...
Problem with nlsur command

Problem with nlsur command

4:49 PM Data Cleaning Data management Data Processing
Dear Stata users, I am currently running the nlsur command to estimate a system of 2 equations using the following syntax: Code: nlsu...
How significance of stata pwcorr is calculated?

How significance of stata pwcorr is calculated?

4:19 PM Data Cleaning Data management Data Processing
When i run pwcorr, with the option sig, how stata test de significance of the correlation coeficient? How its calculated?
Propensity score weighting on samples

Propensity score weighting on samples

3:14 PM Data Cleaning Data management Data Processing
I want to weight my sample with propensity scores first, then run OLS with weighted sample. I know PHP Code: teffect  does this job by c...
Statistical comparison between 6 groups with unequal variance and 1 observation.

Statistical comparison between 6 groups with unequal variance and 1 observation.

2:59 PM Data Cleaning Data management Data Processing
Dear Statalists, I am analyzing a dataset which includes two variable "tech changing rate" and "Group". "Group...
Looking for US data base for tuition fees

Looking for US data base for tuition fees

2:59 PM Data Cleaning Data management Data Processing
Hi, Im currently working in a research on higher education in the United States. I have searched for a database with the average tuition c...
Error "initial values not feasible" for multiple imputation.

Error "initial values not feasible" for multiple imputation.

2:14 PM Data Cleaning Data management Data Processing
Dear experts, I'd like to ask for your help with syntax. I am doing a multiple imputation as below, but I get an error "initial ...
Geographic Regression discontinuity

Geographic Regression discontinuity

2:05 PM Data Cleaning Data management Data Processing
Hello guys, I am comparing the economic well-being of some group of people residing on opposite sides of a certain border. I believe there ...
Weighting without knowing psu

Weighting without knowing psu

1:54 PM Data Cleaning Data management Data Processing
Dear Stata users, I want to conduct some cross-sectional analysis with data from the South African NIDS (National Income Dynamics Study)/h...
Way to identify first successful loop iteration?

Way to identify first successful loop iteration?

1:14 PM Data Cleaning Data management Data Processing
Hi Stata, I have a loop: forv x = 1/100{ cap{ [stata_commands] [if first successful iteration, execute command x ] } } How can I...
-mimrgns- and -marginsplot-

-mimrgns- and -marginsplot-

1:04 PM Data Cleaning Data management Data Processing
The help file for mimrgns states that while "[i]n principle, marginsplot works after mimrgns […], the plotted confidence intervals a...
Confusion about how to keep one row of data for each student with the most number of classes taken in a program

Confusion about how to keep one row of data for each student with the most number of classes taken in a program

1:04 PM Data Cleaning Data management Data Processing
I am using Stata 15.1 for Windows. I would like to keep a certain student in the program that he concentrated in the most. If he took 1 cl...
Use past quarter average of data to regress with next quarter data.

Use past quarter average of data to regress with next quarter data.

11:29 AM Data Cleaning Data management Data Processing
Hello all, I am a newbie to STATA and struggling with a peculiar problem. Hoping for some insight from you all. I am looking to see the ...
problems with moving average - panel data

problems with moving average - panel data

11:04 AM Data Cleaning Data management Data Processing
Hi everyone, I am using Italian administrative longitudinal data for 10 years. Since I would like to understand how household income has ev...
Dropping variables in the batch structure

Dropping variables in the batch structure

10:44 AM Data Cleaning Data management Data Processing
My dataset contains ~4000 variables from a survey that collected data on upto 10 members from the same household. (AGE_01, AGE_02...AGE_10/S...
Principal component regression*using the*multinomial logit model

Principal component regression*using the*multinomial logit model

10:19 AM Data Cleaning Data management Data Processing
Hello everybody. I have a few questions on the principal component regression. The latter consists of three steps which are well summarize...
Reshaping multiple variables in one dataset using STATA

Reshaping multiple variables in one dataset using STATA

8:35 AM Data Cleaning Data management Data Processing
Good morning, I would like to reshape my data from wide to long. I have multiple variables within the dataset that I would like to do this...
Preparing household data

Preparing household data

8:25 AM Data Cleaning Data management Data Processing
Dear Stata list My data has the following format: Code: clear input householdID personID personinHHID personsfatherID 1001 ...
Implement Multinomial Logit Model using ml command

Implement Multinomial Logit Model using ml command

8:10 AM Data Cleaning Data management Data Processing
I would like to implement multinomial logit model using maximum likelihood command to generate the same result as mlogit. Pr(y=1) = exp(bet...
panelsubmatrix(): 3301 subscript invalid

panelsubmatrix(): 3301 subscript invalid

7:59 AM Data Cleaning Data management Data Processing
I use stata15 and I write Code: xsmle INDEX CREDDEV CHEXCHRATE UNINF, wmat(Z) model(sdm) and I get this error Code: panelsubmatrix(...
How to use "/" in Stata

How to use "/" in Stata

7:05 AM Data Cleaning Data management Data Processing
Hi everyone, I would like to know how to build a code that lets me display the following data: Individual's heights of those individ...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ▼  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ▼  January (836)
      • System GMM and AB test for AR(2)
      • Documentation for non existent function
      • How to declare weekly data as time series data in ...
      • Creating and storing residuals in a loop
      • Creating and storing residuals in a loop
      • Question about reghdfe
      • Can I use tssmooth for a fixed number of periods l...
      • Calibration of logistic regression on large dataset.
      • Weak IV test postestimation test when using ivregh...
      • Generate a moving window average
      • Guarantee 3 consecutive observations before and af...
      • Simple help with global macros
      • LPOLY: How can I change the scale of the axis on t...
      • How can I make a line graph for data from a certai...
      • Missing R-squared from IV regression
      • Longitudinal data - generating variables dependent...
      • Residuals in a panel data model
      • Adding an interaction term into a model or stratif...
      • Marginal effects Tobit (mfx vs margins)
      • Help with using expand>2 while replacing values in...
      • Individual Caliper for Variables Nearest-Neighbour...
      • A question on macro expression.
      • Launching a Free Online Course: Introduction to GI...
      • Panel data - dropping cross section based on missi...
      • Problems when running optimal k-means cluster solu...
      • Studentized deleted residuals and DFfits after log...
      • Results Interpretation
      • Why do i have large z test statistics when i ran t...
      • Replacing missing variable with other observations...
      • Collapse different columns differently
      • Machine Learning setup
      • Wrangling panel data - calculating growth rates an...
      • Comparing predictions and regression fitted values...
      • Importing previously imputed data using mi import
      • Microsoft Organizational Structure: Divisional Str...
      • Microsoft Leadership: A New Era for Multinational ...
      • Scoring measures using STATA
      • Problem with nlsur command
      • How significance of stata pwcorr is calculated?
      • Propensity score weighting on samples
      • Statistical comparison between 6 groups with unequ...
      • Looking for US data base for tuition fees
      • Error "initial values not feasible" for multiple i...
      • Geographic Regression discontinuity
      • Weighting without knowing psu
      • Way to identify first successful loop iteration?
      • -mimrgns- and -marginsplot-
      • Confusion about how to keep one row of data for ea...
      • Use past quarter average of data to regress with n...
      • problems with moving average - panel data
      • Dropping variables in the batch structure
      • Principal component regression*using the*multinomi...
      • Reshaping multiple variables in one dataset using ...
      • Preparing household data
      • Implement Multinomial Logit Model using ml command
      • panelsubmatrix(): 3301 subscript invalid
      • How to use "/" in Stata
      • is it possible to do a repeated Measures ANOVA wit...
      • Merging two databases. With different dates. Best ...
      • defining perfect sequence pairs and average non-pe...
      • destringing
      • Labeling multiple variables using the foreach loop
      • bootstrapping after xtreg
      • xtabond2, Is my model valid?
      • Repeated time values within panel r(451)
      • type mismatch error in htlist command of htopen pa...
      • Factor variable and menl
      • When does multi-collinearity increase significance...
      • "randomselect" in a program used in a simulation
      • Looping through multiple dependent variables with ...
      • looping over observations - retaining the values f...
      • Interpreting coefficients in logit
      • Factor variable and menl
      • create Adjacency matrix nwcommands
      • Heteroscedasticity and auto correlation in dummy v...
      • Blanks in polychoric matrix
      • Issue with predictnl
      • Pedroni's Panel Cointegration Tests and Panel Dyna...
      • Renaming Windows directory using Stata syntax?
      • Issue with two-step multinomial logit estimation (...
      • Forecast applications of dynamic panel data models...
      • Generalized ordered probit: "Looping over program"...
      • vce(cluster clustvar) & fe with logit
      • Advice regarding estimating competing risk
      • Confidence intervals of medians by group
      • Multiple Regression Tables into a Single Word File...
      • Interpretation: Difference in difference on repeat...
      • Interpretation: Difference in difference on repeat...
      • Margins for multi equation regression with transfo...
      • Examine treatment effect at different follow-up pe...
      • Date Variable to show only year
      • retrospective power analysis on panel data
      • Microsoft Business Strategy and Competitive Advantage
      • Internal validation of risk prediction model - Boo...
      • if [varname] = [values]?
      • Help with a detailed specification using multiple ...
      • Number of parameters in exploratory factor analysis
      • Exogeneity test for nonlinear unbalanced panel wit...
      • Test differences in firms that adopted org change
      • Multilevel analysis
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer