BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Rename variables to remove part of the text

Rename variables to remove part of the text

Thursday, February 28, 2019 Data Cleaning Data management Data Processing
Dear Statalists, I have the following variables and would like to rename them to remove untitled### from the name. Please help me do this....
Betareg

Betareg

9:12 PM Data Cleaning Data management Data Processing
Hi all, I have a propotion dependent variable that’s greater than zero and less then one. The mean is 0.1240891 and SD 0.1363 and its posi...
ivreg, ivprobit and biprobit which one to use? (any theoretical reasoning?)

ivreg, ivprobit and biprobit which one to use? (any theoretical reasoning?)

8:42 PM Data Cleaning Data management Data Processing
As I do not have any econometric background I find it difficult to understand which one to use when outcomes differ a lot. My dependent (fem...
Question spmap

Question spmap

6:27 PM Data Cleaning Data management Data Processing
Hi, I have a question about spmap. I'm just a beginner with STATA and I want to know how to use spmap for dummy variables. In my exam...
renaming using loop

renaming using loop

5:02 PM Data Cleaning Data management Data Processing
Code: * Example generated by -dataex-. To install: ssc install dataex clear input str43 Q2_8a1 double(Q2_8b1 Q2_8c1 Q2_8d1 Q2_8e1) str43 ...
creating a variable that takes a value of the difference of the last non-missing value and the first non-missing value in panel data

creating a variable that takes a value of the difference of the last non-missing value and the first non-missing value in panel data

1:38 PM Data Cleaning Data management Data Processing
i have panel time series. I want to create a variable that is the difference between the last non-missing value and the first non-missing v...
Binscatter2 - faster, enhanced binned scatterplots in Stata

Binscatter2 - faster, enhanced binned scatterplots in Stata

1:32 PM Data Cleaning Data management Data Processing
Hi all, I wanted to make note of a program that I've had available on GitHub for a while now to generate binned scatterplots in Stata,...
How does one put formatted date labels into excel with Putexcel?*

How does one put formatted date labels into excel with Putexcel?*

11:17 AM Data Cleaning Data management Data Processing
Hi there, Here's my code: local vars "date" foreach pre in `vars'{ putexcel set "tabout/cleaningtest_`pre...
Extracting a specific part of a string if the string contains an exact syntax

Extracting a specific part of a string if the string contains an exact syntax

10:47 AM Data Cleaning Data management Data Processing
OK, so I'm not explaining this very well but here is what I want to do. I have data as shown below. The incentive has a maximum of ...
Proportional odd assumption for ordered logit regression in panel data

Proportional odd assumption for ordered logit regression in panel data

10:12 AM Data Cleaning Data management Data Processing
Respected sir, I am using xtologit command for ordered logit regression for panel data. My dependent variable has three categories. I want t...
How to solve autocorrelation and hetero ?

How to solve autocorrelation and hetero ?

9:42 AM Data Cleaning Data management Data Processing
I using xtserial and xttest3 What should I do to deal with autocorrelation and hetero ? My data have 9 countries and 10-40 quarters ( T ...
a new variable based on two existing variables

a new variable based on two existing variables

8:42 AM Data Cleaning Data management Data Processing
Hi, I'm using SHARE dataset where there is a dummy variable for people having played chess/cards in the previous year (ac035d10) and a...
Simple loop question

Simple loop question

8:27 AM Data Cleaning Data management Data Processing
Dear Listers, I would like to ask you a very simple question regarding loop. I want to make a variable for each of countries listed in m...
How to recode data with multiple groups

How to recode data with multiple groups

8:17 AM Data Cleaning Data management Data Processing
Hi Everyone, I am fairly new to stata and I am looking at cancer data. I have histology groupings for lung cancer. The groups all over the...
dummy for panel data

dummy for panel data

7:57 AM Data Cleaning Data management Data Processing
Hi, how to create dummy variable for panel data? I want to create a dummy for 130 districts for 3 years; 2010, 2012 and 2014. If the distri...
Issues with Hausman-Wu test

Issues with Hausman-Wu test

7:57 AM Data Cleaning Data management Data Processing
Hi everyone, I am having issues with running a Hausman-Wu test. The output is as follows: " hausman fe re Note: the rank of the d...
Local macro list and tuples

Local macro list and tuples

7:42 AM Data Cleaning Data management Data Processing
I am in need of two different tuples lists, so I would like to store one set as a local and call the other in a different loop. Below is t...
Help on modelling- endogeneity-panel data

Help on modelling- endogeneity-panel data

6:47 AM Data Cleaning Data management Data Processing
Hello, I am running a model to find the effect of corurption on GDP in across Italian regions using panel data. I have found endogeneity b...
Joining datasets when a variable is in a different format

Joining datasets when a variable is in a different format

6:42 AM Data Cleaning Data management Data Processing
Hi all, I currently have a data set that looks like this where each quarter is represented by a seperate variable and contains values of t...
Report estimates from Heckman AND margins command

Report estimates from Heckman AND margins command

6:32 AM Data Cleaning Data management Data Processing
Dear all, I am using Stata 14 and have some questions regarding the commands -Heckman-, -margins- and -outreg2-. I am trying to produce t...
Help with reshape

Help with reshape

6:17 AM Data Cleaning Data management Data Processing
Hello everyone, I have got the following matrix. * Example generated by -dataex-. To install: ssc install dataex clear input int(isin ...
reshape in Stata

reshape in Stata

6:17 AM Data Cleaning Data management Data Processing
Dear All, I have some issues about reshape. Here is the original matrix I have. Code: * Example generated by -dataex-. To install: ssc i...
Question about Goodness of fit result

Question about Goodness of fit result

6:12 AM Data Cleaning Data management Data Processing
Hi, I am running a SEM model on Stata. I've 5 IVs, 1 mediator, and DV. with the sample size of 400. And I obtained this result, do you ...
VISUA Dialog programming GUI

VISUA Dialog programming GUI

5:22 AM Data Cleaning Data management Data Processing
I have seen the presentation on the below link, they have used a VISUA GUI for creating Stata Dialog boxes and survey plugin for data entry....
Converting string to date/time variables fails somehow

Converting string to date/time variables fails somehow

4:47 AM Data Cleaning Data management Data Processing
Hello Statalist community, I am struggeling with converting a string containing date and time information inti SIF/HRF. It seems that the se...
Problem with importing excel on new MacBook

Problem with importing excel on new MacBook

4:27 AM Data Cleaning Data management Data Processing
Hi, I recently started working on a new MacBook from my old MacBook (i.e. I transferred all files, and am now using the new MacBook, Stata...
Problem with graph combine

Problem with graph combine

4:03 AM Data Cleaning Data management Data Processing
Hello, I am trying to combine some graphs. I generate the graphs and save them into a folder. When I try to combine them Stata returns erro...
Time Series Regression

Time Series Regression

1:57 AM Data Cleaning Data management Data Processing
Hello, I am looking at logreturns of two different stock market indices in two countries and seeing if the weather (Cloud, temp, humidity,...
Garch model regression

Garch model regression

1:47 AM Data Cleaning Data management Data Processing
Hello, I am looking at logreturns of two different stock market indices in two countries and seeing if the weather (Cloud, temp, humidity,...
How to destring date variable formatted as year and week number

How to destring date variable formatted as year and week number

Wednesday, February 27, 2019 Data Cleaning Data management Data Processing
Dear Statalist, I was just wondering if there is an easy way to destring a date variable formatted as "year and week number". f...
IVREG LIML yields zero cefficients and p-value=1.

IVREG LIML yields zero cefficients and p-value=1.

7:37 PM Data Cleaning Data management Data Processing
I am using Stata to run "IVREG LIML" which is an IV with Limited Maximum Likelihood estimator. However, some of the coefficients a...
Page break in PDF output

Page break in PDF output

4:52 PM Data Cleaning Data management Data Processing
I need to put page breaks in PDF strategically to keep certain lines of text together on the same page. Is there any way to measure the ...
importing multiple excel files using a loop

importing multiple excel files using a loop

12:17 PM Data Cleaning Data management Data Processing
Hi statalists, I have the following code but I always get the error message local dpto AMAZONAS ANCASH APURIMAC foreach x in local dpto {...
How to get the quintile points in survey data?

How to get the quintile points in survey data?

12:07 PM Data Cleaning Data management Data Processing
I am using a survey data now and trying very hard to figure out the quintile points (20%, 40%, 60%, 80%) of a continuous variable. I know t...
Is it possible to test coefficient estimates across two samples using GMM model?

Is it possible to test coefficient estimates across two samples using GMM model?

11:27 AM Data Cleaning Data management Data Processing
Dear All, I run the same GMM model in two different samples (firms which have high imported input vs low imported input). I would like to ...
Analysis for two continuous variables (not normally distributed)

Analysis for two continuous variables (not normally distributed)

11:27 AM Data Cleaning Data management Data Processing
Dear Statlist, I have two continuous variables, the dependent variable is not normally distributed while the independent variable is norma...
Average Partial Effects (APE) after Dynamic Probit Model ala (Wooldridge, 2005)

Average Partial Effects (APE) after Dynamic Probit Model ala (Wooldridge, 2005)

11:07 AM Data Cleaning Data management Data Processing
I am trying to calculate the average partial effects of the state dependence and some other covariates after running a dynamic probit model....
Bysort

Bysort

10:32 AM Data Cleaning Data management Data Processing
Hello, I have reproduced a simple replica of a bigger problem. I have data which looks as attached. I need, the value against HS 3201 as...
Struggling with Collinearity in Panel Data

Struggling with Collinearity in Panel Data

10:32 AM Data Cleaning Data management Data Processing
Hi, I am struggling with how to set up my regression robustness check. I am currently running a regression of recycling rates on income, p...
Problems with Nesting in Panel Regression "panels are not nested within clusters"

Problems with Nesting in Panel Regression "panels are not nested within clusters"

10:03 AM Data Cleaning Data management Data Processing
Hi, I am not sure how to overcome this error message I am receiving "panels are not nested within clusters". The code I am tryin...
problema with local extended_fcn local list : dir . files "*"

problema with local extended_fcn local list : dir . files "*"

9:42 AM Data Cleaning Data management Data Processing
Hello everyone. this is my first post in Statalist, so please forgive any mistake on it. I'm trying to capture the filenames of seve...
Replace missing observations in VAR1 with observations from VAR2 f

Replace missing observations in VAR1 with observations from VAR2 f

9:02 AM Data Cleaning Data management Data Processing
Dear forum, I am trying to replace the missing observation(s) in ExportMarkets with the value in MirrorExportMarkets. May I ask what the c...
Exporting underlying data behind stata generated graphs

Exporting underlying data behind stata generated graphs

8:32 AM Data Cleaning Data management Data Processing
Hi, i was able to export the underlying data behind a graph i created using a dataset (below) but when i do, i only get the mediam values a...
How to compare prognostic models with a survivaldecision curve analysis for survival outcomes?

How to compare prognostic models with a survivaldecision curve analysis for survival outcomes?

8:02 AM Data Cleaning Data management Data Processing
Hello, i am currently using the dca (stdca) command in stata 14.0 I research a dataset of patients with a prognostic marker and would like...
Subpop MLM: Xtmixed

Subpop MLM: Xtmixed

7:57 AM Data Cleaning Data management Data Processing
Hi all, I am running a multi-level modeling using complex survey data, which used a stratified, clustered, and unequally sampling design. ...
Sample selection in the control function approach

Sample selection in the control function approach

6:51 AM Data Cleaning Data management Data Processing
I am trying to understand what sample it is correct to use when estimating the models using the control function (CF) approach. Below, I exp...
Can't open miest.ster

Can't open miest.ster

6:51 AM Data Cleaning Data management Data Processing
Dear Statalist, I am trying do run a Cox regression on an imputed dataset using restricted cubic splines. I used the following code (the d...
lower what does mean?

lower what does mean?

5:51 AM Data Cleaning Data management Data Processing
hi friends. please help me to understand this code. what does mean lower? gen cityname2=lower(cityofprop)+", "+ lower(stateofprop...
Adding missing values in a varlist

Adding missing values in a varlist

5:21 AM Data Cleaning Data management Data Processing
Hi, In my data set answers below zero each show a different type of missing value (e.g. not answered or not asked etc. from -1 to -10) and...
Endogenous binary regressor in multinomial logit model

Endogenous binary regressor in multinomial logit model

5:01 AM Data Cleaning Data management Data Processing
Hello forum members, i do have a multinomial logit model where one binary explanatory variable is endogenous (EEV). I am aware of solution...
Help with event study on the impact of Hurricane Harvey on the US stock market using SP500 company data

Help with event study on the impact of Hurricane Harvey on the US stock market using SP500 company data

4:56 AM Data Cleaning Data management Data Processing
Dear Reader, I would like some help with a study I am doing at University. I have downloaded data regarding stock prices for companies loc...
Bootstrap after two stage probit

Bootstrap after two stage probit

4:41 AM Data Cleaning Data management Data Processing
Dear all, I was using ivprobit but it seems there is a problem with the margins command when I want to get average marginal effects. It giv...
Hypothesis Testing on coefficients

Hypothesis Testing on coefficients

4:21 AM Data Cleaning Data management Data Processing
Hi I am interested in testing the hypothesis highlighted in the picture. Ho: gamma>=beta and Ha: gamma<beta Is there a way in stat...
Identifying units of analysis in STATA

Identifying units of analysis in STATA

2:11 AM Data Cleaning Data management Data Processing
Hi, I'm starting a master thesis and am very new to STATA with very little experience. I am looking at innovation in various technologie...
log using

log using

12:56 AM Data Cleaning Data management Data Processing
hi friends. i am run this code but faced with error.although it is run without error yesterday, i am confused please help me code: log us...
Plotting categorical variables

Plotting categorical variables

12:41 AM Data Cleaning Data management Data Processing
Hi all, can any of you tel me how I can get ht below graph? My rep variable is life satisfaction and the independent include - marital s...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ▼  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ▼  February (876)
      • Rename variables to remove part of the text
      • Betareg
      • ivreg, ivprobit and biprobit which one to use? (an...
      • Question spmap
      • renaming using loop
      • creating a variable that takes a value of the diff...
      • Binscatter2 - faster, enhanced binned scatterplots...
      • How does one put formatted date labels into excel ...
      • Extracting a specific part of a string if the stri...
      • Proportional odd assumption for ordered logit regr...
      • How to solve autocorrelation and hetero ?
      • a new variable based on two existing variables
      • Simple loop question
      • How to recode data with multiple groups
      • dummy for panel data
      • Issues with Hausman-Wu test
      • Local macro list and tuples
      • Help on modelling- endogeneity-panel data
      • Joining datasets when a variable is in a different...
      • Report estimates from Heckman AND margins command
      • Help with reshape
      • reshape in Stata
      • Question about Goodness of fit result
      • VISUA Dialog programming GUI
      • Converting string to date/time variables fails som...
      • Problem with importing excel on new MacBook
      • Problem with graph combine
      • Time Series Regression
      • Garch model regression
      • How to destring date variable formatted as year an...
      • IVREG LIML yields zero cefficients and p-value=1.
      • Page break in PDF output
      • importing multiple excel files using a loop
      • How to get the quintile points in survey data?
      • Is it possible to test coefficient estimates acros...
      • Analysis for two continuous variables (not normall...
      • Average Partial Effects (APE) after Dynamic Probit...
      • Bysort
      • Struggling with Collinearity in Panel Data
      • Problems with Nesting in Panel Regression "panels ...
      • problema with local extended_fcn local list : dir ...
      • Replace missing observations in VAR1 with observat...
      • Exporting underlying data behind stata generated g...
      • How to compare prognostic models with a survivalde...
      • Subpop MLM: Xtmixed
      • Sample selection in the control function approach
      • Can't open miest.ster
      • lower what does mean?
      • Adding missing values in a varlist
      • Endogenous binary regressor in multinomial logit m...
      • Help with event study on the impact of Hurricane H...
      • Bootstrap after two stage probit
      • Hypothesis Testing on coefficients
      • Identifying units of analysis in STATA
      • log using
      • Plotting categorical variables
      • Export Regression results without Fixed Effects us...
      • Fixed effect regression with some conditions
      • Counting the # of factors when using "i.index"
      • Create a var with some conditions in a same column
      • Stata update error: 'Could not move java to .old/'...
      • ML estimation goes forever in longitudinal analysis
      • LCA gsem output tables and intercept significance
      • Significance levels of combined coefficients
      • geodist - calculating distance
      • Omit range of values on the x-axis in a histogram
      • Homogeneity test (Hsiao test) on panel data balanc...
      • How to plot regression in an interval
      • Interpreting coefficient vs. margins from three-wa...
      • Setting e(sample) to compute boostrap standard errors
      • multiple fixed effect wiith areg
      • Non-linear graphs comparison
      • chow test for mixed models
      • Reghdfe Help
      • Apprpriate to use a log for my dependent variable
      • Possible to combine metan results into one forest ...
      • Aggregating and percentage distribution simultaneo...
      • Generating advanced variable in Stata (panel data)
      • Generating age variable in pannel dataset
      • Difference-in- model and data structure
      • Replace with value from another observation?
      • Variable selection in survival analyses
      • Panel data: Identify recurrent strings across columns
      • xtprobit panel data prediction
      • Creating new variable across rows
      • Question on AIC (and Log-likelihood)
      • What pseudo R squared and chi2 obtained with -pste...
      • elasticity calculation with a factor variable regr...
      • Generate time value with year and quarter and xtset
      • Too large R Squared
      • Oaxaca_rif error
      • note: index_msa omitted because of collinearity
      • note: index_msa omitted because of collinearity
      • Line graph with different style for forecasted values
      • egen anycount
      • Linear regression
      • views Creating box plots of the gap between two gr...
      • What observations are at the conditional quantile ...
      • Determining the P value for trends of continuous a...
      • Need help to interpret the interaction term in lon...
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer