BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
generate a dummy varaible based on whetehr the other dummy has the same or more ==1 than ==0 over the sample period

generate a dummy varaible based on whetehr the other dummy has the same or more ==1 than ==0 over the sample period

Thursday, March 31, 2022 Data Cleaning Data management Data Processing
Hello I have a panel data with firm and year variables. I also have a dummy variable, litigation, which equals 1 if there are 1 or more li...
Random sampling according to group in Stata

Random sampling according to group in Stata

9:23 PM Data Cleaning Data management Data Processing
Hi everyone, I have a question about how to randomly sample data in Stata according to specific groups. Below is my data structure: I wa...
Asreg

Asreg

7:23 PM Data Cleaning Data management Data Processing
When I run the following code webuse grunfeld, clear bys company: asreg invest mvalue kstock, fmb newey(2) I get the following results ...
Loop function

Loop function

7:23 PM Data Cleaning Data management Data Processing
Hello, I want to generate following: gen capm1 = rmrf if decile == 1 and this 10x times, basically until gen capm10 = rmrf if decile ==...
Logit odds ratio

Logit odds ratio

6:23 PM Data Cleaning Data management Data Processing
I am using xtlogit and have a simple model that performs as we might wish. But the odds ratio for one x variable is 186.8 which seems imposs...
Parameter estimates scaled by SD

Parameter estimates scaled by SD

4:23 PM Data Cleaning Data management Data Processing
Hello Forum, Does anybody has an idea how to approach to get parameter estimates scaled by the standard deviation of an underlying variabl...
grouping dimensions with borders

grouping dimensions with borders

2:51 PM Data Cleaning Data management Data Processing
I want horizontal lines separating my the levels of my first dimension. I can almost obtain this using by adding the option "spacer...
Tabulation based on multiple categorical variables

Tabulation based on multiple categorical variables

Wednesday, March 30, 2022 Data Cleaning Data management Data Processing
I am using Stata 17 and I’d like to produce a frequency table ( tabulate one way ) based on multiple variables that have been manually enc...
Quarter Averages to Yearly Averages

Quarter Averages to Yearly Averages

6:25 PM Data Cleaning Data management Data Processing
Hello, Code: * Example generated by -dataex-. For more info, type help dataex clear input float(fundnr month year q_mf_expratio) 1 7 20...
From monthly to yearly in %

From monthly to yearly in %

5:24 PM Data Cleaning Data management Data Processing
Hello! Code: * Example generated by -dataex-. For more info, type help dataex clear input float(fundnr month year mf_m_cf) 1 7 2011 ...
Time series dummy variable

Time series dummy variable

3:23 PM Data Cleaning Data management Data Processing
Hi all, I am trying to make a time-series line plot for employment as a dummy variable (called "lfs", where 1="employed...
Ordinal independent variable treated as continuous & marginal effects in ordinal logistic regression

Ordinal independent variable treated as continuous & marginal effects in ordinal logistic regression

3:23 PM Data Cleaning Data management Data Processing
Hello, I am an economics student doing an undergraduate empirical dissertation using data from the European Working Conditions Survey, whe...
complex tag - I think

complex tag - I think

3:23 PM Data Cleaning Data management Data Processing
Hello, below is a case where a person has 2 rows of data and and 2 different values for the variable lx_item_id_rc on the same date (dos_rc)...
No Observation Error with correlation

No Observation Error with correlation

Tuesday, March 29, 2022 Data Cleaning Data management Data Processing
Hi everyone, I'm very new to Stata and I am very confused my this error. I have created two variables to describe flights from differen...
Lag length in panel data

Lag length in panel data

6:23 PM Data Cleaning Data management Data Processing
Dear Statalisters, I have an unbalanced panel and was looking for a way to determine optimal lag length in panels. ​​​ I run Fisher-type...
Missing data

Missing data

5:24 PM Data Cleaning Data management Data Processing
I am looking to analyze missing data so I tried with command mdesc and summ but it came back with no missing data. When I tried codebook it ...
Importing multiple excel worksheets using a loop

Importing multiple excel worksheets using a loop

4:23 PM Data Cleaning Data management Data Processing
Hello everyone, I have a single excel file that has multiple worksheets, the worksheets are named Country 1, Country 2, Country 3 etc. I w...
Error message when conducting Heckman correlation test

Error message when conducting Heckman correlation test

4:23 PM Data Cleaning Data management Data Processing
Dear all, I am trying to use the Heckman correlation test to investigate the possibility of a selection bias in my data. I have panel data...
Seeking advice on how to accurately calculate marginal effects after multivariate probit regression

Seeking advice on how to accurately calculate marginal effects after multivariate probit regression

3:23 PM Data Cleaning Data management Data Processing
Dear statalist users, Hope you all are doing well. I used Stata/MP 14.0 to run a multivariate probit model which has three binary dependen...
Variable labels lost due to reshape

Variable labels lost due to reshape

Monday, March 28, 2022 Data Cleaning Data management Data Processing
Hello, I have a dataset (first example below) that I reshape from wide to long. However I lost of the label information. How can I keep the...
Pearson's Chi-Squared Test - Requesting help with tab2

Pearson's Chi-Squared Test - Requesting help with tab2

5:23 PM Data Cleaning Data management Data Processing
I am currently working on an extension for research that requires me to measure whether the allocation of patents to examiners are random. I...
Merging multiple .tex files: How to sort the individual files?

Merging multiple .tex files: How to sort the individual files?

5:23 PM Data Cleaning Data management Data Processing
I am outputting a lot of tex tables as raw files and then merge them into one ordered tex document using an automated procedure. The code be...
scatterplot

scatterplot

5:23 PM Data Cleaning Data management Data Processing
I would like to do several scatterplots were I plot the correlation btw mortality and covid-cases and were I would like to identify the trea...
Missing Time-Series Data

Missing Time-Series Data

4:23 PM Data Cleaning Data management Data Processing
Hi Statalist, I have a time series dataset of ozone data for multiple different counties. It has quite a bit of missing data, sometimes se...
LSDVC method

LSDVC method

3:23 PM Data Cleaning Data management Data Processing
After regressions with LSDVC method stata indicates bootsraps error. I want to know what is differnce between Bootstraps and error standard?...
Ytitle with euro sign does not display horizontally

Ytitle with euro sign does not display horizontally

3:23 AM Data Cleaning Data management Data Processing
Despite putting angle(0) the title will not be displayed horizontally. Since this only happens for the title but not the label, I presume it...
Generating random variables based on some predefined correlations

Generating random variables based on some predefined correlations

1:23 AM Data Cleaning Data management Data Processing
Dear Stata Members I would like to ask a few questions and clear some doubts regarding regressions but for that, I need to create some rand...
Export Word putdocx

Export Word putdocx

12:23 AM Data Cleaning Data management Data Processing
Hi I have not had Stata for too long and am trying to create a Word export from Stata. Unfortunately, Stata rounds down or up my data whe...
What should the command be?

What should the command be?

Sunday, March 27, 2022 Data Cleaning Data management Data Processing
I would like to ask about the exact command. Assume yt= (infla t, unrate t, hwages t)is(3×1)-vector, in order to estimate the order of the ...
How can I recode the variable in this long format data set with Stata?

How can I recode the variable in this long format data set with Stata?

8:23 PM Data Cleaning Data management Data Processing
I want to create a new variable "signal" in this long format data set. The rule is set below, if all status values equal 1 within...
Wooldridge Introductory Econometrics 6th Edition CH13 C16 Qvi and Qvii.

Wooldridge Introductory Econometrics 6th Edition CH13 C16 Qvi and Qvii.

8:23 PM Data Cleaning Data management Data Processing
Hi, I am currently solving the question from Wooldridge, the question is as follow: Array The link for the data is: https://s2.smu.edu/t...
Create a Dummy indicating country pair has appeared previously in different variable

Create a Dummy indicating country pair has appeared previously in different variable

5:23 PM Data Cleaning Data management Data Processing
Code: * Example generated by -dataex-. For more info, type help dataex clear input float(Year country_pair2 BIT_pair) 1993 4447 1 1999 25...
Taking strings out of file names

Taking strings out of file names

4:23 PM Data Cleaning Data management Data Processing
I scraped together some xls files using the almighty Python. Before I can/want to work with these files I must rename them, because this cod...
Amazon McKinsey 7S Model

Amazon McKinsey 7S Model

Saturday, March 26, 2022 Research Methodology
Amazon McKinsey 7S model illustrates the ways in which seven key elements of businesses can be united to increase effectiveness. According t...
Imputing missing state-years using average of surrounding years

Imputing missing state-years using average of surrounding years

7:23 PM Data Cleaning Data management Data Processing
Hello, I have a state-year panel dataset, which is mostly balanced except for one missing year in Wisconsin (1998) and two missing years i...
How can I use pweight by hand?

How can I use pweight by hand?

7:23 PM Data Cleaning Data management Data Processing
Dear statalists: I want to use a mirco sampling data in R,but the package used in R is systemfit which doesn't support sampling weight(...
All Dates on Data Editor Coming Up as .

All Dates on Data Editor Coming Up as .

6:23 PM Data Cleaning Data management Data Processing
Hello! Currently new to operating STATA. My Data Editor does not show any dates despite me having dates put in on my REDCap. All dates app...
Estimating the modified Jones model by industry and year using panel data

Estimating the modified Jones model by industry and year using panel data

6:23 PM Data Cleaning Data management Data Processing
Hi all. I'm studying the relationship between restatements and firms' earnings management behaviour. I'm using DA to capture suc...
Combined new variable: code MV from other variables

Combined new variable: code MV from other variables

6:23 PM Data Cleaning Data management Data Processing
I would like to create a variable in which I code 1 if a certain threshold in several other indicators is met. 0 only if the threshold is no...
Singleton dummy variable (possible problem)

Singleton dummy variable (possible problem)

4:23 PM Data Cleaning Data management Data Processing
Dear all, I am running a couple of regressions with country and year fixed effects. But, in some regressions, when I include dummies for c...
Calculando o coeficiente de Gini

Calculando o coeficiente de Gini

3:23 PM Data Cleaning Data management Data Processing
Boa tarde pessoal, estou tentando calcular o índice de gini para alguns setores da economia, identificados de 1 a 5. Busquei em alguns fórun...
export spatial weight matrix

export spatial weight matrix

Friday, March 25, 2022 Data Cleaning Data management Data Processing
Hi, I'm exploiting district-level unbalanced panel data. I used the following command to generate a weight matrix w HTML Code: spw...
Event Study Graph: Do I need an indicator variable for each pre and post treatment period (except t-1)?

Event Study Graph: Do I need an indicator variable for each pre and post treatment period (except t-1)?

7:23 PM Data Cleaning Data management Data Processing
Code: * Example generated by -dataex-. For more info, type help dataex clear input float(id monthly independent sales TreatZero lead2 lea...
How to Count the Number of the Students Having Grade Retention in a Dataset in Stata Code?

How to Count the Number of the Students Having Grade Retention in a Dataset in Stata Code?

7:23 PM Data Cleaning Data management Data Processing
As the title suggested, I would like to count the number of the students who had grade retention in K-12 system. For example, I want to know...
Reshape wide to long - dataset

Reshape wide to long - dataset

5:23 PM Data Cleaning Data management Data Processing
Hello, I need to organize my data and convert it from wide to long. Since I have to many variables I use the command "set maxvar 3276...
Long data? Help. New to Stata

Long data? Help. New to Stata

4:23 PM Data Cleaning Data management Data Processing
Hi. I am new to Stata, and coming back to data analysis after 20 years since my undergraduate. I am sure that this is a simple fix, but ther...
Making a figure including confidence intervals for multiple groups

Making a figure including confidence intervals for multiple groups

4:23 PM Data Cleaning Data management Data Processing
Dear Stata-users, As I was working on my thesis, I was wondering about the following. I have made two figures with confidence intervals fo...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ▼  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ▼  March (186)
      • generate a dummy varaible based on whetehr the oth...
      • Random sampling according to group in Stata
      • Asreg
      • Loop function
      • Logit odds ratio
      • Parameter estimates scaled by SD
      • grouping dimensions with borders
      • Tabulation based on multiple categorical variables
      • Quarter Averages to Yearly Averages
      • From monthly to yearly in %
      • Time series dummy variable
      • Ordinal independent variable treated as continuous...
      • complex tag - I think
      • No Observation Error with correlation
      • Lag length in panel data
      • Missing data
      • Importing multiple excel worksheets using a loop
      • Error message when conducting Heckman correlation ...
      • Seeking advice on how to accurately calculate marg...
      • Variable labels lost due to reshape
      • Pearson's Chi-Squared Test - Requesting help with ...
      • Merging multiple .tex files: How to sort the indiv...
      • scatterplot
      • Missing Time-Series Data
      • LSDVC method
      • Ytitle with euro sign does not display horizontally
      • Generating random variables based on some predefin...
      • Export Word putdocx
      • What should the command be?
      • How can I recode the variable in this long format ...
      • Wooldridge Introductory Econometrics 6th Edition C...
      • Create a Dummy indicating country pair has appeare...
      • Taking strings out of file names
      • Amazon McKinsey 7S Model
      • Imputing missing state-years using average of surr...
      • How can I use pweight by hand?
      • All Dates on Data Editor Coming Up as .
      • Estimating the modified Jones model by industry an...
      • Combined new variable: code MV from other variables
      • Singleton dummy variable (possible problem)
      • Calculando o coeficiente de Gini
      • export spatial weight matrix
      • Event Study Graph: Do I need an indicator variable...
      • How to Count the Number of the Students Having Gra...
      • Reshape wide to long - dataset
      • Long data? Help. New to Stata
      • Making a figure including confidence intervals for...
      • Regression for all possible combinations of variables
      • Panel data - matching observations from an unsorte...
      • Seeking advice on how to address the correlations ...
      • Monthly excess return
      • Line graph for categorical data
      • How to suppress categorical outputs in proportiona...
      • DiD with back-to-back treatments
      • Has anyone benchmarked the performance of the new ...
      • Error reshape wide to long: "variable 0_welsh impl...
      • Panel regression - subgroup analysis
      • Principal component analysis
      • [DUVIDA] Separar valores de uma variável
      • duplicate data
      • Weighted average for panel groups and subperiods
      • Fractional response with censored zeros
      • Passing from monthly to yearly returns
      • Dynamic panel vs XTREG vs XTEGAR vs other
      • Interactions between continuous variables in panel...
      • How to make ivreghdfe work for reporting first sta...
      • Mediation using xtreg (indirect, direct, and total...
      • stuck on long to wide format
      • do year effects in xtreg capture year effects for ...
      • New Variable
      • Logit models and stationery predictors*
      • What does inflate mean in a ZINB model?
      • Continuously compounded returns in event studies (...
      • Looping over two tables
      • Comparing two dependent variables with different f...
      • Convert time with a 12 hrs setting, to 24 hrs
      • Can I simulate a children income distribution base...
      • tline on top of the graph using command twoway area
      • A Question about Analyzing Shool-nested Panel Data
      • how to get the coefficient of post and treat when ...
      • Switching regression model and Heckman procedure
      • How can I display the top five observations (by fr...
      • Convert a dataset from wide to long
      • Creating new variable from old variable for each u...
      • Creating new variable for each id from old variabl...
      • Error opening Excel after using merge option on -p...
      • option 3aster not allowed
      • STATA date format
      • stata help with multivariate regression and intera...
      • Proportional hazard model
      • Generate 2nd Highest Value
      • Explanatory and explained variables with opposite ...
      • Struggling to create a percent change variable tha...
      • Microfinance or Self Help Group
      • Continuous variable Treatment Group in DID Regression
      • "by" and missingness in table1_mc
      • &, +, and logical idiosyncrasies
      • Xtreg results in insufficient observations
      • Can't correctly convert timeseries time variable d...
      • GMM xtabond2 with external IVs
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer