BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Individual line colours in pcspike

Individual line colours in pcspike

Friday, April 30, 2021 Data Cleaning Data management Data Processing
Hello, I'm trying to create a parallel axis dot plot where each line has a different colour (I've managed to get everything else I...
Line Graph in Panel Dataset

Line Graph in Panel Dataset

9:28 PM Data Cleaning Data management Data Processing
Hi! I wanted to how to create a single line graph for a panel dataset? The dataset looks something like this: Country Year GDP A ...
Creating a treatment for DID analysis: how to do it the right?

Creating a treatment for DID analysis: how to do it the right?

8:25 PM Data Cleaning Data management Data Processing
Hi all. I have panel data containing the number of Brazilian institutions of higher education (variable "n_he_inst") across more t...
Weights in the rdbwselect command

Weights in the rdbwselect command

8:25 PM Data Cleaning Data management Data Processing
I am trying to use the rdbwselect command and it has the the following syntax (from The Stata Journal (2017) 17, Number 2, pp. 372–404) rd...
Three-level ZINB in Stata

Three-level ZINB in Stata

5:25 PM Data Cleaning Data management Data Processing
Hi, I use Stata 16.1 and I need to run a 3-level Zero Inflated Negative Binomial regression. I have seen some previous answers suggest usi...
generating separate variables from one variables's observations, is there a loop?

generating separate variables from one variables's observations, is there a loop?

7:25 AM Data Cleaning Data management Data Processing
Dear all, From the variable v I would like to generate 3 variables: v1, v2, v3. Given that these are the 3 observations of v. I need to ma...
Cleaning for Optimal Modeling (ft. Panel Data)

Cleaning for Optimal Modeling (ft. Panel Data)

6:25 AM Data Cleaning Data management Data Processing
Hello Everyone, Let's say I would like to do a linear regression on some Panel data without knowing if linear regression is the most a...
exporting many tables to excel

exporting many tables to excel

6:25 AM Data Cleaning Data management Data Processing
Hi , I have a question about how I can export many tables in a good format. I am trying to run regressions for 49 industries (later I need u...
Mean of means, with means obtained from by year, sort + weigths

Mean of means, with means obtained from by year, sort + weigths

4:25 AM Data Cleaning Data management Data Processing
Hello, I have obtained the mean, median, stdev by year (I have 18 years in the sample) for my expenditure variable a_rep_pos throught the...
parsing a variable based on digits

parsing a variable based on digits

3:25 AM Data Cleaning Data management Data Processing
Dear All, I try to create a new variable (let's say caseid_copy) based on the original caseid. However, I have problems with its digit...
Replicating Excel Intercept/Slope

Replicating Excel Intercept/Slope

3:25 AM Data Cleaning Data management Data Processing
I have four variables, y1-y4 that I would like to regress on four fixed x's, x1-x4. (I have taken the logs of both the x's and y...
how to calculate pseudo-R2 using imputed data in a multinomial logistic regression

how to calculate pseudo-R2 using imputed data in a multinomial logistic regression

3:25 AM Data Cleaning Data management Data Processing
Dear all, I would like to calculate the pseudo-R2 for my multinomial logistic regression in multiple imputed data. This is my code for im...
Creating space between bar charts

Creating space between bar charts

2:25 AM Data Cleaning Data management Data Processing
Hi, I am using Stata 17 and need help in bar charts. Below is an example of my dataset: Code: * Example generated by -dataex-. For mor...
Multilevel model with olog?

Multilevel model with olog?

2:25 AM Data Cleaning Data management Data Processing
I usally use the command: mixed dep_var || country_var: Problem is that my dependent variable is only scaled in two categories (0-1). I...
Generating new ID variable taking into account duplicates across 2 other variables

Generating new ID variable taking into account duplicates across 2 other variables

2:25 AM Data Cleaning Data management Data Processing
First time poster, so I’m sorry for any errors… I have two ID variables (ID1 and ID2). I want to create a new ID variable taking into acco...
Need help with Mean for Grouped Data

Need help with Mean for Grouped Data

2:25 AM Data Cleaning Data management Data Processing
Array Good morning Stata Family, Please I want to create a new variable for the average wage earned by each worker for B and C shifts al...
Specification of treatment and control group in diff-in-diff

Specification of treatment and control group in diff-in-diff

1:25 AM Data Cleaning Data management Data Processing
Dear all, I am doing a difference-in-differences estimation to see the effect of a parental leave reform on mothers' and fathers' w...
marginal effects

marginal effects

1:25 AM Data Cleaning Data management Data Processing
Dear everyone, I have a regression on GDP where I include both inflation, inflation^2 and one lag of these (I also have other control vari...
Endogeneity test for two endogenous variables using REIV (xtivreg, re)

Endogeneity test for two endogenous variables using REIV (xtivreg, re)

1:25 AM Data Cleaning Data management Data Processing
Dear all, I am using an REIV using the command "xtivreg, re". I have two endogenous variables, y2 and y3 (y3 is an interaction o...
How to choose part of dataset based on date

How to choose part of dataset based on date

1:25 AM Data Cleaning Data management Data Processing
Hi I need some help with Stata syntax. I have a dataset with persons who have participated in a health survey, and later had a specific su...
Multiple observations per year in the dependent variable and annual observations for the dependent variables

Multiple observations per year in the dependent variable and annual observations for the dependent variables

12:25 AM Data Cleaning Data management Data Processing
Hello Statalisters, I am trying to explore the impact of CSR expenses on the credit ratings of firms. Please find below a snapshot of the ...
Help needed: Export F-stats and P-values to an excel file

Help needed: Export F-stats and P-values to an excel file

Thursday, April 29, 2021 Data Cleaning Data management Data Processing
Hi, I am using Stata 17 and need help in exporting results from Stata to excel. Here is an example of my data: Code: * Example generat...
Direct, indirect and total marginal effect in the mvprobit

Direct, indirect and total marginal effect in the mvprobit

10:25 PM Data Cleaning Data management Data Processing
Please how to calculate the direct, indirect and total marginal effect in the mvprobit ?
Predicted Probabilities with Logit and Fixed Effect

Predicted Probabilities with Logit and Fixed Effect

7:25 PM Data Cleaning Data management Data Processing
Hello, I am trying to compare 4 different groups (black women, white women, black men, and white women) and their probability for leadership...
Measuring environmental innovation patents

Measuring environmental innovation patents

5:25 PM Data Cleaning Data management Data Processing
Dear Stata experts, For my master thesis, I'm figuring out how to analyse in order to measure environmental innovation patents. I...
Plotting regression results in scatter plot

Plotting regression results in scatter plot

5:25 PM Data Cleaning Data management Data Processing
Hello, I have a regression of the form: Code: reg yvar xvar zvar i.year where year is a categorical variable from 1990 to 2020. I wan...
margins & marginsplot in LCA by group

margins & marginsplot in LCA by group

5:25 PM Data Cleaning Data management Data Processing
Good morning colleagues, I'm having some trouble working out how to use margins and marginsplot to present bar charts for my Latent Cl...
Create a variable that summarizes the number of distinct combinations of string variables

Create a variable that summarizes the number of distinct combinations of string variables

5:25 PM Data Cleaning Data management Data Processing
Hi all, Apologies in advance if my problem is unclear. I'm a former Stata user who has since been using Excel and previously SAS so I...
Intersection among datasets

Intersection among datasets

6:25 AM Data Cleaning Data management Data Processing
Hi all, so I have k databases (55) called name_country_mole.dta. Each database is an unbalanced panel of molecules observed for a certain ...
enquiry about using likelihood ratio test when employing conditional logistics regression !

enquiry about using likelihood ratio test when employing conditional logistics regression !

6:25 AM Data Cleaning Data management Data Processing
Hello, I am testing a categorical variable (ethnicity) with a binary outcome (death). I want to obtain a single P value across all categori...
xtlogit with one dummy predictor

xtlogit with one dummy predictor

5:25 AM Data Cleaning Data management Data Processing
Hello all, I have been looking for days for an answer to this question, but have not found an answer so far. My problem is as follows: I...
Forecasting

Forecasting

5:25 AM Data Cleaning Data management Data Processing
I have been using STATA to do some forecasting on stock market data. When I generate the forecasts and look on a graph they are miles off, e...
Tesla McKinsey 7S Model

Tesla McKinsey 7S Model

4:36 AM Research Methodology
Tesla McKinsey 7S model illustrates the ways in which seven elements of businesses can be aligned to increase effectiveness. According to th...
No observations r(2000)

No observations r(2000)

4:25 AM Data Cleaning Data management Data Processing
Dear everyone, I am working on my masters thesis and I am trying to make a twostep system gmm regression on gdp with a five-year average. ...
Generalise value of dummy variable to all observations of a given panel ID

Generalise value of dummy variable to all observations of a given panel ID

4:25 AM Data Cleaning Data management Data Processing
Hi, I need to set the value of a dummy variable = 1 for all observations of a given panelID, if it already equals 1 for any observations o...
Creating a categorical variable with multiple dummy variables

Creating a categorical variable with multiple dummy variables

4:25 AM Data Cleaning Data management Data Processing
Hello everyone, I am looking for a way to create one categorical variable out of multiple dummy variables. More specifically, I have ten d...
Help needed in loops

Help needed in loops

3:25 AM Data Cleaning Data management Data Processing
Hi all, I am using Stata 17 and would need some help in my loops. Below is an example of my dataset: Code: * Example generated by -dat...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ▼  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ▼  April (683)
      • Individual line colours in pcspike
      • Line Graph in Panel Dataset
      • Creating a treatment for DID analysis: how to do i...
      • Weights in the rdbwselect command
      • Three-level ZINB in Stata
      • generating separate variables from one variables's...
      • Cleaning for Optimal Modeling (ft. Panel Data)
      • exporting many tables to excel
      • Mean of means, with means obtained from by year, s...
      • parsing a variable based on digits
      • Replicating Excel Intercept/Slope
      • how to calculate pseudo-R2 using imputed data in a...
      • Creating space between bar charts
      • Multilevel model with olog?
      • Generating new ID variable taking into account dup...
      • Need help with Mean for Grouped Data
      • Specification of treatment and control group in di...
      • marginal effects
      • Endogeneity test for two endogenous variables usin...
      • How to choose part of dataset based on date
      • Multiple observations per year in the dependent va...
      • Help needed: Export F-stats and P-values to an exc...
      • Direct, indirect and total marginal effect in the ...
      • Predicted Probabilities with Logit and Fixed Effect
      • Measuring environmental innovation patents
      • Plotting regression results in scatter plot
      • margins & marginsplot in LCA by group
      • Create a variable that summarizes the number of di...
      • Intersection among datasets
      • enquiry about using likelihood ratio test when emp...
      • xtlogit with one dummy predictor
      • Forecasting
      • Tesla McKinsey 7S Model
      • No observations r(2000)
      • Generalise value of dummy variable to all observat...
      • Creating a categorical variable with multiple dumm...
      • Help needed in loops
      • Tesla Value Chain Analysis
      • How to insert control variables and the command 'b...
      • Transforming datatable
      • Dummy variable omitted due to collinearity
      • Uncertainty Index
      • extract the year from an inconsistent string date ...
      • Two-step system GMM estimations
      • Trouble replacing missing values using bysort:replace
      • Bookmark in Stata 17
      • GMM with binary output
      • Tesla Porter’s Five Forces Analysis
      • there is somthing wrong when use tuples command
      • Options to plot yearly data for five years in a si...
      • Question on merge results not consistent
      • How to interpret results when dependent variable t...
      • Add rows within the group
      • Generating a variable to count repetitions of obse...
      • Diff in Diff by Hand and Stata
      • test of endogeneity
      • Removing duplicates
      • Expressing all other variables in deviation from t...
      • Complicate box plot
      • How to count a purchase date of medicines?
      • Generating variable and replacing the outlier as m...
      • Why is R2 greater than 1.0 in this PCSE analysis?
      • Variable with negative values
      • Alternatives to ratio of two count variables when ...
      • Doing Bayesian modelling in Stata
      • Creating a new categorical variable by spousal agr...
      • How to reshape data from wide to long with "tricky...
      • How to reshape data from wide to long with multipl...
      • Highest frequency variable
      • How do i create a variable from a longitudinal dat...
      • Preparing data for the difference-in-differences a...
      • Add spike shape options in plotting function
      • Quantile regression in panel data
      • Read multiple databases with loops
      • Simple question about counting in specific cases
      • Correct to take log of explanatory variable alread...
      • Analyze an increase in minimu wages on stata
      • If statement with string
      • Comparing models in Latent Class analysis
      • Endogenous control variables in nonlinear regression
      • Calculate odds ratio from linear regression coeffi...
      • A question about longtitudinal data analysis: what...
      • Exporting multiple wntestq (Ljung-Box test) result...
      • Suggestions for statistical tests to run on variab...
      • Dividing series of variables through looping
      • l2.X variable as IV for X? Anderson Hsiao(1981). h...
      • Tesla Marketing Mix (Tesla 7Ps of Marketing)
      • Validity and Robustness checks for Pooled/Repeated...
      • Is my use of a t-test with this data appropriate?
      • Tesla Marketing Strategy: a brief overview
      • VECM multiple cointegrating equation interpretation
      • rename variables using loops
      • Finding mean value of a column based on year
      • Loop by group in a variable and selecting by condi...
      • Generate random data with mean, std and range
      • Exporting a file with a list of variable names and...
      • lead (t+1) variables
      • Annualize monthly returns
      • Three way interaction without significant two way ...
      • Matching with replacement based on size and industry
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer