BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Possible to put interaction terms of X and indictors of Y quintile in a regression?

Possible to put interaction terms of X and indictors of Y quintile in a regression?

Thursday, June 30, 2022 Data Cleaning Data management Data Processing
Dear Statalist users, I am wondering if I would estimate a regression like this: Y = X + X * Y_Q2 + X * Y_Q3 + X * Y_Q4 + X * Y_Q5 + Y_Q...
how to replace/recode variables that are a multiple of some number

how to replace/recode variables that are a multiple of some number

11:23 PM Data Cleaning Data management Data Processing
Hi I have a variable "level" that ranges from 1 to 125 I would like to recode it into a variable "pitch", such that val...
Shortcut for generating new variables based on many existing

Shortcut for generating new variables based on many existing

6:23 PM Data Cleaning Data management Data Processing
I have groups of variables contactoutcome_con_<y>_<x> and contactmethod_con_<y>_<x>, where x is a program from 1-8 a...
create a loop to convert string variables in numeric variables

create a loop to convert string variables in numeric variables

6:23 PM Data Cleaning Data management Data Processing
Hello, I am Salvatore, happy to join the Stata Forum community. I am a new user who recently started using Stata. For my thesis research, ...
Matching without replacement from a file of pairs for case-control and other applications

Matching without replacement from a file of pairs for case-control and other applications

5:23 PM Data Cleaning Data management Data Processing
Short version: I’m seeking a solution to how to do 1:m matching of cases and controls without replacement, *given a file in long (“edge”) fo...
bysort query issues

bysort query issues

4:23 PM Data Cleaning Data management Data Processing
Hello all: I am trying to replace first row of path with "CLL" and second row of path as "RT" for _merge == 2 cases. T...
Replace command with non-mutually exclusive categorical data

Replace command with non-mutually exclusive categorical data

2:23 PM Data Cleaning Data management Data Processing
Hello, I am working with a dataset from a Twitter content analysis project and am stuck trying out figure out how to take 8 categorical tw...
Number of lags too high

Number of lags too high

Wednesday, June 29, 2022 Data Cleaning Data management Data Processing
Hi everyone, I am writing as I am currently working on my bachelor's thesis with Stata. I am doing a time series analysis. I have fo...
pweights in reghdfe allow colinear variable to generate a coefficient?

pweights in reghdfe allow colinear variable to generate a coefficient?

4:23 PM Data Cleaning Data management Data Processing
Hi, This is my first post here. This question pertains to the use of pweight in reghdfe. I have a variable that is colinear with the fixe...
VAR or PVAR different lags for endgenous variables

VAR or PVAR different lags for endgenous variables

4:23 PM Data Cleaning Data management Data Processing
Dear community, in a VAR or PVAR model would it be possible for endgenous variables to have different lags? The PVAR seems to always use...
comparing the parametric survival regression models

comparing the parametric survival regression models

3:23 PM Data Cleaning Data management Data Processing
What is the command for comparing the parametric survival regression models? For example, I estimate the model by both exponential and Weibu...
comparing the parametric survival regression models

comparing the parametric survival regression models

2:23 PM Data Cleaning Data management Data Processing
What is the command for comparing the parametric survival regression models? For example, I estimate the model by both exponential and Weibu...
Dummy for export-starters and non-exporters in the period just before the export-starter enters the export market

Dummy for export-starters and non-exporters in the period just before the export-starter enters the export market

Tuesday, June 28, 2022 Data Cleaning Data management Data Processing
I want to create a dummy for export starters to non-exporters in the years before entry. How can I code this? When Export[_n]==1 & Ex...
How to inform readers the context of hierarchical logistic regression that removes significant main effects

How to inform readers the context of hierarchical logistic regression that removes significant main effects

4:23 PM Data Cleaning Data management Data Processing
Hello all, I'm examining a hypothetical scenario to determine how living alone and the mechanism of feedback affects a person's wi...
PPML - generating a time dependent threshold

PPML - generating a time dependent threshold

4:23 PM Data Cleaning Data management Data Processing
Dear Stata community, I am working with a gravity model and what to cluster my observations depending on if they are in the highest, middl...
Generating dummy observations to balance a panel

Generating dummy observations to balance a panel

4:23 PM Data Cleaning Data management Data Processing
I hope this request makes sense, as it is just to aid in my estimation. Below is the dataex of a dummy dataset resembling my original, and b...
Determining the explanatory power of an interaction term

Determining the explanatory power of an interaction term

3:23 PM Data Cleaning Data management Data Processing
Hi everyone, I'm trying to fill a table with each line representing the explanatory power of a particular part of my model (such as fi...
Conditional loop analysis with sums in panel data

Conditional loop analysis with sums in panel data

2:23 PM Data Cleaning Data management Data Processing
I have an unbalanced data of company id (i) country code (c) year (y) ratio-1 (r1) ratio-2 (r2) ratio-3 (r3) ratio-4 (r4) ...
Using a loop to calculate new variable

Using a loop to calculate new variable

Monday, June 27, 2022 Data Cleaning Data management Data Processing
Hello all, I want to find the new sale price for each year represented through new_sales . I will use the sale price in 2000q4 as the base...
How to define treatment & control groups properly?

How to define treatment & control groups properly?

5:23 PM Data Cleaning Data management Data Processing
I’m working on a project examining the effect of a 2016 cash transfer on fertility. Who is eligible for the cash? All families with: 1.) ...
What miss option means in gunique ?

What miss option means in gunique ?

3:23 PM Data Cleaning Data management Data Processing
Deat Stata user, I found this gunique function, and using the same variable list I got different total observations and unique observations...
CI Decmposition Results not Showing percentage contribution

CI Decmposition Results not Showing percentage contribution

2:23 PM Data Cleaning Data management Data Processing
Hi All, I am using Stata 16 and trying to decompose the concentration index but the results show nothing for percentage contribution. My d...
synth_runner automatically generated predictorvars

synth_runner automatically generated predictorvars

1:23 AM Data Cleaning Data management Data Processing
I am using synth_runner in STATA 17. I got the exact same results when I run two specifications. depvar remains the same. In the second spec...
Extract country names from affiliations

Extract country names from affiliations

12:23 AM Data Cleaning Data management Data Processing
Hi, I have a dataset of about 1000 articles with variables such as id, title, abstract and affiliation. I was unable to get a dataex due t...
Flexible case-control matching command

Flexible case-control matching command

Sunday, June 26, 2022 Data Cleaning Data management Data Processing
Hello, First, thanks in advance for anyone who can help me with this. I haven't had much luck recently with other avenues so I hoped I...
Working with Time and DateTime variables from Excel

Working with Time and DateTime variables from Excel

7:23 PM Data Cleaning Data management Data Processing
Hi, I noticed that whenever I import an excel file that contains a time or datetime variable onto Stata (time up to seconds), the values f...
scatter graph with different styles and specific axis

scatter graph with different styles and specific axis

5:23 PM Data Cleaning Data management Data Processing
Hi statalist, I have the following data: Code: * Example generated by -dataex-. For more info, type help dataex clear input byte perio...
Twoway Line: Deleting a straight line

Twoway Line: Deleting a straight line

4:23 PM Data Cleaning Data management Data Processing
Hello, I am trying to create a figure using the code: twoway line sum mdate. The figure shows both a actual line and a straight fitted line....
Panel Data fixed effects and time effects

Panel Data fixed effects and time effects

3:23 PM Data Cleaning Data management Data Processing
Hello there, For my master thesis I am conducting research about the effects of the digital divide on the educational attainment in the Eur...
Question on metacumbounds

Question on metacumbounds

12:23 AM Data Cleaning Data management Data Processing
Hi everyone, I would like to seek advice on the metacumbounds package used for trial sequential analysis. I have 2 questions: 1. error ...
Populating a column with values

Populating a column with values

Saturday, June 25, 2022 Data Cleaning Data management Data Processing
I need to populate the empty columns with corresponding values within that year, it could be that I have to repeat the entry of the values (...
Failing to convert .shp and .dbf dile to .dta format

Failing to convert .shp and .dbf dile to .dta format

6:23 PM Data Cleaning Data management Data Processing
Hello, I'm trying to convert a gis file to .dta format. To answer a few questions: .shp and .dbf are in the same directory and there i...
chnaging values of a variable

chnaging values of a variable

6:23 PM Data Cleaning Data management Data Processing
Dear Listers, I am using Stata 15.1. I am working with a dataset with more than a million observations. I would like to change the value...
log differenced model and GMM Estimator

log differenced model and GMM Estimator

6:23 PM Data Cleaning Data management Data Processing
Hi all, I have panel data (T=30) and due to potential threat of non stationarity in my data i've transformed my data into log differen...
heterofactor and ML maximization

heterofactor and ML maximization

3:23 PM Data Cleaning Data management Data Processing
I am using the heterofactor command ( https://www.stata-journal.com/articl...article=st0431 ), but it is not interacting properly with mle m...
Significant in one-way, but not in two-way ANOVA

Significant in one-way, but not in two-way ANOVA

1:23 PM Data Cleaning Data management Data Processing
Hey, I'm currently doing my data analysis for my thesis but I encountered a problem. The main effect is significant in the one-way ANOVA...
Stset with durations

Stset with durations

8:23 AM Data Cleaning Data management Data Processing
Stset with durations. hello i have wide format data with different dates which i used to create different DURATIONS since the begining of o...
Question: Bayesian Vector Autoregression (BVAR)

Question: Bayesian Vector Autoregression (BVAR)

7:23 AM Data Cleaning Data management Data Processing
Hello everyone, is there any code to summarise and or combine results from BVAR model? For instance...quietly, esttab, eststo is not workin...
Split population duration

Split population duration

5:23 AM Data Cleaning Data management Data Processing
Hello, I'm trying to make a Split population duration with the spsurv command but I can't understand how to get on one side the surv...
Merge accuracy using str format when most contain only numbers

Merge accuracy using str format when most contain only numbers

4:23 AM Data Cleaning Data management Data Processing
Dear stata user, I have a question regarding the merge accuracy of str. I have dataset A whose firm_id are in string format, but most of th...
Summing over a value of variable for repeating county

Summing over a value of variable for repeating county

Friday, June 24, 2022 Data Cleaning Data management Data Processing
Hello respected stata community, I have a dummy variable which takes the value 0 and 1. The observation for different counites are given b...
Box plot help

Box plot help

4:23 PM Data Cleaning Data management Data Processing
Hello everyone, I am trying to make this visual from a book by Edward Tufte where he talks about using a stripped down version of the box ...
How to do a Box Plot with mean instead of median and SD instead of quartiles?

How to do a Box Plot with mean instead of median and SD instead of quartiles?

Thursday, June 23, 2022 Data Cleaning Data management Data Processing
Dear Statalisters, Please have a look at my data: Code: * Example generated by -dataex-. For more info, type help dataex clear input f...
Help with bootstrap in obtaining a standard error.

Help with bootstrap in obtaining a standard error.

7:23 PM Data Cleaning Data management Data Processing
Hi Everyone: I think my problem has nothing to do with the data set and so I'm not showing a data example. In the bootstrap, I want a st...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ▼  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ▼  June (174)
      • Possible to put interaction terms of X and indicto...
      • how to replace/recode variables that are a multipl...
      • Shortcut for generating new variables based on man...
      • create a loop to convert string variables in numer...
      • Matching without replacement from a file of pairs ...
      • bysort query issues
      • Replace command with non-mutually exclusive catego...
      • Number of lags too high
      • pweights in reghdfe allow colinear variable to gen...
      • VAR or PVAR different lags for endgenous variables
      • comparing the parametric survival regression models
      • comparing the parametric survival regression models
      • Dummy for export-starters and non-exporters in the...
      • How to inform readers the context of hierarchical ...
      • PPML - generating a time dependent threshold
      • Generating dummy observations to balance a panel
      • Determining the explanatory power of an interactio...
      • Conditional loop analysis with sums in panel data
      • Using a loop to calculate new variable
      • How to define treatment & control groups properly?
      • What miss option means in gunique ?
      • CI Decmposition Results not Showing percentage con...
      • synth_runner automatically generated predictorvars
      • Extract country names from affiliations
      • Flexible case-control matching command
      • Working with Time and DateTime variables from Excel
      • scatter graph with different styles and specific axis
      • Twoway Line: Deleting a straight line
      • Panel Data fixed effects and time effects
      • Question on metacumbounds
      • Populating a column with values
      • Failing to convert .shp and .dbf dile to .dta format
      • chnaging values of a variable
      • log differenced model and GMM Estimator
      • heterofactor and ML maximization
      • Significant in one-way, but not in two-way ANOVA
      • Stset with durations
      • Question: Bayesian Vector Autoregression (BVAR)
      • Split population duration
      • Merge accuracy using str format when most contain ...
      • Summing over a value of variable for repeating county
      • Box plot help
      • How to do a Box Plot with mean instead of median a...
      • Help with bootstrap in obtaining a standard error.
      • residual from multiple regresion
      • Listing out the frequency distributions for multip...
      • Export labels from alpha output to Excel
      • Splitting a dataset into multiple datasets
      • McDonald’s Value Chain Analysis
      • Help creating an index variable
      • Problems with FE in capital structure study
      • Multilevel model: about the decision of levels
      • Graph svy total with confidence intervals
      • McDonald’s Porter’s Five Forces Analysis
      • gen id = _n
      • McDonald’s 7Ps of Marketing
      • McDonald’s Segmentation, Targeting and Positioning
      • Generating and exporting summary table to Excel
      • P for trend in MV regression
      • sureg compare COX time-to-event regression compari...
      • McDonald’s Ansoff Matrix
      • coefplot help
      • How can check the assumption for survival analysis...
      • multiple lines in one graph with if conditions
      • McDonald’s Corporation Report
      • Reshaping Data and Variables for a Dynamic Panel D...
      • Decomposing Wagstaff Concentration Index
      • Deleting entire Firm based on condition from panel...
      • Graphic with graphs for sub catvars and overall graph
      • Convert Scientific Notations (e-07) into Decimal N...
      • Why some global parameter is in { } while the othe...
      • collect: rows and columns' name changed?
      • How to add fixed effect in sureg Stata?
      • Variance Covariance Matrix of Sureg in Stata
      • Adding linear time trends (dropped)
      • Fixed effect- three identifier variables and resul...
      • Logistic regression
      • No Output, Infinite Cycling Wheel (Multiple Imputa...
      • Interpretation of two-way interaction plot
      • Clustering SEs
      • Using EGEN to Create Count Variable
      • SEM panel model with three variables: cross-lagged...
      • Regressions Table in Stata
      • Predicting counterfactual from reghdfe
      • OLS with binary dependent variable
      • Convert variables to numeric
      • Export labels from alpha output to Excel
      • ciplot
      • Count frequencies of (binary) variable and merge t...
      • Descriptive Statistics by Segment
      • TWFE: Different results between Stata and R
      • Stochastic Frontier Analysis
      • Showing values that exist in a value label but are...
      • Milliseconds inconsistent when trying to find the ...
      • Loop for variable creation
      • Merging two datasets to calculate an amount of day...
      • multiple random sample from a data
      • Stata local macros not displaying
      • How to reshape/convert a household data set in the...
      • R^2 for diff-in-diff
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer