BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
gen dummy with missing data

gen dummy with missing data

Sunday, October 31, 2021 Data Cleaning Data management Data Processing
Dear All, Suppose that I run the code Code: sysuse auto, clear gen d = (rep78 > 2) & !missing(rep78) i.e., d-=1, if rep78 ...
Tabulate two way by summarizing the sum

Tabulate two way by summarizing the sum

11:23 PM Data Cleaning Data management Data Processing
Hi Everyone, I have the following data on year (first dimension of panel) , market (second dimension of panel) and product code/type (thir...
Assigning colors to stacked bar plot with ordinal y-variable.

Assigning colors to stacked bar plot with ordinal y-variable.

9:23 PM Data Cleaning Data management Data Processing
Dear Forum, I would like to plot a stacked bar graph of an "outcome" over hiv status (positive vs. negative) and stratified by s...
egen problem

egen problem

9:23 PM Data Cleaning Data management Data Processing
Hi list, I want to generate a variable that equals the mean of the values of the first two nonmissing variables in a three-variable list (...
"Encode" a String with another String?

"Encode" a String with another String?

9:23 PM Data Cleaning Data management Data Processing
Hello: I am trying to create a long code list where var1 is a string and what it should represent is shown in var2, also a string. I then ...
Diff in diff: Fix data for one-to-one matching propensity score

Diff in diff: Fix data for one-to-one matching propensity score

8:23 PM Data Cleaning Data management Data Processing
I have a data set of companies with the date they were acquired by a business group and a control group in which the companies have been in ...
New to stata, Impossible problem for a summary table?

New to stata, Impossible problem for a summary table?

7:23 PM Data Cleaning Data management Data Processing
Hello all- I'm so glad I found this forum! I've just started working with stata for a biostats class. I've been given a problem ...
Maximum number of iterations exceeded

Maximum number of iterations exceeded

6:23 PM Data Cleaning Data management Data Processing
Hello, I hope you guys are doing well. I am estimating a model using panel ardl techniques, namely the dynamic fixed effect regressor. An...
Reshape Wide Issue

Reshape Wide Issue

4:23 PM Data Cleaning Data management Data Processing
Hi, I have been trying to reshape my data from long to wide format using reshape wide. The data looks like this: OrganisationName AssetCla...
Calculating F-statistic after constrained regression

Calculating F-statistic after constrained regression

1:23 PM Data Cleaning Data management Data Processing
I have run this unconstrained regression *Model: lcost=ß_1+ß_2loutput+ ß_3plabor+ ß_4lcapital+ß_5lpfuel and would like to set the restric...
How to convert SIC codes into Fama French 12

How to convert SIC codes into Fama French 12

12:23 PM Data Cleaning Data management Data Processing
Hello, I am a uni student and I'm terrible with computers. For a project that is due in a week, we are supposed to convert industry SIC ...
putdocx basic summary statistics

putdocx basic summary statistics

12:23 PM Data Cleaning Data management Data Processing
I am running Stata 17 SE. I would like to send the output of basic summary statistics commands, namely Code: summarize Code: tabulat...
Reshape dataset including variables names as an additional variable

Reshape dataset including variables names as an additional variable

12:23 PM Data Cleaning Data management Data Processing
Hello Statalist community, I don't have yet much experience using Stata and hope you can help me with a problem I am encountering. I...
xtunitroot error 'too many variables specified'

xtunitroot error 'too many variables specified'

10:23 AM Data Cleaning Data management Data Processing
Good afternoon. The dataset looks like this: Code: * Example generated by -dataex-. For more info, type help dataex clear input long p...
Rename Variables using a loop

Rename Variables using a loop

9:23 AM Data Cleaning Data management Data Processing
Good morning My problem is the following. I would like to rename my variables using a loop. My initial situation is this: I have a set o...
Losing a value after dropping duplicates

Losing a value after dropping duplicates

9:23 AM Data Cleaning Data management Data Processing
Hi everyone I need a general code to fill the remaining cells at "cvcie0" for the same "gvkey" and for the same "...
How to select optimal GMM model

How to select optimal GMM model

9:23 AM Data Cleaning Data management Data Processing
Greetings, I am estimating several specifications for the macroeconomic determinants of non-performing loans using system GMM with xtabond...
Poisson regression

Poisson regression

5:23 AM Data Cleaning Data management Data Processing
Hi everyone! this platform has been a great help for me. I would like to ask questions on Poison regression Background- i want to estimat...
Renaming date variables

Renaming date variables

5:23 AM Data Cleaning Data management Data Processing
Hello gang I'm trying to fix such that my date variable, specifically the one showing month, looks a little "cleaner". Initi...
Help - estout export to csv - a whole row is concentrated in one cell

Help - estout export to csv - a whole row is concentrated in one cell

5:23 AM Data Cleaning Data management Data Processing
Hi friends, I am trying to export a table to excel using estout. Instead of each value being put in a cell (="..."), for some of...
Collapse and generate average values

Collapse and generate average values

4:23 AM Data Cleaning Data management Data Processing
Dear, I want to collapse the years into periods of, say, two years each. For example, if I have a panel from 1990 to 1995, I want period I...
Robust Standard Errors how to get value for Wald Chi^2?

Robust Standard Errors how to get value for Wald Chi^2?

3:23 AM Data Cleaning Data management Data Processing
Hello everyone! I am running panel data regressions and am using the 'robust' command as I think my model has heteroskedasticity (...
egen, group

egen, group

3:23 AM Data Cleaning Data management Data Processing
Dear All, Suppose that I have this data set (the original question is here ), Code: * Example generated by -dataex-. For more info, type h...
How to create an aggregate observational unit by adding values by country and year.

How to create an aggregate observational unit by adding values by country and year.

1:23 AM Data Cleaning Data management Data Processing
Hello, I have the following dataset for a number of countries. I would like to generate an aggregate entry (call it africa under variable ...
Finding Market Shares

Finding Market Shares

12:23 AM Data Cleaning Data management Data Processing
Hi all, I have a data in the following format. I need to calculate the market share where market share would be simply percentage of sales...
group string var with random names

group string var with random names

Saturday, October 30, 2021 Data Cleaning Data management Data Processing
Hi, I have list of investor name which are not same like "investor group" var. I want to create a variable like investor group f...
ttesti equivalent of oneway ANOVA

ttesti equivalent of oneway ANOVA

10:23 PM Data Cleaning Data management Data Processing
Hi all, I need to compare means of a parameter of 8 groups (independent). I do have SD but I do not have individual data. I was wonderin...
repeated-measure vs. nested anova problems

repeated-measure vs. nested anova problems

7:23 PM Data Cleaning Data management Data Processing
I think I want to use Repeated-Measure Anova command, "anova, repeated ()" or "wsanova" commands based on the instructio...
Help with Recoding with Multiple Loops

Help with Recoding with Multiple Loops

4:23 PM Data Cleaning Data management Data Processing
Hello, I have a series of variables that looks like job[i]_sched[y]. [i] varies from 1 to 6. [y] varies from 1997 to 2017, but the last thre...
qq Plots

qq Plots

2:23 PM Data Cleaning Data management Data Processing
Hello everbody, I have estimated parameter alpha for each participant with six different methods. I wanted to plot each alpha of one metho...
svy:logit has more observations than simple logit command

svy:logit has more observations than simple logit command

11:23 AM Data Cleaning Data management Data Processing
I am working with survey data and use the svy command before running my logit regression as follows: svy: logit Y X1 X2. Running this comman...
addplot error after marginsplot

addplot error after marginsplot

9:23 AM Data Cleaning Data management Data Processing
I ran a model ( dependent variable = gender) and 3 independent variables – year, position_department_n, and interaction of year and positio...
svy: mean and sorting estimated means

svy: mean and sorting estimated means

4:23 AM Data Cleaning Data management Data Processing
Greetings, Statalisters. I'm hoping this is something someone is willing and able to explain to me, since I've failed at figuring ...
Graph with Y axis as string

Graph with Y axis as string

4:23 AM Data Cleaning Data management Data Processing
Hi everyone, I have data like below: 25 different methods that generate 25 different estiamtion on OR and its 95%CI. I want to generate a ...
Changing the value of many observations

Changing the value of many observations

4:23 AM Data Cleaning Data management Data Processing
Hello So, i'm working on a pretty big dataset for a Norwegian insurance company, and I wanted to change one variable in regards to its...
Get the standard error from --exlogistic-- command?

Get the standard error from --exlogistic-- command?

2:23 AM Data Cleaning Data management Data Processing
Hi all, I'm running the exlogistic command for meta-analysis, with id as study id: Code: exlogistic r id group, binomial(n) group(...
multilevel multinomial modelling using the gllamm command in stata

multilevel multinomial modelling using the gllamm command in stata

Friday, October 29, 2021 Data Cleaning Data management Data Processing
Good day everyone, I am using the gllamm command in stata 15.0 for multilevel (2-leveled) multinomial logistic regression. The outcome/depe...
Manually producing probabilites after logit

Manually producing probabilites after logit

5:23 PM Data Cleaning Data management Data Processing
Dear All, I estimate a logit model and then I need to calculate Prob[Y=1|X], where X is my set of regressors. Obviously, I can use: Cod...
how many observations per year in panel data

how many observations per year in panel data

4:23 PM Data Cleaning Data management Data Processing
Hi there, I have a panel dataset. How can I see how many observations there are per year? I see in the data viewer that some years have lo...
Eliminate a part of a string

Eliminate a part of a string

3:23 PM Data Cleaning Data management Data Processing
Hello there, I would like to eliminate a part of a string starting from a position recorded in another variable for 2000 characters backwar...
How do you de-trend an event study plot?

How do you de-trend an event study plot?

3:23 PM Data Cleaning Data management Data Processing
I have an event study graph that I plot using coefficients from my dependent variable regressed on the leads and lags of treatment. There ap...
Setting the outcome variable in Cox regression.

Setting the outcome variable in Cox regression.

10:23 AM Data Cleaning Data management Data Processing
Hi, I am trying to estimate the hazard ratios of the age of first drinking, the goal is to measure the risks of underage drinking (<18 ...
labsize in points generating strange result

labsize in points generating strange result

8:23 AM Data Cleaning Data management Data Processing
Per the request of my publisher, I am going through a set of graphs and specifying text in point sizes. This has worked fine except in one c...
Where is the dta file for math scores of pupils in the third and fifth years from different schools in Inner London (Mortimore et al. 1988)

Where is the dta file for math scores of pupils in the third and fifth years from different schools in Inner London (Mortimore et al. 1988)

7:23 AM Data Cleaning Data management Data Processing
I apologize for asking this question because I assume I am missing something obvious. Many Stata examples use data from Mortimore et al. 198...
Error in dictionary file

Error in dictionary file

7:23 AM Data Cleaning Data management Data Processing
I have a dataset in .txt, which I have to translate with a dictionary file. I have written the dictionary file according to instructions fro...
Elasticity analysis

Elasticity analysis

5:23 AM Data Cleaning Data management Data Processing
Hi all, I am in urgent need to find the solution of a problem. It would be great if you guys could help background of the problem: I ha...
Assigning random number while using pre-defined observation specific probabilities (Result of an LCA) - using rdiscrete

Assigning random number while using pre-defined observation specific probabilities (Result of an LCA) - using rdiscrete

5:23 AM Data Cleaning Data management Data Processing
Hello Statalist, As a result of an LCA I have three variables containing the probability of each observation to belong to a specific group...
Sum variables taking into account missing data

Sum variables taking into account missing data

4:23 AM Data Cleaning Data management Data Processing
Hello, I have the following dataset: Code: * Example generated by -dataex-. To install: ssc install dataex clear input long gid int(co...
Error: <istmt>: 3499 ASREGFMB() not found

Error: <istmt>: 3499 ASREGFMB() not found

3:23 AM Data Cleaning Data management Data Processing
Good morning, I try to run the asreg command and I receive this error. I checked that the installation is correct that I have everything up...
Cmxtmixlogit discrete choice experiment with choice card blocks

Cmxtmixlogit discrete choice experiment with choice card blocks

3:23 AM Data Cleaning Data management Data Processing
Dear all, I am encountering a convergence problem with the 'cmxtmixlogit' command, using STATA 16.1 for windows. I performed a d...
zero-inflated and right-censored count data

zero-inflated and right-censored count data

1:23 AM Data Cleaning Data management Data Processing
Dear all statalists, Thank you for clicking on my post. What is the proper way to deal with zero-inflated and right-censored count data? ...
Number of splitvoters from two variables

Number of splitvoters from two variables

1:23 AM Data Cleaning Data management Data Processing
I need to know the number of split voters in Denmark - those who did not vote for the same party at the general and the local election. An...
Panel data estimatioin

Panel data estimatioin

Thursday, October 28, 2021 Data Cleaning Data management Data Processing
Hello. Need help with panel data estimation. I am working on panel data with T=96 and N=260. I used Hausman test and it gives FE an appropr...
converting dates (year and month)

converting dates (year and month)

8:23 PM Data Cleaning Data management Data Processing
Dear All, Is there a more concise way to go from date to newdate below? Thanks. Code: * Example generated by -dataex-. For more info, type...
alternative to keep if for a large list of firm id s

alternative to keep if for a large list of firm id s

7:23 PM Data Cleaning Data management Data Processing
Hi all, I have a list of firm ids that contains 17,000 firm ids. I have a big dataset which contains over 5 million observations of an eve...
Why are Frames Useful

Why are Frames Useful

7:23 PM Data Cleaning Data management Data Processing
Perhaps this is a question for another forum, but I had a more general question about dataframes. I know the data frame feature was added ...
Reshaping Long Data Issue

Reshaping Long Data Issue

7:23 PM Data Cleaning Data management Data Processing
I am trying to collapse this data but I have continuously gotten errors regarding the i and j not being unique. Here is a snippit of the dat...
Using svy command with graph command

Using svy command with graph command

5:23 PM Data Cleaning Data management Data Processing
Dear all, I am trying to use graph in Stata 16 to plot bars of the means of categorical variables (5-point scale) while using the svy co...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ▼  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ▼  October (646)
      • gen dummy with missing data
      • Tabulate two way by summarizing the sum
      • Assigning colors to stacked bar plot with ordinal ...
      • egen problem
      • "Encode" a String with another String?
      • Diff in diff: Fix data for one-to-one matching pro...
      • New to stata, Impossible problem for a summary table?
      • Maximum number of iterations exceeded
      • Reshape Wide Issue
      • Calculating F-statistic after constrained regression
      • How to convert SIC codes into Fama French 12
      • putdocx basic summary statistics
      • Reshape dataset including variables names as an ad...
      • xtunitroot error 'too many variables specified'
      • Rename Variables using a loop
      • Losing a value after dropping duplicates
      • How to select optimal GMM model
      • Poisson regression
      • Renaming date variables
      • Help - estout export to csv - a whole row is conce...
      • Collapse and generate average values
      • Robust Standard Errors how to get value for Wald C...
      • egen, group
      • How to create an aggregate observational unit by a...
      • Finding Market Shares
      • group string var with random names
      • ttesti equivalent of oneway ANOVA
      • repeated-measure vs. nested anova problems
      • Help with Recoding with Multiple Loops
      • qq Plots
      • svy:logit has more observations than simple logit ...
      • addplot error after marginsplot
      • svy: mean and sorting estimated means
      • Graph with Y axis as string
      • Changing the value of many observations
      • Get the standard error from --exlogistic-- command?
      • multilevel multinomial modelling using the gllamm ...
      • Manually producing probabilites after logit
      • how many observations per year in panel data
      • Eliminate a part of a string
      • How do you de-trend an event study plot?
      • Setting the outcome variable in Cox regression.
      • labsize in points generating strange result
      • Where is the dta file for math scores of pupils in...
      • Error in dictionary file
      • Elasticity analysis
      • Assigning random number while using pre-defined ob...
      • Sum variables taking into account missing data
      • Error: <istmt>: 3499 ASREGFMB() not found
      • Cmxtmixlogit discrete choice experiment with choic...
      • zero-inflated and right-censored count data
      • Number of splitvoters from two variables
      • Panel data estimatioin
      • converting dates (year and month)
      • alternative to keep if for a large list of firm id s
      • Why are Frames Useful
      • Reshaping Long Data Issue
      • Using svy command with graph command
      • Using svy with graph
      • Bootstrapped CI for effect size for one sample t-test
      • finding keywords in string variables
      • recovering the estimate of an interaction term usi...
      • One line code for performing arithmetic operation ...
      • catplot - how to show n to the left of the bars, p...
      • doiplot - postestimation module after metan for pl...
      • Identifying first and second re-admissions in a he...
      • How Can I Perform Panel Unit Root with Structural ...
      • How to replace dummy by same value of adjacent yea...
      • Bootstrapping more descriptive of more than one va...
      • Getting File Ready For Stata Analysis
      • Clustering SEs in Panel IVFE regression eliminates...
      • Measuring dynamics
      • cleaning the dataset for diff and diff model
      • Filtering in Stata
      • Replacing missing values by matching them with a d...
      • One line code for performing arithmetic operation ...
      • Strange results with -mi impute-
      • Multiple imputation
      • Unconditional Quantile Regression (rifreg) with an...
      • Again on revisiting the role of p-value
      • Repeated-Measure Ancova with Participants Randomiz...
      • Collinearity with time dummies in FE, but not in RE
      • New package: xtbreak - module for detecting and da...
      • Linear Regression for pre-post test
      • Treatment-covariate interaction in ipdmetan!
      • What is the difference between tab and tab1, and h...
      • How can I get GWR R-square?
      • Data Analysis
      • censusapi package
      • Create an indicator of participation in long format
      • Help Translating Factor Analysis Script from SPSS ...
      • Looping of a specific type of a variable list
      • color macros for graphs
      • Removing a single character from a string
      • I CANT Dowland this code 'xtpanicca'
      • Counterfactual decomposition in Stata
      • comparing the logit marginal effects to LPN coeffi...
      • Error when estimating dsge model
      • Adding test statistic to a table for a first stage...
      • Problem with Fixed effects in Cross Sectional Data...
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer