BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
How to get the value of the first occurence on the first row

How to get the value of the first occurence on the first row

Thursday, December 31, 2020 Data Cleaning Data management Data Processing
I have a data in this format below. I want to generate another column (X) that gave me the value of the first occurrence of column A. For ex...
Invalid lval

Invalid lval

1:25 PM Data Cleaning Data management Data Processing
Hi, I am new to mata, and I'm replicating an old code that uses it. Unfortunately, I'm getting the errors `invalid lval r(3000)` a...
ivreghdfe: how to first-stage regression predicted values

ivreghdfe: how to first-stage regression predicted values

11:25 AM Data Cleaning Data management Data Processing
hello, i'm using ivreghdfe, and i'm interested in obtained predicted values of the endogenous variable from the first-stage regressi...
multiple imputation error after MI Estimate logistic regression

multiple imputation error after MI Estimate logistic regression

10:25 AM Data Cleaning Data management Data Processing
Hello: Working on MI Everything works up to the MI estimate. See Error below Have been to the Stata manual and viewed the videos and beli...
Create for each time period the median of the 350 biggest values

Create for each time period the median of the 350 biggest values

9:25 AM Data Cleaning Data management Data Processing
Hello everyone, I am currently sitting on an issue regarding taking the median of the 350 highest values per period. Basically I have the ...
Create dummy variable to show change in certain timeframe of paneldata

Create dummy variable to show change in certain timeframe of paneldata

8:25 AM Data Cleaning Data management Data Processing
Hi, I am working with paneldata and want to create a dummy variable that shows me if there is a change in the variable -occ-(occupation) i...
Graph with bold ylabels

Graph with bold ylabels

8:25 AM Data Cleaning Data management Data Processing
Hi all, I am trying to get bold ylabels in a time series graph. I had success only in xlabel , example: Code: use "http://www.pr...
Dropping Observations of Variable according to frequency of occurence

Dropping Observations of Variable according to frequency of occurence

6:25 AM Data Cleaning Data management Data Processing
Hi, I am working with the following paneldata and want to drop all observations of which I have less than six observations per Household I...
Parallel trend test for DID model

Parallel trend test for DID model

6:25 AM Data Cleaning Data management Data Processing
Dear friends, why the test results using ttable and pstest are not consistent. Code: pstest $xlist Y , t(treated) The results is ...
Dropping duplicate observations conditioned on another variable

Dropping duplicate observations conditioned on another variable

6:25 AM Data Cleaning Data management Data Processing
Hi, I am using Stata 16.1, and have the following (general) issue. I want to drop duplicate observations of one variable (educ) from my da...
Numerical format in regression tables using esttab/estout

Numerical format in regression tables using esttab/estout

6:25 AM Data Cleaning Data management Data Processing
Dear Statalist, I am using esttab (wrapper for estout) from SSC in Stata 16.1. I am trying to export regression tables in which there ar...
Static Model with xtabond2

Static Model with xtabond2

5:25 AM Data Cleaning Data management Data Processing
Dear Users, I have the following equation for panel data xtreg zer esg cap turn tdrdta prbv betal i.year, fe vce(cluster id) However, ...
Updating input output models using RAS method

Updating input output models using RAS method

3:25 AM Data Cleaning Data management Data Processing
Happy new year! Is there any tool that enables to update IO tables in STATA?
Find unobserved values

Find unobserved values

3:25 AM Data Cleaning Data management Data Processing
Hey guys, I am currently trying to figure out how to find unobserved values. I have different sic codes (labeled "sic), accordingly v...
choropleth map stretched out

choropleth map stretched out

2:25 AM Data Cleaning Data management Data Processing
Hi everyone, I'm using Stata 16 to make a chloropleth map of Connecticut. Reading the posts on here, I tried using geo2xy to correct th...
How to solve auto-correlation using unbalanced panel with gaps

How to solve auto-correlation using unbalanced panel with gaps

1:25 AM Data Cleaning Data management Data Processing
Hi all, I am dealing with unbalanced panel data with gaps, it is a survey to manufacturer companies between 1990 to 2012. I am using xtreg,...
Finding minimum value out of 5 different variables for each row

Finding minimum value out of 5 different variables for each row

Wednesday, December 30, 2020 Data Cleaning Data management Data Processing
I would like to create a new variable that equals the minimum value out of 5 different variables for each row. Example: For each "ID...
How would I create this graph?

How would I create this graph?

9:25 PM Data Cleaning Data management Data Processing
I am very rusty with STATA and need some help with creating this graph shown in the image. What would be the command in order to create a be...
Deleted

Deleted

9:25 PM Data Cleaning Data management Data Processing
Deleted
Predicting probability of a multivalued endogenous treatment

Predicting probability of a multivalued endogenous treatment

8:25 PM Data Cleaning Data management Data Processing
I am trying to figure out how to predict probability of participating in different treatments (or multivalued treatment). Since the entreat ...
Dealing with id problems in panel data

Dealing with id problems in panel data

6:25 PM Data Cleaning Data management Data Processing
Hello, I am using a person-firm-level panel data for analysis of wage differentials. But I found that in my dataset has some duplicated ide...
spmap

spmap

10:25 AM Data Cleaning Data management Data Processing
Dear All, I am using stata14 and trying to create maps of Bangladesh using the package spmap. I want two maps on the same scale. But when I...
Quantile Regression

Quantile Regression

10:25 AM Data Cleaning Data management Data Processing
Code: reg price treatment i.state i.year income rural [fw=round(weight)] , cluster(state) Hi I ran the above regression but now I wish...
regression coefplot with 3 variables

regression coefplot with 3 variables

9:25 AM Data Cleaning Data management Data Processing
hello I want to make a regression coefplot for 3 variables, the independent variable being a standardised AIDS figure (number of known peop...
【imputation】how to fix r(459) in mi impute chained

【imputation】how to fix r(459) in mi impute chained

9:25 AM Data Cleaning Data management Data Processing
Hello! I recently encountered a problem in using MI module in Stata. I want to impute variable wage by "mi impute chained" comman...
apc model

apc model

9:25 AM Data Cleaning Data management Data Processing
Hi, I want to estimate the following model: log_income = B*age with year and cohort fixed effects. I'm trying to use the command apc bu...
Estimating Intergenerational Correlation (IGC) with nlcom

Estimating Intergenerational Correlation (IGC) with nlcom

9:25 AM Data Cleaning Data management Data Processing
Dear Statalisters, I am trying to estimate the Intergenerational Correlation (IGC) in education using nlcom but cannot figure out how to ...
spmap options

spmap options

9:25 AM Data Cleaning Data management Data Processing
Hello, I used -spmap- to map US employment and this is the code I used: Code: spmap cbp_emp_sum using uscoord if NAME!="Alaska...
New version of xcontract )AGAIN) on SSC

New version of xcontract )AGAIN) on SSC

9:25 AM Data Cleaning Data management Data Processing
Thanks once again to Kit Baum, a new version of the xcontract package (superseding yesterday's version) is now available for download f...
Overlapping values on y axis in Stata graph

Overlapping values on y axis in Stata graph

9:25 AM Data Cleaning Data management Data Processing
Array Hi, is there a way to avoid overlapping of numbers on the y axis
Outreg2 to Excel R(198) error diagnosis

Outreg2 to Excel R(198) error diagnosis

9:25 AM Data Cleaning Data management Data Processing
I am trying to run the below code in STATA 16.1. STATA is fully updated, and I've uninstalled and re-installed the outreg2 command, re-s...
Median, Tercile and Quartile

Median, Tercile and Quartile

9:25 AM Data Cleaning Data management Data Processing
Hi, I want to create dummy variables based on the median. Sometiems I do that based on quartile and tercile. I do the following 1. This wi...
Use the lastvar local to read v1 v2 v3 variables from insheet with numeric names

Use the lastvar local to read v1 v2 v3 variables from insheet with numeric names

7:25 AM Data Cleaning Data management Data Processing
I am reading in GIS data provided by a soil scientist on my research team that is exported to csv files with numeric names (years only). I h...
Generate weighted median variable by other variables

Generate weighted median variable by other variables

7:25 AM Data Cleaning Data management Data Processing
Hello, I have a dataset which lists the year, state, age group, income, and survey weight of individuals surveyed. Agegroup takes on 0 if ...
Postestimation test for cross-sectional time series FGLS regression

Postestimation test for cross-sectional time series FGLS regression

5:25 AM Data Cleaning Data management Data Processing
Hi I'm conducting a study on the determinants of bank profitability in my country. I have data from 16 of the total 18 banks (N=16) sp...
Different y-axis range on xtline plots

Different y-axis range on xtline plots

5:25 AM Data Cleaning Data management Data Processing
I'm trying to construct some xtline plots using 10 different ids. The range of values of y for id #s 1-9 is 0-100, but for id #10 the ra...
heteroskedasticity in logistic regression model

heteroskedasticity in logistic regression model

4:25 AM Data Cleaning Data management Data Processing
Hi, I have cross sectional data and am using logistic regression. My question is how do I check my data for heteroskedasticity and in case...
change the color of bars -BOXPLOT

change the color of bars -BOXPLOT

4:25 AM Data Cleaning Data management Data Processing
Hi to everybody, I have done for the first time "Box plot by group with data point" By default, Stata gives me graphs with the ...
Equivalent for noisily option in fuzzy package?

Equivalent for noisily option in fuzzy package?

3:25 AM Data Cleaning Data management Data Processing
Hello Statalist, does anyone know how to get Stata to print all calculations occuring due to commands from the 'fuzzy' package by ...
Mixed effects model on eyes observations

Mixed effects model on eyes observations

3:25 AM Data Cleaning Data management Data Processing
Hi statalist team! I am glad that I participate here! I would like to ask for some help on a "begginer's" problem that I am fa...
Converting annual data to semiannual

Converting annual data to semiannual

3:25 AM Data Cleaning Data management Data Processing
Hi everyone, i have to convert a set of annual observations about debt and deficit gdp ratios into semiannual, can someone hepl me? It's...
interaction effect in instrument variable probit model

interaction effect in instrument variable probit model

1:25 AM Data Cleaning Data management Data Processing
Dear All, my query is related to the interaction effect in the instrument variable probit model. When I running the ivprobit model with my m...
How to group connected dates?

How to group connected dates?

1:25 AM Data Cleaning Data management Data Processing
I have a panel containing daily data which are not all continuous. Within an ID, if those dates are continuous or the gap between two dates ...
datetime of march 31, 2012

datetime of march 31, 2012

1:25 AM Data Cleaning Data management Data Processing
Dear All, I have this data set Code: * Example generated by -dataex-. To install: ssc install dataex clear input long(patient_id appo_date...
Modifying dataset for relative time model

Modifying dataset for relative time model

Tuesday, December 29, 2020 Data Cleaning Data management Data Processing
Array Array Array Array P.S. To express notation correctly, I uploaded the screenshot of my post. Sample dataset is attached.
Simulating the Dickey Fuller distribution in Stata

Simulating the Dickey Fuller distribution in Stata

1:25 PM Data Cleaning Data management Data Processing
Dear all, Hope you are all well. Can I ask if there is a way of simulating the Dickey Fuller distribution in Stata? Best
stuck with nested loops

stuck with nested loops

1:25 PM Data Cleaning Data management Data Processing
I am trying to select 10 controls per case (using risk-set sampling) matched by general practice. I used a loop for that as shown below, but...
Coarsened Exact Matching in unbalanced panel data

Coarsened Exact Matching in unbalanced panel data

11:25 AM Data Cleaning Data management Data Processing
I am running Coarsened Exact Matching (CEM) on a highly unbalanced panel data (with 26 periods: 13 pre-shock and 13 post-shock), for a diff-...
Oaxaca command - Yun Decomposition

Oaxaca command - Yun Decomposition

11:25 AM Data Cleaning Data management Data Processing
Hi there, I am trying to do Yun's decomposition for a Probit regression using the Oaxaca command (by Ben Jann). I'm having some tr...
Problem with egen rownonmiss, strok

Problem with egen rownonmiss, strok

11:25 AM Data Cleaning Data management Data Processing
Hello, I am trying to create a variable that will count how many nonmissing responses across several string variables exist per observatio...
Panel data and count dependent variable

Panel data and count dependent variable

9:25 AM Data Cleaning Data management Data Processing
Hi all, I have a panel data set of 18 years and 50 regions. My dependent variable is a count one, with over-dispersion. Hausman test indica...
New version of xcontract on SSC

New version of xcontract on SSC

9:25 AM Data Cleaning Data management Data Processing
Thanks as always to Kit Baum, a new version of the xcontract package is now available for download from SSC. In Stata, use the ssc command...
Problems with merge

Problems with merge

9:25 AM Data Cleaning Data management Data Processing
Hi there, I was trying to merge two datasets (code:joinby), however it seems that stata cannot identify my key variable. The master datas...
fuzzy fsQCA bestfit displaying few cases

fuzzy fsQCA bestfit displaying few cases

7:25 AM Data Cleaning Data management Data Processing
Hello Statalist, I am conducting a fuzzy-set QCA using the 'fuzzy' package written by Longest & Vaisey (2008) . I am looking a...
How to use tabstat's "format" option with estpost? Getting Error

How to use tabstat's "format" option with estpost? Getting Error

6:25 AM Data Cleaning Data management Data Processing
Hi, I have a time variable in Stata internal form . I am trying to calculate it's summary statistics by group and then exporting it t...
Panel 2sls with multiple interations of endogeneous variables

Panel 2sls with multiple interations of endogeneous variables

6:25 AM Data Cleaning Data management Data Processing
I´m using Stata 14 with Windows 10 OS. I have 2 endogenous and 8 exogenous variables. I need to run regressions with double and triple int...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ▼  2020 (7956)
    • ▼  December (653)
      • How to get the value of the first occurence on the...
      • Invalid lval
      • ivreghdfe: how to first-stage regression predicted...
      • multiple imputation error after MI Estimate logist...
      • Create for each time period the median of the 350 ...
      • Create dummy variable to show change in certain ti...
      • Graph with bold ylabels
      • Dropping Observations of Variable according to fre...
      • Parallel trend test for DID model
      • Dropping duplicate observations conditioned on ano...
      • Numerical format in regression tables using esttab...
      • Static Model with xtabond2
      • Updating input output models using RAS method
      • Find unobserved values
      • choropleth map stretched out
      • How to solve auto-correlation using unbalanced pan...
      • Finding minimum value out of 5 different variables...
      • How would I create this graph?
      • Deleted
      • Predicting probability of a multivalued endogenous...
      • Dealing with id problems in panel data
      • spmap
      • Quantile Regression
      • regression coefplot with 3 variables
      • 【imputation】how to fix r(459) in mi impute chained
      • apc model
      • Estimating Intergenerational Correlation (IGC) wit...
      • spmap options
      • New version of xcontract )AGAIN) on SSC
      • Overlapping values on y axis in Stata graph
      • Outreg2 to Excel R(198) error diagnosis
      • Median, Tercile and Quartile
      • Use the lastvar local to read v1 v2 v3 variables f...
      • Generate weighted median variable by other variables
      • Postestimation test for cross-sectional time serie...
      • Different y-axis range on xtline plots
      • heteroskedasticity in logistic regression model
      • change the color of bars -BOXPLOT
      • Equivalent for noisily option in fuzzy package?
      • Mixed effects model on eyes observations
      • Converting annual data to semiannual
      • interaction effect in instrument variable probit m...
      • How to group connected dates?
      • datetime of march 31, 2012
      • Modifying dataset for relative time model
      • Simulating the Dickey Fuller distribution in Stata
      • stuck with nested loops
      • Coarsened Exact Matching in unbalanced panel data
      • Oaxaca command - Yun Decomposition
      • Problem with egen rownonmiss, strok
      • Panel data and count dependent variable
      • New version of xcontract on SSC
      • Problems with merge
      • fuzzy fsQCA bestfit displaying few cases
      • How to use tabstat's "format" option with estpost?...
      • Panel 2sls with multiple interations of endogeneou...
      • problem in a model because of collinear variables
      • Measuring Cumulative Density/Area Under Kernel Den...
      • Hansen Test
      • "intreg" and "cmp" give different results for inte...
      • how to run a 3 level multilevel model ?
      • regression discontinuity with multiple cutoffs ("r...
      • Retroactively input data
      • How to compute for inverse Mills ratio from a hete...
      • Pooled OLS versus random effects in an extremely u...
      • Two-way fixed effects and multiple FEs (some quest...
      • Install Unix stata on Ubuntu in Windows Subsystem ...
      • Weakly balanced panel
      • Generating variables t-4 to 5
      • Nearstat Error Message
      • Connect Variables
      • Labeling values less than 1
      • Mean/median difference between quintiles significa...
      • Labeling observations in qnorm plot
      • Absolute coefficient of variation as a new variable
      • Nearstat Error Message
      • Downloading SPOST13
      • question of standardized value
      • Calculate the difference between dates of events a...
      • Country fixed effects multilevel analysis not working
      • Tabout: produce a table of means with multiple var...
      • Use variables of only one child from among many ch...
      • Use variables of only one child from among many ch...
      • Firm Fixed Effects in Manova
      • suest and teffects together
      • Help with Frequency Tables
      • SEM with count, binary and continuous endogenous v...
      • adjusted R-squared using area versus reghdfe
      • Change format of elapsed time from seconds to mm:ss
      • Is it possible to overlay a stacked bar plot and a...
      • Oaxaca-Blinder Decomposition for ordinal dependent...
      • Displaying rho after biprobit, margins in the outp...
      • Margins command for FE Panel Data with interaction...
      • Saving regression results
      • Cannot locate set preferences
      • specifying eligibility criteria for "controls" dur...
      • how to descriptive data
      • matching 'cases' to a 'reference group'
      • Calculation of industry growth and industry market...
      • Syntax for the * comment
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer