BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Contract year reshape

Contract year reshape

Saturday, December 31, 2022 Data Cleaning Data management Data Processing
Dear Statalists, Hope all is well. I am dealing with the contract data with specific contract start and end years as shown below: ...
Removing duplicates

Removing duplicates

12:23 PM Data Cleaning Data management Data Processing
Hi, I have a dataset which i want to be uniques with the variables gvkey, fyear and loc. But there are duplicates. Incase of duplicates, ...
dynamic looped regressions

dynamic looped regressions

9:23 AM Data Cleaning Data management Data Processing
Dear Stata colleagues: I am using LASSO for inference. I want to independently assess the causality of 40 policy variables on a rate. Some...
How to use xtcointtest with runby? OR How to do cointegration tests for each individuals separately for panel data?

How to use xtcointtest with runby? OR How to do cointegration tests for each individuals separately for panel data?

Friday, December 30, 2022 Data Cleaning Data management Data Processing
Dear all: I'm trying to use xtcointtest to do cointegration tests for each individuals separately in panel data and store the results, ...
Renaming Matrix Rows With Spaces

Renaming Matrix Rows With Spaces

6:23 PM Data Cleaning Data management Data Processing
Suppose I wish to rename the rows of a matrix. Code: clear * cls matrix A = (2,1\3,2) cls loc rn foreach i in "City God Temple...
Extracting All Strings After a Word within a Variable Name

Extracting All Strings After a Word within a Variable Name

2:23 PM Data Cleaning Data management Data Processing
I have a dataset representing a treated unit (GDP), a series of estimated counterfactuals, (cf) and pointwise treatment effects (diff_). Let...
Assigning value from one variable to another if condition is met

Assigning value from one variable to another if condition is met

1:23 PM Data Cleaning Data management Data Processing
I have a data manipulation question: I have a dataset with the variables "id_1", "year", and "id_2." I would ...
Is it possible that log mis-calculates it or create weird ".b" when the workload is too heavy?

Is it possible that log mis-calculates it or create weird ".b" when the workload is too heavy?

7:24 AM Data Cleaning Data management Data Processing
I use Stata 17 (24 cores). I am sorry that I cannot provide the reproducible example because this problem is really a problem of Stata gen...
Is it possible that log mis-calculates it or create weird ".b" when the workload is too heavy?

Is it possible that log mis-calculates it or create weird ".b" when the workload is too heavy?

7:23 AM Data Cleaning Data management Data Processing
I use Stata 17 (24 cores). I am sorry that I cannot provide the reproducible example because this problem is really a problem of Stata gen...
Determining level of significance between two median and between two catergorial variables, pre and post treatment

Determining level of significance between two median and between two catergorial variables, pre and post treatment

6:23 AM Data Cleaning Data management Data Processing
Dear Stata community, I was wondering if anybody could help me with the following question. I am trying to figure out how to determine two...
Create graphs

Create graphs

4:24 AM Data Cleaning Data management Data Processing
Dear all, I have a dataset that contains the following string variables: "country" and "countrydestination". The first...
having issues plotting stock returns in panel data

having issues plotting stock returns in panel data

2:23 AM Data Cleaning Data management Data Processing
Dear statalist, i am trying to plot the stock return data for a panel data of some companies, however, each time i plot this, but each time ...
Average marginal effects of continuous variables by levels of a dichotomous variable

Average marginal effects of continuous variables by levels of a dichotomous variable

Thursday, December 29, 2022 Data Cleaning Data management Data Processing
Hi, I run this four-way interaction model in Stata 14: xtreg Y c.X1##c.X2##c.X3##X4 Controls i.country i.year,r Then, I want to get th...
general time trend

general time trend

6:24 AM Data Cleaning Data management Data Processing
Hi, we have a problem incorporating a general time trend in our panel data with year fixed and country fixed effects. The defendant variable...
Meta-analysis help: how to convert hazard ratio to risk ratio

Meta-analysis help: how to convert hazard ratio to risk ratio

Wednesday, December 28, 2022 Data Cleaning Data management Data Processing
I am working on a meta-analysis using Stata. All the primary studies report risk ratio except one study that reported hazard ratio. Is there...
Downloading data from Public API from json format to csv file or txt file so that I can read data in stata on macOS

Downloading data from Public API from json format to csv file or txt file so that I can read data in stata on macOS

7:23 PM Data Cleaning Data management Data Processing
I have to download the following data from Business Dynamics Statistics ( which is public data provided by census bureau ) from the followin...
kappaetc / Krippendorff's alpha with missing values and across multiple variables

kappaetc / Krippendorff's alpha with missing values and across multiple variables

4:23 PM Data Cleaning Data management Data Processing
My research team coded/classified a large set of documents. I took a random subset of 1000 from the coded docs to code and check the reliabi...
Contrast tables

Contrast tables

3:23 PM Data Cleaning Data management Data Processing
Hi all, New Statalist user here with a simple question (I'm a long time Stata user who is trying to learn to use tables). I am trying ...
Panel data: Creating a bilateral dataset with countries and years (replicating the dataset for each panel)

Panel data: Creating a bilateral dataset with countries and years (replicating the dataset for each panel)

2:23 PM Data Cleaning Data management Data Processing
Hi everyone, I have a panel dataset of exchange rate with 200 countries and 16 years, please find below an example: country_origin yea...
Estimating risk difference for binary outcome in cluster RCT

Estimating risk difference for binary outcome in cluster RCT

4:25 AM Data Cleaning Data management Data Processing
I am analysing data from a cluster RCT looking at the effect of an intervention (int) in health centres (centre) to reduce inappropriate ant...
Balance an unbalanced dataset

Balance an unbalanced dataset

4:25 AM Data Cleaning Data management Data Processing
Hi all, I have a strongly unbalanced dataset of countries observed by year. I would like to balance it by retaining as many observations ...
Weekly means

Weekly means

4:25 AM Data Cleaning Data management Data Processing
Hello Everyone!! I need to obtain weekly means for some measurements. I need a weekly mean by subject and week for each of them. I'm no...
Replace missing values with values from other observations

Replace missing values with values from other observations

1:23 AM Data Cleaning Data management Data Processing
Hello, I´m stucked with my data cleaning. Below, a excerpt of my dataset is shown. I only have included data from one gvkey code, but in t...
Using matching with DID for repeated cross sectional data

Using matching with DID for repeated cross sectional data

Tuesday, December 27, 2022 Data Cleaning Data management Data Processing
I am new to STATA and working on repeated cross-sectional data for Difference-in-Difference analysis. I have treatment and control groups be...
Add on consecutively by group

Add on consecutively by group

4:23 PM Data Cleaning Data management Data Processing
Hi all, I have the first observation for each id for experience, and I would like to replace the value of experience +1 for the next year...
Example of GMM moment-evaluator program with many equations

Example of GMM moment-evaluator program with many equations

3:23 PM Data Cleaning Data management Data Processing
Hi, Statalist, Does anybody know a simple example of how to implement the moment-evaluator program version of GMM with more than one equat...
testing the validy of the instrument in xtivreg

testing the validy of the instrument in xtivreg

2:23 PM Data Cleaning Data management Data Processing
Hi, I am using xtivreg , where xtivreg y (x=z) The issue is that how can i tell that the instrument z is a valid instrument? I don'...
how to know whether a placebo test is passed

how to know whether a placebo test is passed

4:23 AM Data Cleaning Data management Data Processing
Hello! I'm conducting a placebo test on a staggered DID, I randomly assign treatment firms and policy year to my sample firms, repeat ...
Something wrong with -colorpalette-

Something wrong with -colorpalette-

Monday, December 26, 2022 Data Cleaning Data management Data Processing
Dear Stata users, I have installed the -colorpalette- wrote by Ben Jann , and I remenber that it run successfully before. I run it today b...
Avergaing values of variables to reduce survey observations

Avergaing values of variables to reduce survey observations

5:23 PM Data Cleaning Data management Data Processing
Hi, I am looking for a solution to average the survey respones. Below S007 is suvey respondent ID, loc is country name and fyear is the y...
Stata for loop error

Stata for loop error

3:23 PM Data Cleaning Data management Data Processing
Hi, I am trying to run the below loop in stata. But i get an error. Can someone help me. I am a beginner in using Stata. The variables...
Interpreting significant interaction effect while main effect is insignificant

Interpreting significant interaction effect while main effect is insignificant

6:23 AM Data Cleaning Data management Data Processing
Hi! I am investigating the effect of balanced ambidexterity and combined ambidexterity on firm performance, with moderating variables envi...
Dealing with missing data after using business date for daily stock return

Dealing with missing data after using business date for daily stock return

6:23 AM Data Cleaning Data management Data Processing
Dear distinguished Prof, I have daily data on stock price of 15 companies, I want to calculate daily stock price for each companies in line...
Year and Industry dummies with GEE

Year and Industry dummies with GEE

3:23 AM Data Cleaning Data management Data Processing
Hello everyone, I have an urgent problem. I am investigating the implications of CEO´s political ideology and Corporate Strategic Change u...
How to run a regression excluding certain countries ids

How to run a regression excluding certain countries ids

Sunday, December 25, 2022 Data Cleaning Data management Data Processing
Dear all I am trying to run a regression excluding certain countries that don’t require disclosure. I have used this code: reg xy if coun...
How to run a regression excluding certain countries ids

How to run a regression excluding certain countries ids

7:23 PM Data Cleaning Data management Data Processing
Dear all I am trying to run a regression excluding certain countries that don’t require disclosure. I have used this code: reg xy if coun...
Manage dates with different digits

Manage dates with different digits

6:23 PM Data Cleaning Data management Data Processing
Dear Stata users, I'm using the Stata 17 MP and want to manipulate dates with different digits (see below). In the first row, for in...
Percentage problem regarding loss and networth

Percentage problem regarding loss and networth

12:23 PM Data Cleaning Data management Data Processing
Hello all I am using a definition to create a dummy variable. If the accumulated loss of the year is more than 50% of the previous four ...
Merge Datasets with multiple observations for one year

Merge Datasets with multiple observations for one year

5:23 AM Data Cleaning Data management Data Processing
Hello everybody, I am currently trying to merge two datasets. One Dataset (Platforms) contains the annual traffic of a company and the oth...
Command likes -fbar- that simultaneously show frequencies or percents of categorical variables

Command likes -fbar- that simultaneously show frequencies or percents of categorical variables

Saturday, December 24, 2022 Data Cleaning Data management Data Processing
Dear Stata users, The bar plot is frequently used in daily practice. Suppose that I have several categorical variables in my dataset, and I...
Comparing two groups of variables with each others

Comparing two groups of variables with each others

8:23 AM Data Cleaning Data management Data Processing
Hi all, first of all, I would like to wish you happy holidays. I have a question about comparing two groups with each other. Just to outli...
Omitted variable (log) in a panel data with FE in Stata 16.1

Omitted variable (log) in a panel data with FE in Stata 16.1

1:23 AM Data Cleaning Data management Data Processing
Dear Statalisters, My analysis is about finding to what extent government expenditures determine income inequality in an unbalanced panel ...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ▼  2022 (2201)
    • ▼  December (181)
      • Contract year reshape
      • Removing duplicates
      • dynamic looped regressions
      • How to use xtcointtest with runby? OR How to do co...
      • Renaming Matrix Rows With Spaces
      • Extracting All Strings After a Word within a Varia...
      • Assigning value from one variable to another if co...
      • Is it possible that log mis-calculates it or creat...
      • Is it possible that log mis-calculates it or creat...
      • Determining level of significance between two medi...
      • Create graphs
      • having issues plotting stock returns in panel data
      • Average marginal effects of continuous variables b...
      • general time trend
      • Meta-analysis help: how to convert hazard ratio to...
      • Downloading data from Public API from json format ...
      • kappaetc / Krippendorff's alpha with missing value...
      • Contrast tables
      • Panel data: Creating a bilateral dataset with coun...
      • Estimating risk difference for binary outcome in c...
      • Balance an unbalanced dataset
      • Weekly means
      • Replace missing values with values from other obse...
      • Using matching with DID for repeated cross section...
      • Add on consecutively by group
      • Example of GMM moment-evaluator program with many ...
      • testing the validy of the instrument in xtivreg
      • how to know whether a placebo test is passed
      • Something wrong with -colorpalette-
      • Avergaing values of variables to reduce survey obs...
      • Stata for loop error
      • Interpreting significant interaction effect while ...
      • Dealing with missing data after using business dat...
      • Year and Industry dummies with GEE
      • How to run a regression excluding certain countrie...
      • How to run a regression excluding certain countrie...
      • Manage dates with different digits
      • Percentage problem regarding loss and networth
      • Merge Datasets with multiple observations for one ...
      • Command likes -fbar- that simultaneously show freq...
      • Comparing two groups of variables with each others
      • Omitted variable (log) in a panel data with FE in ...
      • Computing elasticities in a log-log model with sta...
      • Looping identify autosomal monosomies | regexm
      • About clogit in interventional study
      • Multiply obverations of a variable with a vector t...
      • Create new variable using coefficient from a varia...
      • Regress in fixed time period
      • Weights not working
      • How to fix when option is not allowed (r198)?
      • converting modate format string to numeric
      • placebo test using -permute-
      • Change xlabel in a graph
      • max or min of rows---- bysort and generate
      • What Estimation Method to use? (PSM, Probit, Logit...
      • Second minimum date
      • Qqvalue: q-values the same as p-values
      • Populating month level data from annual level data...
      • Interpreting time-varying coefficients in a fixed ...
      • Difference between interaction term or subgroup an...
      • Beta regressions
      • How to predict survival curve or risk beyond study...
      • Data collapse
      • IV regressions - modifying dates
      • Twoway bar and line with 2 y-axes and labels
      • Keep multiple observations in a variable
      • Interpreting MCA predict rowscores
      • Highlight lines on spaghetti plot
      • Displaying Percent and Rawpercent Statistics as co...
      • variable RR already defined variable Lower not fou...
      • Make zero line thicker
      • max and if together
      • Nearmrg: Please help
      • Within Variation Programming
      • Convert all elements of a matrix into respective i...
      • Creating dummy for unique value
      • Autocorrelation
      • How to plot only interaction terms in coefplot
      • What does xtset do exactly?
      • Running multinomial probit regressions with multip...
      • Comparing xtpedroni and xtcointtest pedroni
      • Position in ranking as dependent variable
      • Replacing the nth word of a string
      • Testing Equality of Gini Coefficients when samples...
      • regressions with Newey–West standard errors for a ...
      • Hausman Test for FE and RE
      • Using a Matrix to Identify Anaemia
      • How to standardize two different samples?
      • Constraining parameters in the nlsur command
      • PS Matching issues
      • Generate Unique Group ID in a Panel Data with Spel...
      • Organizational Structure
      • Need help with surveyset
      • I would like to turn my imperfect edgelist into a ...
      • Merging m:m - it seems to work for me. Why critise...
      • Independent Variable predicts is highly correlated...
      • CEM v/s psmatch2
      • Two way pyramid graph using countries instead of r...
      • Referring to local macros using unab
      • Data manipulation - your experience with merging
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ►  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ►  September (654)
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer