BJ Data Tech Solution

Specialized on Data processing, Data management Implementation plan, Data Collection tools - electronic and paper base, Data cleaning specifications, Data extraction, Data transformation, Data load, Analytical Datasets, and Data analysis. BJ Data Tech Solutions teaches on design and developing Electronic Data Collection Tools using CSPro, and STATA commands for data manipulation. Setting up Data Management systems using modern data technologies such as Relational Databases, C#, PHP and Android.

  • Home
  • Data Management
  • Data Analysis
  • Data Collection Tools Tips
Help with R squared in Cox model with shared frailty

Help with R squared in Cox model with shared frailty

Wednesday, September 30, 2020 Data Cleaning Data management Data Processing
Hi all, I am fitting a Cox model with shared frailty and I hope to get the (pseudo) R squared for my model. Here is my code: stcox x1 x2...
Tabulation of percentages for an outcome variable by gender for each racial group

Tabulation of percentages for an outcome variable by gender for each racial group

7:25 PM Data Cleaning Data management Data Processing
Hi, I would please like some help with a tabulation issue I am having. So, I have about 10 causes of death and I would like the percentages...
How to add multiple regression lines to a marginsplot graph?

How to add multiple regression lines to a marginsplot graph?

7:25 PM Data Cleaning Data management Data Processing
Hello, I would like to use marginsplot to show many regressions on the same graph. My dependent variable is a scale (0, .5,1, 1.5, 2, 2.5,...
Crosstabulation Question: Options row vs. col

Crosstabulation Question: Options row vs. col

7:25 PM Data Cleaning Data management Data Processing
I know this is a really basic question, but the logic confuses me every time I do crosstabs - even within a few weeks of the last time I did...
Which is the correct approach in coding a dummy variable

Which is the correct approach in coding a dummy variable

7:25 PM Data Cleaning Data management Data Processing
Hi Statalist. I want to generated a dummy variable from a categorical variable with values ranging '0-10'. The range '0-2'...
Knots in Non parametric series regression

Knots in Non parametric series regression

12:25 PM Data Cleaning Data management Data Processing
Dear All, I am Maheswaran Kesavan doing masters in University college London. I am doing an Non parametric series regression using B spl...
Predict based on regression model

Predict based on regression model

12:25 PM Data Cleaning Data management Data Processing
Hi, I estimated on the regression model: Code: reg lnChild lnCash lnWhite lnCash*lnWhite Based on this regression, I want to predict ...
ICC (Intra-class correlation coefficient) vs "9% of this variation in mortality was attributable solely to the surgeon."

ICC (Intra-class correlation coefficient) vs "9% of this variation in mortality was attributable solely to the surgeon."

11:25 AM Data Cleaning Data management Data Processing
I am trying to find out the relationship between a. the ICC for surgeons and the b. the variation due to surgeons. In Udyavar,2018 (The imp...
power analysis for modified poisson regression

power analysis for modified poisson regression

11:25 AM Data Cleaning Data management Data Processing
Hello, I am trying to determine the power of one my analyses. The outcome is binary and relative risk if the effect estimate, determined u...
"Not sorted"

"Not sorted"

11:25 AM Data Cleaning Data management Data Processing
Dear All, I have individual level panel data that includes spells, with a 6 month *follow up* period after each spell as follows: Code: ...
initial values not feasible melogit

initial values not feasible melogit

9:25 AM Data Cleaning Data management Data Processing
Hello -- I am running an melogit on a 330,000 person nested in ~1900 neighborhoods in 47 countries. For my models, I keep getting "Init...
Comparing two datasets by two variables

Comparing two datasets by two variables

9:25 AM Data Cleaning Data management Data Processing
Hi, I am very new to stata. I have x1, y1, z variables in data1.dta and x2, y2, N in data2.dta. I am trying to run an analysis where: Ste...
Log transforming variables

Log transforming variables

9:25 AM Data Cleaning Data management Data Processing
Two questions related to log transforming variables. I understand that we would want to log transform the dependent variable if normal dis...
How to assess if there is enough variation in your dependent variable

How to assess if there is enough variation in your dependent variable

9:25 AM Data Cleaning Data management Data Processing
Is there a simple way to assess if there is enough variation in your dependent variable, or is it best to just run the regression and asses ...
Change in variance time series

Change in variance time series

8:25 AM Data Cleaning Data management Data Processing
Hi everyone, I am analysing a time series (stock returns) and I am trying to check whether variance in the second half of my sample is dif...
xtivreg , first failes with "conformability error" r(503)

xtivreg , first failes with "conformability error" r(503)

8:25 AM Data Cleaning Data management Data Processing
Dear Readers xtivreg without "first" runs fine, but fails when I add the "first" option with "conformability erro...
Split String

Split String

8:25 AM Data Cleaning Data management Data Processing
Code: clear input HAVE WANT1 WANT2 AA01 AA 01 AZ02 AZ 02 AV03 AV 03 AA04 AA 04 AA05 AA 05 A06 A 06 A07 A 07 ...
New version of wridit on SSC

New version of wridit on SSC

7:25 AM Data Cleaning Data management Data Processing
Thanks as always to Kit Baum, a new version of the wridit package is now available for download from SSC. In Stata, use the ssc command to...
compare value different variable and rows

compare value different variable and rows

6:25 AM Data Cleaning Data management Data Processing
hello, I want to compare values of two varible in diffrent rows. For example vr1 vr2 10 20 20 30 40 50 the vr1 has the value 20for ob...
table summarizing 3 categorical variables in string form between two groups (binary variable)

table summarizing 3 categorical variables in string form between two groups (binary variable)

5:25 AM Data Cleaning Data management Data Processing
Hello, I am having a hard time finding examples of summary tables between two groups let's say students who dropped out vs those who d...
Combining two datasets and keeping specific observations

Combining two datasets and keeping specific observations

4:25 AM Data Cleaning Data management Data Processing
I have two datasets. Dataset A and Dataset B. Dataset A is my existing data that I have organized and cleaned. Data examples are given below...
Replacing values from one observation to another

Replacing values from one observation to another

4:25 AM Data Cleaning Data management Data Processing
Dear Statalist users, this may be a trivial question to you, but I am a little bit struggling to do this. My data are as this example ...
Fixed effects regression is doing something I'm not noticing?

Fixed effects regression is doing something I'm not noticing?

4:25 AM Data Cleaning Data management Data Processing
In Stata I complete a fixed-effects conditional logistic regression model of a binary predictor (1==Unemployed | 0 ==Employed) on a binary o...
Interpreting stset- output

Interpreting stset- output

4:25 AM Data Cleaning Data management Data Processing
Hello, I use Stata 15.1 for survival analysis, using a Cox-model (stcox). My master dataset is the UCDP Peace Agreement Dataset V19.1, with...
Suest after fracreg

Suest after fracreg

2:25 AM Data Cleaning Data management Data Processing
Hello all, This seems like a simple question. I wanted to compare coefficients from two models estimated using fracreg command (fraction...
reshaping complex panel data

reshaping complex panel data

12:25 AM Data Cleaning Data management Data Processing
[CODE] * Example generated by -dataex-. To install: ssc install dataex clear input str7 _AIHWperiod int SA3 str20 _AIHWgeoname str61 _AIH...
Logit model for estimate Demand (Berry 1944)

Logit model for estimate Demand (Berry 1944)

12:25 AM Data Cleaning Data management Data Processing
Hello everybody I have the following excercise to estimate Demand allowing for heterogeneal preference shocks : Supermarkets market Ti...
Deciding equation when analyzing by ppml

Deciding equation when analyzing by ppml

Tuesday, September 29, 2020 Data Cleaning Data management Data Processing
Hello My data is panel data with strongly balanced . As I want to know the effect of lpi on export and import I have chosen export and impo...
Non integer weights - problem

Non integer weights - problem

10:25 AM Data Cleaning Data management Data Processing
Hi. I'm using weights from the European social Survey which have three different weights: design, population and post estimation weight...
Mark Highest Recurring Observation

Mark Highest Recurring Observation

10:25 AM Data Cleaning Data management Data Processing
Dear All, Hope you are well. I am wanting to generate a variable that marks the highest recurring violation type for each business. Th...
How to make mother education variable for each observation in a large data

How to make mother education variable for each observation in a large data

8:25 AM Data Cleaning Data management Data Processing
i have data in the form of Parent key KEY indvidual_number_ in_ roaster completed education mother line number in roaster PARENT_KEY KEY_...
How to use a Cox regression model for time-to-event analysis using different years as control

How to use a Cox regression model for time-to-event analysis using different years as control

8:25 AM Data Cleaning Data management Data Processing
Good day Background: I am trying to conduct a retrospective epidemiological study looking at the incidence of a certain disease (on admis...
Redirecting All Ado Paths to New Drive & Folder

Redirecting All Ado Paths to New Drive & Folder

8:25 AM Data Cleaning Data management Data Processing
At some unfortunate time, I named my partitioned drive with the letter "B". I am setting up a new computer and IT requires me to u...
Time series operators not allowed?

Time series operators not allowed?

7:25 AM Data Cleaning Data management Data Processing
Dear All, I would like to create a 5 month *follow up period* after a specific variable takes a value of 1. My data looks as follows: Co...
Dropping all companies with no observation at the start of the time period

Dropping all companies with no observation at the start of the time period

4:25 AM Data Cleaning Data management Data Processing
Hi Statalist, I have a database of a lot of companies for 7 years: 01-2013 to 01-2019. However, in order to calculate certain variables, I...
Solve a system of equations

Solve a system of equations

3:25 AM Data Cleaning Data management Data Processing
Hello everybody! I would like to solve in Stata ( or maybe Mata? I never used it) this system of equations: 99-x=(4999-y)*0.0198 99-x=(2...
I need help on merging two datasets.

I need help on merging two datasets.

3:25 AM Data Cleaning Data management Data Processing
Hello, I am having trouble merging two datasets for my thesis. To reduce clutter, I have only included 4 different variables. 'gvkey...
Choosing between OLS , RE, FE

Choosing between OLS , RE, FE

2:25 AM Data Cleaning Data management Data Processing
Hello I am analyzing the effect of LPI on export and import trade ( using panel data) when I ran OLS, the result was below Code: . reg...
summarizing in Stata

summarizing in Stata

1:25 AM Data Cleaning Data management Data Processing
Hi, I´m a rookie in using Stata and I am stuck at this point. I have an issue using the sum function. I have a data set of 1156 observations...
Comparing categorical variables over time in a randomised cluster trial

Comparing categorical variables over time in a randomised cluster trial

1:25 AM Data Cleaning Data management Data Processing
Good morning everyone, hope you're all well - and hope that you can help with some confusion. I have a dataset where I am looking at a...
Drop duplicate quarterly dates

Drop duplicate quarterly dates

1:25 AM Data Cleaning Data management Data Processing
Hi, I am working with a dataset which consisted of monthly obeservations of two variables m1 and m3. I have now converted these montly dat...
putexcel command - error

putexcel command - error

12:25 AM Data Cleaning Data management Data Processing
Hi all I have a question about the putexcel command. When I run the code, sometimes I get the following error message: file C:\...\rep...
Cumulative event duration with repeated events as a function of follow up time

Cumulative event duration with repeated events as a function of follow up time

12:25 AM Data Cleaning Data management Data Processing
Dear STATAlist I have a dataset with a starting date (different for each id), and different events which can occur repeatedly. What I am i...
Time specification

Time specification

Monday, September 28, 2020 Data Cleaning Data management Data Processing
Dear colleagues, I am working on 15 years of repeated cross-sectional data. I was wondering whether it is an appropriate strategy to incl...
Fillin/expand with panel and different dates

Fillin/expand with panel and different dates

11:25 PM Data Cleaning Data management Data Processing
Hi All, please, could someone help me? I have the data below: Code: * Example generated by -dataex-. To install: ssc install dataex clea...
Inclusion of both Age and Time Indicators in Panel Data Analysis

Inclusion of both Age and Time Indicators in Panel Data Analysis

10:25 PM Data Cleaning Data management Data Processing
Dear Colleagues, I am analyzing a panel data that collects information on children every two years since 2010, so the panel data has a tot...
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

Latest Articles

Categories

  • CouchDb Skills
  • Data Analysis
  • Data Cleaning
  • Data management
  • Data Processing
  • Research Methodology

Popular Articles

  • How to drop random years from panel data?
    I have a panel data set, consisting of 125 countries, 36 years. I want to run an IV regression multible times and randomly drop 5 (of the 36...
  • Saving pointer matrixes using -mata matsave-
    I am relatively new to the use of pointers in Mata and have thusfar been impressed with their utility. Specific to this query, I have been...
  • instrumenting a binary endogenous regressor
    Hello, I am trying to run a model with a binary endogenous regressor. I am still learning econometrics so I am sorry if this may be a trivi...
  • "tsegen" by group
    Hi, I would like to calculate the moving average of _b_LogSize _b_LogBM _b_MOM12 _b_cons by months of the year over the last 10 years. For...
  • Fixed Effects for a Panel at a Coarser Level
    Hello, I want to include some fixed effects in my model that I believe are difficult to include so any advice on how exactly this can be d...
  • RDD rdrobust problem
    Dear all, I am researching the effect of grade retention on exam results (which can vary from 0 to 20) and I am using a RDD to research th...
  • Growth model - No convergence
    I would like to develop a latent growth model (LGM) with Stata. The point is to illustrate estimated effects of predictors by using Stata...
  • Nvidia Organizational Structure: functional and hybrid
    Nvidia is 7th largest company in the world with a market cap of USD 1 trillion. Due to the size and scope of its operations, it is difficult...
  • Getting values from second to last loop of a -while- loop
    Hi fellow Statalisters, I am using a -while- loop for a particular application, where I need to retrieve a particular value from the secon...
  • Using weights with xtheckman | xtheckman's fixed effects equivalent
    Hi, I am using six waves of the PSID to estimate several determinants (particularly wealth) of the wage equation and the selection equatio...

Recomended Articles

Powered by Blogger.

About Me

Mtenga Baltazar
View my complete profile

Blog Archive

  • ►  2024 (6)
    • ►  February (6)
  • ►  2023 (877)
    • ►  November (1)
    • ►  October (9)
    • ►  September (14)
    • ►  July (9)
    • ►  June (15)
    • ►  May (133)
    • ►  April (174)
    • ►  March (176)
    • ►  February (157)
    • ►  January (189)
  • ►  2022 (2201)
    • ►  December (181)
    • ►  November (180)
    • ►  October (198)
    • ►  September (182)
    • ►  August (182)
    • ►  July (194)
    • ►  June (174)
    • ►  May (167)
    • ►  April (181)
    • ►  March (186)
    • ►  February (170)
    • ►  January (206)
  • ►  2021 (7379)
    • ►  December (327)
    • ►  November (645)
    • ►  October (646)
    • ►  September (639)
    • ►  August (557)
    • ►  July (649)
    • ►  June (656)
    • ►  May (697)
    • ►  April (683)
    • ►  March (697)
    • ►  February (518)
    • ►  January (665)
  • ▼  2020 (7956)
    • ►  December (653)
    • ►  November (659)
    • ►  October (598)
    • ▼  September (654)
      • Help with R squared in Cox model with shared frailty
      • Tabulation of percentages for an outcome variable ...
      • How to add multiple regression lines to a marginsp...
      • Crosstabulation Question: Options row vs. col
      • Which is the correct approach in coding a dummy va...
      • Knots in Non parametric series regression
      • Predict based on regression model
      • ICC (Intra-class correlation coefficient) vs "9% o...
      • power analysis for modified poisson regression
      • "Not sorted"
      • initial values not feasible melogit
      • Comparing two datasets by two variables
      • Log transforming variables
      • How to assess if there is enough variation in your...
      • Change in variance time series
      • xtivreg , first failes with "conformability error"...
      • Split String
      • New version of wridit on SSC
      • compare value different variable and rows
      • table summarizing 3 categorical variables in strin...
      • Combining two datasets and keeping specific observ...
      • Replacing values from one observation to another
      • Fixed effects regression is doing something I'm no...
      • Interpreting stset- output
      • Suest after fracreg
      • reshaping complex panel data
      • Logit model for estimate Demand (Berry 1944)
      • Deciding equation when analyzing by ppml
      • Non integer weights - problem
      • Mark Highest Recurring Observation
      • How to make mother education variable for each obs...
      • How to use a Cox regression model for time-to-even...
      • Redirecting All Ado Paths to New Drive & Folder
      • Time series operators not allowed?
      • Dropping all companies with no observation at the ...
      • Solve a system of equations
      • I need help on merging two datasets.
      • Choosing between OLS , RE, FE
      • summarizing in Stata
      • Comparing categorical variables over time in a ran...
      • Drop duplicate quarterly dates
      • putexcel command - error
      • Cumulative event duration with repeated events as ...
      • Time specification
      • Fillin/expand with panel and different dates
      • Inclusion of both Age and Time Indicators in Panel...
      • How to reset a loop with a continues set of codes?
      • Using value labels in graphs through loops
      • How to use the results of LCA(Latent Class Analysi...
      • 'SWEXP': piecewise exponential distribution function
      • Calculating 95% CI for a chi square analysis?
      • Reshaping data long to wide(NSS, 64th round)
      • putexcel with dates as row names
      • Using mysuest to compare coefficients from differe...
      • how to export stata result to excel using table an...
      • Negative independent values
      • Using rsens for sensitivity analysis after PSM
      • bootstrap likelihood ratio OR Lo-Mendell-Rubin adj...
      • Replacing missing values with the mean by group vi...
      • Different results xtmelogit vs. melogit
      • Using only end of month values for a daily dataset?
      • Assigning preferred IDs to to District Names of a ...
      • Variable assigning number to dates within periods
      • Do I need to check for heteroscedasticity in media...
      • Building time windows from years, starting from mo...
      • Removing duplicate words from a string variable
      • Not recounting new CEO role
      • Do I add mediator and control variables in Wooldri...
      • Exporting variable names, labels, and value codes ...
      • loop for finding lowest value of one
      • 3 way graph
      • Merging all datasets in directory
      • tabout version 3 topf and botf options not working...
      • ITSA Diagnostics
      • Child immunization for Nigeria
      • Replicating graph
      • Trying to make a plot with median and CI
      • Regression with Indicator Function splitting Indep...
      • Generating decile portfolio's
      • Stata/MP 16.1 Processing Speed
      • Using runiformint(a,b) - Create each Value between...
      • Sampclus vs Power (cluster) for observations per c...
      • Clustering under few treated clusters
      • Assumptions linear regressions
      • Question On Extracting Value Label and assign them...
      • World Health Organisation Cardio vasulcar risk chart
      • Read yaml file
      • How can I test CFA between formative and reflectiv...
      • Heckman Model for Selection Bias with Panel Data
      • Fitstat in STATA 16
      • Country and years fixed effect vs Random
      • getting result of table command in excel
      • Making Tables from Matrices using esttab
      • Making Tables from Matrices using esttab
      • Very novice question re calculating Wald test
      • panel ardl
      • Fractional logistic regression equation
      • Matrix of measures of association as a graphical t...
      • Matrix of measures of association as a graphical t...
      • Different test results using lincom and margins, o...
    • ►  August (660)
    • ►  July (682)
    • ►  June (683)
    • ►  May (708)
    • ►  April (692)
    • ►  March (698)
    • ►  February (638)
    • ►  January (631)
  • ►  2019 (9458)
    • ►  December (601)
    • ►  November (643)
    • ►  October (650)
    • ►  September (637)
    • ►  August (645)
    • ►  July (681)
    • ►  June (654)
    • ►  May (1034)
    • ►  April (1079)
    • ►  March (1122)
    • ►  February (876)
    • ►  January (836)
  • ►  2018 (931)
    • ►  December (692)
    • ►  November (239)

© BJ Data Tech Solution | Theme by Rifki.id | Premium Blogger Templates | PBT | Powered by Blogger |-| About | Privacy Policy | Sitemap | Contact | Disclaimer