Balance analysis for treatment effects

Order

<- See Stata's other features

Highlights

Test for balance for inverse-probability-weighted estimators
Comparison of model-adjusted covariate distributions across treatment groups

Summary statistics
Box plots
Kernel density plots

Kernel density plot comparing propensity scores across treatment groups

Treatment-effects models extract experimental-style causal effects from observational data.

In experimental data, treatment groups must be assigned randomly, meaning characteristics across groups will be approximately equal.

Treatment-effects estimators for observational data use a variety of techniques, including reweighting and matching, to achieve experimental-like balanced data results.

If the reweighting or matching is successful, the distribution of each covariate should be the same across treatment groups. In such cases, we say that the treatment model “balanced” the covariates.

We can examine whether the treatment model balanced the covariates and perform a statistical test.

Three diagnostics and one test are provided.

One diagnostic reports, for each covariate, the model-adjusted difference in means in the treatment groups and the ratio of variances.
Another diagnostic graphs the model-adjusted estimated densities of covariates; these densities can be examined visually to verify that they are approximately equal.
The third graphical diagnostic is the same as the second but uses box plots rather than probability densities.
The statistical test is an overidentification test. It tests whether the model-adjusted means of the covariates are the same between groups.

Let's see it work

Say that we estimate the effect of smoking during pregnancy on infant birthweight using an inverse-probability-weighted (IPW) treatment-effects estimator.

We assume that treatment (smoking during pregnancy) is determined by marital status, the mother's age, attendance to prenatal care during the first quarter of pregnancy, and whether this is the mother's first pregnancy.

. teffects ipw (bweight) (mbsmoke mmarried mage prenatal1 fbaby c.mage#(c.mage 
     i.mmarried prenatal1))

Iteration 0:   EE criterion =  9.365e-20
Iteration 1:   EE criterion =  2.884e-26

Treatment-effects estimation                     Number of obs     =      4,642
Estimator      : inverse-probability weights
Outcome model  : weighted mean
Treatment model: logit

                               Robust
       bweight  Coefficient  std. err.      z    P>|z|     [95% conf. interval]

ATE            
               mbsmoke 
(Smoker vs Nonsmoker)    -239.6875   26.43427    -9.07   0.000    -291.4977   -187.8773

POmean         
               mbsmoke 
            Nonsmoker     3403.638    9.56792   355.73   0.000     3384.885     3422.39

We find that the average treatment effect (ATE) is -240 grams.

Have we done an adequate job of balancing the covariates so that we can trust the estimated treatment effect?

We could use the overidentification test:

. tebalance overid, nolog

Overidentification test for covariate balance
H0: Covariates are balanced:

         chi2(8)      =  11.8612
         Prob > chi2  =   0.1575

We cannot reject the null hypothesis that the covariates are balanced, and that's good.

We can look at the various diagnostics (and in real life, we probably would have used the diagnostics before using the statistical test).

tebalance summarize reports the model-adjusted difference in means and ratio of variances between the treated and untreated for each covariate:

. tebalance summarize

Covariate balance summary

                         Raw     Weighted
 
Number of obs =        4,642      4,642.0
Treated obs   =          864      2,329.1
Control obs   =        3,778      2,312.9




                   Standardized differences          Variance ratio
                          Raw    Weighted           Raw   Weighted

         mmarried   -.5953009    .0053497      1.335944   .9953184
             mage    -.300179    .0410889      .8818025   1.076571
        prenatal1   -.3242695    .0009807      1.496155   .9985165
            fbaby   -.1663271   -.0130638      .9430944   .9965406
                  
             mage# 
             mage   -.3028275    .0477465      .8274389   1.109134
                  
         mmarried# 
             mage 
         married    -.6329701    .0197209      1.157026   1.034108
                  
        prenatal1# 
             mage 
             Yes    -.4053969    .0182109      1.226363   1.032561

Ignore the raw columns, at least to begin, and focus on the weighted columns. Differences in weighted means are negligible, and variance ratios are all near one. The Raw columns show where we started, and, before weighting, differences were large.

tebalance can show us densities or box plots so that we can examine the entire distribution. Below we have put the graphs produced by tebalance density and tebalance box together:

Tests and diagnostics confirm that our model balances the covariates.

Tell me more

To find out more about checking for balance after teffects or stteffects, see [CAUSAL] tebalance.

Products

New in Stata 19

Why Stata

All features

Disciplines

Stata/MP

StataNow

Order Stata

Purchase

Order Stata

Bookstore

Stata Press

Stata Journal

Gift Shop

Learn

Free webinars

NetCourses

Classroom and web training

Organizational training

Video tutorials

Third-party courses

Web resources

Teaching with Stata

Support

Training

Video tutorials

FAQs

Statalist: The Stata Forum

Resources

Technical support

Customer service

Alerts

Company

News and events

Customer service

Careers

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Privacy policy

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Required cookies

Advertising cookies

Required cookies

These cookies are essential for our website to function and do not store any personally identifiable information. These cookies cannot be disabled.
Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

Accept Cookies


		Robust
bweight		Coefficient std. err. z P>\|z\| [95% conf. interval]

ATE
mbsmoke
(Smoker vs Nonsmoker)		-239.6875 26.43427 -9.07 0.000 -291.4977 -187.8773

POmean
mbsmoke
Nonsmoker		3403.638 9.56792 355.73 0.000 3384.885 3422.39


		Standardized differences Variance ratio
		Raw Weighted Raw Weighted

mmarried		-.5953009 .0053497 1.335944 .9953184
mage		-.300179 .0410889 .8818025 1.076571
prenatal1		-.3242695 .0009807 1.496155 .9985165
fbaby		-.1663271 -.0130638 .9430944 .9965406

mage#
mage		-.3028275 .0477465 .8274389 1.109134

mmarried#
mage
married		-.6329701 .0197209 1.157026 1.034108

prenatal1#
mage
Yes		-.4053969 .0182109 1.226363 1.032561