Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies

In a study comparing the effects of two treatments, the propensity score is the probability of assignment to one treatment conditional on a subject's measured baseline covariates. Propensity-score matching is increasingly being used to estimate the effects of exposures using observational data....

Full description

Bibliographic Details
Main Author: Austin, Peter C
Format: Online
Language:English
Published: John Wiley & Sons, Ltd. 2011
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3120982/
id pubmed-3120982
recordtype oai_dc
spelling pubmed-31209822011-06-28 Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies Austin, Peter C Main Paper In a study comparing the effects of two treatments, the propensity score is the probability of assignment to one treatment conditional on a subject's measured baseline covariates. Propensity-score matching is increasingly being used to estimate the effects of exposures using observational data. In the most common implementation of propensity-score matching, pairs of treated and untreated subjects are formed whose propensity scores differ by at most a pre-specified amount (the caliper width). There has been a little research into the optimal caliper width. We conducted an extensive series of Monte Carlo simulations to determine the optimal caliper width for estimating differences in means (for continuous outcomes) and risk differences (for binary outcomes). When estimating differences in means or risk differences, we recommend that researchers match on the logit of the propensity score using calipers of width equal to 0.2 of the standard deviation of the logit of the propensity score. When at least some of the covariates were continuous, then either this value, or one close to it, minimized the mean square error of the resultant estimated treatment effect. It also eliminated at least 98% of the bias in the crude estimator, and it resulted in confidence intervals with approximately the correct coverage rates. Furthermore, the empirical type I error rate was approximately correct. When all of the covariates were binary, then the choice of caliper width had a much smaller impact on the performance of estimation of risk differences and differences in means. Copyright © 2010 John Wiley & Sons, Ltd. John Wiley & Sons, Ltd. 2011-03 2010-04-27 /pmc/articles/PMC3120982/ /pubmed/20925139 http://dx.doi.org/10.1002/pst.433 Text en Copyright © 2011 John Wiley & Sons, Ltd. http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation.
repository_type Open Access Journal
institution_category Foreign Institution
institution US National Center for Biotechnology Information
building NCBI PubMed
collection Online Access
language English
format Online
author Austin, Peter C
spellingShingle Austin, Peter C
Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies
author_facet Austin, Peter C
author_sort Austin, Peter C
title Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies
title_short Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies
title_full Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies
title_fullStr Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies
title_full_unstemmed Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies
title_sort optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies
description In a study comparing the effects of two treatments, the propensity score is the probability of assignment to one treatment conditional on a subject's measured baseline covariates. Propensity-score matching is increasingly being used to estimate the effects of exposures using observational data. In the most common implementation of propensity-score matching, pairs of treated and untreated subjects are formed whose propensity scores differ by at most a pre-specified amount (the caliper width). There has been a little research into the optimal caliper width. We conducted an extensive series of Monte Carlo simulations to determine the optimal caliper width for estimating differences in means (for continuous outcomes) and risk differences (for binary outcomes). When estimating differences in means or risk differences, we recommend that researchers match on the logit of the propensity score using calipers of width equal to 0.2 of the standard deviation of the logit of the propensity score. When at least some of the covariates were continuous, then either this value, or one close to it, minimized the mean square error of the resultant estimated treatment effect. It also eliminated at least 98% of the bias in the crude estimator, and it resulted in confidence intervals with approximately the correct coverage rates. Furthermore, the empirical type I error rate was approximately correct. When all of the covariates were binary, then the choice of caliper width had a much smaller impact on the performance of estimation of risk differences and differences in means. Copyright © 2010 John Wiley & Sons, Ltd.
publisher John Wiley & Sons, Ltd.
publishDate 2011
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3120982/
_version_ 1611461609410002944