Article,

A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003

P. Austin.
Stat Med, 27 (12): 2037-2049 (May 2008)
DOI: 10.1002/sim.3150

Abstract

Propensity-score methods are increasingly being used to reduce the impact of treatment-selection bias in the estimation of treatment effects using observational data. Commonly used propensity-score methods include covariate adjustment using the propensity score, stratification on the propensity score, and propensity-score matching. Empirical and theoretical research has demonstrated that matching on the propensity score eliminates a greater proportion of baseline differences between treated and untreated subjects than does stratification on the propensity score. However, the analysis of propensity-score-matched samples requires statistical methods appropriate for matched-pairs data. We critically evaluated 47 articles that were published between 1996 and 2003 in the medical literature and that employed propensity-score matching. We found that only two of the articles reported the balance of baseline characteristics between treated and untreated subjects in the matched sample and used correct statistical methods to assess the degree of imbalance. Thirteen (28 per cent) of the articles explicitly used statistical methods appropriate for the analysis of matched data when estimating the treatment effect and its statistical significance. Common errors included using the log-rank test to compare Kaplan-Meier survival curves in the matched sample, using Cox regression, logistic regression, chi-squared tests, t-tests, and Wilcoxon rank sum tests in the matched sample, thereby failing to account for the matched nature of the data. We provide guidelines for the analysis and reporting of studies that employ propensity-score matching.

BibTeX key: Austin:2008:Stat-Med:18038446
entry type: article
year: 2008
month: may
journal: Stat Med
number: 12
pages: 2037-2049
volume: 27
pmid: 18038446
DOI: 10.1002/sim.3150
url: https://www.ncbi.nlm.nih.gov/pubmed/18038446

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{Austin:2008:Stat-Med:18038446, abstract = {Propensity-score methods are increasingly being used to reduce the impact of treatment-selection bias in the estimation of treatment effects using observational data. Commonly used propensity-score methods include covariate adjustment using the propensity score, stratification on the propensity score, and propensity-score matching. Empirical and theoretical research has demonstrated that matching on the propensity score eliminates a greater proportion of baseline differences between treated and untreated subjects than does stratification on the propensity score. However, the analysis of propensity-score-matched samples requires statistical methods appropriate for matched-pairs data. We critically evaluated 47 articles that were published between 1996 and 2003 in the medical literature and that employed propensity-score matching. We found that only two of the articles reported the balance of baseline characteristics between treated and untreated subjects in the matched sample and used correct statistical methods to assess the degree of imbalance. Thirteen (28 per cent) of the articles explicitly used statistical methods appropriate for the analysis of matched data when estimating the treatment effect and its statistical significance. Common errors included using the log-rank test to compare Kaplan-Meier survival curves in the matched sample, using Cox regression, logistic regression, chi-squared tests, t-tests, and Wilcoxon rank sum tests in the matched sample, thereby failing to account for the matched nature of the data. We provide guidelines for the analysis and reporting of studies that employ propensity-score matching.}, added-at = {2019-10-28T04:45:14.000+0100}, author = {Austin, P C}, biburl = {https://www.bibsonomy.org/bibtex/2eb0b6555ab4287ddef3a757806fd9de5/jkd}, description = {A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003. - PubMed - NCBI}, doi = {10.1002/sim.3150}, interhash = {fa5e515adb21ae7b1ca610b2efe0116a}, intrahash = {eb0b6555ab4287ddef3a757806fd9de5}, journal = {Stat Med}, keywords = {CausalInference statistics}, month = may, number = 12, pages = {2037-2049}, pmid = {18038446}, timestamp = {2019-10-28T04:45:14.000+0100}, title = {A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003}, url = {https://www.ncbi.nlm.nih.gov/pubmed/18038446}, volume = 27, year = 2008 }

BibSonomy

A critical appraisal of propensity-score matching in the medical literature between 1996 and 2003

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on