Imputation of SF-12 Health Scores for Respondents with Partially Missing Data

Published in: Health Services Research, v. 40, no. 3, June 2005, p. 905-921

Posted on on January 01, 2005

by Honghu H. Liu, Ron D. Hays, John L. Adams, Wen-Pin Chen, Diana M. Tisnado, Carol Mangione, Cheryl L. Damberg, Katherine L. Kahn

Read More

Access further information on this document at

This article was published outside of RAND. The full text of the article can be found at the link above.

OBJECTIVE: To create an efficient imputation algorithm for imputing the SF-12 physical component summary (PCS) and mental component summary (MCS) scores when patients have one to eleven SF-12 items missing. STUDY SETTING: Primary data collection was performed between 1996 and 1998. STUDY DESIGN: Multi-pattern regression was conducted to impute the scores using only available SF-12 items (simple model), and then supplemented by demographics, smoking status and comorbidity (enhanced model) to increase the accuracy. A cut point of missing SF-12 items was determined for using the simple or the enhanced model. The algorithm was validated through simulation. DATA COLLECTION: Thirty-thousand-three-hundred and eight patients from 63 physician groups were surveyed for a quality of care study in 1996, which collected the SF-12 and other information. The patients were classified as chronic patients if they reported that they had diabetes, heart disease, asthma/chronic obstructive pulmonary disease, or low back pain. A follow-up survey was conducted in 1998. PRINCIPAL FINDINGS: Thirty-one percent of the patients missed at least one SF-12 item. Means of variance of prediction and standard errors of the mean imputed scores increased with the number of missing SF-12 items. Correlations between the observed and the imputed scores derived from the enhanced models were consistently higher than those derived from the simple model and the increments were significant for patients with 6 missing SF-12 items (p<.03). CONCLUSION: Missing SF-12 items are prevalent and lead to reduced analytical power. Regression-based multi-pattern imputation using the available SF-12 items is efficient and can produce good estimates of the scores. The enhancement from the additional patient information can significantly improve the accuracy of the imputed scores for patients with 6 items missing, leading to estimated scores that are as accurate as that of patients with <6 missing items.

This report is part of the RAND Corporation External publication series. Many RAND studies are published in peer-reviewed scholarly journals, as chapters in commercial books, or as documents published by other organizations.

The RAND Corporation is a nonprofit institution that helps improve policy and decisionmaking through research and analysis. RAND's publications do not necessarily reflect the opinions of its research clients and sponsors.