Differential Privacy Protections in 2020 U.S. Decennial Census Data Do Not Impede Measurement of Racial and Ethnic Disparities

Joshua Snoke, Ann C. Haas, Steven C. Martino, Marc N. Elliott

ResearchPosted on rand.org May 16, 2024Published in: Medical Care Research and Review. doi: 10.1177/10775587241251870

Census data are vital to health care research but must also protect respondents' confidentiality. The 2020 decennial Census employs a new Differential Privacy framework; this study examines its effect on the accuracy of an important tool for measuring health disparities, the Bayesian Improved Surname and Geocoding (BISG) algorithm, which uses Census Block Group data to estimate race and ethnicity when self-reported data are unavailable. Using self-reported race and ethnicity data as our standard, we compared the accuracy of BISG estimates calculated using the original 2010 Census counts to the accuracy of estimates calculated using 2010 data but with 2020 Differential Privacy in place. The Differential Privacy methodology slightly decreases BISG accuracy for American Indian and Alaska Native people but has little effect for other groups, suggesting that the methodology will not impede health disparities research that employs BISG and similar methods.

Topics

Document Details

  • Publisher: SAGE Publications Inc
  • Availability: Non-RAND
  • Year: 2024
  • Pages: 5
  • Document Number: EP-70474

Research conducted by

This publication is part of the RAND external publication series. Many RAND studies are published in peer-reviewed scholarly journals, as chapters in commercial books, or as documents published by other organizations.

RAND is a nonprofit institution that helps improve policy and decisionmaking through research and analysis. RAND's publications do not necessarily reflect the opinions of its research clients and sponsors.