New Williams et al. Research on Improving Survey Inference

November 25, 2024

Improving Survey Inference Using Administrative Records Without Releasing Individual-Level Continuous Data

Abstract

Probability surveys are challenged by increasing nonresponse rates, resulting in biased statistical inference. Auxiliary information about populations can be used to reduce bias in estimation. Often continuous auxiliary variables in administrative records are first discretized before releasing to the public to avoid confidentiality breaches. This may weaken the utility of the administrative records in improving survey estimates, particularly when there is a strong relationship between continuous auxiliary information and the survey outcome. In this paper, we propose a two-step strategy, where the confidential continuous auxiliary data in the population are first utilized to estimate the response propensity score of the survey sample by statistical agencies, which is then included in a modified population data for data users. In the second step, data users who do not have access to confidential continuous auxiliary data conduct predictive survey inference by including discretized continuous variables and the propensity score as predictors using splines in a Bayesian model. We show by simulation that the proposed method performs well, yielding more efficient estimates of population means with 95% credible intervals providing better coverage than alternative approaches. We illustrate the proposed method using the Ohio Army National Guard Mental Health Initiative (OHARNG-MHI). The methods developed in this work are readily available in the R package AuxSurvey.

Keywords: Bayesian predictive inference; Rstan; continuous auxiliary variables; generalized additive model; inclusion propensity; poststratification.

Citation

Williams SZ, Zou J, Liu Y, Si Y, Galea S, Chen Q. Improving Survey Inference Using Administrative Records Without Releasing Individual-Level Continuous Data. Stat Med. 2024 Nov 18. doi: 10.1002/sim.10270. Epub ahead of print. PMID: 39557420.

Recent Posts

Lindenfeld Investigates LFO Impacts on Health Outcomes

Legal Financial Obligations: An Understudied Public Health Exposure Abstract The impacts of exposure to the criminal justice system on health-related outcomes are well studied in the United States (US). However, while previous studies focus on the impacts of arrest,...

EJB Talks: Beyond “Does It Work?”

Beyond “Does It Work?”: Laura Peck on Policy, Evidence, and Impact EJB Talks returns for Season 14 with Dean Stuart Shapiro speaking with Laura Peck, one of our newest Public Policy Associate Professors and a Principal Faculty Fellow with the Heldrich Center for...

Heldrich Center: Motivational Texts and Unemployment

Original post from the Daily Targum By Akash Nattamai Researchers at the John J. Heldrich Center for Workforce Development recently published a report regarding the effectiveness of motivational text messaging on reintroducing people in the statewide Reemployment...

Guest Speaker Lerrel Pinto: Robot Data is Not Enough Data

How can robots make physical labor easier for humans? This past week, Prof. Lerrel Pinto gave a talk at the Bloustein School titled "Robot Data is Not Enough Data." Lerrel Pinto is the co-founder of Assured Robot Intelligence (ARI) and an Assistant Professor of...