Impact of Small Group Size on Neighbourhood Influences in Multilevel Models

Published in: Journal of Epidemiology and Community Health, v. 65, no. 8, Aug. 2011, p. 688-695

Posted on on August 01, 2011

by Katherine P. Theall, Richard Scribner, Stephanie Broyles, Qingzhao Yu, Jigar Chotalia, Neal Simonsen, Matthias Schonlau, Bradley P. Carlin

Read More

Access further information on this document at Journal of Epidemiology and Community Health

This article was published outside of RAND. The full text of the article can be found at the link above.

BACKGROUND: Given the growing availability of multilevel data from national surveys, researchers interested in contextual effects may find themselves with a small number of individuals per group. Although there is a growing body of literature on sample size in multilevel modelling, few have explored the impact of group sizes of less than five. METHODS: In a simulated analysis of real data, the impact of a group size of less than five was examined on both a continuous and dichotomous outcome in a simple two-level multilevel model. Models with group sizes one to five were compared with models with complete data. Four different linear and logistic models were examined: empty models; models with a group-level covariate; models with an individual-level covariate and models with an aggregated group-level covariate. The study evaluated further whether the impact of small group size differed depending on the total number of groups. RESULTS: When the number of groups was large (N=459), neither fixed nor random components were affected by small group size, even when 90% of tracts had only one individual per tract and even when an aggregated group-level covariate was examined. As the number of groups decreased, the SE estimates of both fixed and random effects were inflated. Furthermore, group-level variance estimates were more affected than were fixed components. CONCLUSIONS: Datasets in which there is a small to moderate number of groups, with the majority of very small group size (n<5), size may fail to find or even consider a group-level effect when one may exist and also may be underpowered to detect fixed effects.

This report is part of the RAND Corporation External publication series. Many RAND studies are published in peer-reviewed scholarly journals, as chapters in commercial books, or as documents published by other organizations.

The RAND Corporation is a nonprofit institution that helps improve policy and decisionmaking through research and analysis. RAND's publications do not necessarily reflect the opinions of its research clients and sponsors.