Classification and regression tree analysis in public health: methodological review and comparison with logistic regression.
Document Type
Article, Non peer-reviewed
Publication Date
12-3-2003
Abstract
BACKGROUND: Audience segmentation strategies are of increasing interest to public health professionals who wish to identify easily defined, mutually exclusive population subgroups whose members share similar characteristics that help determine participation in a health-related behavior as a basis for targeted interventions. Classification and regression tree (C&RT) analysis is a nonparametric decision tree methodology that has the ability to efficiently segment populations into meaningful subgroups. However, it is not commonly used in public health. PURPOSE: This study provides a methodological overview of C&RT analysis for persons unfamiliar with the procedure. METHODS AND RESULTS: An example of a C&RT analysis is provided and interpretation of results is discussed. Results are validated with those obtained from a logistic regression model that was created to replicate the C&RT findings. Results obtained from the example C&RT analysis are also compared to those obtained from a common approach to logistic regression, the stepwise selection procedure. Issues to consider when deciding whether to use C&RT are discussed, and situations in which C&RT may and may not be beneficial are described. CONCLUSIONS: C&RT is a promising research tool for the identification of at-risk populations in public health research and outreach.
Recommended Citation
Lemon, Stephenie C.; Roy, Jason; Clark, Melissa A.; Friedmann, Peter D.; and Rakowski, William, "Classification and regression tree analysis in public health: methodological review and comparison with logistic regression." (2003). All Scholarly Works. 8468.
https://scholarlycommons.libraryinfo.bhs.org/all_works/8468
PMID
14644693