A Propensity Score Matched Case-Control
Study of Historical Physical Activity and
Mortality in U.S. Adults
Peter D Hart1,2,3*
1Health Promotion Program, Montana State University, France
2Kinesmetrics Lab, Montana State University, France
3Hea!th Demographics, Montana State University, France
Submission: November 3, 2017; Published: December 15, 2017
*Corresponding author: Peter D Hart, Associate Professor, Health Promotion College of Education, Arts & Sciences and Nursing Montana State University, Northern PO Box 7751 Havre, MT 59501-7751 Tel: 4062653719; Fax: 4062654129; Email: peter.hart@msun.edu
How to cite this article: Peter D Hart. A Propensity Score Matched Case-Control Study of Historical Physical Activity and Mortality in U.S. Adults. JOJ Pub Health. 2017; 2(5): 555600. DOI: 10.19080/JOJPH.2017.02.555600.
Abstract
Background: Propensity score matching (PSM) is a technique that matches individuals based on their predicted probability relative to specified characteristics. In a case-control study, controls can be selected using PSM so that they are similar to cases on all relevant covariates.The purpose of this study was to use PSM to investigate the relationship between historical physical activity (PA) and mortality in a large national survey.
Methods: Data for this research came from the 2001-02 NHANES and its corresponding linked mortality file. PA was determined by the reporting of moderate and vigorous leisure-time PA. A measure of sedentary (ST) behavior was also used and determined by reported screen time. Three separate PSM studies were carried out: all-cause mortality, cancer mortality, and coronary heart disease (CHD) mortality. Controls were propensity score matched to cases in each study using age, sex, race, income, and marital status. Logistic regression was used to model the relationship between PA and mortality and compute odds ratios (ORs) with 95% confidence intervals (CIs).
Results: A total of 512 (1,092) all-cause, 148 (441) cancer, and 86 (256) heart disease cases (controls) were used in the analyses with an average follow-up of 9.2 years. Those participating in MPA (OR=1.51, 95% CI: 1.21-1.89), VPA (2.35, 1.75-3.16), and MAVPA (2.60, 1.73-3.90) at baseline were more likely to have survived as compared to their less active counter parts. In addition, those reporting ST behavior were less likely (0.43, 0.33-0.55) to have survived as compared to their less sedentary counterparts. Marginally different findings were observed for the cancer and heart disease analyses.
Conclusion: Results from this study support the use of PSM with a large mortality linked survey. Furthermore, moderate and vigorous PA and less sedentary behavior were related to increased longevity in U.S. adults.
Keywords: Propensity Score Matching (PSM); Epidemiology; Mortality and Physical Activity
Introduction
Physical activity (PA) is a preventive health behavior that protects against chronic disease as well as premature mortality [1]. Current U.S. guidelines for PA recommend all adults accumulate 150+ minutes each week of moderate-intensity PA [2]. A recent study examined the relationship between increasing amounts of PA and all-cause mortality in a large sample of women, where PA was assessed objectively using accelerometers [3]. Results from this study showed that both total PA amount (counts/day) and moderate-to-vigorous PA amount (min/day) were inversely related to all-cause mortality risk in women. Another recent study examined the association between PA and all-cause mortality among elderly Hong Kong adults [4]. Results from this study showed that PA, assessed by an age-specific PA scale, was inversely related to all-cause mortality. Moreover, the researchers in this study showed that the PA and mortality relationship remained after controlling for cardio respiratory fitness. Similar inverse associations between PA and mortality have been reported for heart disease [5-7] and cancer-related [8-11] mortality (Table 1).
Note: p-values are from chi-square tests of independence.
Case-control studies are retrospective observational studies that are commonly used in epidemiological research [12]. In a case-control study, "cases" are first selected based on the presence of a specific disease while "controls" without the disease are then selected so as to match the cases on certain characteristics (i.e., age and sex) [13]. Historical exposures (i.e., risk factors) are assessed and then compared between the cases and controls to determine if associations exist. Case-control studies are considered "retrospective" because they begin with the outcomes and look back (historically) to the causes. For example, in a Fujian population, a case-control study was employed to determine if PA was associated with lung cancer [14]. In this study, cases were those with newly diagnosed lung cancer and controls were age- and sex-matched individuals without lung cancer. Participants reported their past two-year occupational as well as recreational PA. Results from this study showed that occupational PA was associated with an increased risk of lung cancer whereas recreational PA was associated with decreased risk of lung cancer. PA behavior has been examined historically in many case-control studies concerning such outcomes as cancer [15-17], stroke [18], diabetes [19], chronic obstructive pulmonary disease [20], and liver disease [21].
By design, large cross-sectional surveys do not have the data needed to be used in case-control studies. However, some U.S. health surveys do have follow-up probability-matched mortality status associated with their participants [22]. These linked datasets could then be used in case-control studies, where mortality status can provide information for case and control assignment and initial survey responses can serve as historical exposure information. A final case-control design feature, however, is the matching of controls to identified cases in these large datasets (Table 2). Propensity score matching (PSM) is a statistical technique that can match individuals based on their predicted probability relative to specified characteristics [23]. Thus, using large national health datasets with identified cases (e.g., death), controls can be selected using PSM so that they are similar to cases on all relevant covariates (e.g., age, sex, etc.) [24]. Therefore, the purpose of this study was to use PSM to investigate the relationship between historical PA and mortality in a large national survey.
Note: Historical PA was assessed in 2001-02. MOVPA is moderate or moderate and vigorous PA, ST is sedentary based on screen time.
Methods
Participants and design
Data for this research came from the 2001-2002 National Health and Nutrition Examination Survey (NHANES) along with its associated mortality file [25]. NHANES is a continuous survey created to assess health and nutrition status and behavior in the U.S. population. NHANES data are organized by category: demographics, dietary, examination, laboratory, questionnaire,vigorous PA, MPA is moderate PA, VPA is vigorous PA, MAVPA is and limited access. NHANES data are unique because of the physical examination (by health professionals) and clinical laboratory data components. Mortality data are matched to NHANES participants by the National Center for Health Statistics (NCHS) with the most recent mortality follow-up ending this past December 31, 2011. For this study, a total of 1,604 participants who were 18+ years of age, who answered all relevant questions, and who were eligible for mortality linkage were included (Table 3).
Note: OR in bold are significantly different from 1. MOVPA is moderate or vigorous PA, MPA is moderate PA, VPA is vigorous PA, MAVPA is moderate and vigorous PA, ST is sedentary based on screen time. Unadjusted model show crude odd ratios (ORs). Adjusted I shows ORs adjusted for body mass index (BMI). Adjusted II shows ORs fully adjusted for age, sex, race/ethnicity, income, and BMI.
Measures
Five different independent variables were used in this study: moderate or vigorous PA (MOVPA), moderate PA only (MPA), vigorous PA only (VPA), moderate and vigorous PA (MAVPA), and sedentary (ST) behavior. The four activity variables (MOVPA, MPA, VPA, and MAVPA) were determined from a series of questions asking respondents if they participated in moderate or vigorous activities [26]. Moderate-intensity activity was assessed by the following question "Over the past 30 days, did you do moderate activities for at least 10 minutes that cause only light sweating or a slight to moderate increase in breathing or heart rate? Some examples are brisk walking, bicycling for pleasure, golf, and dancing." Vigorous-intensity activity was assessed by the following question "Over the past 30 days, did you do any vigorous activities for at least 10 minutes that caused heavy sweating, or large increases in breathing or heart rate? Some examples are running, lap swimming, aerobics classes or fast bicycling." Those respondents answering "yes" to either question above were considered participating in MOVPA. Participants answering "yes" to only the first or only the second question above were considered participating in MPA and VPA, respectively. Finally, participants answering "yes" to both questions above were considered participating in MAVPA. ST behavior was assessed by the following question: "Over the past 30 days, on a typical day how much time altogether did you spend on a typical day sitting and watching TV or videos or using a computer outside of work?" Participants reporting 5+ hours per day were considered sedentary. Three different mortality status variables were used: all-cause (case/control), coronary heart disease (CHD) (case/control), and cancer (case/control). Finally, five covariates were used for PSM: age, sex, race, marital status, and income.
Statistical Analysis
Three separate PSM studies were carried out: all-cause mortality, cancer mortality, and CHD mortality. PSM required four distinct steps. Step 1 consisted of merging the NHANES survey dataset with the mortality linked file. Step 2 consisted of removing participants with any missing relevant data. Step 3 consisted of the creation of the three separate mortality datasets. Step 4 consisted of the PSM. Controls were propensity score matched to cases in each study using age, sex, race, income, and marital status. SPSS v24 was used for all PSM [27]. Logistic regression was used to model the relationship between PA and mortality and compute odds ratios (ORs) with 95% confidence intervals (CIs). SAS version 9.4 was used for all statistical modelling [28,29]. All significance levels were set to p=.05.
Results
Table 1 displays the socio demographic comparison between cases and matched controls across all-cause, CHD, and cancer studies. A total of 512 (1,092) all-cause, 148 (441) cancer, and 86 (256) heart disease cases (controls) were used in the analyses with an average follow-up of 9.2 years. After PSM, cases and controls appeared similar on most socio demographic variables (age, sex, race, and income) across all three studies. With only the exception of age in the all-cause mortality matching, where there was a tendency toward more older (70+ years) cases (52.2%) as compared to controls (33.2%, p<.001).
Table 2 displays sample-specific estimates of PA behavior among participants that experienced mortality. PA differences among those that experienced mortality were evident, with only 6.5% of adults dying from all-causes historically reporting MAVPA, as compared to 93.5% (p<.001) not reporting MAVPA. This difference was narrowly consistent across CHD and cancer- related mortality studies. Similarly, only 14.0% of adults dying from all-causes historically reported VPA, as compared to 86.0% (p<.001) not reporting VPA. As well, these differences were narrowly consistent across CHD and cancer-related mortality studies. In terms of ST behavior, fewer participants dying from all-causes were ST (30.8%) than not (69.2%) (p<.001). More noteworthy, ST behavior was more prevalent among those dying of CHD (37.5%) as compared to those dying of cancer (22.3%).
Table 3 displays the odds of all-cause, CHD, and cancer survival among adults historically reporting various forms of PA behavior. Adults reporting MOVPA (OR=1.80, 1.44-2.24), MPA (OR=1.51, 1.21-1.89), VPA (2.35, 1.75-3.16), and MAVPA (2.60, 1.73-3.90) at baseline were more likely to have survived as compared to their less active counterparts. In addition, those assessed as ST were less likely (0R=0.43, 0.33-0.55) to have survived as compared to their less ST counterparts. Similar findings were observed for the CHD study, with the exception that only MOVPA, VPA, and ST behaviours were related to survival. Finally, all PA measures were significantly related to cancer-related survival with exception of ST behavior.
Discussion
The purpose of this study was to first use PSM to individually match control group participants to case (experienced mortality) participants in the NHANES dataset. This part of the study was adequately performed, as seen by a relatively even socio demographic match between cases and controls.PSM has been used successfully by other researchers to aid in creating case- control designs in survey data [30-32]. An additional purpose of this study was to investigate the relationship between historical PA and all-cause, CHD, and cancer-related mortality. Results clearly showed that adults participating in various forms of PA at baseline were more likely to have survived to follow- up. In the all-cause mortality analysis, all forms of PA as well as ST behavior were related to survival. As well, these findings remained significant after controlling for confounding variables. The strongest survival relationship appeared related to MAVPA. This finding is consistent with other research. One recent study among Swedish adults used accelerometer-measured PA to examine its relationship with all-cause, CHD, and cancer mortality [33]. Results of this study showed an inverse relationship with time spent in moderate-to-vigorous PA and survival time, across all three mortality scenarios. Additionally, results of this study found that the most ST adults were at greatest risk for mortality, in all three mortality scenarios.
In the CHD mortality analysis, VPA was the strongest predictor of survival, with MPA showing unrelated to survival. Data supporting this specific finding are sparse. However, one study among older men examined the relationship between self-reported PA and CHD events [34]. This study found an inverse relationship between PA intensity (occasional PA, light PA, moderate PA, and vigorous groups) and risk of CHD event, with the vigorous-intensity group experiencing the lowest risk of these events. Finally, in the cancer mortality analysis, all forms of PA behavior were associated with survival, except for ST behavior. Albeit, ST was a suggestive predictor of survival, it was not strong enough to detect significance. This finding is consistent with other research, where PA measures have been found associated with cancer mortality despite a lack of association with ST behavior [35].
This study has limitations worth discussing. The first limitation is the use of self-reported PA behavior at baseline, as opposed to the use of a more objective method (e.g., accelerometers). In self-reported PA scenarios, participants must recall their activity, allowing for inaccurate recollection as well as inaccurate assessment of duration and intensity. In fact, many studies have shown the overestimation of PA and underestimation of ST behavior when assessed by questionnaire [36-38]. However, the current study used a lower-bound criteria for PA, where adults were considered active (or not active) at a given intensity merely by acknowledging (or not acknowledging) a 10-minute minimum participation at the given intensity. Therefore, the current study was not concerned with measuring specific PA amounts (i.e., min/day) at specific intensities. Thus, the bias in our findings should be minimal in comparison to other studies that use self-reported PA measures.
Another limitation worth mentioning is the use of historical PA as main exposure variables (risk factors) in this study Specifically, this study had no means of assessing changes in PA across the observational period. Therefore, it is possible that some adults in the sample were physically active (or inactive) at baseline assessment and then became inactive (or active) before the end of mortality follow-up. This inability to assess changes in participant PA may have caused a certain amount of un measurable exposure misclassification. Consequently, the findings in this study may suffer from a certain amount of bias and should then be interpreted with caution.
Conclusion
Results from this study support the use of PSM in developing a case-control study with large mortality linked data. Furthermore, moderate and vigorous PA and less sedentary behavior were related to increased longevity in U.S. adults. Additionally, vigorous PA and less sedentary behavior were related to CHD survival whereas all forms of PA behavior except sedentariness were related to cancer survival. Health promotion specialists should consider different types of PA behavior when planning interventions to increase longevity in adults.
Acknowledgement
No financial assistance was used to assist with this project.
References
- Healthy people (2020) US Department of Health and Human Services. Office of Disease Prevention and Health Promotion, USA.
- Physical Activity Guidelines Advisory Committee (2008) Washington DC: US Department of Health and Human Services, USA.
- Lee IM, Shiroma EJ, Evenson KR, Kamada M, La Croix AZ, et al. (2017) Accelerometer-Measured Physical Activity and Sedentary Behavior in Relation to All-Cause Mortality: The Women's Health Study. Circulation
- Lu Z, Woo J, K wok T (2017) The Effect of Physical Activity and Cardio respiratory Fitness on All-Cause Mortality in Hong Kong Chinese Older Adults. J Gerontol A Biol 2(6).
- Williams ED, Stamatakis E, Chandola T, Hamer M (2011) Physical activity behaviour and coronary heart disease mortality among South Asian people in the UK: an observational longitudinal study. Heart 97(8): 655-659.
- Schnohr P, Lange P, Scharling H, Jensen JS (2006) Long-term physical activity in leisure time and mortality from coronary heart disease, stroke, respiratory diseases, and cancer. The Copenhagen City Heart Study. European Journal of Cardiovascular Prevention & Rehabilitation 13(2): 173-179.
- Stewart RA, Held C, Hadziosmanovic N, Armstrong PW, Cannon CP, et al. (2017) Physical activity and mortality in patients with stable coronary heart disease. Journal of the American College of Cardiology 70(14): 1689-1700.
- Li Y, Gu M, Jing F, Cai S, Bao C, et al. (2016) Association between physical activity and all cancer mortality: Dose-response meta-analysis of cohort studies. International journal of cancer 138(4): 818-832.
- O Donovan G, Lee IM, Hamer M, Stamatakis E (2017) Association of "weekend warrior” and other leisure time physical activity patterns with risks for all-cause, cardiovascular disease, and cancer mortality. JAMA internal medicine 177(3): 335-342.
- Lee JY, Ryu S, Cheong E, Sung KC (2016) Association of physical activity and inflammation with all-cause, cardiovascular-related, and cancer- related mortality. In Mayo Clinic Proceedings 91(12): 1706-1716.
- Mok Y, Jeon C, Lee GJ, Jee SH (2016) Physical Activity Level and Colorectal Cancer Mortality: The Korean Metabolic Syndrome Mortality Study. Asia Pacific Journal of Public Health 28(7): 638-647.
- Dishman R, Heath G, Lee IM (2012) Physical activity epidemiology 2nd Edn. Human Kinetics
- Last JM, Abramson JH, Freidman GD (2001) A dictionary of epidemiology. New York: Oxford university press, USA.
- He F, Chen LM, Xiong WM, Xu QP, Xiao RD, et al. (2017) A case-control study of the association between self-reported occupational and recreational physical activity and lung cancer. Medicine 96(36): 79197923.
- Reulen RC, De Vogel S, Zhong W, Zhong Z, Xie LP, et al. (2017) Physical activity and risk of prostate and bladder cancer in China: The South and East China case-control study on prostate and bladder cancer. PloS one 12(6): 611-613.
- Yen SH, Knight A, Krishna MB, Muda WM, Rufai AA (2016) Lifetime Physical Activity and Breast Cancer: a Case-Control Study in Kelantan, Malaysia. Asian Pacific Journal of Cancer Prevention 17(8): 4083-4088.
- Golshiri P, Rasooli S, Emami M, Najimi A (2016) Effects of Physical Activity on Risk of Colorectal Cancer: A Case-control Study. International journal of preventive medicine 7: 11-32.
- Liang W, Lee AH, Binns CW, Zhou Q, Huang R, et al. (2009) Habitual physical activity reduces the risk of ischemic stroke: a case-control study in southern china. Cerebro vascular Diseases 28(5): 454-459.
- Wang L, Yamaguchi T, Yoshimine T, Katagiri A, Shirogane K, Ohashi Y (2002) A case-control study of risk factors for development of type 2 diabetes: emphasis on physical activity. Journal of epidemiology 12(6): 424-430.
- Hirayama F, Lee AH, Hiramatsu T (2010) Life-long physical activity involvement reduces the risk of chronic obstructive pulmonary disease: a case-control study in Japan. Journal of Physical Activity and Health 7(5): 622-626.
- Gibson PS, Lang S, Gilbert M, Kamat D, Bansal S, et al. (2015) Assessment of diet and physical activity in paediatric non-alcoholic fatty liver disease patients: a United Kingdom case control study. Nutrients 7(12): 9721-9733.
- CDC (2017) National Center for Health Statistics. NCHS Data Linked to NDI Mortality Files. Retrieved, USA.
- Fraeman KH, Evidera BA (2015) A General SAS® Macro to Implement Optimal N: 1 Propensity Score Matching Within a Maximum Radius.
- Parsons LS (2004) Performing a1: N case-control match on propensity score. In proceedings of the 29th Annual SAS users group international conference, SAS Institute, USA, pp. 165-229.
- National Center for Health Statistics (2013) Office of Analysis and Epidemiology. Analytic Guidelines for NCHS 2011 Linked Mortality Files, Hyattsville, Maryland, USA.
- Zipf G, Chiappa M, Porter KS (2013) National Health and Nutrition Examination Survey: Plan and operations. National Center for Health Statistics. Vital Health Stat 1(56): 1-37.
- Thoemmes F (2012) Propensity scores matching in SPSS. ArXiv preprint.
- 28. Stokes ME, Davis CS, Koch GG (2012) Categorical data analysis using SAS. SAS institute, USA.
- Cody RP, Smith JK (1985) Applied statistics and the SAS programming language. North Holland, Netherlands.
- Chun SY, Park H, Lee TH, Park EC (2015) Do long term cancer survivors have better health-promoting behavior than non-cancer populations? Case-control study in Korea. Asian Pac J Cancer Prev. 16(4): 1415-1420.
- Shetty V, Harrell L, Clague J, Murphy DA, Dye BA, et al. (2016) Methamphetamine users have increased dental disease: a propensity score analysis. Journal of dental research 95(7): 814-821.
- 32. Lee HS, Lee JH (2017) Vitamin D and Urinary Incontinence among Korean Women: a Propensity Score-matched Analysis from the 20082009 Korean National Health and Nutrition Examination Survey. Journal of Korean medical science. 32(4): 661-665.
- Dohrn M, Sjostrom M, Kwak L, Oja P, Hagstromer M (2017) Accelerometer-measured sedentary time and physical activity-A 15 year follow-up of mortality in a Swedish population-based cohort. Journal of Science and Medicine in Sport.
- Jefferis BJ, Whincup PH, Lennon LT, Papacosta O, Goya Wannamethee S (2014) Physical Activity in Older Men: Longitudinal Associations with Inflammatory and Hemostatic Biomarkers, N-Terminal ProBrain Natriuretic Peptide, and Onset of Coronary Heart Disease and Mortality. Journal of the American Geriatrics Society 62(4): 599-606.
- Wang A, Qin F, Hedlin H, Desai M, Chlebowski R, et al. (2016) Physical activity and sedentary behavior in relation to lung cancer incidence and mortality in older women: The Women's Health Initiative. International journal of cancer 139(10): 2178-2192.
- Watson ED, Micklesfield LK, van Poppel MN, Norris SA, Sattler MC, et al. (2017) Validity and responsiveness of the Global Physical Activity Questionnaire (GPAQ) in assessing physical activity during pregnancy. PloS one 12(5): 79-96.
- Menezes D, Laranjo L, Marmeleira J (2017) Criterion-related validity of the short form of the international physical activity questionnaire in adults who are Deaf. Disability and health journal 10(1): 33-38.
- Sebastiao E, Gobbi S, Chodzko Zajko W, Schwingel A, Papini CB, et al. (2012) The International Physical Activity Questionnaire-long form overestimates self-reported physical activity of Brazilian adults. Public Health 126(11): 967-975.