Behavioral Health Risk Profiles of Undergraduate University Students in England, Wales, and Northern Ireland: A Cluster Analysis

Background Limited research has explored clustering of lifestyle behavioral risk factors (BRFs) among university students. This study aimed to explore clustering of BRFs, composition of clusters, and the association of the clusters with self-rated health and perceived academic performance. Method We assessed (BRFs), namely tobacco smoking, physical inactivity, alcohol consumption, illicit drug use, unhealthy nutrition, and inadequate sleep, using a self-administered general Student Health Survey among 3,706 undergraduates at seven UK universities. Results A two-step cluster analysis generated: Cluster 1 (the high physically active and health conscious) with very high health awareness/consciousness, good nutrition, and physical activity (PA), and relatively low alcohol, tobacco, and other drug (ATOD) use. Cluster 2 (the abstinent) had very low ATOD use, high health awareness, good nutrition, and medium high PA. Cluster 3 (the moderately health conscious) included the highest regard for healthy eating, second highest fruit/vegetable consumption, and moderately high ATOD use. Cluster 4 (the risk taking) showed the highest ATOD use, were the least health conscious, least fruit consuming, and attached the least importance on eating healthy. Compared to the healthy cluster (Cluster 1), students in other clusters had lower self-rated health, and particularly, students in the risk taking cluster (Cluster 4) reported lower academic performance. These associations were stronger for men than for women. Of the four clusters, Cluster 4 had the youngest students. Conclusion Our results suggested that prevention among university students should address multiple BRFs simultaneously, with particular focus on the younger students.

Studies of the health and wellbeing of university populations revealed different extents of clustering of lifestyle BRFs across students (3,4,6,7). Among British university students, research has reported three distinctive health behavior risk profiles based on five lifestyle BRFs (4). Evidence indicates that single BRFs and especially alcohol consumption is related to poorer selfrated health and lower academic achievement of students (8). However, existing research that identified student clusters with specific health risk profiles, e.g., Ref. (3,4) did not investigate the association between belonging to risky behavior cluster/s and poor health or lower academic outcomes. Such information would be relevant, as the propensity of the collective clustering of unhealthy behaviors exponentially exacerbates the risk for comorbidity in later life (8,9).
Cluster analysis (CA) is a promising approach to assess students' health-related lifestyle characteristics in a collective manner. CA is premised on shared characteristics to categorize a given population into mutually exclusive subgroups (clusters) for which the properties or patterns within a cluster are similar to each other than they are to properties within a different cluster (10,11). Indeed, researchers have voiced that most health behavior research has adopted an approach where behaviors were studied in isolation and traditionally focused on individual risk behavior/s, in segregation from other BRFs (4,7), despite that BRFs co-exist together and are related to one another (12). Limited research has explored clustering of lifestyle BRFs; and the studies that undertook such approach rarely focused on university students. In addition, few of such studies considered a wide/diverse range of BRFs; and rarely assessed the relationships between the emerging BRFs clusters and students' self-rated health and academic performance.
To bridge these knowledge gaps and to add new insights to the limited research on clustering of BRFs, and its association with health and academic achievement of students, the current study employed a large sample of students at seven universities in three countries of the United Kingdom (England, Wales, and Northern Ireland) in order to: (1) identify and describe the clustering of five major lifestyle BRFs [health awareness, nutrition behavior, physical activity (PA), sleep, and alcohol, tobacco, and other drugs]; (2) characterize the student composition of each of the emerging clusters in terms of sociodemographics; and (3) examine the associations between the emerging BRFs clusters and students' self-rated health and academic performance.

ethics, sample, and Data collection
Ethical approval from the participating institutions (see below) was obtained prior to data collection. A self-administered general Student Health Survey collected health and well-being data through 2007-2008 (1, 3, 13, 14) during the last 10 min of the lectures. The research aims and objectives were explained in an information sheet delivered with the questionnaire to the participants. Students were informed that participation was voluntary and that by completing the questionnaire, they agreed to participate in the study. Hence, informed consent was obtained in accordance with the Declaration of Helsinki from all individual participants included in the study. Confidentiality was observed, participants were informed that the study was anonymous, no incentives were provided, and data were strictly protected at all stages. Representative sampling was achieved from each of the seven participating institutions, and an 80% response rate led to a sample of 3,706 undergraduate students (University of Chester n = 993, University of Gloucestershire n = 970, Bath Spa University n = 485, University of Ulster n = 475, Swansea University n = 406, Oxford Brookes University n = 208, and Plymouth University n = 169). Data quality assurance was optimized through centralized data entry using Teleform computer software.

Measures
Similar to other general student health and wellbeing studies (3,5,13,15), the data collection tool captured: sociodemographic information (age, gender, sufficiency of income, and type of accommodation); lifestyle features (PA, nutritional intake, restful sleep patterns, illicit drug use); self-rated health; and self-rated academic performance. The following questionnaire based on previous studies (3, 13) measured students' health behavior and lifestyle, self-rated health, and self-rated academic performance, and are included in the CA of this study.

Health Awareness/Consciousness
Health awareness/consciousness was assessed by the question "To what extent do you keep an eye on your health?" with four response options ("Not at all, " "not much, " "to some extent, " and "very much").

Nutritional Behavior
Consumption of fruits and vegetables was assessed with the question "How many servings of fruits and vegetables do you usually have per day (1 serving = 1 medium piece of fruit, 1/2 cup chopped, cooked, or canned fruits/vegetables, 3/4 cup fruit/ vegetable juice, small bowl of salad greens, or 1/2 cup dried)?" with four response options ("I don't eat fruits and vegetables, " "1-2, " "3-4, " and "5 or more" servings).
Consumption of sweets was measured by the question "How often do you eat sweets (chocolate, candy, etc.)?" with five response options ("Several times a day, " "daily, " "Several times a week, " "1-4 times a month, " and "never").
The importance of healthy eating was measured with the item "How important is it for you to eat healthy?" with five response options from "Very important" to "Not at all important. "

Physical Activity
Three forms of PA (i.e., vigorous PA, moderate PA, and muscle strengthening PA) were assessed with the following questions: "On how many of the past 7 days did you: (1) participate in vigorous exercise for ≥20 min?; (2) participate in moderate exercise for ≥30 min?; (3) do exercises to strengthen or tone your muscles, such as push-ups, sit-ups, or weight lifting?" For each form of PA, students reported the number of days for which they engaged in any such activity (ranging from 0 to 7 days).

Sleep
Sleep/rest was assessed with the question "On how many of the past 7 days did you get enough sleep so that you felt rested when you woke up in the morning?" Students reported the number of days (ranging from 0 to 7 days).
Alcohol, Tobacco, and Other Drug (ATOD) Use Alcohol (frequency) was assessed by the item "within the last 3 months, how often did you drink alcohol, e.g., beer?" with six response options "Several times/day, " "Everyday, " "Several times/ week, " "Once a week, " "Less than once/week, " and "Never. " Alcohol (binge drinking) was measured with the question "Think back over the last 2 weeks. How many times, if any, have you had five or more alcoholic drinks at a sitting?" Alcohol problem drinking was assessed using the 4 standard items that form the CAGE screening test (16) for problem alcohol use with 2 response options ("Yes, " "No"). From the total score to these items, a binary variable was formulated, where a cut-off of scores ≥2 indicated presence of "Problem drinking, " while scores <2 indicated "No problem drinking" (16).
Smoking was measured with the item "Within the last 3 months, how often did you smoke (cigarettes, pipes, cigarillos, cigars)?" with response options "Daily, " "Occasionally, " and "Never. " Illicit drug (ecstasy, marijuana, cocaine, heroin, crack, LSD, amphetamines) was assessed by the question "Have you ever use/ used drugs?" with response options "Yes, regularly, " "Yes but only a few times, " "Never. "

Self-Rated Health
Self-rated health was assessed by asking "How would you describe your general health?" with five response options "Excellent, " "Very good, " "Good, " "Fair, " and "Poor. "

Self-Rated Academic Performance
Self-rated academic performance was measured by the item "How do you rate your performance in comparison with your fellow students?" There were five response options: "Much better, " "Better, " "The same, " "Worse, " and "Much worse. "

statistical analyses
We undertook a two-step CA (11) based on 13 lifestyle BRFs (8 categorical, 5 continuous) using SPSS v23.0. Two-step CA combines pre-clustering and hierarchical methods to identify groupings that differ on criterion variables within a data set. This method is suitable for large datasets and can handle scale and ordinal data in the same model (16). In our clustering algorithm, we utilized a two-step procedure with a hierarchical clustering method, i.e., the Schwarz's Bayesian Criterion to automatically determine the number of clusters. We employed two distance measures namely, log-likelihood (for categorical variables) and Euclidean (for continuous variables) (11). Within each cluster, we computed the percentage (%) for specific categories of lifestyle behavioral factors. Uniform categories were utilized across each cluster per behavioral factor to ensure accurate comparability of outcomes. For lifestyle behavioral variables in continuous data format, for example days per week of PA, the mean was presented for each cluster. We conducted Chi-square tests together with Cramer's V test (17) to identify differences between the clusters in terms of sociodemographic characteristics and categorical BRFs. Analysis of variance (ANOVA) with post hoc pairwise comparisons using the Bonferroni method (18) was used to assess the significance of differences in BRFs that were in continuous variable format among the clusters. The ordinal regression (in statistical package STATA 14) examined the association between cluster membership (main exposure variable) and two dependent variables, namely: (1) students' self-rated health and (2) students' self-rated academic performance. Missing data in the original sample were handled through multiple imputation for non-response (19). We performed 20 imputations in SPSS v23.0 and utilized a complete sample of the twentieth imputation as basis for our analysis and results reported in this article.

clustering of lifestyle BrFs among students
Cluster analysis generated 4 clusters (Tables 2 and 3). Clusters 1 and 2 were of almost even size (ratio of largest cluster to smallest = 1.1) and approximately twice the size of Clusters 3 and 4. As depicted in Table 2, the clusters differed significantly by gender. The percentages of female students were highest in Clusters 1 and 4 and lowest in Cluster 3. All gender differences between clusters were significant (χ 2 tests, p < 0.001) except for the comparison between Clusters 1 and 3.
Further, the clusters differed significantly by type of student accommodation during the academic terms (χ 2 = 122.4, p < 0.001, Cramer's Phi = 0.109). All differences by type of accommodation were significant (χ 2 , p < 0.001) except for between Clusters 1 and 3.
Finally, all the clusters differed significantly by students' mean age (F = 45.9, p < 0.001) whereby; Cluster 4 had the youngest student sample of 22.6 (SD 6.4) years, while Cluster 1 students exhibited the highest mean age (27, SD 10.0). Table 3 provides a summary of the BRFs characteristics of the student clusters.

Cluster 1 (The High Physically Active and Health Conscious)
These students had very high health awareness/consciousness, high regard for healthy eating, and the highest fruit/vegetable consumption among all the four clusters. They were also the most

Cluster 2 (The Abstinent)
These students had the least ATOD use compared to other clusters. Only 0.2% of this cluster had been binge drunk in the    previous 2 weeks, and only 16.8% screened positive for problem drinking (CAGE test). This cluster also had the second highest percentage of nonsmokers (78.4%) within the last 3 months, and highest proportion (77.2%) of life-time never drug users. The cluster comprised highly health conscious students who regarded healthy eating as highly important, and their fruit/vegetable consumption was third highest (though much lower than cluster 1). Sweets consumption was medium high, and cluster members had a medium high mean of 2.7 days of adequate sleep per week.

Cluster 3 (The Moderately Health Conscious)
This cluster comprised students with the highest regard for healthy eating, second highest fruit/vegetable consumption (but much lower than cluster 1), and their consumption of sweets was lowest among all the clusters. Their ATOD use was lower than that of Cluster 4 and Cluster 1, and on average, slept adequately for 2.9 days per week. Their PA level was not particularly high and similar to that of Clusters 2 and 4 students.

Cluster 4 (The Risk Taking)
This cluster included students with the highest ATOD use among the 4 clusters. Likewise, these students were the least health conscious, attached the least importance to eating healthy, had the least daily fruit intake and highest intake of sweets. Although Cluster 4 members slept adequately for the least number of days (2.1), however, their PA level was similar to that of Clusters 2 and 3. Chi square and ANOVA results ( Table 3) indicated statistically significant differences between the clusters based on students' lifestyle BRFs, with the majority of the differences across the clusters being highly statistically significant, i.e., p < 0.001. association Between cluster Type and self-rated health Students self-rated their health as good (41.8%), very good (38.5%), excellent (9%), fair (9.2%), or poor (1.4%). Ordinal regression examined the association between students' self-rated health and cluster type. We found an interaction effect (χ 2 = 247.7, p < 0.001) between gender and the main exposure variable (cluster type). Therefore, we stratified the analysis by gender ( Table 4). For both genders, the odds for higher self-rated health level were lower for Cluster 2, Cluster 3, and Cluster 4 compared to Cluster 1. All associations were highly statistically significant (p < 0.001).

association Between cluster Type and self-rated academic Performance
Participants rated their academic performance compared to their peers as "the same" (65.5%), "worse" (16.9%), "much better" (1.5%), "better" (14.1%), or "much worse" (1.9%). We found an interaction effect (χ 2 = 93.47, p < 0.001) between gender and the main exposure variable (cluster type). Therefore, we stratified the analysis by gender ( Table 5). Ordinal regression showed ( Table 5) that compared to Cluster 1 as reference, Clusters 2 and 4 were associated with lower odds of higher self-rated academic performance among female students. Among the male students, compared to Cluster 1 as reference, Clusters 3 and 4 were both associated with lower odds for higher self-rated academic performance.

DiscUssiOn
We identified and described the clustering of five major health lifestyle BRFs, that included the "big four" modifiable health behaviors (ATOD, nutrition, PA, and sleep); and examined the associations of the resulting clusters with students' self-rated health and academic performance. Multiple BRFs were prevalent in a significant proportion of our undergraduates, denoting the coexistence of health damaging lifestyle characteristics.
Our main findings include: (a) less healthy cluster membership was associated with lower self-rated health; (b) less healthy cluster membership was associated with lower academic performance; and (c) there were subgroups of students with particularly high risk for a certain BRF and for a combination of multiple BRFs. Specifically, our findings indicated that student groups with a clustering of BRFs such as "the risk taking" or "the abstinent" and even "the moderately health conscious" exhibited lower self-rated health than "the physically active and health conscious" cluster. In addition, students in these clusters reported lower academic performance, although some effects were found only for male or for female students. This implied that certain clustering of BRFs does not only diminish the health of students, but may also affect their academic achievement negatively. Young students, financially deprived students and those living with roommates or not alone were most likely to belong to these groups with higher risk profile. Therefore, universities need to be aware that large proportions of their student population may practice harmful lifestyle behaviors. Accordingly, universities need to develop targeted interventions, e.g., programs that specifically address ATOD which tends to be high in combination with poor eating habits and poor sleep among certain groups of younger students.
The current survey found distinct lifestyle behavior patterns among undergraduates. Due to different methodologies, sample selection, different BRFs examined, and the scarcity of studies that explored BRFs clustering among university students, we could compare our results only to a certain extent with other research. Generally, our findings are in line with other studies. For instance, our results are supported by research among young people (15-21 years and older) in Switzerland, where multiple risk factors were observed in a substantial proportion of young people (12). Likewise, our four clusters of BRFs resonate with an Irish study among the general population, where research of health behavior clustering in a nationally representative adult sample observed six clusters (20). Likewise, in China, a two-step CA identified three health-related lifestyle clusters (21).
In terms of university students, other research also supported our cluster findings. For instance, among young female students in the USA, where the majority of women had more than two unhealthy behaviors and a CA defined three distinct clusters (22); or among undergraduates at a university in the UK, which found three distinct student clusters based upon PA, fruit/vegetable intake, binge drinking, and smoking (4).
There is a dearth of studies on the mechanisms explaining clustering of multiple risk behaviors, even though these behaviors are significant public health issues. Some authors observed "transfer" effects (e.g., nonsmokers consume less alcohol, regularly active people smoke less, nondrinkers smoke less); as well as "compensation" effects (e.g., regularly active people consume alcohol more frequently; alcohol drinkers are more active) (23). Other research, albeit applied to one risk behavior (smoking) rather than to multiple BRFs, has highlighted the role of psychosocial and behavioral protection and risk factors (24) and has shown that such factors, theoretically derived from the constructs in problem-behavior theory (25), were associated with the initiation of smoking in a longitudinal study (24). We suggest that a framework of protective and risk factors might be helpful in exploring college multiple BRFs. Risk factors that stimulate several risk behaviors simultaneously could be social pressures to use ATOD, to consume unhealthy food and to neglect sleep. More generic protective factors, as derived from problem-behavior theory, could be values and expectations for academic achievement or support and control from parents, friends, or partners (25).
This study has some limitations. It is cross-sectional and generalizations of the findings need caution. Self-reported data could have imprecisions (recall bias, social desirability, and sociability). Students were recruited during lessons/lectures, hence those not present in the class at the time of data collection, were not included in the survey. Some variables were measured by single items due to respondent burden, and that the study was a general student health survey undertaken within a short duration during lectures. This rendered the measurement of BRFs with more items unworkable. We did not assess serving sizes in the questions on nutrition. The extent of observed clustering of more or less favorable variables might be subject to many features (usually not measured) that would confound such complex and intricately associated constellations of BRFs relationships. Such confounders are usually challenging to unpack, let alone attribute to certain aspects of the university, region, country, or participating individuals (7). Future research should attempt to address these limitations. Nevertheless, the current research is a pioneer in examining a wider range of key BRFs across a large sample of undergraduates in three UK countries; and, it is one of the few studies that systematically considered self-rated health and academic achievement of the participants.
We conclude that the identification of subgroups of young people with a high prevalence of one or more of these risk factors allows for an optimization of the allocation of preventive measures. The identification of distinct clusters may aid in uncovering specific groups with higher risk behavior profiles. The clustering of risk factors provides support for multiplebehavior interventions. Our study calls for drug prevention campaigns and other preventive activities targeted at certain sub-groups of students who are likely to combine PA with alcohol and other drug use.

eThics sTaTeMenT
This study was carried out in accordance with the recommendations of the ethics committee of the University of Gloucestershire with informed consent from all subjects. All subjects gave informed consent in accordance with the Declaration of Helsinki. The protocol was approved by the ethics committees of the participating institutions, namely the University of Chester University of Gloucestershire, Bath Spa University, University of Ulster, Swansea University, Oxford Brookes University, and Plymouth University.
aUThOr cOnTriBUTiOns WA has conceptualized the study and is responsible for the data acquisition, interpretation of results, and drafted the manuscript. DS is responsible for the data analysis and contributed to the drafting of the manuscript. CS participated in the conceptualization of the study, the interpretation of results, and drafting of the manuscript. All authors have approved the final version of the manuscript.

FUnDing
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.