- Research article
- Open Access
Young adult cancer risk behaviours originate in adolescence: a longitudinal analysis using ALSPAC, a UK birth cohort study
BMC Cancer volume 21, Article number: 365 (2021)
An estimated 40% of cancer cases in the UK in 2015 were attributable to cancer risk behaviours. Tobacco smoking, alcohol consumption, obesity, and unprotected sexual intercourse are known causes of cancer and there is strong evidence that physical inactivity is associated with cancer. These cancer risk behaviours co-occur however little is known about how they pattern longitudinally across adolescence and early adulthood. Using data from ALSPAC, a prospective population-based UK birth cohort study, we explored patterns of adolescent cancer risk behaviours and their associations with cancer risk behaviours in early adulthood.
Six thousand three hundred fifty-one people (46.0% of ALSPAC participants) provided data on all cancer risk behaviours at one time during adolescence, 1951 provided data on all cancer risk behaviours at all time points. Our exposure measure was quartiles of a continuous score summarising cumulative exposure to cancer risk behaviours and longitudinal latent classes summarising distinct categories of adolescents exhibiting similar patterns of behaviours, between age 11 and 18 years. Using both exposure measures, odds of harmful drinking (Alcohol Use Disorders Identification Test-C ≥ 8),daily tobacco smoking, nicotine dependence (Fagerström test ≥4), obesity (BMI ≥30), high waist circumference (females: ≥80 cm and males: ≥94 cm, and high waist-hip ratio (females: ≥0.85 and males: ≥1.00) at age 24 were estimated using logistic regression analysis.
We found distinct groups of adolescents characterised by consistently high and consistently low engagement in cancer risk behaviours. After adjustment, adolescents in the top quartile had greater odds of all outcomes in early adulthood: nicotine dependency (odds ratio, OR = 5.37, 95% confidence interval, CI = 3.64–7.93); daily smoking (OR = 5.10, 95% CI =3.19–8.17); obesity (OR = 4.84, 95% CI = 3.33–7.03); high waist circumference (OR = 2.48, 95% CI = 1.94–3.16); harmful drinking (OR = 2.04, 95% CI = 1.57–2.65); and high waist-hip ratio (OR = 1.88, 95% CI = 1.30–2.71), compared to the bottom quartile. In latent class analysis, adolescents characterised by consistently high-risk behaviours throughout adolescence were at higher risk of all cancer risk behaviours at age 24, except harmful drinking.
Exposure to adolescent cancer risk behaviours greatly increased the odds of cancer risk behaviours in early adulthood. Interventions to reduce these behaviours should target multiple rather than single risk behaviours and should focus on adolescence.
Cancer is one of the leading causes of death in the UK  and has superseded cardiovascular disease as the leading cause of death in high income countries . Tobacco smoking, [3, 4] alcohol consumption, [3, 5,6,7,8] obesity, [3, 5, 9,10,11] and unprotected sexual intercourse [3, 12] are known causes of cancer. There is also strong evidence that physical inactivity is associated with cancer incidence, [5, 13] with emerging findings indicating a causal association between physical inactivity and prostate, colorectal and breast cancers [14,15,16]. These exposures are also known predictors of other adverse health outcomes and a significant burden to the NHS. In 2017–18, there were an estimated 489,300 hospital admissions and 77,800 deaths attributable to tobacco smoking, representing 16% of all deaths in the UK  and 10,660 hospital admissions with a primary diagnosis of obesity . In 2018/19 there were 358,000 estimated admissions to hospital and 5698 alcohol-specific deaths, where the main reason for admission was attributable to alcohol . While obesity is not a behaviour, for ease of description, we will refer to this group of exposures as behaviours henceforth.
Experimentation constitutes a normal part of growing up. However, these behaviours’ can become habitual and set the pattern for a life characterised by unhealthy practices . While there have been significant declines in adolescent tobacco smoking and alcohol consumption in the UK,  the prevalence of other risk behaviours remains high. Twenty-eight per cent of children aged 2 to 15 years are overweight and among them, 17% of boys and 15% of girls are obese with prevalence increasing with age . Physical inactivity and sedentary behaviour are common with only 18% of 5–16 years olds in England meeting current Chief Medical Officer guidelines of taking part in sport and physical activity for at least 60 min every day . Young people also have the highest diagnosed rates of the most common sexually transmitted infections (STIs) of all age groups .
There is evidence that these behaviours co-occur in cross-sectional data, [24,25,26,27] there is also evidence that single risk behaviours co-occur in longitudinal data, [28,29,30,31,32,33] however less is known about how this multiplicity of risk behaviours pattern longitudinally across adolescence and into early adulthood. The aim of this study was to investigate the patterns of multiple cancer risk behaviours across adolescence (age 11–18 years) and their associations with subsequent cancer risk behaviours in early adulthood (age 24 years). Our primary hypothesis was that adolescents engaged in more cancer risk behaviours across adolescence would be at greater risk of early adult cancer risk behaviours at age 24 years. We have previously shown in cross-sectional analyses that multiple risk behaviours cluster by number of behaviours rather than producing distinct risk profiles . Therefore, our secondary aim was to explore, in a longitudinal analysis, if multiple cancer risk behaviours cluster across adolescence to produce qualitatively distinct risk profiles, characterised by engagement in certain behaviours. To our knowledge this is the first study to develop longitudinal measures of multiple adolescent behaviours and explore associations with cancer risk outcomes in early adulthood.
Aims of the study
Using longitudinal data from the Avon Longitudinal Study of Parents and Children (ALSPAC), an ongoing prospective observational population-based birth cohort study the aims of this study were: (i) to investigate the patterns of multiple cancer risk behaviours across adolescence (age 11–18 years) using both quartiles of a continuous score summarising cumulative exposure and longitudinal latent class analysis; and (ii) to explore whether and how these patterns are associated with subsequent cancer risk behaviours in early adulthood (age 24 years).
Design & setting of the study
Data were drawn from ALSPAC, an ongoing prospective observational population-based birth cohort study investigating the effects of a wide range of influences on health and development across the life course [34, 35]. Pregnant women, resident in Avon, UK and with expected dates of delivery 1st April 1991 to 31st December 1992 were invited to take part in the study. The initial number of pregnancies enrolled was 14,541 (for these at least one questionnaire has been returned or a “Children in Focus” clinic had been attended by 19/07/99). Of these initial pregnancies, there was a total of 14,676 foetuses, resulting in 14,062 live births and 13,988 children who were alive at 1 year of age. Details of all available questionnaires and data can be found through a searchable data dictionary (http://www.bristol.ac.uk/alspac/researchers/our-data/). Ethical approval for the study was obtained from the ALSPAC Law and Ethics Committee and local Research Ethics Committees. Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time.
Exposure measure - adolescent cancer risk behaviours
We used repeated measures of tobacco smoking, alcohol consumption, obesity, sexual risk and physical inactivity at ages ~ 11, ~ 14, ~ 16 and ~ 18 years (see Table 1). We include both physical inactivity and obesity as exposures because although physical activity may lie on the causal pathway to obesity,  there is strong evidence that both exposures have their own unconfounded, causal effect on cancer outcomes [14,15,16, 37]. Self-completed questionnaires issued during clinics, self-completed responses to postal questionnaires and parent or carer report questionnaire data were used to derive these measures. Details about the risk thresholds can be found in Supplementary Material 1. Our focus in this research and more widely, is adolescent multiple risk behaviours [27, 38,39,40]. We chose these exposures a priori, owing to their known effect on cancer incidence and mortality and while the outcomes at age 24 are illustrative of longer-term trajectories, the real focus of this work is in identifying the patterns of risk behaviour across adolescence in order to identify intervention strategies. We therefore have not excluded any of these behaviours from the exposure measure because they are under-reported outcomes at age 24 years.
Outcome measures – early adult cancer risk
The early adult outcome measures are, where possible, more severe presentations of the adolescent cancer risk behaviours. For example, the adolescent smoking exposure ranges from ever smoked to weekly smoking, whereas the early adult outcome measures were daily smoking and having nicotine dependence. General obesity, as defined by height and weight, was supplemented by measures of central obesity: high waist circumference (≥80 cm for females and ≥ 94 cm for males, and high waist-hip ratio (≥0.85 for females and ≥ 1.00 for males) at age 24 years.
Early adult cancer risk was based on measurements collected in clinics (measured height and weight to compute body mass index, waist circumference and waist-hip ratio), or responses to questionnaires (harmful drinking, daily smoking and nicotine dependence) by participants at age ~ 24 years (mean age 24 years and 6 months, SD = 9.78 months). We were unable to include measures of accelerometery measured physical inactivity, owing to low numbers with a valid minimum number of days of wear-time (only 380 participants with 3 days of data). We were unable to estimate sexual risk using data about Chlamydia incidence because perfect prediction from measures integral to the final analysis was observed in the multiple imputation model, which may bias the relation of interest . Binary indicators were derived for harmful drinking: a score of ≥8 in the Alcohol Use Disorders Identification Test-C (AUDIT-C); daily smoking; nicotine dependence (a score of ≥4 in the Fagerström test), obesity (a BMI of ≥30); high waist circumference, as defined by the National Institute of Health and Clinical Excellence (NICE) and World Health Organisation (WHO) guidelines: ≥80 cm (females) and ≥ 94 cm (males); and high waist-hip ratio (≥0.85 for females and ≥ 1.00 for males) [42, 43].
We identified potential confounders (common causes of both exposures and outcomes) that occurred before the exposure measures i.e. before age 11 years. All models were adjusted for: sex, intelligence quotient (IQ), parental socioeconomic status, adverse childhood experiences (ACEs),  maternal cannabis use, maternal harmful alcohol use, maternal smoking, child depressive symptoms (SMFQ), child total difficulties score (SDQ) and child antisocial behaviour. Models relating to the anthropometric outcomes (obesity, waist circumference and waist-hip ratio) were additionally adjusted for birthweight, gestational age, maternal obesity, maternal physical inactivity, and maternal unhealthy diet (see Supplementary Material 2 for more details of how confounder measures were derived).
We summarised exposure to our adolescent cancer risk behaviours of interest (tobacco smoking, alcohol consumption, obesity, unprotected sexual intercourse, and physical inactivity) in two ways. We assigned each participant a score of one (risk present) or zero (risk not present) for each of the five risk behaviours at each of the four time points. Using the total risk score at each time point we then calculated a cumulative continuous score, summarising exposure to the five risk behaviours across adolescence and expressed the score as the area under the curve. This was done by summing the product of the total number of risks and the time interval, at four time points between ages ~ 11 and ~ 18 years (detailed explanations outlining the methods used to calculate this measure can be found in Supplementary Material 3). Second, using the same data, we derived longitudinal latent growth curves to explore whether the same behaviours cluster to produce qualitatively distinct risk profiles (over and above the cumulative continuous score). The processes used to derive the latent classes are described in more detail in Supplementary Material 4.
We explored the patterning of adolescent cancer risk behaviours, using quartiles of a continuous score summarising cumulative exposure to provide a comparative measure for the latent classes. We compared models with between 2 and 7 classes using both complete case and imputed data (see below for imputation method). The optimum model, as determined by the lowest Bayesian information criterion (BIC), was a 6-class latent class growth analysis, for both the imputation and complete case samples. These models produce a class-assignment probability indicating the confidence with which each participant can be allocated to a specific latent class. Entropy summarises this information as a single measure ranging from zero to one (one indicating absolute certainty that individuals have been assigned to the correct class). We have additionally provided analyses of the four-class solution in supplementary materials 7, 8 and 9, which despite having a higher BIC, provides a useful comparator for the quartiles.
Logistic regression analysis was used to examine prospective associations between quartiles of the continuous score and early adult cancer risk behaviours at age 24 years. We ran unadjusted models for all outcomes, including only the exposure and outcome measures followed by a sequence of adjusted models, which additionally controlled for: (i) partially adjusted: sex, IQ, parental socioeconomic status and adverse childhood experiences (ACEs); and (ii) fully adjusted: maternal cannabis use, maternal harmful alcohol use, maternal smoking, child depressive symptoms (SMFQ), child total difficulties score (SDQ) and child antisocial behaviour. Models for obesity, waist circumference, and waist-hip ratio outcomes were additionally adjusted for birthweight, gestational age, maternal obesity, maternal physical inactivity, and maternal unhealthy diet.
Data on all exposures at one time point were available for 6351 (46.0%) of ALSPAC participants. As shown in Table 2, among this sample, data were complete for the sex variable, near complete for the confounder measures (for example, housing tenure n = 6169, 97.1%); available for ~ 50% of the sample for the outcome measures (obesity n = 3202, 50.4%); and available for 1951 (30.7%) for the exposure measure. In our primary analysis, multiple imputation was used to account for missing data (see below). In sensitivity analyses, we investigated associations for each 24-year outcome on complete case samples i.e. those with no missing data on any of exposure, outcome or confounder measures. The flow diagram for deriving the sample can be found in Supplementary Material 5. The subsample of ALSPAC participants selected for imputation is not a random sample: they are more likely to be female and less likely to be from the lowest income quintile; to be living in privately or subsidised rental property; to have a mother with fewer educational qualifications and have lower parental social class (p < 0.001). Please see Supplementary Material 6 which provides a comparison of the imputation sample and those excluded from the analysis by key demographic variables, where we also provide a detailed discussion about why we do not think this leads to selection bias. Multivariate imputation by chained equations was carried out using the ‘ice’ routine in Stata. This approach is based on the missing at random (MAR) assumption, i.e. that any differences between the missing and observed values, can be explained by differences in the observed data . All variables used in the analyses, including the outcome measures, exposure measures and confounders were included in the imputation model, along with alternative measures that had been collected at different times. These were included as auxiliary variables to reduce bias by improving the precision of the imputation model. Monte Carlo errors were used to compare the results obtained when imputing 25, 100 and 200 data sets. Imputed results shown have been pooled across the 200 data sets, having satisfied White et al.’s rules of thumb for the number of imputations . All analysis was conducted using Stata version 15  and Mplus version 8 .
Table 2 outlines the descriptive statistics for the complete case sample and the imputed data sample used in this research. For the complete case sample, high waist circumference with a prevalence of 411 (28.3%) was the most frequently occurring outcome at age 24 years. This was followed by harmful alcohol 273 (18.9%), obesity 183 (12.6%), daily smoking 121 (8.3%), high waist-to-hip ratio 114 (7.9%), and nicotine dependence 50 (3.5%).
Patterning of adolescent cancer risk behaviours
Examination of the quartiles of a continuous score summarising cumulative exposure to adolescent cancer risk shown in Fig. 1, revealed that adolescents in the top 25% of participants, (Q1: shown with the solid black line) consistently had the highest chance of exhibiting all adolescent risk behaviours at each of the four time points, indicating that there is a group of young people who consistently undertake multiple cancer risk behaviours across adolescence. There were six latent classes, which are described in more detail in Supplementary Materials 7 and 8. We found that quartiles of cumulative exposure were highly consistent with the latent classes. For example, 100% of those in the persistent very low-risk class belonged to the lowest quartile of the cumulative exposure measure, and vice versa, where 20% of those in the lowest quartile belonged to the persistent very low-risk class and 80% to the persistent low-risk class (see Supplementary Material 9).
However, while the quartiles and the latent classes were quantitatively similar, there were some qualitative differences (see Supplementary Material 7). For example, although the persistent-low and low-to-increasing risk classes had similar average cumulative risk scores, their risk profiles were different. Adolescents in the persistent low-risk class had consistently low-risk of all behaviours throughout adolescence, whereas those in the low-increasing-risk class had a rapidly increasing risk of alcohol use, physical inactivity, and sexual risk from age 14 years. Similarly, the moderate-increasing-risk and persistent-moderate-risk classes had similar average cumulative risk scores. However, while their obesity and physical inactivity risk was similar across adolescence, the moderate-increasing-risk class had increasing risk of tobacco and alcohol use and sexual risk, while the persistent-moderate-risk class had low cumulative tobacco use and sexual risk and decreasing alcohol use (a similar analysis using the 4-class solution can also be found in supplementary materials 7, 8 and 9). Unfortunately, entropy for the latent class results was poor (6-class solution: 0.64 for complete case and 0.53 for the imputed sample and 4-class solution: 0.61 for complete case and 0.51 for the imputed sample), meaning that around one third of participants have an incorrect class assignment for the complete case sample and just under half have an incorrect assignment for the imputed sample.
Social patterning of the exposure and outcome measures
There was evidence that the quartiles of adolescent cancer risk were socially patterned by both maternal education and parental social class (see Supplementary Material 10). However, the effect for income was non-monotonic and those in the middle-low income group, as opposed to the low-income group, had the highest adolescent risk scores.
There were some differences in the prevalence of the 24-year outcomes comparing males with females (Table 3). Males had increased odds of harmful drinking at age 24 years compared with females, (odds ratio (OR) = 1.68; 95%CI:1.42 to 1.99). However, males had decreased odds of all the anthropometric measures compared to females: obesity (OR = 0.72; 95%CI:0.57 to 0.89), high waist circumference (OR = 0.44; 95%CI:0.37 to 0.53) and high waist-hip ratio: (OR = 0.18 95%CI:0.12 to 0.27). There were no differences between males and females with respect to odds of either smoking outcomes. There was social patterning of the outcomes, when compared to the reference category (in each case the highest social class, level of maternal education, or income quintile). The odds of all outcomes increased for every incremental decrease in social position. There was one notable exception, harmful drinking, where the social patterning ran the opposite direction and for every incremental increase in social position, there were increased odds of harmful drinking.
Associations between multiple adolescent cancer risk behaviours and at age 24 years
Associations of large magnitude were present between adolescent and early adult cancer risk behaviours (see Table 4). When compared to the bottom quartile, those in the top quartile had more than five times greater odds of nicotine dependency at age 24 years: (odds ratio, OR = 5.37; 95% confidence interval, CI: 3.64 to 7.93) and daily smoking (OR = 5.10; 95%CI: 3.19 to 8.17); nearly five times the odds of being obese (OR = 4.84; 95%CI:3.33 to 7.03); nearly two and a half times the odds of a high waist circumference (OR = 2.48 95%CI:1.94 to 3.16); just more than twice the odds of harmful drinking (OR = 2.04 95%CI: 1.57 to 2.65); and nearly twice the odds of a high waist-hip ratio (OR = 1.88 95%CI:1.30 to 2.71).
With reference to Fig. 2 the 6-class solution, the persistent very low-risk and persistent low-risk classes in adolescence consistently had the lowest risk of all outcomes at age 24 years. The persistent high-risk class in adolescence consistently had the highest risk of all outcomes at age 24 years, except harmful drinking, which had a non-monotonic association with the adolescent latent classes (low-increasing- and persistent-moderate-risk classes in adolescence had the highest risk of harmful drinking at 24). The moderate-increasing- and persistent-high-risk classes in adolescence had the highest risk of both tobacco outcomes at age 24. Finally, the anthropometric outcomes (obesity, high waist circumference and high waist-hip ratio) largely followed a linear association between latent class in adolescence and risk of each outcome at age 24. With reference to Fig. 3, the results for the 4-class solution are very similar and provide a useful comparator for the quartiles. The persistent high-risk class consistently had the highest risk of all outcomes at age 24 years, except harmful drinking, which had a non-monotonic association with the adolescent latent classes (the low-increasing risk class had the same risk of harmful drinking at 24). Similarly, the anthropometric outcomes (obesity, high waist circumference and high waist-hip ratio) largely followed a linear association between latent class in adolescence and risk of each outcome at age 24.
Using repeated measures of cancer risk behaviours at four occasions across adolescence, we used two different methods to derive patterns of adolescent cancer risk. We found distinct groups of adolescents characterised by consistently high- and low-cancer risk behaviours during adolescence. We also found associations of large magnitude between adolescent and early adult cancer risk behaviours. We demonstrate for the first time that multiple cancer risk behaviours pattern longitudinally across adolescence and into young adulthood. Unlike previous research which focusses on either multiple risk behaviours at one time point (using cross-sectional data), or longer-term trajectories of single risk behaviours, our research combines the two. We have included a multiplicity of risk behaviours (five separate risk behaviours), using longitudinal data (at four different time points) covering a minimum 13-year period. Preventing multiple cancer risk behaviours during adolescence would likely reduce these behaviours in early adulthood and across the life-course, thereby reducing cancer incidence and mortality.
The strengths of this study were the long-term, longitudinal design. We were able to show associations between an exposure measure that covered the whole adolescent period (age 11–18 years) and outcomes 6 years later at age ~ 24 years. This has the advantage that most young people will have left education and entered the labour market and therefore our findings are indicative of a longer-term trajectory across different environments. We used both a cumulative score and latent classes to capture the exposure measure, which provided different insights, but were mutually reinforcing. We were also able to adjust for an extensive range of potential confounders, reducing the chance of residual confounding.
The weaknesses of the study include the potential for residual confounding and cohort attrition, as in all observational studies. However, for selection bias to pose a problem, our outcome measures would have to be conditionally related to whether a participant remains in the sample or missing not at random (MNAR) [45, 49]. We assume that our data is missing at random (MAR). Many of the measures were collected using self-report, which might mean participants favour responses they believe are more socially acceptable rather than choosing responses that reflect their true feelings. However, by the time ALSPAC participants had reached adolescence, following years of completing research questionnaires, it is likely that they will have developed trust in the maintenance of their anonymity, possibly reducing such bias. The risk behaviours were all reduced to binary variables in order to calculate the score total, which may lead to a loss of information. However, in line with our previous research and given the highly differential risks associated with these behaviours, at different levels and for different cancers, each risk has an equal weighting, as it would not be possible to accurately weight the risk behaviours differently. We were unable to include unhealthy diet as part of the adolescent exposure, as it was not collected in ALSPAC past the age of 13 years. We were also unable to include outcome measures relating to physical inactivity or sexual risk, owing to the extent of their missingness and the problem of perfect prediction as outlined in the methods. With reference to the latent class analysis, entropy was poor (6-class solution: 0.64 for complete case and 0.53 for the imputed sample and 4-class solution: 0.61 for complete case and 0.52 for the imputed sample). Therefore, while the classes may provide further insight into the patterns of adolescent behaviours, we cannot be confident that participants have been separated into the correct classes, which limits their utility as a targeting tool when considering public health interventions. They are also difficult to replicate because we would require the same number of similar exposures, at similar time points, which is not always available in other data sets. Finally, ALSPAC is not a nationally representative sample and therefore may limit the generalisability of our findings.
Given the co-occurrence of these behaviours and their associations with early adult risk behaviours, public health policies should adopt approaches that enable all health professionals who have contact with adolescents - including sexual health clinicians, general practitioners (GPs), public health workers and policy makers - to address a multitude of risks, at each contact. As a third of the cohort were identified to be persistently-moderate, or persistently-high engagers in risk throughout adolescence, it may be that early intervention is required to prevent engagement at an early stage in adolescence.
We found evidence that both the adolescent exposure measure and the outcomes at age 24 years (except harmful drinking) were socially patterned, i.e. the likelihood/odds of exposure to these risk behaviours increased for every incremental decrease in social position. However, interventions that focus on the most disadvantaged people, will not redress the social gradient and will only tackle a small part of the problem, while targeting subgroups excludes other deprived groups and risks stigmatising those targeted . Further, because the social patterning for harmful drinking, ran in the opposite direction, adopting this strategy would be ineffective if we targeted interventions in this way. Universal interventions aimed at behaviour change at the individual level, have been shown to further exacerbate health inequalities, because more advantaged people and those who already have healthy behaviours tend to be quicker and more likely to take up these types of interventions . Given this, and the social gradient of our exposure and outcome measures, interventions that are universal, but with a scale and intensity that is proportionate to the level of risk,  are preferable. Previous research relating to the social patterning of alcohol consumption has been mixed. For example, Melotti and colleagues found that adolescents that come from higher-income households in childhood were more likely to use alcohol. However, those with mothers with more educational qualifications were less likely to use alcohol . Possible explanations that young adults with higher socioeconomic status in ALSPAC are more likely to be harmful drinking, include having increased disposable income available to them, which facilitates the purchase of alcohol, not available to less well-off individuals. An alternative explanation is provided by Luthar  who argues that young people from affluent families feel more pressure to achieve and are more isolated from their parents which contributes to their unhealthy behaviours. It is also possible that young people from higher socioeconomic backgrounds who are more likely to attend higher education, maintain drinking habits they acquired at university, into their early twenties.
A recent Cochrane systematic review showed that applying the World Health Organisation (WHO) health promoting schools framework improves some aspects of student health, including body mass index (BMI), physical activity, physical fitness and tobacco use, to an extent that is important at the population level. However, it found little evidence for improvement in zBMI (BMI, standardized for age and gender), and no evidence for alcohol use . Another Cochrane Systematic Review looking at individual-, family-, and school-level interventions targeting multiple risk behaviours in young people, demonstrated that universal school-based interventions are most effective in preventing adolescent alcohol use, tobacco smoking, illicit drug use and antisocial behaviour, and increasing physical activity, but did not find strong evidence of benefit for family or individual-level interventions . Therefore, efforts at preventing these behaviours should focus on developing appropriate proportionate universalist interventions in secondary schools.
We have shown that a multitude of behaviours known to cause cancer (tobacco smoking, alcohol consumption, obesity, unprotected sexual intercourse, and physical inactivity), originate in adolescence. Future research should focus on identifying the antecedents of adolescent cancer risk behaviours to identify opportunities for earlier intervention before the behaviours begin. Research is also required into whether the associations observed in ALSPAC are evident in similar cohort studies and outside of the UK.
We have shown that exposure to adolescent cancer risk behaviours greatly increased the odds of cancer risk behaviours in early adulthood. Interventions to reduce these behaviours should target multiple rather than single risk behaviours and should focus on adolescence. Given this, and the social gradient of our exposure and outcome measures, school-based interventions that are universal, but with a scale and intensity that is proportionate to the level of risk are preferable.
Availability of data and materials
The datasets supporting the conclusions of this article are available from ALSPAC. The ALSPAC policy on data sharing is available at www.bristol.ac.uk/alspac. To discuss access to ALSPAC data, please contact the ALSPAC executive team on firstname.lastname@example.org. Details of all available questionnaires can be found using the following link (http://www.bristol.ac.uk/alspac/researchers/our-data/questionnaires/child-completed-questionnaires/). We have used several questionnaires, the details of which can be found in the supplementary materials file, Data sources.
Avon Longitudinal Study of Parents and Children
Body mass index
ONS. Deaths registered in England and Wales: 2019 [Internet]. 2019. Available from: https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/bulletins/deathsregistrationsummarytables/2019#main-points
Dagenais GR, Leong DP, Rangarajan S, Lanas F, Lopez-Jaramillo P, Gupta R, et al. Variations in common diseases, hospital admissions, and deaths in middle-aged adults in 21 countries from five continents (PURE): a prospective cohort study. Lancet [Internet]. 2020;395(10226):785–94. https://doi.org/10.1016/S0140-6736(19)32007-0 Variations in common diseases, hospital admissions, and deaths in middle-aged adults in 21 countries from five continents (PURE): a prospective cohort study.
Brown KF, Rumgay H, Dunlop C, Ryan M, Quartly F, Cox A, et al. The fraction of cancer attributable to modifiable risk factors in England, Wales, Scotland, Northern Ireland, and the United Kingdom in 2015. Br J Cancer [Internet]. 2018;118(8):1130–41. https://doi.org/10.1038/s41416-018-0029-6.
Parkin DM. Tobacco-attributable cancer burden in the UK in 2010. Br J Cancer [Internet]. 2011;105(S2):S6–13. https://doi.org/10.1038/bjc.2011.475.
WCRF. Diet, Nutrition Physical activity and Cancer: a global Perspective Continuous Update Project Expert Report 2018. London: World Cancer Research Fund (WCRF); 2018.
Praud D, Rota M, Rehm J, Shield K, Zatoński W, Hashibe M, et al. Cancer incidence and mortality attributable to alcohol consumption. Int J Cancer. 2016;138(6):1380–7. https://doi.org/10.1002/ijc.29890.
Parkin DM. Cancers attributable to consumption of alcohol in the UK in 2010. Br J Cancer. 2011;105(S2):S14–8. https://doi.org/10.1038/bjc.2011.476.
Connor J. Alcohol consumption as a cause of cancer. Addiction. 2016; Available from: http://www.ncbi.nlm.nih.gov/pubmed/27442501.
Brenner DR, Poirier AE, Ruan Y, Hebert LA, Grevers X, Walter SD, et al. Estimates of the current and future burden of cancer attributable to excess body weight and abdominal adiposity in Canada. Prev Med (Baltim). 2019;122:49–64. https://doi.org/10.1016/j.ypmed.2019.03.014.
Bitzur R, Brenner R, Maor E, Antebi M, Ziv-Baran T, Segev S, et al. Metabolic syndrome, obesity, and the risk of cancer development. Eur J Intern Med [Internet]. 2016;34:89–93. Available from:. https://doi.org/10.1016/j.ejim.2016.08.019.
Parkin DM, Boyd L. Cancers attributable to overweight and obesity in the UK in 2010. Br J Cancer [Internet]. 2011;105(S2):S34–7. https://doi.org/10.1038/bjc.2011.481.
IARC. World Cancer Report. P. K, editor. Lyon: IARC Press; 2003.
Parkin DM. Cancers attributable to inadequate physical exercise in the UK in 2010. Br J Cancer. 2011;105(S2):S38–41. https://doi.org/10.1038/bjc.2011.482.
Zhang X, Theodoratou E, Li X, Farrington SM, Svinti V, Campbell H, et al. Physical activity and colorectal cancer risk: a two-sample Mendelian randomisation study. Lancet. 2019;394:S101 Available from: http://www.sciencedirect.com/science/article/pii/S0140673619328983.
Papadimitriou N, Dimou N, Tsilidis KK, Banbury B, Martin RM, Lewis SJ, et al. Physical activity and risks of breast and colorectal cancer: a Mendelian randomisation analysis. Nat Commun. 2020;11(1):597. https://doi.org/10.1038/s41467-020-14389-8.
Kazmi N, Haycock P, Tsilidis K, Lynch BM, Truong T. The Practical Consortium CBCP, et al. Appraising causal relationships of dietary, nutritional and physical-activity exposures with overall and aggressive prostate cancer: two-sample Mendelian-randomization study based on 79 148 prostate-cancer cases and 61 106 controls. Int J Epidemiol. 2019. https://doi.org/10.1093/ije/dyz235.
Digital NHS. Statistics on smoking, England; 2019.
Digital NHS. Statistics on obesity, Physical Activity and Diet, England, 2019. p. 2019. Available from: https://digital.nhs.uk/data-and-information/publications/statistical/statistics-on-obesity-physical-activity-and-diet/statistics-on-obesity-physical-activity-and-diet-england-2019
Digital N. Statistics on alcohol, England 2020 [Internet]. 2020. Available from: https://digital.nhs.uk/data-and-information/publications/statistical/statistics-on-alcohol/2020
Jessor R. Risk behavior in adolescence: A psychosocial framework for understanding and action. J Adolesc Heal. 1991;12(8):597-605.
Digital N. Health survey for England 2018 Children’s health. Surrey: Health and Social Care Information Centre; 2019.
Digital N. Health survey for England 2018 overweight and obesity in adults and children [Internet]. Surrey: NatCen Social Research; 2019. Available from: https://digital.nhs.uk/data-and-information/publications/statistical/health-survey-for-england/2018
Public Health England. Sexually transmitted infections and screening for chlamydia in England, 2018 Health Protection Report. Vol. 13. London; 2019.
Dumith SC, Muniz LC, Tassitano RM, Hallal PC, AMB M. Clustering of risk factors for chronic diseases among adolescents from Southern Brazil. Prev Med (Baltim). 2012;54(6):393–6 Available from: https://pubmed.ncbi.nlm.nih.gov/22484392. 2012/03/29.
Alzahrani SG, Watt RG, Sheiham A, Aresu M, Tsakos G. Patterns of clustering of six health-compromising behaviours in Saudi adolescents. BMC Public Health. 2014;14(1):1–6.
Noel H, Denny S, Farrant B, Rossen F, Teevale T, Clark T, et al. Clustering of adolescent health concerns: a latent class analysis of school students in New Zealand. J Paediatr Child Health [Internet]. 2013;49(11):935–41. https://doi.org/10.1111/jpc.12397.
Wright C, Heron J, Campbell R, Hickman M, Kipping RR. Adolescent multiple risk behaviours cluster by number of risks rather than distinct risk profiles in the ALSPAC cohort. BMC Public Health. 2020;20:290.
Brook DW, Brook JS, Zhang C, Whiteman M, Cohen P, Finch SJ. Developmental trajectories of cigarette smoking from adolescence to the early thirties: personality and behavioral risk factors. Nicotine Tob Res [Internet]. 2008;10(8):1283–91. Available from: https://pubmed.ncbi.nlm.nih.gov/18686175. https://doi.org/10.1080/14622200802238993.
Costello DM, Dierker LC, Jones BL, Rose JS. Trajectories of smoking from adolescence to early adulthood and their psychosocial risk factors. Health Psychol [Internet]. 2008;27(6):811–8. Available from: https://pubmed.ncbi.nlm.nih.gov/19025277. https://doi.org/10.1037/0278-6188.8.131.521.
Jacob T, Koenig LB, Howell DN, Wood PK, Haber JR. Drinking trajectories from adolescence to the fifties among alcohol-dependent men. J Stud Alcohol Drugs [Internet]. 2009;70(6):859–69. Available from: https://pubmed.ncbi.nlm.nih.gov/19895762. https://doi.org/10.15288/jsad.2009.70.859.
Maggs JL, Schulenberg JE. Trajectories of alcohol use during the transition to adulthood. Alcohol Res Heal [Internet]. 2004;28(4):195–201 Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6601670/.
Britton A, Ben-Shlomo Y, Benzeval M, Kuh D, Bell S. Life course trajectories of alcohol consumption in the United Kingdom using longitudinal data from nine cohort studies. BMC Med. 2015;13(1):47. https://doi.org/10.1186/s12916-015-0273-z.
Callo Quinte G, Barros F, Gigante DP, de Oliveira IO, dos Santos Motta JV, Horta BL. Overweight trajectory and cardio metabolic risk factors in young adults. BMC Pediatr. 2019;19(1):75. https://doi.org/10.1186/s12887-019-1445-3.
Boyd A, Macleod J, Lawlor DA, Fraser A, Henderson J, Molloy L, et al. Cohort profile: the ‘children of the 90s’-the index offspring of the Avon longitudinal study of parents and children. Int J Epidemiol. 2013;42(1):111–27 Available from: http://www.ncbi.nlm.nih.gov/pubmed/22507743. 2012/04/18.
Fraser A, Macdonald-wallis C, Tilling K, Boyd A, Golding J, Davey Smith G, et al. Cohort profile: the Avon longitudinal study of parents and children: ALSPAC mothers cohort. Int J Epidemiol. 2013;42(1):97–110. https://doi.org/10.1093/ije/dys066.
Jago R, Salway RE, Ness AR, JPH S, Ridd MJ, Henderson AJ. Associations between physical activity and asthma, eczema and obesity in children aged 12–16: an observational cohort study. BMJ Open. 2019;9(1):e024858 Available from: http://bmjopen.bmj.com/content/9/1/e024858.abstract.
Bull CJ, Bell JA, Murphy N, Sanderson E, Davey Smith G, Timpson NJ, et al. Adiposity, metabolites, and colorectal cancer risk: Mendelian randomization study. BMC Med. 2020;18(1):396. https://doi.org/10.1186/s12916-020-01855-9.
Wright C, Kipping RR, Hickman M, Campbell R, Heron J. The effect of multiple risk behaviours in adolescence on educational attainment at age 16: a UK birth cohort study. BMJ Open. 2018;8(7):e020182. https://doi.org/10.1136/bmjopen-2017-020182.
Campbell R, Wright C, Hickman M, Kipping RR, Smith M, Pouliou T, et al. Multiple risk behaviour in adolescence is associated with substantial adverse health and social outcomes in early adulthood: Findings from a prospective birth cohort study. Prev Med (Baltim). 2020;138:106157 Available from: http://www.sciencedirect.com/science/article/pii/S009174352030181X.
Kipping Smith M, Heron J, Hickman M, Campbell R. Multiple risk behaviour in adolescence and socio-economic status: findings from a UK birth cohort. Eur J Public Heal. 2015;25(1):44–9 Available from: http://www.ncbi.nlm.nih.gov/pubmed/24963150.
White IR, Daniel R, Royston P. Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables. Comput Stat Data Anal. 2010;54(10):2267–75. Available from: https://pubmed.ncbi.nlm.nih.gov/24748700. https://doi.org/10.1016/j.csda.2010.04.005.
NICE. Obesity: identification, assessment and Obesity: identification, assessment and management. National Institute for Health and Care Excellence (NICE); 2014.
WHO. Waist Circumference and Waist–Hip Ratio: Report of a WHO Expert Consultation [Internet]. Geneva; 2008. Available from: https://apps.who.int/iris/bitstream/handle/10665/44583/9789241501491_eng.pdf?ua=1
Houtepen LC, Heron J, Suderman MJ, Tilling K, Howe LD. Adverse childhood experiences in the children of the Avon Longitudinal Study of Parents and Children (ALSPAC). Wellcome open Res. 2018;3:106 Available from: https://pubmed.ncbi.nlm.nih.gov/30569020.
Sterne White J, Carlin J, Spratt M, Royston P, Kenward M, Wood A, et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009;338(jun29 1):b2393. https://doi.org/10.1136/bmj.b2393.
White Royston P, Wood AM. Multiple imputation using chained equations: issues and guidance for practice. Stat Med. 2011;30(4):377–99 Available from: http://www.ncbi.nlm.nih.gov/pubmed/21225900.
LLC S. Stata Statistical Software: Release 15. 15th ed. Texas; 2017.
Muthén LK, Muthén BO. Mplus: statistical analysis with latent variables: User’s guide (version 8). Los Angeles CA: Authors; 2017.
Hughes RA, Heron J, Sterne JAC, Tilling K. Accounting for missing data in statistical analyses: multiple imputation is not always the answer. Int J Epidemiol. 2019;48(4):1294–304. https://doi.org/10.1093/ije/dyz032 Accounting for missing data in statistical analyses: multiple imputation is not always the answer.
Marmot M. Fair society healthy lives full report; 2010.
Bauld L, Judge KEN, Barnes M, Benzeval M, Mackenzie M, Sullivan H. Promoting Social Change: The Experience of Health Action Zones in England. J Soc Policy. 2005;34(3):427–45 Available from: https://www.cambridge.org/core/article/promoting-social-change-the-experience-of-health-action-zones-in-england/F032053AC2415ABE33049B2ACC7383BB. 2005/06/15.
Melotti R, Heron J, Hickman M, Macleod J, Araya R, Lewis G. Adolescent Alcohol and Tobacco Use and Early Socioeconomic Position: The ALSPAC Birth Cohort. Pediatrics. 2011;127(4):e948 Available from: http://pediatrics.aappublications.org/content/127/4/e948.abstract. LP-e955.
Luthar SS. The Culture of Affluence: Psychological Costs of Material Wealth. Child Dev. 2003;74(6):1581–93. https://doi.org/10.1046/j.1467-8624.2003.00625.x The Culture of Affluence: Psychological Costs of Material Wealth.
Langford R, Bonell CP, Jones HE, Pouliou T, Murphy SM, Waters E, et al. The WHO Health Promoting School framework for improving the health and well-being of students and their academic achievement. Cochrane Database Syst Rev. 2014;(4). https://doi.org/10.1002/14651858.CD008958.pub2.
MacArthur G, Caldwell DM, Redmore J, Watkins SH, Kipping R, White J, et al. Individual-, family-, and school-level interventions targeting multiple risk behaviours in young people. Cochrane Database Syst Rev. 2018;(10). https://doi.org/10.1002/14651858.CD009927.pub2.
Harris PA, Taylor R, Minor BL, Elliott V, Fernandez M, O’Neal L, et al. The REDCap consortium: Building an international community of software platform partners. J Biomed Inform. 2019;95:103208 Available from: http://www.sciencedirect.com/science/article/pii/S1532046419301261.
We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses. The UK Medical Research Council and Wellcome (Grant ref.: 102215/2/13/2) and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors and Caroline Wright will serve as guarantor for the contents of this paper.
Study data were collected and managed using REDCap electronic data capture tools hosted at University of Bristol . REDCap (Research Electronic Data Capture) is a secure, web-based software platform designed to support data capture for research studies, providing 1) an intuitive interface for validated data capture; 2) audit trails for tracking data manipulation and export procedures; 3) automated export procedures for seamless data downloads to common statistical packages; and 4) procedures for data integration and interoperability with external sources.
Our manuscript was written in accordance with STROBE guidelines, a completed checklist has been provided.
Patient and public involvement
No patients were involved in the design of this study. However, our choice of risk behaviours was informed by discussions with adolescents from the Centre for the Development and Evaluation of Complex Interventions for Public Health Improvement (DECIPHer) Advice leading to Public Health Advancement (ALPHA) young person’s research advisory group (https://decipher.uk.net/public-health-improvement-research-networks-phirns/public-involvement-alpha/).
CW is funded by a Cancer Research UK Population Research Postdoctoral Fellowship (C60153/A23895). ALSPAC receives core support from the UK Medical Research Council and the Wellcome Trust (grant reference 092731) and the University of Bristol and also from MRC and Alcohol Research UK (MR/L022206/1). This work was undertaken with the support of The Centre for the Development and Evaluation of Complex Interventions for Public Health Improvement (DECIPHer), a UKCRC Public Health Research Centre of Excellence. Joint funding (MR/KO232331/1) from the British Heart Foundation, Cancer Research UK, Economic and Social Research Council, Medical Research Council, the Welsh Government and the Wellcome Trust, under the auspices of the UK Clinical Research Collaboration, is gratefully acknowledged. RMM is supported in part by the National Institute for Health Research Bristol Biomedical Research Centre. RMM was supported by a Cancer Research UK (C18281/A29019) programme grant (the Integrative Cancer Epidemiology Programme). MH is supported by NIHR Health Protection Research Unit in Behavioural Science and Evaluation. MH&RC acknowledge support of NIHR School of Public Health Research.
The views and opinions expressed by authors in this publication are those of the authors and do not necessarily reflect those of the UK National Institute for Health Research (NIHR) or the Department of Health and Social Care.
The funders did not play any part in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or the decision to submit the article for publication. All the researchers are independent from funders and had full access to all of the data (including statistical reports and tables) in the study and can take responsibility for the integrity of the data and the accuracy of the data analysis.
Ethics approval and consent to participate
Ethical approval for the study was obtained from the Avon Longitudinal Study of Parents and Children Ethics and Law Committee and local Research Ethics Committees: Bristol and Weston Health Authority: E1808 Children of the Nineties: Avon Longitudinal Study of Pregnancy and Childhood (ALSPAC). (28th November 1989), Southmead Health Authority: 49/89 Children of the Nineties - “ALSPAC”. (5th April 1990), Frenchay Health Authority: 90/8 Children of the Nineties. (28th June 1990), 15 Year Clinic: Central & South Bristol Research Ethics Committee (UBHT): 06/Q2006/53 Avon Longitudinal Study of Parents and Children (ALSPAC), Hands on Assessments: Teen Focus 3 (Focus 15+). (7th August 2006) (Confirmed 15th September 2006) and 17 Year clinic: North Somerset & South Bristol Research Ethics Committee: 08/H0106/9 Avon Longitudinal Study of Parents and Children (ALSPAC), Hands on Assessments: Teen Focus 4 (Focus 17+) (18th November 2008). Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the Avon Longitudinal Study of Parents and Children Ethics and Law Committee at the time.
Study participants who complete questionnaires consent to the use of their data by approved researchers. Up until age 18 an overarching parental consent was used to indicate parents were happy for their child (the study participant) to take part in ALSPAC. Consent for data collection and use was implied via the written completion and return of questionnaires. Study participants have the right to withdraw their consent for specific elements of the study, or from the study as a whole, at any time.
Consent for publication
Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time.
The authors declare that there are no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Material, Description of data: The supplementary material contains additional information concerning: 1. Risk thresholds; 2. Confounder measures and their derivation; 3.Deriving the adolescent cumulative risk exposure measure; 4. Deriving the latent classes; 5. Deriving the sample; 6. Comparison of imputation sample and those excluded from the analysis by key demographic variables; 7.Risk profiles for adolescence, percentage of sample and cumulative risk score for complete case and imputed samples; 8. Latent class trajectories; 9. Cross tabulations of cumulative risk score quartiles and proportion belonging to each of the latent classes; and 10. Associations between the adolescent cancer risk measure and socioeconomic status (SES) measures.
STROBE Statement—Checklist of items that should be included in reports of cohort studies, Description of data: a table providing information regarding where each of the components of the report are located in the document.
Data sources. Description of data: contains both a table showing the sources of ALSPAC questionnaire and clinic data used in research and links to each questionnaire or data source.
About this article
Cite this article
Wright, C., Heron, J., Kipping, R. et al. Young adult cancer risk behaviours originate in adolescence: a longitudinal analysis using ALSPAC, a UK birth cohort study. BMC Cancer 21, 365 (2021). https://doi.org/10.1186/s12885-021-08098-8
- Cancer risk behaviours
- UK birth cohort study
- Early adulthood
- Longitudinal latent class analysis