Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study

Objective: To derive and validate a set of clinical risk prediction algorithm to estimate the 10-year risk of 11 common cancers. Design: Prospective open cohort study using routinely collected data from 753 QResearch general practices in England. We used 565 practices to develop the scores and 18...

Full description

Bibliographic Details
Main Authors: Hippisley-Cox, Julia, Coupland, Carol
Format: Article
Published: BMJ Publishing Group Ltd 2015
Subjects:
Online Access:https://eprints.nottingham.ac.uk/47593/
_version_ 1848797584758407168
author Hippisley-Cox, Julia
Coupland, Carol
author_facet Hippisley-Cox, Julia
Coupland, Carol
author_sort Hippisley-Cox, Julia
building Nottingham Research Data Repository
collection Online Access
description Objective: To derive and validate a set of clinical risk prediction algorithm to estimate the 10-year risk of 11 common cancers. Design: Prospective open cohort study using routinely collected data from 753 QResearch general practices in England. We used 565 practices to develop the scores and 188 for validation. Subjects: 4.96 million patients aged 25–84 years in the derivation cohort; 1.64 million in the validation cohort. Patients were free of the relevant cancer at baseline. Methods: Cox proportional hazards models in the derivation cohort to derive 10-year risk algorithms. Risk factors considered included age, ethnicity, deprivation, body mass index, smoking, alcohol, previous cancer diagnoses, family history of cancer, relevant comorbidities and medication. Measures of calibration and discrimination in the validation cohort. Outcomes: Incident cases of blood, breast, bowel, gastro-oesophageal, lung, oral, ovarian, pancreas, prostate, renal tract and uterine cancers. Cancers were recorded on any one of four linked data sources (general practitioner (GP), mortality, hospital or cancer records). Results: We identified 228 241 incident cases during follow-up of the 11 types of cancer. Of these 25 444 were blood; 41 315 breast; 32 626 bowel, 12 808 gastro-oesophageal; 32 187 lung; 4811 oral; 6635 ovarian; 7119 pancreatic; 35 256 prostate; 23 091 renal tract; 6949 uterine cancers. The lung cancer algorithm had the best performance with an R2 of 64.2%; D statistic of 2.74; receiver operating characteristic curve statistic of 0.91 in women. The sensitivity for the top 10% of women at highest risk of lung cancer was 67%. Performance of the algorithms in men was very similar to that for women. Conclusions: We have developed and validated a prediction models to quantify absolute risk of 11 common cancers. They can be used to identify patients at high risk of cancers for prevention or further assessment. The algorithms could be integrated into clinical computer systems and used to identify high-risk patients. Web calculator: There is a simple web calculator to implement the Qcancer 10 year risk algorithm together with the open source software for download (available at http://qcancer.org/10yr/).
first_indexed 2025-11-14T20:06:12Z
format Article
id nottingham-47593
institution University of Nottingham Malaysia Campus
institution_category Local University
last_indexed 2025-11-14T20:06:12Z
publishDate 2015
publisher BMJ Publishing Group Ltd
recordtype eprints
repository_type Digital Repository
spelling nottingham-475932020-05-04T17:01:40Z https://eprints.nottingham.ac.uk/47593/ Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study Hippisley-Cox, Julia Coupland, Carol Objective: To derive and validate a set of clinical risk prediction algorithm to estimate the 10-year risk of 11 common cancers. Design: Prospective open cohort study using routinely collected data from 753 QResearch general practices in England. We used 565 practices to develop the scores and 188 for validation. Subjects: 4.96 million patients aged 25–84 years in the derivation cohort; 1.64 million in the validation cohort. Patients were free of the relevant cancer at baseline. Methods: Cox proportional hazards models in the derivation cohort to derive 10-year risk algorithms. Risk factors considered included age, ethnicity, deprivation, body mass index, smoking, alcohol, previous cancer diagnoses, family history of cancer, relevant comorbidities and medication. Measures of calibration and discrimination in the validation cohort. Outcomes: Incident cases of blood, breast, bowel, gastro-oesophageal, lung, oral, ovarian, pancreas, prostate, renal tract and uterine cancers. Cancers were recorded on any one of four linked data sources (general practitioner (GP), mortality, hospital or cancer records). Results: We identified 228 241 incident cases during follow-up of the 11 types of cancer. Of these 25 444 were blood; 41 315 breast; 32 626 bowel, 12 808 gastro-oesophageal; 32 187 lung; 4811 oral; 6635 ovarian; 7119 pancreatic; 35 256 prostate; 23 091 renal tract; 6949 uterine cancers. The lung cancer algorithm had the best performance with an R2 of 64.2%; D statistic of 2.74; receiver operating characteristic curve statistic of 0.91 in women. The sensitivity for the top 10% of women at highest risk of lung cancer was 67%. Performance of the algorithms in men was very similar to that for women. Conclusions: We have developed and validated a prediction models to quantify absolute risk of 11 common cancers. They can be used to identify patients at high risk of cancers for prevention or further assessment. The algorithms could be integrated into clinical computer systems and used to identify high-risk patients. Web calculator: There is a simple web calculator to implement the Qcancer 10 year risk algorithm together with the open source software for download (available at http://qcancer.org/10yr/). BMJ Publishing Group Ltd 2015-03-01 Article PeerReviewed Hippisley-Cox, Julia and Coupland, Carol (2015) Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study. BMJ Open, 5 (3). e007825/1-e007825/25. ISSN 2044-6055 QCancer; QResearch; Development; Validation; Risk Prediction algorithms; Common cancers; Primary care http://bmjopen.bmj.com/content/5/3/e007825.abstract doi:10.1136/bmjopen-2015-007825 doi:10.1136/bmjopen-2015-007825
spellingShingle QCancer; QResearch; Development; Validation; Risk Prediction algorithms; Common cancers; Primary care
Hippisley-Cox, Julia
Coupland, Carol
Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study
title Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study
title_full Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study
title_fullStr Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study
title_full_unstemmed Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study
title_short Development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study
title_sort development and validation of risk prediction algorithms to estimate future risk of common cancers in men and women: prospective cohort study
topic QCancer; QResearch; Development; Validation; Risk Prediction algorithms; Common cancers; Primary care
url https://eprints.nottingham.ac.uk/47593/
https://eprints.nottingham.ac.uk/47593/
https://eprints.nottingham.ac.uk/47593/