Overcoming challenges to data quality in the ASPREE clinical trial

© 2019 The Author(s). Background: Large-scale studies risk generating inaccurate and missing data due to the complexity of data collection. Technology has the potential to improve data quality by providing operational support to data collectors. However, this potential is under-explored in community...

Full description

Bibliographic Details
Main Authors: Lockery, J.E., Collyer, T.A., Reid, Christopher, Ernst, M.E., Gilbertson, D., Hay, N., Kirpach, B., McNeil, J.J., Nelson, M.R., Orchard, S.G., Pruksawongsin, K., Shah, R.C., Wolfe, R., Woods, R.L.
Format: Journal Article
Language:English
Published: BMC 2019
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/80064
_version_ 1848764153291866112
author Lockery, J.E.
Collyer, T.A.
Reid, Christopher
Ernst, M.E.
Gilbertson, D.
Hay, N.
Kirpach, B.
McNeil, J.J.
Nelson, M.R.
Orchard, S.G.
Pruksawongsin, K.
Shah, R.C.
Wolfe, R.
Woods, R.L.
author_facet Lockery, J.E.
Collyer, T.A.
Reid, Christopher
Ernst, M.E.
Gilbertson, D.
Hay, N.
Kirpach, B.
McNeil, J.J.
Nelson, M.R.
Orchard, S.G.
Pruksawongsin, K.
Shah, R.C.
Wolfe, R.
Woods, R.L.
author_sort Lockery, J.E.
building Curtin Institutional Repository
collection Online Access
description © 2019 The Author(s). Background: Large-scale studies risk generating inaccurate and missing data due to the complexity of data collection. Technology has the potential to improve data quality by providing operational support to data collectors. However, this potential is under-explored in community-based trials. The Aspirin in reducing events in the elderly (ASPREE) trial developed a data suite that was specifically designed to support data collectors: the ASPREE Web Accessible Relational Database (AWARD). This paper describes AWARD and the impact of system design on data quality. Methods: AWARD's operational requirements, conceptual design, key challenges and design solutions for data quality are presented. Impact of design features is assessed through comparison of baseline data collected prior to implementation of key functionality (n = 1000) with data collected post implementation (n = 18,114). Overall data quality is assessed according to data category. Results: At baseline, implementation of user-driven functionality reduced staff error (from 0.3% to 0.01%), out-of-range data entry (from 0.14% to 0.04%) and protocol deviations (from 0.4% to 0.08%). In the longitudinal data set, which contained more than 39 million data values collected within AWARD, 96.6% of data values were entered within specified query range or found to be accurate upon querying. The remaining data were missing (3.4%). Participant non-attendance at scheduled study activity was the most common cause of missing data. Costs associated with cleaning data in ASPREE were lower than expected compared with reports from other trials. Conclusions: Clinical trials undertake complex operational activity in order to collect data, but technology rarely provides sufficient support. We find the AWARD suite provides proof of principle that designing technology to support data collectors can mitigate known causes of poor data quality and produce higher-quality data. Health information technology (IT) products that support the conduct of scheduled activity in addition to traditional data entry will enhance community-based clinical trials. A standardised framework for reporting data quality would aid comparisons across clinical trials.
first_indexed 2025-11-14T11:14:50Z
format Journal Article
id curtin-20.500.11937-80064
institution Curtin University Malaysia
institution_category Local University
language English
last_indexed 2025-11-14T11:14:50Z
publishDate 2019
publisher BMC
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-800642021-01-05T08:07:08Z Overcoming challenges to data quality in the ASPREE clinical trial Lockery, J.E. Collyer, T.A. Reid, Christopher Ernst, M.E. Gilbertson, D. Hay, N. Kirpach, B. McNeil, J.J. Nelson, M.R. Orchard, S.G. Pruksawongsin, K. Shah, R.C. Wolfe, R. Woods, R.L. Science & Technology Life Sciences & Biomedicine Medicine, Research & Experimental Research & Experimental Medicine Health data Clinical trial Data quality Health technology LOW-DOSE ASPIRIN PRIMARY PREVENTION REDUCING EVENTS RANDOMIZED-TRIAL ELDERLY ASPREE USABILITY SAFETY CARE © 2019 The Author(s). Background: Large-scale studies risk generating inaccurate and missing data due to the complexity of data collection. Technology has the potential to improve data quality by providing operational support to data collectors. However, this potential is under-explored in community-based trials. The Aspirin in reducing events in the elderly (ASPREE) trial developed a data suite that was specifically designed to support data collectors: the ASPREE Web Accessible Relational Database (AWARD). This paper describes AWARD and the impact of system design on data quality. Methods: AWARD's operational requirements, conceptual design, key challenges and design solutions for data quality are presented. Impact of design features is assessed through comparison of baseline data collected prior to implementation of key functionality (n = 1000) with data collected post implementation (n = 18,114). Overall data quality is assessed according to data category. Results: At baseline, implementation of user-driven functionality reduced staff error (from 0.3% to 0.01%), out-of-range data entry (from 0.14% to 0.04%) and protocol deviations (from 0.4% to 0.08%). In the longitudinal data set, which contained more than 39 million data values collected within AWARD, 96.6% of data values were entered within specified query range or found to be accurate upon querying. The remaining data were missing (3.4%). Participant non-attendance at scheduled study activity was the most common cause of missing data. Costs associated with cleaning data in ASPREE were lower than expected compared with reports from other trials. Conclusions: Clinical trials undertake complex operational activity in order to collect data, but technology rarely provides sufficient support. We find the AWARD suite provides proof of principle that designing technology to support data collectors can mitigate known causes of poor data quality and produce higher-quality data. Health information technology (IT) products that support the conduct of scheduled activity in addition to traditional data entry will enhance community-based clinical trials. A standardised framework for reporting data quality would aid comparisons across clinical trials. 2019 Journal Article http://hdl.handle.net/20.500.11937/80064 10.1186/s13063-019-3789-2 English http://creativecommons.org/licenses/by/4.0/ BMC fulltext
spellingShingle Science & Technology
Life Sciences & Biomedicine
Medicine, Research & Experimental
Research & Experimental Medicine
Health data
Clinical trial
Data quality
Health technology
LOW-DOSE ASPIRIN
PRIMARY PREVENTION
REDUCING EVENTS
RANDOMIZED-TRIAL
ELDERLY ASPREE
USABILITY
SAFETY
CARE
Lockery, J.E.
Collyer, T.A.
Reid, Christopher
Ernst, M.E.
Gilbertson, D.
Hay, N.
Kirpach, B.
McNeil, J.J.
Nelson, M.R.
Orchard, S.G.
Pruksawongsin, K.
Shah, R.C.
Wolfe, R.
Woods, R.L.
Overcoming challenges to data quality in the ASPREE clinical trial
title Overcoming challenges to data quality in the ASPREE clinical trial
title_full Overcoming challenges to data quality in the ASPREE clinical trial
title_fullStr Overcoming challenges to data quality in the ASPREE clinical trial
title_full_unstemmed Overcoming challenges to data quality in the ASPREE clinical trial
title_short Overcoming challenges to data quality in the ASPREE clinical trial
title_sort overcoming challenges to data quality in the aspree clinical trial
topic Science & Technology
Life Sciences & Biomedicine
Medicine, Research & Experimental
Research & Experimental Medicine
Health data
Clinical trial
Data quality
Health technology
LOW-DOSE ASPIRIN
PRIMARY PREVENTION
REDUCING EVENTS
RANDOMIZED-TRIAL
ELDERLY ASPREE
USABILITY
SAFETY
CARE
url http://hdl.handle.net/20.500.11937/80064