What you need to know before implementing a clinical research data warehouse: Comparative review of integrated data repositories in health care institutions

Background: Integrated data repositories (IDRs), also referred to as clinical data warehouses, are platforms used for the integration of several data sources through specialized analytical tools that facilitate data processing and analysis. IDRs offer several opportunities for clinical data reuse, a...

Full description

Bibliographic Details
Main Authors: Gagalova, Kristina, Angelica Leon Elizalde, M., Portales-Casamar, E., Görges, M.
Format: Journal Article
Language:English
Published: 2020
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/96871
_version_ 1848766202829078528
author Gagalova, Kristina
Angelica Leon Elizalde, M.
Portales-Casamar, E.
Görges, M.
author_facet Gagalova, Kristina
Angelica Leon Elizalde, M.
Portales-Casamar, E.
Görges, M.
author_sort Gagalova, Kristina
building Curtin Institutional Repository
collection Online Access
description Background: Integrated data repositories (IDRs), also referred to as clinical data warehouses, are platforms used for the integration of several data sources through specialized analytical tools that facilitate data processing and analysis. IDRs offer several opportunities for clinical data reuse, and the number of institutions implementing an IDR has grown steadily in the past decade. Objective: The architectural choices of major IDRs are highly diverse and determining their differences can be overwhelming. This review aims to explore the underlying models and common features of IDRs, provide a high-level overview for those entering the field, and propose a set of guiding principles for small- to medium-sized health institutions embarking on IDR implementation. Methods: We reviewed manuscripts published in peer-reviewed scientific literature between 2008 and 2020, and selected those that specifically describe IDR architectures. Of 255 shortlisted articles, we found 34 articles describing 29 different architectures. The different IDRs were analyzed for common features and classified according to their data processing and integration solution choices. Results: Despite common trends in the selection of standard terminologies and data models, the IDRs examined showed heterogeneity in the underlying architecture design. We identified 4 common architecture models that use different approaches for data processing and integration. These different approaches were driven by a variety of features such as data sources, whether the IDR was for a single institution or a collaborative project, the intended primary data user, and purpose (research-only or including clinical or operational decision making). Conclusions: IDR implementations are diverse and complex undertakings, which benefit from being preceded by an evaluation of requirements and definition of scope in the early planning stage. Factors such as data source diversity and intended users of the IDR influence data flow and synchronization, both of which are crucial factors in IDR architecture planning.
first_indexed 2025-11-14T11:47:24Z
format Journal Article
id curtin-20.500.11937-96871
institution Curtin University Malaysia
institution_category Local University
language eng
last_indexed 2025-11-14T11:47:24Z
publishDate 2020
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-968712025-02-13T00:34:57Z What you need to know before implementing a clinical research data warehouse: Comparative review of integrated data repositories in health care institutions Gagalova, Kristina Angelica Leon Elizalde, M. Portales-Casamar, E. Görges, M. data aggregation data analytics data warehousing database health informatics information storage and retrieval Background: Integrated data repositories (IDRs), also referred to as clinical data warehouses, are platforms used for the integration of several data sources through specialized analytical tools that facilitate data processing and analysis. IDRs offer several opportunities for clinical data reuse, and the number of institutions implementing an IDR has grown steadily in the past decade. Objective: The architectural choices of major IDRs are highly diverse and determining their differences can be overwhelming. This review aims to explore the underlying models and common features of IDRs, provide a high-level overview for those entering the field, and propose a set of guiding principles for small- to medium-sized health institutions embarking on IDR implementation. Methods: We reviewed manuscripts published in peer-reviewed scientific literature between 2008 and 2020, and selected those that specifically describe IDR architectures. Of 255 shortlisted articles, we found 34 articles describing 29 different architectures. The different IDRs were analyzed for common features and classified according to their data processing and integration solution choices. Results: Despite common trends in the selection of standard terminologies and data models, the IDRs examined showed heterogeneity in the underlying architecture design. We identified 4 common architecture models that use different approaches for data processing and integration. These different approaches were driven by a variety of features such as data sources, whether the IDR was for a single institution or a collaborative project, the intended primary data user, and purpose (research-only or including clinical or operational decision making). Conclusions: IDR implementations are diverse and complex undertakings, which benefit from being preceded by an evaluation of requirements and definition of scope in the early planning stage. Factors such as data source diversity and intended users of the IDR influence data flow and synchronization, both of which are crucial factors in IDR architecture planning. 2020 Journal Article http://hdl.handle.net/20.500.11937/96871 10.2196/17687 eng http://creativecommons.org/licenses/by/4.0/ fulltext
spellingShingle data aggregation
data analytics
data warehousing
database
health informatics
information storage and retrieval
Gagalova, Kristina
Angelica Leon Elizalde, M.
Portales-Casamar, E.
Görges, M.
What you need to know before implementing a clinical research data warehouse: Comparative review of integrated data repositories in health care institutions
title What you need to know before implementing a clinical research data warehouse: Comparative review of integrated data repositories in health care institutions
title_full What you need to know before implementing a clinical research data warehouse: Comparative review of integrated data repositories in health care institutions
title_fullStr What you need to know before implementing a clinical research data warehouse: Comparative review of integrated data repositories in health care institutions
title_full_unstemmed What you need to know before implementing a clinical research data warehouse: Comparative review of integrated data repositories in health care institutions
title_short What you need to know before implementing a clinical research data warehouse: Comparative review of integrated data repositories in health care institutions
title_sort what you need to know before implementing a clinical research data warehouse: comparative review of integrated data repositories in health care institutions
topic data aggregation
data analytics
data warehousing
database
health informatics
information storage and retrieval
url http://hdl.handle.net/20.500.11937/96871