Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies. / Swertz, Morris; van Enckevort, Esther; Oliveira, José Luis; Fortier, Isabel; Bergeron, Julie; Thurin, Nicolas H.; Hyde, Eleanor; Kellmann, Alexander; Pahoueshnja, Romin; Sturkenboom, Miriam; Cunnington, Marianne; Nybo Andersen, Anne Marie; Marcon, Yannick; Gonçalves, Gonçalo; Gini, Rosa.

In: Yearbook of medical informatics, Vol. 31, No. 1, 2022, p. 262-272.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Swertz, M, van Enckevort, E, Oliveira, JL, Fortier, I, Bergeron, J, Thurin, NH, Hyde, E, Kellmann, A, Pahoueshnja, R, Sturkenboom, M, Cunnington, M, Nybo Andersen, AM, Marcon, Y, Gonçalves, G & Gini, R 2022, 'Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies', Yearbook of medical informatics, vol. 31, no. 1, pp. 262-272. https://doi.org/10.1055/s-0042-1742522

APA

Swertz, M., van Enckevort, E., Oliveira, J. L., Fortier, I., Bergeron, J., Thurin, N. H., Hyde, E., Kellmann, A., Pahoueshnja, R., Sturkenboom, M., Cunnington, M., Nybo Andersen, A. M., Marcon, Y., Gonçalves, G., & Gini, R. (2022). Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies. Yearbook of medical informatics, 31(1), 262-272. https://doi.org/10.1055/s-0042-1742522

Vancouver

Swertz M, van Enckevort E, Oliveira JL, Fortier I, Bergeron J, Thurin NH et al. Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies. Yearbook of medical informatics. 2022;31(1):262-272. https://doi.org/10.1055/s-0042-1742522

Author

Swertz, Morris ; van Enckevort, Esther ; Oliveira, José Luis ; Fortier, Isabel ; Bergeron, Julie ; Thurin, Nicolas H. ; Hyde, Eleanor ; Kellmann, Alexander ; Pahoueshnja, Romin ; Sturkenboom, Miriam ; Cunnington, Marianne ; Nybo Andersen, Anne Marie ; Marcon, Yannick ; Gonçalves, Gonçalo ; Gini, Rosa. / Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies. In: Yearbook of medical informatics. 2022 ; Vol. 31, No. 1. pp. 262-272.

Bibtex

@article{dec30261f4494c1abf23293cee10bb60,
title = "Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies",
abstract = "OBJECTIVES: Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible, Interoperable and Reusable. However, their valuable diversity also created heterogeneity, which poses challenges to optimally exploit their richness. METHODS: In this opinion review, we analyse catalogues for human subject research ranging from cohort studies to surveillance, administrative and healthcare records. RESULTS: We observe that while these catalogues are heterogeneous, have various scopes, and use different terminologies, still the underlying concepts seem potentially harmonizable. We propose a unified framework to enable catalogue data sharing, with catalogues of multi-center cohorts nested as a special case in catalogues of real-world data sources. Moreover, we list recommendations to create an integrated community of metadata catalogues and an open catalogue ecosystem to sustain these efforts and maximise impact. CONCLUSIONS: We propose to embrace the autonomy of motivated catalogue teams and invest in their collaboration via minimal standardisation efforts such as clear data licensing, persistent identifiers for linking same records between catalogues, minimal metadata 'common data elements' using shared ontologies, symmetric architectures for data sharing (push/pull) with clear provenance tracks to process updates and acknowledge original contributors. And most importantly, we encourage the creation of environments for collaboration and resource sharing between catalogue developers, building on international networks such as OpenAIRE and research data alliance, as well as domain specific ESFRIs such as BBMRI and ELIXIR.",
author = "Morris Swertz and {van Enckevort}, Esther and Oliveira, {Jos{\'e} Luis} and Isabel Fortier and Julie Bergeron and Thurin, {Nicolas H.} and Eleanor Hyde and Alexander Kellmann and Romin Pahoueshnja and Miriam Sturkenboom and Marianne Cunnington and {Nybo Andersen}, {Anne Marie} and Yannick Marcon and Gon{\c c}alo Gon{\c c}alves and Rosa Gini",
note = "Publisher Copyright: IMIA and Thieme. This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.",
year = "2022",
doi = "10.1055/s-0042-1742522",
language = "English",
volume = "31",
pages = "262--272",
journal = "Yearbook of medical informatics",
issn = "0943-4747",
publisher = "International Medical Informatics Association",
number = "1",

}

RIS

TY - JOUR

T1 - Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies

AU - Swertz, Morris

AU - van Enckevort, Esther

AU - Oliveira, José Luis

AU - Fortier, Isabel

AU - Bergeron, Julie

AU - Thurin, Nicolas H.

AU - Hyde, Eleanor

AU - Kellmann, Alexander

AU - Pahoueshnja, Romin

AU - Sturkenboom, Miriam

AU - Cunnington, Marianne

AU - Nybo Andersen, Anne Marie

AU - Marcon, Yannick

AU - Gonçalves, Gonçalo

AU - Gini, Rosa

N1 - Publisher Copyright: IMIA and Thieme. This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.

PY - 2022

Y1 - 2022

N2 - OBJECTIVES: Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible, Interoperable and Reusable. However, their valuable diversity also created heterogeneity, which poses challenges to optimally exploit their richness. METHODS: In this opinion review, we analyse catalogues for human subject research ranging from cohort studies to surveillance, administrative and healthcare records. RESULTS: We observe that while these catalogues are heterogeneous, have various scopes, and use different terminologies, still the underlying concepts seem potentially harmonizable. We propose a unified framework to enable catalogue data sharing, with catalogues of multi-center cohorts nested as a special case in catalogues of real-world data sources. Moreover, we list recommendations to create an integrated community of metadata catalogues and an open catalogue ecosystem to sustain these efforts and maximise impact. CONCLUSIONS: We propose to embrace the autonomy of motivated catalogue teams and invest in their collaboration via minimal standardisation efforts such as clear data licensing, persistent identifiers for linking same records between catalogues, minimal metadata 'common data elements' using shared ontologies, symmetric architectures for data sharing (push/pull) with clear provenance tracks to process updates and acknowledge original contributors. And most importantly, we encourage the creation of environments for collaboration and resource sharing between catalogue developers, building on international networks such as OpenAIRE and research data alliance, as well as domain specific ESFRIs such as BBMRI and ELIXIR.

AB - OBJECTIVES: Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible, Interoperable and Reusable. However, their valuable diversity also created heterogeneity, which poses challenges to optimally exploit their richness. METHODS: In this opinion review, we analyse catalogues for human subject research ranging from cohort studies to surveillance, administrative and healthcare records. RESULTS: We observe that while these catalogues are heterogeneous, have various scopes, and use different terminologies, still the underlying concepts seem potentially harmonizable. We propose a unified framework to enable catalogue data sharing, with catalogues of multi-center cohorts nested as a special case in catalogues of real-world data sources. Moreover, we list recommendations to create an integrated community of metadata catalogues and an open catalogue ecosystem to sustain these efforts and maximise impact. CONCLUSIONS: We propose to embrace the autonomy of motivated catalogue teams and invest in their collaboration via minimal standardisation efforts such as clear data licensing, persistent identifiers for linking same records between catalogues, minimal metadata 'common data elements' using shared ontologies, symmetric architectures for data sharing (push/pull) with clear provenance tracks to process updates and acknowledge original contributors. And most importantly, we encourage the creation of environments for collaboration and resource sharing between catalogue developers, building on international networks such as OpenAIRE and research data alliance, as well as domain specific ESFRIs such as BBMRI and ELIXIR.

U2 - 10.1055/s-0042-1742522

DO - 10.1055/s-0042-1742522

M3 - Journal article

C2 - 36463884

AN - SCOPUS:85143322953

VL - 31

SP - 262

EP - 272

JO - Yearbook of medical informatics

JF - Yearbook of medical informatics

SN - 0943-4747

IS - 1

ER -

ID: 338822951