Curation by Design v11SM

The document discusses curating genomic data by design from creation to interpretation. It proposes sharing curation responsibility over the whole community through creating an independent DATA cooperative to handle exponential growth and complexity of curating individual citizen data on their behalf in a trusted and transparent manner between institutes.

Uploaded by

Peter Walgemoed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views1 page

Curation by Design v11SM

Uploaded by

Peter Walgemoed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Genomic Data Curation by Design

1 2
Peter Walgemoed, Bert Eussen
1 2
Carelliance Group BV Eindhoven NL, Clinical Genetics ErasmusMC Rotterdam NL
Sharing genomic data globally for all stakeholders from creation to interpretation is
a major challenge. The figure on the right shows this challenge and different perspective
for each stakeholder. Solutions are being developed at the institutional level.
Catalogues like Decipher and ClinVar are the repository end products of international
knowledge communities. Besides these reference datasets, clinical (HPO), imaging,
behaviour/lifestyle and diagnostic(SNOMED, LOINC) datasets are also used.
To support curation, we have developed a concept where data is tagged from the
moment of creation. and can be shared globally. Curation starts with raw data in a lab
or with the clinical work-up. The processed data is linked to a person as a research and/or
clinical subject at the institutional level. For each step of the process, standardization of
the format is required. The local institute is responsible for the master copy, as well as for
access to the privacy and sample IDs. The question is whether each institute is able to
take responsibility for curation? We propose sharing the responsibility over the whole
community, and creating a new, independent DATA co-operative.

End user client/citizen value Applications IT-infrastructure DATA

Data value creation

DATA: Co-operation between Institutes

Data Service Curation
Managed Cloud Service
Institutional service
Citizens Clients
Array service
HPO LAB service
NGS service EU Diagnostic
LAB service LAB service
LAB service
Privacy Keys
CN Analysis
Software P2G Software Software Software
Variant Analysis

FAIR (master) IT-infra DATA IT-infra DATA

IT-infra DATA Center Any Trusted
Data Asset Center Center

Independent Master Data Copies

Trusted Document System: TrustDocA

The lab is a data collection point but it is driven by its clients (researchers and clinicians). These clients have the responsibility to manage the privacy for their clients (citizens).
Therefore data curation is on behalf of the citizen. All procedures and lab services are documented in a trusted, authoring document system (TrustDocA).

Institute Services

Governance Sample
model Fraction Technical
Assay
Analysis
analysis I Interpretation
Medical analysis
Informed
Request
consent
Clinical
exame HPO Phenotype
Privacy 2
data Genotype

Data curation starts with a medical request for a citizen, and includes informed consent. A clear and uniform governance model should form the basis of this registration.
All data derived from the original request should be linked within this model. Diagnostic, research or shared processes are represented by the red and blue lines.
The governance metadata should be present in all the production files (assets) and should included in the mastercopies of the generated assets.

It will be challenging for citizens to curate their own data. It is likely to grow exponentially and it will become very complex to handle phenotypic,
laboratory, treatment, municipal and personal health and lifestyle data. Therefore a trusted and transparent co-operation between institutes is
required to curate the data on the citizen’s behalf. DATA co-operative not only includes storage and preservation but also creates value by using
the data as much as possible. Data repositories like Decipher, ClinVar and local aplications are service providers for
combined datasets and act on behalf of clinical and research clients in the DATA co-operative.
Transparent data collection systems are essential for consortia wanting to share data on behalf of their clients/
citizens as part of a FAIR data policy. Governance should be by design and citizen informed consent implies
that a data copy is curated by the DATA co-operative and should be available for future generations.

Carelliance Group BV Eindhoven & Clinical Genetics ErasmusMC Email: [email protected] [email protected] 2016

Untitled
No ratings yet
Untitled
714 pages
Genomic Data Sharing Case Studies, Challenges, and Opportunities For Precision Medicine, 1st Edition Academic PDF Download
100% (8)
Genomic Data Sharing Case Studies, Challenges, and Opportunities For Precision Medicine, 1st Edition Academic PDF Download
15 pages
Digital Medicine Bringing Digital Solutions To Medical Practice 1st Edition Accessible PDF Download
100% (15)
Digital Medicine Bringing Digital Solutions To Medical Practice 1st Edition Accessible PDF Download
16 pages
ClassXI DS Student Handbook
No ratings yet
ClassXI DS Student Handbook
109 pages
2016 Book SecondaryAnalysisOfElectronicH PDF
No ratings yet
2016 Book SecondaryAnalysisOfElectronicH PDF
435 pages
Tech Trials
No ratings yet
Tech Trials
59 pages
IoT Project Template
No ratings yet
IoT Project Template
54 pages
Big Data y Medicina Personalizada Master
No ratings yet
Big Data y Medicina Personalizada Master
45 pages
Technical Manual: John Deere
No ratings yet
Technical Manual: John Deere
478 pages
Emrys Consortium National Data Library White Paper Challenge Response For Wellcome & ESRC
No ratings yet
Emrys Consortium National Data Library White Paper Challenge Response For Wellcome & ESRC
21 pages
Major Project Report (Edited)
No ratings yet
Major Project Report (Edited)
39 pages
4 KohMingshi TRUST
No ratings yet
4 KohMingshi TRUST
28 pages
World Economic Forum
No ratings yet
World Economic Forum
26 pages
snakemake原始文献
No ratings yet
snakemake原始文献
28 pages
Data Governance Book
No ratings yet
Data Governance Book
11 pages
Biomedical Data Management
No ratings yet
Biomedical Data Management
22 pages
Interoperability Standards Oct16
No ratings yet
Interoperability Standards Oct16
24 pages
Big Data and Genomics
No ratings yet
Big Data and Genomics
17 pages
Workbook - A Guide To The Vascular System
No ratings yet
Workbook - A Guide To The Vascular System
202 pages
A Data Management Infrastructure
No ratings yet
A Data Management Infrastructure
20 pages
Published Paper Idris
No ratings yet
Published Paper Idris
17 pages
2016-12 Hortonworks Road Show - From Acquisition To Insights
No ratings yet
2016-12 Hortonworks Road Show - From Acquisition To Insights
24 pages
NIH Strategic Plan For Data Science
No ratings yet
NIH Strategic Plan For Data Science
26 pages
Midpresentation Report-2024
No ratings yet
Midpresentation Report-2024
21 pages
IBM Watson - How Cognitive Computing Can Be Applied
No ratings yet
IBM Watson - How Cognitive Computing Can Be Applied
14 pages
Clinical Genomic Data
No ratings yet
Clinical Genomic Data
14 pages
Data Handling
No ratings yet
Data Handling
15 pages
BIOINFORMATICS ASSIGNMENT - Final - DR - 01
No ratings yet
BIOINFORMATICS ASSIGNMENT - Final - DR - 01
17 pages
Scribd 4
No ratings yet
Scribd 4
14 pages
Knowledge Graphs and Their Applications in Drug
No ratings yet
Knowledge Graphs and Their Applications in Drug
14 pages
M CC EDIT - Removed
No ratings yet
M CC EDIT - Removed
20 pages
Preparing Next-Generation Scientists For Biomedical Big Data: Artificial Intelligence Approaches
No ratings yet
Preparing Next-Generation Scientists For Biomedical Big Data: Artificial Intelligence Approaches
11 pages
Sciencedirect Big Data Analytics For Personalized Medicine: Davide Cirillo and Alfonso Valencia
No ratings yet
Sciencedirect Big Data Analytics For Personalized Medicine: Davide Cirillo and Alfonso Valencia
10 pages
Recommendations To Enhance Rigor and Reproducibility in Biomedical Research
No ratings yet
Recommendations To Enhance Rigor and Reproducibility in Biomedical Research
19 pages
Science and Technology: 7.1. Indian Biological Data Center
No ratings yet
Science and Technology: 7.1. Indian Biological Data Center
11 pages
Big Data in Digital Healthcare Lessons Learnt and Recommendations
No ratings yet
Big Data in Digital Healthcare Lessons Learnt and Recommendations
10 pages
W Defa3443
No ratings yet
W Defa3443
12 pages
Bioinformatics DA 3.1
No ratings yet
Bioinformatics DA 3.1
11 pages
Health 1
No ratings yet
Health 1
11 pages
The Elixir of Life Sciences Success
No ratings yet
The Elixir of Life Sciences Success
10 pages
87 655 2 PB PDF
No ratings yet
87 655 2 PB PDF
8 pages
Ijerph 15 02796 PDF
No ratings yet
Ijerph 15 02796 PDF
9 pages
AS04
No ratings yet
AS04
10 pages
A Data Warehouse Architecture For Clinical Data Warehousing: Tony R. Sahama and Peter R. Croll
No ratings yet
A Data Warehouse Architecture For Clinical Data Warehousing: Tony R. Sahama and Peter R. Croll
6 pages
Creating Vibrant Ecosystem
No ratings yet
Creating Vibrant Ecosystem
7 pages
Poh 2014
No ratings yet
Poh 2014
6 pages
1,000,000 Dollar Genome
No ratings yet
1,000,000 Dollar Genome
5 pages
PGH 25 Años Después
No ratings yet
PGH 25 Años Después
6 pages
Shabani Et Al. (2017)
No ratings yet
Shabani Et Al. (2017)
6 pages
MT - Data Governance vs. Data Integrity
No ratings yet
MT - Data Governance vs. Data Integrity
7 pages
Genome Database Groupwork
No ratings yet
Genome Database Groupwork
5 pages
Big Data Biology - in Medicine
No ratings yet
Big Data Biology - in Medicine
4 pages
Merging Heterogeneous Clinical Data To Enable Knowledge Discovery
No ratings yet
Merging Heterogeneous Clinical Data To Enable Knowledge Discovery
5 pages
Integrative Analysis of Genomic Data Types and AI Methodologies in Healthcare Applications
No ratings yet
Integrative Analysis of Genomic Data Types and AI Methodologies in Healthcare Applications
5 pages
Europe Targeting
No ratings yet
Europe Targeting
8 pages
The National Institutes of Health 'S Big Data To Knowledge (BD2K) Initiative: Capitalizing On Biomedical Big Data
No ratings yet
The National Institutes of Health 'S Big Data To Knowledge (BD2K) Initiative: Capitalizing On Biomedical Big Data
3 pages
8th Workshop On Biomedical and Bioinformatics Challenge - 2015 - Procedia Comput
No ratings yet
8th Workshop On Biomedical and Bioinformatics Challenge - 2015 - Procedia Comput
3 pages
Developing A Safety Culture
100% (3)
Developing A Safety Culture
30 pages
The Regulations For Classification of Saudi Building Code Violations
No ratings yet
The Regulations For Classification of Saudi Building Code Violations
27 pages
Acute Ischemic Stroke Update
No ratings yet
Acute Ischemic Stroke Update
38 pages
Exam Prep - Nov-2016
No ratings yet
Exam Prep - Nov-2016
60 pages
ABG Examples ABG Exam Questions For Medical Students and PACES
No ratings yet
ABG Examples ABG Exam Questions For Medical Students and PACES
10 pages
Laboratory and Clinical Genomic Data Sharing Is Crucial To Impr - 2017 - Genetic
No ratings yet
Laboratory and Clinical Genomic Data Sharing Is Crucial To Impr - 2017 - Genetic
2 pages
FANTILLO GLENNCHARLIANE ClinDataRep
No ratings yet
FANTILLO GLENNCHARLIANE ClinDataRep
4 pages
Ventilator Graphics - Basics
No ratings yet
Ventilator Graphics - Basics
51 pages
Discussion 4
No ratings yet
Discussion 4
2 pages
2022 00532 Michael Demetriou Et Al V Michael Demetriou Et Al RESPONDENT S BRIEF 18
No ratings yet
2022 00532 Michael Demetriou Et Al V Michael Demetriou Et Al RESPONDENT S BRIEF 18
23 pages
Pe Worksheet Class 11
No ratings yet
Pe Worksheet Class 11
22 pages
BM Lejuez BATD Manual
No ratings yet
BM Lejuez BATD Manual
32 pages
Internship Report ON A Case Study On Human Resource Practices of Fair Food & Lifestyle LTD
No ratings yet
Internship Report ON A Case Study On Human Resource Practices of Fair Food & Lifestyle LTD
31 pages
Paramedical Diploma in Radiotherapy Technician
No ratings yet
Paramedical Diploma in Radiotherapy Technician
26 pages
Food Additives
No ratings yet
Food Additives
6 pages
The Bowdoin Orient - Vol. 149, No. 15 - February 7, 2020
No ratings yet
The Bowdoin Orient - Vol. 149, No. 15 - February 7, 2020
16 pages
PROZYME Probiotics
100% (1)
PROZYME Probiotics
19 pages
Life Sciences P2 May-June 2019 Memo Eng
No ratings yet
Life Sciences P2 May-June 2019 Memo Eng
13 pages
Sec 1 Chapter 3 Safe Notes 2024
No ratings yet
Sec 1 Chapter 3 Safe Notes 2024
9 pages
Reduced Thoracolumbar Fascia Shear Strain in Human Chronic Low Back Pain
No ratings yet
Reduced Thoracolumbar Fascia Shear Strain in Human Chronic Low Back Pain
12 pages
PBL Parkinsons Disease
No ratings yet
PBL Parkinsons Disease
23 pages
Suspensions: Sahara College Narowal
No ratings yet
Suspensions: Sahara College Narowal
9 pages
Bristow Laterjet Protocol PDF
No ratings yet
Bristow Laterjet Protocol PDF
10 pages
Anomalies of Nervous System
No ratings yet
Anomalies of Nervous System
19 pages
Embalmer
No ratings yet
Embalmer
14 pages
"Recycle, Reuse and Rumble" Health Communication Program HLTH 634 Chelsea Armah
No ratings yet
"Recycle, Reuse and Rumble" Health Communication Program HLTH 634 Chelsea Armah
5 pages
Global Health Benefits: Policy Holder: Policy #: Effective Date: Insured: Member #
No ratings yet
Global Health Benefits: Policy Holder: Policy #: Effective Date: Insured: Member #
1 page
Assessment Explanatio Nofthe Problem Objectives Nursing Intervention Rationale Evaluation
No ratings yet
Assessment Explanatio Nofthe Problem Objectives Nursing Intervention Rationale Evaluation
3 pages
11 MMDA V Bel-Air
No ratings yet
11 MMDA V Bel-Air
4 pages
Fast Resolution - Modification en
No ratings yet
Fast Resolution - Modification en
4 pages
Signed - GR001657 Feedback Letter March 2024
No ratings yet
Signed - GR001657 Feedback Letter March 2024
2 pages
Artificial Intelligence and Natural Algorithms
From Everand
Artificial Intelligence and Natural Algorithms
Rijwan Khan
No ratings yet
PYTHON DATA ANALYTICS: Mastering Python for Effective Data Analysis and Visualization (2024 Beginner Guide)
From Everand
PYTHON DATA ANALYTICS: Mastering Python for Effective Data Analysis and Visualization (2024 Beginner Guide)
FLOYD BAX
No ratings yet

Curation by Design v11SM

Uploaded by

Curation by Design v11SM

Uploaded by

Genomic Data Curation by Design

End user client/citizen value Applications IT-infrastructure DATA

DATA: Co-operation between Institutes

FAIR (master) IT-infra DATA IT-infra DATA

Independent Master Data Copies

You might also like