Neuroimaging, Genetics, and Clinical Data Sharing in Python Using The Cubicweb Framework

The document discusses the development of a Python-based solution using the CubicWeb framework to address technological challenges in large multi-center population imaging studies in neuroscience and psychiatry. It presents three web services for data upload, collaborative quality assessment, and publication, which facilitate efficient data sharing and management across various institutions. The framework supports complex data types, including neuroimaging and genetics, while ensuring adaptability and interoperability for evolving research needs.

Uploaded by

leizhou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views11 pages

Neuroimaging, Genetics, and Clinical Data Sharing in Python Using The Cubicweb Framework

Uploaded by

leizhou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Technology Report

published: 16 March 2017

doi: 10.3389/fninf.2017.00018

Antoine Grigis1*, David Goyard1, Robin Cherbonnier1, Thomas Gareau1,

Dimitri Papadopoulos Orfanos1, Nicolas Chauvat2, Adrien Di Mascio2,
Gunter Schumann3, Will Spooren4, Declan Murphy5 and Vincent Frouin1
1
UNATI, Neurospin, CEA, Université Paris-Saclay, Gif-sur-Yvette, France, 2 Logilab, Paris, France, 3 Medical Research
Council, Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience,
King’s College London, London, UK, 4 F. Hoffmann-La Roche Pharmaceuticals, Basel, Switzerland, 5 King’s College London,
London, UK

In neurosciences or psychiatry, the emergence of large multi-center population imaging

studies raises numerous technological challenges. From distributed data collection, across
different institutions and countries, to final data publication service, one must handle the
massive, heterogeneous, and complex data from genetics, imaging, demographics, or
clinical scores. These data must be both efficiently obtained and downloadable. We
present a Python solution, based on the CubicWeb open-source semantic framework,
aimed at building population imaging study repositories. In addition, we focus on the
tools developed around this framework to overcome the challenges associated with data
Edited by: sharing and collaborative requirements. We describe a set of three highly adaptive web
Daniel Marcus,
Washington University in St. Louis, services that transform the CubicWeb framework into a (1) multi-center upload platform,
USA (2) collaborative quality assessment platform, and (3) publication platform endowed with
Reviewed by: massive-download capabilities. Two major European projects, IMAGEN and EU-AIMS,
B. Nolan Nichols,
are currently supported by the described framework. We also present a Python package
SRI International, USA
David J. Just, that enables end users to remotely query neuroimaging, genetics, and clinical data from
Mayo Clinic, USA scripts.
*Correspondence:
Antoine Grigis Keywords: web service, data sharing, database, neuroimaging, genetics, medical informatics, Python
[email protected]

Received: 14 November 2016

Accepted: 22 February 2017 1. INTRODUCTION
Published: 16 March 2017

Citation:
Health research strategies using neuroimaging have shifted in recent years: the focus has moved
Grigis A, Goyard D, Cherbonnier R, from patient care only, to a combination of patient care and prevention. In the case of neuro-
Gareau T, Papadopoulos Orfanos D, degenerative and psychiatric diseases, this drives the creation of increasingly numerous massive
Chauvat N, Di Mascio A, imaging studies also known as Population Imaging (PI) surveys (Hurko et al., 2012; Poldrack and
Schumann G, Spooren W, Murphy D Gorgolewski, 2014). It should be noticed that PI studies no longer consist of image data only. The
and Frouin V (2017) Neuroimaging,
recent wide availability of high-throughput genomics has augmented the subject data with genetics,
Genetics, and Clinical Data Sharing in
Python Using the CubicWeb
epigenetics, and functional genomics. Likewise, the standardization of personality, demographics,
Framework. and deficit tests in psychiatry facilitates the acquisition of clinical/behavioral records to enrich the
Front. Neuroinform. 11:18. subject data in large population studies. Moreover, PI studies now classically encompass more than
doi: 10.3389/fninf.2017.00018 one single imaging session per subject and cover multiple-time point heterogeneous experiments.