Data Science, Intelligence & Society
March 2018
DATAIA Institute
Data Science, Artificial Intelligence & Society
Nozha Boujemaa
Director at DATAIA Institute
Research Director at Inria
nozha.boujemaa@inria.fr
Data Science, Intelligence & Society
Aim of convergence Institutes
•  Structuration of few centres gathering multidisciplinary scientific task forces with large
scale and high visibility in order to reach major challenges, at the crossroads of
societal and economic challenges and questions from the scientific community.
•  Advanced research-training integration.
•  Effective coupling with the socio-economic world –industry partnership.
•  DATAIA is the Convergence Institute in Data Science, AI & Society gathering 130
affiliated researchers and targeting 300 within 3 years, Kick-off => 15 February 2018
Data Science, Intelligence & Society
DATAIA Institute
•  4 Overarching Challenges:
o  From Machine Learning to Artificial Intelligence,
o  From Data to Knowledge, from Data to Decision,
o  Transparency, Responsible AI & Ethics,
o  Data Protection, Regulation and Economy
•  Scientific and disciplinary foundations: Math, Computer Sciences, Management and Economy,
Social Sciences, Legal Sciences
•  Application domains: Internet of people and things, Urbanization 4.0 & Mobility, Optimal Energy
Management, Business Analytics, Health, Well being & personal nutrition, e-Sciences.
•  Roadmap for 8 years, 10M€ -180 M€ Global Budget, with 14 academic founding institutions
•  Kick-off => February 15th 2018
Data Science, Intelligence & Society
Data Science, Intelligence & Society
Les membres fondateurs
•  L’Institut DATAIA est porté par l’Université Paris-Saclay et dirigé par le centre de
recherche Inria Saclay – Île-de-France :
•  Le consortium rassemble des Universités, des Instituts de recherche nationaux
et des Grandes Ecoles :
Data Science, Intelligence & Society
Industrial Affiliation Program
•  Contributions: research support, data and use cases
•  Participation in the definition, selection and monitoring of programs
•  Participation in defining the long-term strategic vision
•  Workshops, S&T work exchange sessions, brainstorming sessions (open problems), etc
•  IP will follow the rules defined in a consortium agreement
•  First look at IP.
Based of what is done in American Universities (Stanford model)
*
*
Data Science, Intelligence & Society
Data Science, Intelligence & Society
•  Alan Turing (UK)
•  IVADO (Canada)
•  Advanced Core Technologies for Big Data
Integration (Japan)
•  DSI (Data Science Institute – Columbia University)
International partners
Data Science, Intelligence & Society
Data & Algorithms
« 2 sides of the same coin »
•  Rising benefits from Big Data and AI technologies have wide impact on our economy and
social organization ;
•  Transparency and trust of such Algorithmic Systems (data & algorithms) becoming
competitiveness factors for Data-driven economy ;
•  Data analytics is changing from description of past to predictive and prescriptive analytics for
decision support ;
•  Importance of remedying the information asymmetry between the producer of the digital
service and its consumer, be it citizen or professional – B2C or B2B => civil rights, competition,
sovereignty.
Data Science, Intelligence & Society
Algorithmic systems in every day life
•  Some dominant platforms on the market play a role of "prescriber”
by directing a large share of user traffic:
•  Ranking mechanisms (search engine),
•  Recommendation mechanisms and content selection
•  Product or service recommendation: is it most appropriate for the consumer
(personalization) or the most appropriate to the seller (given the stock)?
•  Opacity of the use made of the personal data and how they are processed,
•  What about the consent? Is it always respected? Mobilitics CNIL-Inria (Privatics)
•  Credit scoring, how fair is it?
•  Predictive justice?
⇒  New discrimination between those who know how algorithms work ad who do not
In addition to economical and geostrategic effects on persons and societies
Data Science, Intelligence & Society
Algorithmic Systems Bias
Mastering Big Data Technologies: Bias problems could impact data technologies
accuracy and people’s lives
Challenges 1: Data Inputs to an Algorithm
o  Poorly selected data
o  Incomplete, incorrect, or outdated data
o  Data sets that lack disproportionately represent certain populations
o  Malicious attack
Challenges 2: The Design of Algorithmic Systems and Machine Learning
o  Poorly designed matching systems
o  Unintentional perpetuation and promotion of historical biases
o  Decision-making systems that assume correlation implies causation
Data Science, Intelligence & Society
Challenges / Efforts
•  It is a mistake to assume they are objective simply because they are data-driven
•  Algorithms are encapsulated opinions through decision parameters and learning data
•  Mastering the accuracy and robustness of Big Data & AI techniques: bias, reproducibility,
source of unintentional discrimination
•  Implementing the “Transparent-by-design”: fairness/equity, loyalty, neutrality, etc.
•  Interdisciplinary co-conception of solutions, How responsible is a ML algorithm?
•  Interdisciplinary training of Data Scientists: law, sociology and economy, Careful software reuse
=> mastering information leaks (SRE)
AI is part of the solution and not only the law!
Transparency Tools vs GDPR vs Having the Choice
Data Science, Intelligence & Society
Transparent-by-design, auditable-by-design, fairness & non-discrimination-by-
design
§ Explainability, reproducibility & robustness of ML,
§ Data provenance and usage monitoring
§ Progressive user-centric analytics (Mix of Dataviz and Analytics)
§ New paradigms for information flow monitoring
§ Fact-checking requiring explicit & verifiable integration of heterogeneous
data sources
Challenges / Efforts
Data Science, Intelligence & Society
Challenges / Efforts
•  Complex concepts, Dependent on cultural context, law context, etc.
International collaboration is key
Transparency, Asymmetry, Accountability, Loyalty, Fairness, Equity, Intelligibility, Explainability, Traceability,
Auditability, Proof and Certification, Performance, Ethics, Responsibility
Ethical ≠ Responsible, Transparent ≠ Make available the source code
•  Pedagogy and explanation, awareness, uses-cases, (all public! Including scientists)
•  Auditability and Building Transparent-by-Design tools and algorithms
ML algorithms are shared in open-source but NOT Data (governance of AS!)
Data Science, Intelligence & Society
Interdisciplinary challenges
•  From Machine Learning to Artificial Intelligence
o  Innovative machine learning and AI: common sense, adaptability, generalization
o  Deep learning and adversarial learning
o  Machine learning and hyper-optimization
o  Optimization for learning, stochastic gradient method improvements, Bayesian
optimization, combinatorial optimization
o  Link between learning and modelling, integration of a priori into learning
o  Repeatability and robust learning
o  Statistical Inference and Validation
o  Composition of deep architectures
Data Science, Intelligence & Society
Interdisciplinary challenges
•  From Data to Knowledge, from Data to Decision
o  Heterogeneous, semi-structured, complex, incomplete and/or uncertain data
o  Fast big data: new methodologies to use data
o  Online learning, methodology for massive data, efficient methods
o  Improved storage, calculation and estimation for data science
o  Modeling of interactions between agents (human or artificial) by game theory
o  Multiscale and multimodal representation and algorithms
o  Theoretical analysis of heuristic methods (complexity theory, information geometry, Markov
chain theory)
o  Human-machine co-evolution in autonomous systems: conversational agents, autonomous
systems , social robots
Data Science, Intelligence & Society
Interdisciplinary challenges
•  Transparency & digital trust
o  Responsibility-by-design, Explicability-by-design
o  Transparency-by-design, equity-by-design
o  Audit of algorithmic systems: non-discrimination, loyalty, technical bias, neutrality, fairness
o  Measuring digital trust and ownership
o  Progressive user-centric-analytics (interactive monitoring of decision systems: dataviz,
dashboards, IHM) 
o  Responsibility for information processing and decision-making: data usage control and fact-
checking
o  Causal discovery, traceability of inferences from source data, interpretability of deep
architectures
Data Science, Intelligence & Society
Interdisciplinary challenges
•  Data protection, regulation and economy
o  "Privacy-by-design", GDPR
o  Distributed Machine Learning preserving privacy
o  Development of ethically responsible methodologies and technologies to
regulate the collection, use and process of personal data, and the
exploitation of the knowledge derived from this data.
o  Computer security of data processing chains
o  Security/crypto: block-chain and trusted third parties
Data Science, Intelligence & Society
Training and research
•  Three doctoral trainings of the Université Paris-Saclay : EDMH, ED STIC & ED SHS.
•  Reinforce the math-info crossover in data sciences training, new interdisciplinary
curricula more open to SHS: awareness of the responsibility of algorithmic systems,
economic models, rights and uses of data.
•  Research Projets– 3 years, 2 thesis scholarships (or 1PhD + 1 Post-Doc/engineer).
•  International student mobility (incoming and outgoing) with 2 thesis scholarships
(excellence scholarships) per year.
•  Thematic Semesters for MSc / PhD /E-C, Biennial Conference, Annual Self-Assessment
Symposium, Workshops, Challenges, Junior Conference, Summer-school.
Data Science, Intelligence & Society
Data Science, Intelligence & Society
Co-working
•  Workspaces are available for teams affiliated to the DATAIA Institute in the Alan Turing
building, an emblematic venue :
o  1800 sqm of which approximately 300 sqm for the new teams
o  8 teams on site
o  800 sqm of meeting spaces
•  Implementation of telepresence screens in progress.
Data Science, Intelligence & Society
•  National Scientific Platform for Transparency &
Accountability Tools and Methods for Data and
Algorithms (Fairness, Neutrality, Loyalty); B2B &
B2C.
•  Support of The new “Law for Digital Republic”: the
right to the explainability of algorithmic decision of
public services (APB service stopped!)
•  Contributors: CNNum, DGCCRF besides academia
(Grenoble, Paris, Lille, Rennes etc), industries and
associations,
Data Science, Intelligence & Society
Objectives:
o  Resource center, Empowerment tools: reports, publications,
software, controlled data sets & testing protocols ;
o  Awareness rising: workshops & Moocs ;
o  Best practices recommendation & sharing ;
o  Research & Dev. Programs.
Working Groups :
o  Auditability of Recommendation and Ranking systems ;
o  Explainability, Reproducibility and Bias of ML ;
o  Privacy, Data Usage Control & Information-flow-monitoring ;
o  Influence, Nudging, Fact-ckecking.
Data Science, Intelligence & Society
Merci de votre attention
Science des données, Intelligence & Société
Need for Interdiscplinary efforts
THANK YOU
nozha.boujemaa@inria.fr
Data Science, Intelligence & Society
Data Science, Intelligence & Society
Summer School
•  DATAIA Institute co-organizes the DS3
Summer School with École polytechnique :
o  Speakers confirmed: Cédric Villani, Yann
Le Cun, Adrian Weller, Krishna Gummadi,
Jean-Philippe Vert …
o  Format: plenary and parallel sessions on
several sites
o  Attendees: between 400 and 500
participants (students, academics and
professionals)
DATA SCIENCE SUMMER SCHOOL
TUTORIALS ON
Deep Learning
Yann LECUN [Facebook - New York University]
Interpretable Machine Learning
Adrian WELLER [University of Cambridge - Alan Turing Institute]
Fairness in Machine Learning
Krishna GUMMADI [Max Planck Institute]
Probabilistic Numerical Methods
Mark GIROLAMI [Imperial College London]
Online Learning Algorithms
Nicolò CESA-BIANCHI [University of Milano]
Non-convex Optimization
Suvrit SRA [MIT] ... other speakers will be confirmed soon
PARALLEL SESSIONS
on Health and Social Sciences
PRACTICAL SESSIONS
on Deep Learning, Reinforcement
Learning, Recommender Systems, Precision Medicine...
POSTER SESSION
ROUND TABLE DISCUSSION
Targeted for students, academics and professionals
More information to come on:
www.ds3-datascience-polytechnique.fr
JUNE
25-29
2018
at Campus
polyteChnique
OPENING by Cédric VILLANI
Data Science, Intelligence & Society
France-Japan Symposium
•  The DATAIA Institute co-organize with JST a France-Japan workshop
on Deep Learning and Artificial Intelligence, in partnership with the
French Embassy in Japan and the Ministry of Higher Education,
Research and Innovation (MESRI)
o  Dates: from 11 -12 July, 2018
o  Location: Amphitheatre of MESRI
o  Format: Plenary sessions
o  Attendees: between 150 and 200 participants (academics and
professionals)
o  With the winners of the CREST Program (equivalent to ERC
senior) funded by the JST
Data Science, Intelligence & Society

More Related Content

PDF
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
PPTX
Educating Data Scientists: the SoBigData master experience
PDF
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
PPT
Working with real world data
PDF
Minn twdi 9 9
PPT
Intelligent Data Processing for the Internet of Things
PPT
Internet of Things: Concepts and Technologies
PPT
Smart Cities: How are they different?
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
Educating Data Scientists: the SoBigData master experience
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Working with real world data
Minn twdi 9 9
Intelligent Data Processing for the Internet of Things
Internet of Things: Concepts and Technologies
Smart Cities: How are they different?

What's hot (20)

PDF
Accounting Value Effects for Responsible Networking
PPTX
State of Florida Neo4J Graph Briefing - Keynote
DOC
by Warren Jin
PDF
Code Driven Law?
PDF
Data and Knowledge as Commodities
PDF
Cyber forensics intro & requirement engineering cit dec 21,2013
PDF
AAMAS-2017 8-12 May, 2017, Sao Paulo, Brazil
PPTX
SMART Seminar Series: "From Big Data to Smart data"
PDF
Open Data Analytical Model for Human Development Index to Support Government ...
PPTX
John Eberhardt NSTAC Testimony
PPTX
If you can't beat em, join em
PDF
Understanding Cognitive Applications: A Framework - Sue Feldman
PDF
Diffusion of Big Data and Analytics in Developing Countries
PDF
Data Mining And Visualization of Large Databases
PPTX
Ethics In DW & DM
PPTX
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
PPT
Cyber security solutions for the energy industry in north america israel ga...
PPTX
Crowdsourcing Approaches for Smart City Open Data Management
PDF
Algorithmic auditing 1.0
PDF
Cognitive Computing.PDF
Accounting Value Effects for Responsible Networking
State of Florida Neo4J Graph Briefing - Keynote
by Warren Jin
Code Driven Law?
Data and Knowledge as Commodities
Cyber forensics intro & requirement engineering cit dec 21,2013
AAMAS-2017 8-12 May, 2017, Sao Paulo, Brazil
SMART Seminar Series: "From Big Data to Smart data"
Open Data Analytical Model for Human Development Index to Support Government ...
John Eberhardt NSTAC Testimony
If you can't beat em, join em
Understanding Cognitive Applications: A Framework - Sue Feldman
Diffusion of Big Data and Analytics in Developing Countries
Data Mining And Visualization of Large Databases
Ethics In DW & DM
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
Cyber security solutions for the energy industry in north america israel ga...
Crowdsourcing Approaches for Smart City Open Data Management
Algorithmic auditing 1.0
Cognitive Computing.PDF
Ad

Similar to DATAIA & TransAlgo (20)

PDF
Toward Trustworthy AI
PPTX
Ethical Issues in Machine Learning Algorithms. (Part 1)
PDF
Data Science for Beginner by Chetan Khatri and Deptt. of Computer Science, Ka...
PDF
Data Science Introduction - Data Science: What Art Thou?
PDF
AI in Data science
PPTX
2019 WIA - The Importance of Ethics in Data Science
PDF
Data Science - NXT Level_Dr.Arun.pdf
PPTX
Artificial Intelligence, Social Justice and Digital Civics
PDF
Joachim Ganseman - Pitfalls in AI - Infosecurity.be 2019
PDF
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
PDF
Key Roles In Data-Driven Organisation
PDF
Key Roles In Data-Driven Organisation
PPTX
GTU GeekDay 2019 Limitations of Artificial Intelligence
PDF
The Future of Data Science: Emerging Trends and Technologies
PDF
Ch_2a_Big Data and IoT allowing AI and Energy Industry_vf.pdf
PDF
Data Science versus Artificial Intelligence: a useful distinction
PDF
Artificial Intelligence PPT- Class IX.pdf
PDF
Data science as a commercial and academic practice
PDF
Luciano uvi hackfest.28.10.2020
Toward Trustworthy AI
Ethical Issues in Machine Learning Algorithms. (Part 1)
Data Science for Beginner by Chetan Khatri and Deptt. of Computer Science, Ka...
Data Science Introduction - Data Science: What Art Thou?
AI in Data science
2019 WIA - The Importance of Ethics in Data Science
Data Science - NXT Level_Dr.Arun.pdf
Artificial Intelligence, Social Justice and Digital Civics
Joachim Ganseman - Pitfalls in AI - Infosecurity.be 2019
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
Key Roles In Data-Driven Organisation
Key Roles In Data-Driven Organisation
GTU GeekDay 2019 Limitations of Artificial Intelligence
The Future of Data Science: Emerging Trends and Technologies
Ch_2a_Big Data and IoT allowing AI and Energy Industry_vf.pdf
Data Science versus Artificial Intelligence: a useful distinction
Artificial Intelligence PPT- Class IX.pdf
Data science as a commercial and academic practice
Luciano uvi hackfest.28.10.2020
Ad

Recently uploaded (20)

PDF
Zenith AI: Advanced Artificial Intelligence
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Unlock new opportunities with location data.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
PPT
Geologic Time for studying geology for geologist
PDF
STKI Israel Market Study 2025 version august
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Getting Started with Data Integration: FME Form 101
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
CloudStack 4.21: First Look Webinar slides
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPT
What is a Computer? Input Devices /output devices
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
WOOl fibre morphology and structure.pdf for textiles
Zenith AI: Advanced Artificial Intelligence
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Unlock new opportunities with location data.pdf
Assigned Numbers - 2025 - Bluetooth® Document
Univ-Connecticut-ChatGPT-Presentaion.pdf
Final SEM Unit 1 for mit wpu at pune .pptx
Module 1.ppt Iot fundamentals and Architecture
Geologic Time for studying geology for geologist
STKI Israel Market Study 2025 version august
DP Operators-handbook-extract for the Mautical Institute
Getting Started with Data Integration: FME Form 101
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
CloudStack 4.21: First Look Webinar slides
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
Developing a website for English-speaking practice to English as a foreign la...
What is a Computer? Input Devices /output devices
sustainability-14-14877-v2.pddhzftheheeeee
Benefits of Physical activity for teenagers.pptx
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
WOOl fibre morphology and structure.pdf for textiles

DATAIA & TransAlgo

  • 1. Data Science, Intelligence & Society March 2018 DATAIA Institute Data Science, Artificial Intelligence & Society Nozha Boujemaa Director at DATAIA Institute Research Director at Inria [email protected]
  • 2. Data Science, Intelligence & Society Aim of convergence Institutes •  Structuration of few centres gathering multidisciplinary scientific task forces with large scale and high visibility in order to reach major challenges, at the crossroads of societal and economic challenges and questions from the scientific community. •  Advanced research-training integration. •  Effective coupling with the socio-economic world –industry partnership. •  DATAIA is the Convergence Institute in Data Science, AI & Society gathering 130 affiliated researchers and targeting 300 within 3 years, Kick-off => 15 February 2018
  • 3. Data Science, Intelligence & Society DATAIA Institute •  4 Overarching Challenges: o  From Machine Learning to Artificial Intelligence, o  From Data to Knowledge, from Data to Decision, o  Transparency, Responsible AI & Ethics, o  Data Protection, Regulation and Economy •  Scientific and disciplinary foundations: Math, Computer Sciences, Management and Economy, Social Sciences, Legal Sciences •  Application domains: Internet of people and things, Urbanization 4.0 & Mobility, Optimal Energy Management, Business Analytics, Health, Well being & personal nutrition, e-Sciences. •  Roadmap for 8 years, 10M€ -180 M€ Global Budget, with 14 academic founding institutions •  Kick-off => February 15th 2018 Data Science, Intelligence & Society
  • 4. Data Science, Intelligence & Society Les membres fondateurs •  L’Institut DATAIA est porté par l’Université Paris-Saclay et dirigé par le centre de recherche Inria Saclay – Île-de-France : •  Le consortium rassemble des Universités, des Instituts de recherche nationaux et des Grandes Ecoles :
  • 5. Data Science, Intelligence & Society Industrial Affiliation Program •  Contributions: research support, data and use cases •  Participation in the definition, selection and monitoring of programs •  Participation in defining the long-term strategic vision •  Workshops, S&T work exchange sessions, brainstorming sessions (open problems), etc •  IP will follow the rules defined in a consortium agreement •  First look at IP. Based of what is done in American Universities (Stanford model) * * Data Science, Intelligence & Society
  • 6. Data Science, Intelligence & Society •  Alan Turing (UK) •  IVADO (Canada) •  Advanced Core Technologies for Big Data Integration (Japan) •  DSI (Data Science Institute – Columbia University) International partners
  • 7. Data Science, Intelligence & Society Data & Algorithms « 2 sides of the same coin » •  Rising benefits from Big Data and AI technologies have wide impact on our economy and social organization ; •  Transparency and trust of such Algorithmic Systems (data & algorithms) becoming competitiveness factors for Data-driven economy ; •  Data analytics is changing from description of past to predictive and prescriptive analytics for decision support ; •  Importance of remedying the information asymmetry between the producer of the digital service and its consumer, be it citizen or professional – B2C or B2B => civil rights, competition, sovereignty.
  • 8. Data Science, Intelligence & Society Algorithmic systems in every day life •  Some dominant platforms on the market play a role of "prescriber” by directing a large share of user traffic: •  Ranking mechanisms (search engine), •  Recommendation mechanisms and content selection •  Product or service recommendation: is it most appropriate for the consumer (personalization) or the most appropriate to the seller (given the stock)? •  Opacity of the use made of the personal data and how they are processed, •  What about the consent? Is it always respected? Mobilitics CNIL-Inria (Privatics) •  Credit scoring, how fair is it? •  Predictive justice? ⇒  New discrimination between those who know how algorithms work ad who do not In addition to economical and geostrategic effects on persons and societies
  • 9. Data Science, Intelligence & Society Algorithmic Systems Bias Mastering Big Data Technologies: Bias problems could impact data technologies accuracy and people’s lives Challenges 1: Data Inputs to an Algorithm o  Poorly selected data o  Incomplete, incorrect, or outdated data o  Data sets that lack disproportionately represent certain populations o  Malicious attack Challenges 2: The Design of Algorithmic Systems and Machine Learning o  Poorly designed matching systems o  Unintentional perpetuation and promotion of historical biases o  Decision-making systems that assume correlation implies causation
  • 10. Data Science, Intelligence & Society Challenges / Efforts •  It is a mistake to assume they are objective simply because they are data-driven •  Algorithms are encapsulated opinions through decision parameters and learning data •  Mastering the accuracy and robustness of Big Data & AI techniques: bias, reproducibility, source of unintentional discrimination •  Implementing the “Transparent-by-design”: fairness/equity, loyalty, neutrality, etc. •  Interdisciplinary co-conception of solutions, How responsible is a ML algorithm? •  Interdisciplinary training of Data Scientists: law, sociology and economy, Careful software reuse => mastering information leaks (SRE) AI is part of the solution and not only the law! Transparency Tools vs GDPR vs Having the Choice
  • 11. Data Science, Intelligence & Society Transparent-by-design, auditable-by-design, fairness & non-discrimination-by- design § Explainability, reproducibility & robustness of ML, § Data provenance and usage monitoring § Progressive user-centric analytics (Mix of Dataviz and Analytics) § New paradigms for information flow monitoring § Fact-checking requiring explicit & verifiable integration of heterogeneous data sources Challenges / Efforts
  • 12. Data Science, Intelligence & Society Challenges / Efforts •  Complex concepts, Dependent on cultural context, law context, etc. International collaboration is key Transparency, Asymmetry, Accountability, Loyalty, Fairness, Equity, Intelligibility, Explainability, Traceability, Auditability, Proof and Certification, Performance, Ethics, Responsibility Ethical ≠ Responsible, Transparent ≠ Make available the source code •  Pedagogy and explanation, awareness, uses-cases, (all public! Including scientists) •  Auditability and Building Transparent-by-Design tools and algorithms ML algorithms are shared in open-source but NOT Data (governance of AS!)
  • 13. Data Science, Intelligence & Society Interdisciplinary challenges •  From Machine Learning to Artificial Intelligence o  Innovative machine learning and AI: common sense, adaptability, generalization o  Deep learning and adversarial learning o  Machine learning and hyper-optimization o  Optimization for learning, stochastic gradient method improvements, Bayesian optimization, combinatorial optimization o  Link between learning and modelling, integration of a priori into learning o  Repeatability and robust learning o  Statistical Inference and Validation o  Composition of deep architectures
  • 14. Data Science, Intelligence & Society Interdisciplinary challenges •  From Data to Knowledge, from Data to Decision o  Heterogeneous, semi-structured, complex, incomplete and/or uncertain data o  Fast big data: new methodologies to use data o  Online learning, methodology for massive data, efficient methods o  Improved storage, calculation and estimation for data science o  Modeling of interactions between agents (human or artificial) by game theory o  Multiscale and multimodal representation and algorithms o  Theoretical analysis of heuristic methods (complexity theory, information geometry, Markov chain theory) o  Human-machine co-evolution in autonomous systems: conversational agents, autonomous systems , social robots
  • 15. Data Science, Intelligence & Society Interdisciplinary challenges •  Transparency & digital trust o  Responsibility-by-design, Explicability-by-design o  Transparency-by-design, equity-by-design o  Audit of algorithmic systems: non-discrimination, loyalty, technical bias, neutrality, fairness o  Measuring digital trust and ownership o  Progressive user-centric-analytics (interactive monitoring of decision systems: dataviz, dashboards, IHM)  o  Responsibility for information processing and decision-making: data usage control and fact- checking o  Causal discovery, traceability of inferences from source data, interpretability of deep architectures
  • 16. Data Science, Intelligence & Society Interdisciplinary challenges •  Data protection, regulation and economy o  "Privacy-by-design", GDPR o  Distributed Machine Learning preserving privacy o  Development of ethically responsible methodologies and technologies to regulate the collection, use and process of personal data, and the exploitation of the knowledge derived from this data. o  Computer security of data processing chains o  Security/crypto: block-chain and trusted third parties
  • 17. Data Science, Intelligence & Society Training and research •  Three doctoral trainings of the Université Paris-Saclay : EDMH, ED STIC & ED SHS. •  Reinforce the math-info crossover in data sciences training, new interdisciplinary curricula more open to SHS: awareness of the responsibility of algorithmic systems, economic models, rights and uses of data. •  Research Projets– 3 years, 2 thesis scholarships (or 1PhD + 1 Post-Doc/engineer). •  International student mobility (incoming and outgoing) with 2 thesis scholarships (excellence scholarships) per year. •  Thematic Semesters for MSc / PhD /E-C, Biennial Conference, Annual Self-Assessment Symposium, Workshops, Challenges, Junior Conference, Summer-school. Data Science, Intelligence & Society
  • 18. Data Science, Intelligence & Society Co-working •  Workspaces are available for teams affiliated to the DATAIA Institute in the Alan Turing building, an emblematic venue : o  1800 sqm of which approximately 300 sqm for the new teams o  8 teams on site o  800 sqm of meeting spaces •  Implementation of telepresence screens in progress.
  • 19. Data Science, Intelligence & Society •  National Scientific Platform for Transparency & Accountability Tools and Methods for Data and Algorithms (Fairness, Neutrality, Loyalty); B2B & B2C. •  Support of The new “Law for Digital Republic”: the right to the explainability of algorithmic decision of public services (APB service stopped!) •  Contributors: CNNum, DGCCRF besides academia (Grenoble, Paris, Lille, Rennes etc), industries and associations,
  • 20. Data Science, Intelligence & Society Objectives: o  Resource center, Empowerment tools: reports, publications, software, controlled data sets & testing protocols ; o  Awareness rising: workshops & Moocs ; o  Best practices recommendation & sharing ; o  Research & Dev. Programs. Working Groups : o  Auditability of Recommendation and Ranking systems ; o  Explainability, Reproducibility and Bias of ML ; o  Privacy, Data Usage Control & Information-flow-monitoring ; o  Influence, Nudging, Fact-ckecking.
  • 21. Data Science, Intelligence & Society Merci de votre attention Science des données, Intelligence & Société Need for Interdiscplinary efforts THANK YOU [email protected] Data Science, Intelligence & Society
  • 22. Data Science, Intelligence & Society Summer School •  DATAIA Institute co-organizes the DS3 Summer School with École polytechnique : o  Speakers confirmed: Cédric Villani, Yann Le Cun, Adrian Weller, Krishna Gummadi, Jean-Philippe Vert … o  Format: plenary and parallel sessions on several sites o  Attendees: between 400 and 500 participants (students, academics and professionals) DATA SCIENCE SUMMER SCHOOL TUTORIALS ON Deep Learning Yann LECUN [Facebook - New York University] Interpretable Machine Learning Adrian WELLER [University of Cambridge - Alan Turing Institute] Fairness in Machine Learning Krishna GUMMADI [Max Planck Institute] Probabilistic Numerical Methods Mark GIROLAMI [Imperial College London] Online Learning Algorithms Nicolò CESA-BIANCHI [University of Milano] Non-convex Optimization Suvrit SRA [MIT] ... other speakers will be confirmed soon PARALLEL SESSIONS on Health and Social Sciences PRACTICAL SESSIONS on Deep Learning, Reinforcement Learning, Recommender Systems, Precision Medicine... POSTER SESSION ROUND TABLE DISCUSSION Targeted for students, academics and professionals More information to come on: www.ds3-datascience-polytechnique.fr JUNE 25-29 2018 at Campus polyteChnique OPENING by Cédric VILLANI
  • 23. Data Science, Intelligence & Society France-Japan Symposium •  The DATAIA Institute co-organize with JST a France-Japan workshop on Deep Learning and Artificial Intelligence, in partnership with the French Embassy in Japan and the Ministry of Higher Education, Research and Innovation (MESRI) o  Dates: from 11 -12 July, 2018 o  Location: Amphitheatre of MESRI o  Format: Plenary sessions o  Attendees: between 150 and 200 participants (academics and professionals) o  With the winners of the CREST Program (equivalent to ERC senior) funded by the JST Data Science, Intelligence & Society