SlideShare a Scribd company logo
Grid Projects in the US (an inevitably incomplete view) Ian Foster Computation Institute Argonne National Lab & University of Chicago
Grid Projects in the US Resources Resource Provider Resource Provider Resource Provider
Grid Projects in the US Service  Provider Service  Provider Service  Provider Services Resources Resource Provider
Grid Projects in the US Community Community Community Service  Provider Content Services Resources Resource Provider Software Providers
Grid Projects in the US Community Service  Provider Content Services Resources Software Providers Resource Provider
Resource Providers Campus and regional grids Purdue, Wisc, UCLA, …, … TIGRE, UC system, … Open Science Grid 43,000 CPUs, 6 PB disk, 15,000 CPU days/day Allocations on basis of MOUs TeraGrid ~ 1.2 Pflop/s National Allocation Committee Amazon, Microsoft, IBM, etc. ?? CPUs, ?? storage Fee for service
Open Science Grid Sites  (5/4/08) +3 in Brazil; 2 in Mexico; 2 in Taiwan; 1 in the UK. Grows by 10-20 per year.
Use by Community CMS ATLAS CDF Local Usage & bugs (unmapped to VO) D0 2,000,000 a week 1,000,000 a week
TeraGrid Participants
Growing User Community Source: TeraGrid Central Database
Growing Usage Source: TeraGrid Central Database 3.95B NUs delivered in CY2007
CY2007 Usage by Discipline 3.95B NUs delivered in CY2007 Molecular  Biosciences 31% Chemistry 17% Physics 17% Astronomical  Sciences 12% Materials Research 6% Earth Sciences 3% All 19 Others 4% Advanced Scientific  Computing 2% Atmospheric  Sciences 3% Chemical, Thermal  Systems 5%
Grid Projects in the US For example: Build and test service (Wisc) Certificate Authorities Cancer Biology Informatics Grid LIGO Data Grid Community Service  Provider Content Services Resources Software Providers Resource Provider Service  Provider
caBIG: sharing of infrastructure, applications, and data.  Data Integration! Services & Cancer Biology Globus
caBIG Under the Covers NCICB Research Center Grid-Enabled Client Research Center Tool 1 Tool 2 Tool 3 Tool 4 Grid Data Service Analytical Service Grid Portal Microarray Gene  Database caArray Protein Database Image Tool 2 Tool 3 Grid Services Infrastructure (Metadata, Registry, Query,  Invocation, Security, etc.) Globus
LIGO Data Grid Birmingham • Replicating >1 Terabyte/day to 8 sites 770 TB replicated to date: >120 million replicas MTBF = 1 month LIGO Gravitational Wave Observatory Ann Chervenak et al., ISI; Scott Koranda et al, LIGO Cardiff AEI/Golm   Globus
Grid Projects in the US For example: Earth System Grid Children’s Oncology Grid Southern California Earthquake Center (SCEC) Science gateways Community Service  Provider Content Services Resources Software Providers Resource Provider Community
Earth System Grid Main ESG Portal CMIP3 (IPCC AR4) ESG Portal 198 TB of data at four locations 1,150 datasets 1,032,000  files Includes the past 6 years of joint DOE/NSF climate modeling experiments 35 TB of data at one location 74,700 files Generated by a modeling campaign coordinated by the Intergovernmental Panel on Climate Change Data from 13 countries, representing 25 models 8,000 registered users 1,900 registered projects Downloads to date 49 TB 176,000  files Downloads to date 387 TB 1,300,000 files 500 GB/day (average) 400 scientific papers published to date based on analysis of CMIP3 (IPCC AR4) data ESG usage: over 500 sites worldwide ESG monthly download volumes Globus
SCEC Community  Modeling Environment Pathway Instantiations Knowledge Base Ontologies Curated taxonomies, Relations & constraints Pathway Models Pathway templates, Models of simulation codes Code Repositories Data & Simulation Products Data Collections FSM RDM AWM SRM Storage GRID Pathway Execution Policy, Data ingest, Repository access Grid Services Compute & storage management, Security DIGITAL LIBRARIES Navigation & Queries Versioning, Replication Mediated Collections Federated access KNOWLEDGE ACQUISITION Acquisition Interfaces Dialog planning, Pathway construction strategies Pathway Assembly Template instantiation, Resource selection, Constraint checking KNOWLEDGE REPRESENTATION  & REASONING Knowledge Server Knowledge base access, Inference Translation Services Syntactic & semantic translation Computing Users A collaboratory for system-level earthquake science Globus
Seismic Hazard Analysis Defn:  Max. intensity of shaking expected at a site during a fixed time interval Example: National seismic hazard maps   Intensity measure: peak ground acceleration Interval: 50 yrs Probability of exceedance: 2% (http://geohazards.cr.usgs.gov/eq/) Globus
SCEC Computations & Grid Prepare input to Pathway2    wave propagation code  Pathway2PGV converts    output into hazard map Map is visualized SDSC USC SCEC PSC TeraGrid ISI 12 CPUs 1,700 CPUs 1,200 CPUs 1 CPU 4 CPUs Globus
Children’s Oncology Grid and MEDICUS Globus
Grid Projects in the US Community Service  Provider Content Services Resources Resource Provider Software Providers
Software Providers Globus  [GT4.2 released July 2, 2008] GRAM, GridFTP, MDS, RLS, DRS, … GSI, GridShib, MyProxy, … GridWay (Spain), OGSA-DAI (UK), Introduce, … Condor MPI-G, Swift, Pegasus, Taverna (UK), Kepler caBIG: e.g., Introduce Virtual Data Toolkit (includes VOMS [Italy], …) SRB, iRODS, MyCluster, … … Globus
Virtual Data Toolkit (VDT) Software Release Process VDT components over time: built for 15 Linux Versions Development & testing  Globus
Creating Services: Introduce and gRAVI  Introduce Define service Create skeleton Discover types Add operations Configure security Grid R emote  A pplication  V irtualization  Infrastructure Wrap executables Index  service Repository   Service Introduce Container Ohio State University and Argonne/U.Chicago Appln Service Create Store Advertize Discover Invoke; get results Transfer GAR Deploy Globus
Composing  Services Globus
Service Discovery: Registries Globus
Challenges Community Community Community Service  Provider Content Services Resources Resource Provider Software Providers Conflicting Missions Sustainability Discipline science pull
The Future NSF eXtreme Digital (XD) solicitation Aka “TeraGrid III” DOE, NIH, etc.—what do they want? International cooperation

More Related Content

PPTX
From the Pacific Research Platform to a National Research Platform
PPT
High Performance Collaboration – The Jump to Light Speed
PPTX
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
PPTX
PRP, CHASE-CI, TNRP and OSG
PPTX
Global Research Platforms: Past, Present, Future
PPTX
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
PPTX
Peering The Pacific Research Platform With The Great Plains Network
PPTX
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
From the Pacific Research Platform to a National Research Platform
High Performance Collaboration – The Jump to Light Speed
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
PRP, CHASE-CI, TNRP and OSG
Global Research Platforms: Past, Present, Future
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
Peering The Pacific Research Platform With The Great Plains Network
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...

What's hot (20)

PPTX
Pacific Research Platform Science Drivers
PPTX
CENIC: Pacific Wave and PRP Update Big News for Big Data
PDF
Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...
PPTX
Learning Systems for Science
PPTX
Bionimbus - Northwestern CGI Workshop 4-21-2011
PPTX
The Pacific Research Platform
PPT
A National Big Data Cyberinfrastructure Supporting Computational Biomedical R...
PPTX
NERSC, AI and the Superfacility, Debbie Bard
PPTX
Pacific Wave and PRP Update Big News for Big Data
PPTX
Data Automation at Light Sources
PPT
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
PPTX
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
PPTX
Coding the Continuum
PPT
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
PPTX
The Pacific Research Platform
PPTX
Toward a National Research Platform
PPT
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
PPTX
Ogf27 Ligo
PPTX
The IGSN and Geosamples
PDF
Accelerating your research with Microsoft Azure
Pacific Research Platform Science Drivers
CENIC: Pacific Wave and PRP Update Big News for Big Data
Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...
Learning Systems for Science
Bionimbus - Northwestern CGI Workshop 4-21-2011
The Pacific Research Platform
A National Big Data Cyberinfrastructure Supporting Computational Biomedical R...
NERSC, AI and the Superfacility, Debbie Bard
Pacific Wave and PRP Update Big News for Big Data
Data Automation at Light Sources
A Campus-Scale High Performance Cyberinfrastructure is Required for Data-Int...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
Coding the Continuum
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
The Pacific Research Platform
Toward a National Research Platform
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Ogf27 Ligo
The IGSN and Geosamples
Accelerating your research with Microsoft Azure
Ad

Viewers also liked (6)

PPTX
Streamlined data sharing and analysis to accelerate cancer research
PPTX
Accelerating Data-driven Discovery in Energy Science
PPTX
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
PPTX
building global software/earthcube->sciencecloud
PPTX
Accelerating Discovery via Science Services
PPTX
Globus Auth: A Research Identity and Access Management Platform
Streamlined data sharing and analysis to accelerate cancer research
Accelerating Data-driven Discovery in Energy Science
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
building global software/earthcube->sciencecloud
Accelerating Discovery via Science Services
Globus Auth: A Research Identity and Access Management Platform
Ad

Similar to Grid Projects In The US July 2008 (20)

PPT
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
PPTX
Grid computing
PPT
Positioning University of California Information Technology for the Future: S...
PPT
Computing Outside The Box June 2009
PPTX
Security Challenges and the Pacific Research Platform
PPT
The OptIPuter and Its Applications
PPT
OGCE TeraGrid 2010 Science Gateway Tutorial Intro
PPTX
Creating a Science-Driven Big Data Superhighway
PPT
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
PPT
Computing Outside The Box September 2009
PPT
grid mining
PPT
OptIPuter Overview
PPT
The OptIPuter as a Prototype for CalREN-XD
PPTX
PRP, NRP, GRP & the Path Forward
PPT
TeraGrid Communication and Computation
PPT
Computing Outside The Box
PDF
Cloud Standards in the Real World: Cloud Standards Testing for Developers
PPTX
Indiana University's Advanced Science Gateway Support
PPT
Cal-(IT)2 Projects with Sun Microsystems
PPT
Genomics at the Speed of Light: Understanding the Living Ocean
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
Grid computing
Positioning University of California Information Technology for the Future: S...
Computing Outside The Box June 2009
Security Challenges and the Pacific Research Platform
The OptIPuter and Its Applications
OGCE TeraGrid 2010 Science Gateway Tutorial Intro
Creating a Science-Driven Big Data Superhighway
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Computing Outside The Box September 2009
grid mining
OptIPuter Overview
The OptIPuter as a Prototype for CalREN-XD
PRP, NRP, GRP & the Path Forward
TeraGrid Communication and Computation
Computing Outside The Box
Cloud Standards in the Real World: Cloud Standards Testing for Developers
Indiana University's Advanced Science Gateway Support
Cal-(IT)2 Projects with Sun Microsystems
Genomics at the Speed of Light: Understanding the Living Ocean

More from Ian Foster (20)

PPTX
Global Services for Global Science March 2023.pptx
PPTX
The Earth System Grid Federation: Origins, Current State, Evolution
PPTX
Better Information Faster: Programming the Continuum
PPTX
ESnet6 and Smart Instruments
PPTX
Linking Scientific Instruments and Computation
PPTX
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
PPTX
Foster CRA March 2022.pptx
PPTX
Big Data, Big Computing, AI, and Environmental Science
PPTX
AI at Scale for Materials and Chemistry
PPTX
Data Tribology: Overcoming Data Friction with Cloud Automation
PPTX
Research Automation for Data-Driven Discovery
PPTX
Scaling collaborative data science with Globus and Jupyter
PPTX
Team Argon Summary
PPTX
Thoughts on interoperability
PPTX
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
PPTX
NIH Data Commons Architecture Ideas
PPTX
Going Smart and Deep on Materials at ALCF
PPTX
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
PPTX
Software Infrastructure for a National Research Platform
PPTX
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Global Services for Global Science March 2023.pptx
The Earth System Grid Federation: Origins, Current State, Evolution
Better Information Faster: Programming the Continuum
ESnet6 and Smart Instruments
Linking Scientific Instruments and Computation
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
Foster CRA March 2022.pptx
Big Data, Big Computing, AI, and Environmental Science
AI at Scale for Materials and Chemistry
Data Tribology: Overcoming Data Friction with Cloud Automation
Research Automation for Data-Driven Discovery
Scaling collaborative data science with Globus and Jupyter
Team Argon Summary
Thoughts on interoperability
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
NIH Data Commons Architecture Ideas
Going Smart and Deep on Materials at ALCF
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Software Infrastructure for a National Research Platform
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...

Recently uploaded (20)

PDF
5.Universal-Franchise-and-Indias-Electoral-System.pdfppt/pdf/8th class social...
PPTX
How to Manage Loyalty Points in Odoo 18 Sales
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PPTX
ACUTE NASOPHARYNGITIS. pptx
PPTX
Information Texts_Infographic on Forgetting Curve.pptx
PDF
The Final Stretch: How to Release a Game and Not Die in the Process.
PDF
Landforms and landscapes data surprise preview
PPTX
family health care settings home visit - unit 6 - chn 1 - gnm 1st year.pptx
PPTX
Skill Development Program For Physiotherapy Students by SRY.pptx
PPTX
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PPTX
Cardiovascular Pharmacology for pharmacy students.pptx
PPTX
An introduction to Prepositions for beginners.pptx
PPTX
Presentation on Janskhiya sthirata kosh.
PPTX
Introduction and Scope of Bichemistry.pptx
PDF
Electrolyte Disturbances and Fluid Management A clinical and physiological ap...
DOCX
UPPER GASTRO INTESTINAL DISORDER.docx
PDF
2.Reshaping-Indias-Political-Map.ppt/pdf/8th class social science Exploring S...
PPTX
Software Engineering BSC DS UNIT 1 .pptx
PDF
UTS Health Student Promotional Representative_Position Description.pdf
PDF
Sunset Boulevard Student Revision Booklet
5.Universal-Franchise-and-Indias-Electoral-System.pdfppt/pdf/8th class social...
How to Manage Loyalty Points in Odoo 18 Sales
Week 4 Term 3 Study Techniques revisited.pptx
ACUTE NASOPHARYNGITIS. pptx
Information Texts_Infographic on Forgetting Curve.pptx
The Final Stretch: How to Release a Game and Not Die in the Process.
Landforms and landscapes data surprise preview
family health care settings home visit - unit 6 - chn 1 - gnm 1st year.pptx
Skill Development Program For Physiotherapy Students by SRY.pptx
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
Cardiovascular Pharmacology for pharmacy students.pptx
An introduction to Prepositions for beginners.pptx
Presentation on Janskhiya sthirata kosh.
Introduction and Scope of Bichemistry.pptx
Electrolyte Disturbances and Fluid Management A clinical and physiological ap...
UPPER GASTRO INTESTINAL DISORDER.docx
2.Reshaping-Indias-Political-Map.ppt/pdf/8th class social science Exploring S...
Software Engineering BSC DS UNIT 1 .pptx
UTS Health Student Promotional Representative_Position Description.pdf
Sunset Boulevard Student Revision Booklet

Grid Projects In The US July 2008

  • 1. Grid Projects in the US (an inevitably incomplete view) Ian Foster Computation Institute Argonne National Lab & University of Chicago
  • 2. Grid Projects in the US Resources Resource Provider Resource Provider Resource Provider
  • 3. Grid Projects in the US Service Provider Service Provider Service Provider Services Resources Resource Provider
  • 4. Grid Projects in the US Community Community Community Service Provider Content Services Resources Resource Provider Software Providers
  • 5. Grid Projects in the US Community Service Provider Content Services Resources Software Providers Resource Provider
  • 6. Resource Providers Campus and regional grids Purdue, Wisc, UCLA, …, … TIGRE, UC system, … Open Science Grid 43,000 CPUs, 6 PB disk, 15,000 CPU days/day Allocations on basis of MOUs TeraGrid ~ 1.2 Pflop/s National Allocation Committee Amazon, Microsoft, IBM, etc. ?? CPUs, ?? storage Fee for service
  • 7. Open Science Grid Sites (5/4/08) +3 in Brazil; 2 in Mexico; 2 in Taiwan; 1 in the UK. Grows by 10-20 per year.
  • 8. Use by Community CMS ATLAS CDF Local Usage & bugs (unmapped to VO) D0 2,000,000 a week 1,000,000 a week
  • 10. Growing User Community Source: TeraGrid Central Database
  • 11. Growing Usage Source: TeraGrid Central Database 3.95B NUs delivered in CY2007
  • 12. CY2007 Usage by Discipline 3.95B NUs delivered in CY2007 Molecular Biosciences 31% Chemistry 17% Physics 17% Astronomical Sciences 12% Materials Research 6% Earth Sciences 3% All 19 Others 4% Advanced Scientific Computing 2% Atmospheric Sciences 3% Chemical, Thermal Systems 5%
  • 13. Grid Projects in the US For example: Build and test service (Wisc) Certificate Authorities Cancer Biology Informatics Grid LIGO Data Grid Community Service Provider Content Services Resources Software Providers Resource Provider Service Provider
  • 14. caBIG: sharing of infrastructure, applications, and data. Data Integration! Services & Cancer Biology Globus
  • 15. caBIG Under the Covers NCICB Research Center Grid-Enabled Client Research Center Tool 1 Tool 2 Tool 3 Tool 4 Grid Data Service Analytical Service Grid Portal Microarray Gene Database caArray Protein Database Image Tool 2 Tool 3 Grid Services Infrastructure (Metadata, Registry, Query, Invocation, Security, etc.) Globus
  • 16. LIGO Data Grid Birmingham • Replicating >1 Terabyte/day to 8 sites 770 TB replicated to date: >120 million replicas MTBF = 1 month LIGO Gravitational Wave Observatory Ann Chervenak et al., ISI; Scott Koranda et al, LIGO Cardiff AEI/Golm Globus
  • 17. Grid Projects in the US For example: Earth System Grid Children’s Oncology Grid Southern California Earthquake Center (SCEC) Science gateways Community Service Provider Content Services Resources Software Providers Resource Provider Community
  • 18. Earth System Grid Main ESG Portal CMIP3 (IPCC AR4) ESG Portal 198 TB of data at four locations 1,150 datasets 1,032,000 files Includes the past 6 years of joint DOE/NSF climate modeling experiments 35 TB of data at one location 74,700 files Generated by a modeling campaign coordinated by the Intergovernmental Panel on Climate Change Data from 13 countries, representing 25 models 8,000 registered users 1,900 registered projects Downloads to date 49 TB 176,000 files Downloads to date 387 TB 1,300,000 files 500 GB/day (average) 400 scientific papers published to date based on analysis of CMIP3 (IPCC AR4) data ESG usage: over 500 sites worldwide ESG monthly download volumes Globus
  • 19. SCEC Community Modeling Environment Pathway Instantiations Knowledge Base Ontologies Curated taxonomies, Relations & constraints Pathway Models Pathway templates, Models of simulation codes Code Repositories Data & Simulation Products Data Collections FSM RDM AWM SRM Storage GRID Pathway Execution Policy, Data ingest, Repository access Grid Services Compute & storage management, Security DIGITAL LIBRARIES Navigation & Queries Versioning, Replication Mediated Collections Federated access KNOWLEDGE ACQUISITION Acquisition Interfaces Dialog planning, Pathway construction strategies Pathway Assembly Template instantiation, Resource selection, Constraint checking KNOWLEDGE REPRESENTATION & REASONING Knowledge Server Knowledge base access, Inference Translation Services Syntactic & semantic translation Computing Users A collaboratory for system-level earthquake science Globus
  • 20. Seismic Hazard Analysis Defn: Max. intensity of shaking expected at a site during a fixed time interval Example: National seismic hazard maps Intensity measure: peak ground acceleration Interval: 50 yrs Probability of exceedance: 2% (http://geohazards.cr.usgs.gov/eq/) Globus
  • 21. SCEC Computations & Grid Prepare input to Pathway2 wave propagation code Pathway2PGV converts output into hazard map Map is visualized SDSC USC SCEC PSC TeraGrid ISI 12 CPUs 1,700 CPUs 1,200 CPUs 1 CPU 4 CPUs Globus
  • 22. Children’s Oncology Grid and MEDICUS Globus
  • 23. Grid Projects in the US Community Service Provider Content Services Resources Resource Provider Software Providers
  • 24. Software Providers Globus [GT4.2 released July 2, 2008] GRAM, GridFTP, MDS, RLS, DRS, … GSI, GridShib, MyProxy, … GridWay (Spain), OGSA-DAI (UK), Introduce, … Condor MPI-G, Swift, Pegasus, Taverna (UK), Kepler caBIG: e.g., Introduce Virtual Data Toolkit (includes VOMS [Italy], …) SRB, iRODS, MyCluster, … … Globus
  • 25. Virtual Data Toolkit (VDT) Software Release Process VDT components over time: built for 15 Linux Versions Development & testing Globus
  • 26. Creating Services: Introduce and gRAVI Introduce Define service Create skeleton Discover types Add operations Configure security Grid R emote A pplication V irtualization Infrastructure Wrap executables Index service Repository Service Introduce Container Ohio State University and Argonne/U.Chicago Appln Service Create Store Advertize Discover Invoke; get results Transfer GAR Deploy Globus
  • 29. Challenges Community Community Community Service Provider Content Services Resources Resource Provider Software Providers Conflicting Missions Sustainability Discipline science pull
  • 30. The Future NSF eXtreme Digital (XD) solicitation Aka “TeraGrid III” DOE, NIH, etc.—what do they want? International cooperation