LOW COST SUPERCOMPUTING
USING LINUX CLUSTERS
Topics :

1.   Introduction
2.   Overview of Parallel Processing
3.   Conceptual Overview of Clusters
4.   Cluster Design
5.   Features & Benefits
6.   Application Areas
7.   References
1. INTRODUCTION
Introduction To Clusters

o Parallel processing, the method of having many small tasks solve
  one large problem, has emerged as a key enabling technology in
  modern computing .
o In past several years Parallel Processing is increasing for :
     High performance scientific computing
     General purpose applications
o It results in :
     High performance
     Low cost
     Sustain productivity
What is Clustering ?

Meaning :
o Clustering is a parallel or distributed system consisting of
  independent computers that corporate as a single system.
o Cluster offers a way to use a computer more productively in
  comparison to when number of machines working standalone.
Clustering
2. OVERVIEW OF
PARALLEL PROCESSING
Introduction to Parallelism

o Parallel Processing refers to the concept of speeding-up the
  execution of the program by dividing the program into multiple
  fragments.
o Parallel Processing operates on two levels :
   a.   Hardware Parallelism
   b.   Software Parallelism
Clustering
Parallel Processing Schemes

o There are different approaches to creating effective parallel
  computers , and all of them have different levels of effectiveness
  for different kind of problems.
o Some of the methods are :
      Symmetric Multiprocessing (SMP)
      Non-Uniform Memory Access (NUMA)
      Uniform Memory Access (UMA)
      Single Instruction Multiple Data (SIMD)
      Multiple Instruction Multiple Data (MIMD)
SMP machines
NUMA & UMA machines
SIMD machines
MIMD machines
3. CONCEPTUAL
OVERVIEW OF CLUSTERS
Brief History Of Clusters

o In the summer of 1994 Thomas Sterling and Don Becker working
  at CESDIS under sponsorship of the EES project, built a cluster of
  16 DX4 processors connected by channel bonded Ethernet .
o They called their machines Beowulf .
BEOWULF CLUSTER
Clustering
Clustering
4. CLUSTER DESIGN
Cluster Design

o Cluster design includes :
      Design Considerations
      Topology
      Cluster Style
      Hardware Specification
      Software Requirements
      Cluster Architecture
Design Issues

o Design Issues considers :
   1.   Size Scalability (physical & application)
   2.   Enhanced Availability (failure management)
   3.   Single System Image (SSI look-and-feel of same system)
   4.   Fast Communication (network & protocols)
   5.   Programmability (simple API if required)
Cluster Style



                       Cluster
                       Styles




         Homogeneous             Heterogeneous
Topology
o Currently used topologies in networking are :
      Ring
      Bus
      Star
      Line
      Mesh
      Tree etc.
o We are using star topology due to following reason :
    Failure of one node does not effect entire network
    Range provided by star topology is greater than that of bus topology
    Range can be extended by using routers
Hardware Specification

o Hardware configuration of cluster mainly consists of two
  components.
    Nodes or Workstations
 Interconnection Network
Hardware Requirements

Hardware Requirement for MASTER NODE :
o Master mode is where the users of the system will log in.
o They submit their job processed by system and look at the result
  of their work.
o It requires :
      CPU capable system as master node with fast CPU
      Greater than 128 MB RAM
      8 GB HDD or more
      10 Mbps/ 100 Mbps Ethernet adapter.
Hardware Requirement for SLAVE NODE:
o Slave nodes are useful for computation only.
o For Slave node hard disk capacity need to be very large for
  better storage capacity.
o It requires,
  CPU based system with fast CPU
    32 MB or more memory modules
    4 GB HDD or more
    10 Mbps/ 100 Mbps Ethernet adapter.
Software Requirement

o The platform (O.S.) for developing cluster is very important
  because the throughput and performance of machine is totally
  depends upon how the platform manages the whole cluster.
o Some of the O.S that supports Clustering are,
      Linux
      Unix
      Windows 2000
      Windows NT
Why Linux ?

o    Linux is generally cheaper than other O.S and is frequently less
     problematic than many commercial systems.
o    Linux is chosen because :
    1.   It is a 32-bit multitasking Operating System.
    2.   It runs on hardware ranging from low-end 386 boxes to massive ultra-
         parallel machines.
    3.   Linux has a very strong networking support & also its efficient processing
         support.
    4.   The programming environments & development tools for parallel
         programming are more mature in Linux.
Cluster Architecture

o It covers :
    Network Interface Hardware
    Communication Software
    Cluster Middleware
Clustering
5. FEATURES &
BENEFITS
Features & Benefits

o Data sharing across the interface
o Parallel processing of small tasks
o Easy server maintenance
6. APPLICATIONS
Applications

o   General high performance computing
o   Bulk disk servers
o   High performance web servers
o   Flight simulators
o   Alife
Clustering Examples

o It is used in :
 NASA uses Beowulf which was started in a project headed up by CESDIS
 NOAA uses several Clustering technologies in their project
 Google.com introduced largest ever LINUX cluster which powers their popular
  web search engine
7. REFERENCES
Book References :

o Red Hat Linux System & Network Administration
o Building Linux Clusters -David HM Spector (Oreilley Publications)
o Beginning Linux Programming - Wrox Publications
Web Reference :

 www.beowlf.org
 www.redhat.com/mirrors/LDP
 www.jics.utk.edu Parallel Computing Resources
 www.linux-mag.com/gallery.html Linux Magazine Open Source
THANK YOU

More Related Content

PPTX
WAN & LAN Cluster with Diagrams and OSI explanation
PPS
Linux12 clustering onlinux
PPTX
FIne Grain Multithreading
PDF
ODSA PoC: Network Flow Processor Overview
PPTX
Warehouse scale computer
PPTX
Nehalem
PPTX
Microkernel
PDF
Browsing Linux Kernel Source
WAN & LAN Cluster with Diagrams and OSI explanation
Linux12 clustering onlinux
FIne Grain Multithreading
ODSA PoC: Network Flow Processor Overview
Warehouse scale computer
Nehalem
Microkernel
Browsing Linux Kernel Source

What's hot (20)

PPTX
NetApp C-mode for 7 mode engineers
PDF
[TALK] Exokernel vs. Microkernel
DOCX
In a monolithic kernel
PPTX
Factored operating systems
PDF
Multicore Computers
PDF
An introduction to the Design of Warehouse-Scale Computers
PPTX
Application Performance & Flexibility on Exokernel Systems paper review
PDF
Linux kernel Architecture and Properties
PDF
Application layer
PPT
Multicore Processors
PPT
2. microkernel new
PPT
Multiprocessor Architecture for Image Processing
PPTX
Computer architecture multi core processor
PPTX
Monolithic kernel vs. Microkernel
PPTX
CA presentation of multicore processor
PPTX
Comparison of Amoeba, Mach & Chorus: DOS
PPTX
Multithreading computer architecture
PPT
KERNAL ARCHITECTURE
PDF
Architecture Of The Linux Kernel
PPT
Icg hpc-user
NetApp C-mode for 7 mode engineers
[TALK] Exokernel vs. Microkernel
In a monolithic kernel
Factored operating systems
Multicore Computers
An introduction to the Design of Warehouse-Scale Computers
Application Performance & Flexibility on Exokernel Systems paper review
Linux kernel Architecture and Properties
Application layer
Multicore Processors
2. microkernel new
Multiprocessor Architecture for Image Processing
Computer architecture multi core processor
Monolithic kernel vs. Microkernel
CA presentation of multicore processor
Comparison of Amoeba, Mach & Chorus: DOS
Multithreading computer architecture
KERNAL ARCHITECTURE
Architecture Of The Linux Kernel
Icg hpc-user
Ad

Viewers also liked (16)

PPTX
Head tutor training presentation
PDF
Java Code to Java Heap - En Français
PPTX
Info session presentation
PPTX
21 great ways to sell more, faster
PPTX
Back to school evening
PDF
JavaOne 2014: Java Debugging
PDF
Tuning IBMs Generational GC
PDF
FrenchKit: End to End Application Development with Swift
PDF
O'Reilly Software Architecture Conf: Cloud Economics
PDF
JavaOne 2014: Java vs JavaScript
PDF
High speed networks and Java (Ryan Sciampacone)
PDF
MySQL with DRBD/Pacemaker/Corosync on Linux
PDF
JavaOne2013: Securing Java in the Server Room - Tim Ellison
PDF
IBM InterConnect: Java vs JavaScript for Enterprise WebApps
PDF
JavaOne2013: Implement a High Level Parallel API - Richard Ning
PDF
Software Archaeology
Head tutor training presentation
Java Code to Java Heap - En Français
Info session presentation
21 great ways to sell more, faster
Back to school evening
JavaOne 2014: Java Debugging
Tuning IBMs Generational GC
FrenchKit: End to End Application Development with Swift
O'Reilly Software Architecture Conf: Cloud Economics
JavaOne 2014: Java vs JavaScript
High speed networks and Java (Ryan Sciampacone)
MySQL with DRBD/Pacemaker/Corosync on Linux
JavaOne2013: Securing Java in the Server Room - Tim Ellison
IBM InterConnect: Java vs JavaScript for Enterprise WebApps
JavaOne2013: Implement a High Level Parallel API - Richard Ning
Software Archaeology
Ad

Similar to Clustering (20)

PPTX
PPT
Cluster Computing
PPTX
Cluster computing
PPT
59137949-Cluster-Computing (1).ppt .....
PPT
Cluster Computers
PPTX
Cluster computer
PPTX
Cluster computing
PPTX
Cluster computing
PPTX
Clustering by AKASHMSHAH
PPT
Parallel_and_Cluster_Computing.ppt
PPT
linuxcluster.ppt
PPTX
Cluster
PDF
Cluster Computing
PPTX
Cluster Technique used in Advanced Computer Architecture.pptx
PPTX
cluster computing
PDF
Cluster computing report
DOC
Clusetrreport
PPTX
Cluster computing
PPTX
Cluster computing
PPTX
Cluster cmputing
Cluster Computing
Cluster computing
59137949-Cluster-Computing (1).ppt .....
Cluster Computers
Cluster computer
Cluster computing
Cluster computing
Clustering by AKASHMSHAH
Parallel_and_Cluster_Computing.ppt
linuxcluster.ppt
Cluster
Cluster Computing
Cluster Technique used in Advanced Computer Architecture.pptx
cluster computing
Cluster computing report
Clusetrreport
Cluster computing
Cluster computing
Cluster cmputing

Recently uploaded (20)

PDF
Diabetes Mellitus , types , clinical picture, investigation and managment
PPTX
4. Diagnosis and treatment planning in RPD.pptx
PDF
0520_Scheme_of_Work_(for_examination_from_2021).pdf
PDF
Farming Based Livelihood Systems English Notes
PPTX
UNIT_2-__LIPIDS[1].pptx.................
DOCX
EDUCATIONAL ASSESSMENT ASSIGNMENT SEMESTER MAY 2025.docx
PDF
Physical education and sports and CWSN notes
PPTX
Thinking Routines and Learning Engagements.pptx
PPTX
Reproductive system-Human anatomy and physiology
PDF
fundamentals-of-heat-and-mass-transfer-6th-edition_incropera.pdf
PPTX
Neurological complocations of systemic disease
PDF
Everyday Spelling and Grammar by Kathi Wyldeck
PPT
hemostasis and its significance, physiology
PPTX
Macbeth play - analysis .pptx english lit
PDF
CAT 2024 VARC One - Shot Revision Marathon by Shabana.pptx.pdf
PPTX
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
PPTX
Cite It Right: A Compact Illustration of APA 7th Edition.pptx
PPT
hsl powerpoint resource goyloveh feb 07.ppt
PDF
Health aspects of bilberry: A review on its general benefits
PPTX
ACFE CERTIFICATION TRAINING ON LAW.pptx
Diabetes Mellitus , types , clinical picture, investigation and managment
4. Diagnosis and treatment planning in RPD.pptx
0520_Scheme_of_Work_(for_examination_from_2021).pdf
Farming Based Livelihood Systems English Notes
UNIT_2-__LIPIDS[1].pptx.................
EDUCATIONAL ASSESSMENT ASSIGNMENT SEMESTER MAY 2025.docx
Physical education and sports and CWSN notes
Thinking Routines and Learning Engagements.pptx
Reproductive system-Human anatomy and physiology
fundamentals-of-heat-and-mass-transfer-6th-edition_incropera.pdf
Neurological complocations of systemic disease
Everyday Spelling and Grammar by Kathi Wyldeck
hemostasis and its significance, physiology
Macbeth play - analysis .pptx english lit
CAT 2024 VARC One - Shot Revision Marathon by Shabana.pptx.pdf
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
Cite It Right: A Compact Illustration of APA 7th Edition.pptx
hsl powerpoint resource goyloveh feb 07.ppt
Health aspects of bilberry: A review on its general benefits
ACFE CERTIFICATION TRAINING ON LAW.pptx

Clustering

  • 2. Topics : 1. Introduction 2. Overview of Parallel Processing 3. Conceptual Overview of Clusters 4. Cluster Design 5. Features & Benefits 6. Application Areas 7. References
  • 4. Introduction To Clusters o Parallel processing, the method of having many small tasks solve one large problem, has emerged as a key enabling technology in modern computing . o In past several years Parallel Processing is increasing for :  High performance scientific computing  General purpose applications o It results in :  High performance  Low cost  Sustain productivity
  • 5. What is Clustering ? Meaning : o Clustering is a parallel or distributed system consisting of independent computers that corporate as a single system. o Cluster offers a way to use a computer more productively in comparison to when number of machines working standalone.
  • 8. Introduction to Parallelism o Parallel Processing refers to the concept of speeding-up the execution of the program by dividing the program into multiple fragments. o Parallel Processing operates on two levels : a. Hardware Parallelism b. Software Parallelism
  • 10. Parallel Processing Schemes o There are different approaches to creating effective parallel computers , and all of them have different levels of effectiveness for different kind of problems. o Some of the methods are :  Symmetric Multiprocessing (SMP)  Non-Uniform Memory Access (NUMA)  Uniform Memory Access (UMA)  Single Instruction Multiple Data (SIMD)  Multiple Instruction Multiple Data (MIMD)
  • 12. NUMA & UMA machines
  • 16. Brief History Of Clusters o In the summer of 1994 Thomas Sterling and Don Becker working at CESDIS under sponsorship of the EES project, built a cluster of 16 DX4 processors connected by channel bonded Ethernet . o They called their machines Beowulf .
  • 21. Cluster Design o Cluster design includes :  Design Considerations  Topology  Cluster Style  Hardware Specification  Software Requirements  Cluster Architecture
  • 22. Design Issues o Design Issues considers : 1. Size Scalability (physical & application) 2. Enhanced Availability (failure management) 3. Single System Image (SSI look-and-feel of same system) 4. Fast Communication (network & protocols) 5. Programmability (simple API if required)
  • 23. Cluster Style Cluster Styles Homogeneous Heterogeneous
  • 24. Topology o Currently used topologies in networking are :  Ring  Bus  Star  Line  Mesh  Tree etc. o We are using star topology due to following reason :  Failure of one node does not effect entire network  Range provided by star topology is greater than that of bus topology  Range can be extended by using routers
  • 25. Hardware Specification o Hardware configuration of cluster mainly consists of two components.  Nodes or Workstations
  • 27. Hardware Requirements Hardware Requirement for MASTER NODE : o Master mode is where the users of the system will log in.
  • 28. o They submit their job processed by system and look at the result of their work. o It requires :  CPU capable system as master node with fast CPU  Greater than 128 MB RAM  8 GB HDD or more  10 Mbps/ 100 Mbps Ethernet adapter.
  • 29. Hardware Requirement for SLAVE NODE: o Slave nodes are useful for computation only. o For Slave node hard disk capacity need to be very large for better storage capacity. o It requires, CPU based system with fast CPU  32 MB or more memory modules  4 GB HDD or more  10 Mbps/ 100 Mbps Ethernet adapter.
  • 30. Software Requirement o The platform (O.S.) for developing cluster is very important because the throughput and performance of machine is totally depends upon how the platform manages the whole cluster. o Some of the O.S that supports Clustering are,  Linux  Unix  Windows 2000  Windows NT
  • 31. Why Linux ? o Linux is generally cheaper than other O.S and is frequently less problematic than many commercial systems. o Linux is chosen because : 1. It is a 32-bit multitasking Operating System. 2. It runs on hardware ranging from low-end 386 boxes to massive ultra- parallel machines. 3. Linux has a very strong networking support & also its efficient processing support. 4. The programming environments & development tools for parallel programming are more mature in Linux.
  • 32. Cluster Architecture o It covers :  Network Interface Hardware  Communication Software  Cluster Middleware
  • 35. Features & Benefits o Data sharing across the interface o Parallel processing of small tasks o Easy server maintenance
  • 37. Applications o General high performance computing o Bulk disk servers o High performance web servers o Flight simulators o Alife
  • 38. Clustering Examples o It is used in :  NASA uses Beowulf which was started in a project headed up by CESDIS  NOAA uses several Clustering technologies in their project  Google.com introduced largest ever LINUX cluster which powers their popular web search engine
  • 40. Book References : o Red Hat Linux System & Network Administration o Building Linux Clusters -David HM Spector (Oreilley Publications) o Beginning Linux Programming - Wrox Publications
  • 41. Web Reference :  www.beowlf.org  www.redhat.com/mirrors/LDP  www.jics.utk.edu Parallel Computing Resources  www.linux-mag.com/gallery.html Linux Magazine Open Source