Delivering Transformation. Together.
SITE RELIABILITY ENGINEERING & AIOPS
Murugan Muthayan
Agility Day – Noida – Aug 2019
AGILE, DEVOPS AND SRE..
2
Agile Development
• Transformed the way software being built
• Collaboration & quicker feedback loop
• Better control, early value
DevOps
• Cultural transformation focused on delivery speed
• Enable automation wherever possible
• Make development and operation process frictionless
Site Reliability Engineering
Focus to improve the reliability of software in production by implementing the best practices in engineering and operations
Tesco Transport Systems Adjustment3
SITE RELIABILITY ENGINEERING
SRE incorporates Engineering, Infrastructure and
Operation aspects to create scalable and reliable
software systems that are highly automatic and
self-healing.
SRE aims at DevOps to NoOps - “what happens
when a software engineer is tasked with what
used to be called operations.” - Ben Treynor,
Founder of Google SRE
The purpose of SRE is to achieve reliability by
implementing the best practices in engineering
and operations.
SRE can be thought of as an extreme
implementation of DevOps.
FOCUS AREA FOR SRE
4
MONITORING
(Performance
Metrics)
ALERTING
(Immediate
notifications)
INFRASTRUCTURE
(Scalability / Limitations)
ENGINEERING
(Application Design)
DEBUGGING
(Log files, code
analysis)
SECURITY
(Vulnerabilities)
BEST PRACTICES
(Documentation &
Training)
SITE RELIABILITY ENGINEER
5
The ideal site reliability engineer is either a software engineer with a good administration
background or a highly skilled system administrator with knowledge of coding and automation –
“Part systems administrator, part second tier support and part developer”
50% cap on the aggregate "ops" work for all SREs. SRE team must spend the remaining 50% of its
time actually doing development activities
An SRE team is responsible for,
• availability,
• latency,
• performance,
• efficiency,
• change management,
• monitoring,
• emergency response,
• capacity planning
6
SRE – SKILLS CHECKLIST
SRE - METRICS & MEASUREMENTS
7
Service Level Indicators that measures failures per request by calculating request latencySLI
Service Level Objectives that sets goals for System availability, performance, success ratesSLO
Service level agreements that are driven from SLO and dictate commercial penaltiesSLA
It is a measure of risk and the amount of headroom you have above the SLAError Budget
Mean time to repair is average time required to repair a failureMTTR
Predicted elapsed time between inherent failures of a system during operationMTTF
TAKE AWAY..
8
..and AIOps takes a further step from SRE towards automating IT operations using
advanced analytics !!!
COGNITIVE LEARNING – INTELLIGENT OPERATIONS (AIOps)
9
Insight Predict
Big Data Machine
Learning
Definition - What Does AIOps Mean?
10
AIOps is a methodology that is on the frontier of enterprise IT
operations. AIOps automates various aspects of IT and utilizes
the power of artificial intelligence to create self-learning
programs that help revolutionize IT services.
It is the application of advanced analytics—in the form of
machine learning (ML) and artificial intelligence (AI), towards
automating operations so that your IT Ops team can move at
the speed that your business expects today.
AIOps refers to multi-layered technology platforms that automate
and enhance IT operations by 1) using analytics and machine
learning to analyze big data collected from various IT operations
tools and devices, in order to 2) automatically spot and react to
issues in real time.
What Will Tomorrow Look Like ?
11
….Function Follows Need
Distributed Computing
Software Defined
Everything
Monitoring
Platforms
ISV Platforms
Patchwork, Open source,
Departmental
Source Events
Custom/Standard/Fixed
~ 100 – 1000 eps
Chaotic, Unstructured
~ 1000 – 100,000 eps
Configuration
Flexible
TBC ~ hours
Chaotic
TBC < 1 second
Infrastructure
Multi vendor
UNIX/IP/Windows client
server
Virtualised/Containers
Fluid/UNIX/Mobile/Micro
Digital
Transformation
Demands DevOps &
elastic
2010 2020
Current and Future Demands
12
Scale
• 105+ Moving Parts
• 106+ Notifications
• 109+ Data Points
• 1012 -> 10120+ Possible Failure Modes
+ Bounded by the estimated information content of the
universe !
Compulsion of Change
Complexity
Reduction in the Unit of compute
Mainframe → Server → VM → Container
Multiple Orders of Magnitude
Increase in Change Cycle
Fully fluid CI/CD Cycle
Traditional IT Ops caught Flat - Footed
13
Overwhelmed by DATA and a lack of INFORMATION
Siloed
teams and
tools
Too
many
alerts
No context
when an
incident
occurs
No
early
warning
DevOps
lacks
proactive
assurance
75-80%
~ 90%
> 45%
> 73%
Many Siloed
War room
IT Ops Priorities Driven by Digital Transformation
14
INCREASE frequency of change, stability and availability of IT services1
REDUCE resource operations workload and INCREASE productivity2
CONSOLDATE tools3
MIGRATE to the cloud4
SUPPORT software-defined services5
SUPPORT microservices based software architecture6
AIOps Agile and Proactive Event-to-Resolution Workflow
15
Early Detection, fewer tickets, reduced MTTR
Industrialised data
ingestion from
multiple sources
Automatically resolves
signals from alert noise
Proactively and
automatically detects
incidents and probable root
causes (reduced MTTD)
Enables collaborative
workflows (reduces
adverse business
impact)
Triggers automation
to restore services
Predictive insights
(reduced support
escalations and
MTTR)
How AIOps makes ITOps Robutst ?
16
• Determine the service health of
mission-critical services or
applications.
• Gain control and visibility to
spiraling consumption of cloud
resources.
• Accelerate MTTR with automated
incident management and real-
time configuration management
database (CMDB) updates.
• Build context-rich data lakes
integrating disparate, third-party
data sources.
AIOps makes Teams Faster, Smarter, and More
Productive
17
Level 0/NOC Operators
• Improve efficiency by consolidating related alerts together
• Reduce catch-n-dispatch activities
Support SMEs & Developers
• Pass incident resolution knowledge to lower support tiers
• Collaborate across complex multi-disciplinary incidents
IT Operations Managers
• Delivery service-level state monitoring
• Improve efficiency and job satisfaction
• Identify and address repeating mundane work with run book automation
• Investigate and problem-solve for frequently repeating P3-P5 incidents
IT Senior Management
• Achieve overall per-alert efforts reduction
• Re -purpose the savings towards business’s bottom line
THANK YOU
18
Any Questions ?

More Related Content

PPTX
Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...
PDF
HPE AIOps Expo
PDF
Doing DevOps for Big Data? What You Need to Know About AIOps
PPTX
Before You Deploy An AIOps System, Do this
PDF
Modernizing Infrastructure Monitoring and Management with AIOps
PDF
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
PDF
AIOps Roundtable Munich 2018
PDF
AIOps - The next 5 years
Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...
HPE AIOps Expo
Doing DevOps for Big Data? What You Need to Know About AIOps
Before You Deploy An AIOps System, Do this
Modernizing Infrastructure Monitoring and Management with AIOps
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
AIOps Roundtable Munich 2018
AIOps - The next 5 years

What's hot (18)

PDF
AIOps: Your DevOps Co-Pilot
PPTX
What Does Artificial Intelligence Have to Do with IT Operations?
PDF
Doing DevOps for Big Data? What You Need to Know About AIOps
PDF
Unifying IT with Outcome-Aware AIOps
PDF
No Ops? Or Yes, Ops! The Future of Operations in a DevOps World
PDF
Scale Container Operations with AIOps
PDF
Bringing AIOps to Hybrid Cloud Monitoring and Management
PDF
2019 Performance Monitoring and Management Trends and Insights
PPTX
The future of AIOps
PDF
Splunk for AIOps: Reduce IT outages through prediction with machine learning
PDF
AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...
PPTX
Context Is Critical for IT Operations - How Rich Data Yields Richer Results
PDF
Webinar Slides - How KeyBank Liberated its IT Ops from Rules-Based Event Mana...
PDF
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
PDF
The 6 Steps to Becoming a Top-Performing Organization in Managing IT Operations
PDF
AIOps: Anomalous Span Detection in Distributed Traces Using Deep Learning
PDF
Stream 3 - VMware Sponsor Presentation
PDF
AIOps Is How We Will Survive DevOps
AIOps: Your DevOps Co-Pilot
What Does Artificial Intelligence Have to Do with IT Operations?
Doing DevOps for Big Data? What You Need to Know About AIOps
Unifying IT with Outcome-Aware AIOps
No Ops? Or Yes, Ops! The Future of Operations in a DevOps World
Scale Container Operations with AIOps
Bringing AIOps to Hybrid Cloud Monitoring and Management
2019 Performance Monitoring and Management Trends and Insights
The future of AIOps
Splunk for AIOps: Reduce IT outages through prediction with machine learning
AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...
Context Is Critical for IT Operations - How Rich Data Yields Richer Results
Webinar Slides - How KeyBank Liberated its IT Ops from Rules-Based Event Mana...
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
The 6 Steps to Becoming a Top-Performing Organization in Managing IT Operations
AIOps: Anomalous Span Detection in Distributed Traces Using Deep Learning
Stream 3 - VMware Sponsor Presentation
AIOps Is How We Will Survive DevOps
Ad

Similar to Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan (20)

PDF
Complete guide to AIOps_ Automate IT Operations with AI.pdf
PPTX
AIOps-Solutions-Transforming-IT-Operations-with-Artificial-Intelligence.pptx
PDF
On the Application of AI for Failure Management: Problems, Solutions and Algo...
PDF
How AIOps (Artificial Intelligence in IT Operations) help in improving IT ope...
PDF
AIOps is Revolutionizing IT Operations Management.pdf
PDF
How Does AIOps Benefit DevOps Pipeline and Software Quality? - DevOps Next
PDF
Distributed Trace & Log Analysis using ML
PDF
Skill Up Splunk DevOps slides with AIOps MLOps
PDF
How AIOps Evolved from Monitoring Tools to Autonomous IT Operations_.pdf
PPTX
How to apply machine learning into your CI/CD pipeline
DOCX
A Comprehensive Guide to AIOps Integration in Organizations
PPTX
Know your DevOps
PDF
GCP-pdevops devops engineer exam prepearitaon guide
PPTX
SOC Lessons from DevOps and SRE by Anton Chuvakin
PDF
integrating-cognitive-services-into-your-devops-strategy
PDF
Integrating cognitive services in to your devops strategy
PDF
Driving Digital Transformation through Service-Centric AIOps
PPTX
SRE (service reliability engineer) on big DevOps platform running on the clou...
PDF
The-OpsRamp-State-of- the- AIOps-Report.pdf
PDF
Strengthen and Scale Security for a dollar or less
Complete guide to AIOps_ Automate IT Operations with AI.pdf
AIOps-Solutions-Transforming-IT-Operations-with-Artificial-Intelligence.pptx
On the Application of AI for Failure Management: Problems, Solutions and Algo...
How AIOps (Artificial Intelligence in IT Operations) help in improving IT ope...
AIOps is Revolutionizing IT Operations Management.pdf
How Does AIOps Benefit DevOps Pipeline and Software Quality? - DevOps Next
Distributed Trace & Log Analysis using ML
Skill Up Splunk DevOps slides with AIOps MLOps
How AIOps Evolved from Monitoring Tools to Autonomous IT Operations_.pdf
How to apply machine learning into your CI/CD pipeline
A Comprehensive Guide to AIOps Integration in Organizations
Know your DevOps
GCP-pdevops devops engineer exam prepearitaon guide
SOC Lessons from DevOps and SRE by Anton Chuvakin
integrating-cognitive-services-into-your-devops-strategy
Integrating cognitive services in to your devops strategy
Driving Digital Transformation through Service-Centric AIOps
SRE (service reliability engineer) on big DevOps platform running on the clou...
The-OpsRamp-State-of- the- AIOps-Report.pdf
Strengthen and Scale Security for a dollar or less
Ad

More from AgileNetwork (20)

PDF
ANIn Mumbai 2025 | Measuring Business Value during Agile Transformation by Pr...
PPTX
ANIn Ahmedabad 2025 | Quality as Foundation of Business Agility: How QA Enabl...
PPTX
ANIn Ahmedabad 2025 | Beyond Survival: Enabling Growth Mindset by Abhishek Bh...
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
PPTX
Agile Chennai 18-19 July 2025 Ideathon | Crypton- an AI driven, Quantum Resis...
PPTX
Agile Chennai 18-19 July 2025 | Leading with Integrity in the Age of AI – A C...
PDF
Agile Chennai 18-19 July 2025 | Workshop - Leadership in an Uncertain World: ...
PPTX
Agile Chennai 18-19 July 2025 | The Human Metrics of Agile: Building Resilien...
PPTX
Agile Chennai 18-19 July 2025 | Adaptive Organizations: Built to Learn, Ready...
PPTX
Agile Chennai 18-19 July 2025 | Workshop - Enhancing Agile Collaboration with...
PPTX
Agile Chennai 18-19 July 2025 | The Purpose Playbook: Building AI that Solves...
PDF
Agile Chennai 18-19 July 2025 | The Story of KM Implementation for enabling V...
PPTX
Agile Chennai 18-19 July 2025 | Beyond Survival: Resilience Through Agility a...
PPTX
Agile Chennai 18-19 July 2025 | Kanban: The Shop Floor’s Secret to Smooth Wor...
PDF
Agile Chennai 18-19 July 2025 | Unpacking OKRs: A Guide to Strategic Sophisti...
PPTX
Agile Chennai 18-19 July 2025 | Agility for Resilience - Adaptive Systems & C...
PPTX
Agile Chennai 18-19 July 2025 | Redefining Customer Centricity by Aarthi Ramesh
PDF
ANIn Bengaluru 2025 | Workshop- Innovate For Business Agility: Idea Generatio...
PPTX
ANIn Bengaluru 2025 | Working Smarter: The Fusion of Agile Mindsets and AI Mi...
ANIn Mumbai 2025 | Measuring Business Value during Agile Transformation by Pr...
ANIn Ahmedabad 2025 | Quality as Foundation of Business Agility: How QA Enabl...
ANIn Ahmedabad 2025 | Beyond Survival: Enabling Growth Mindset by Abhishek Bh...
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
Agile Chennai 18-19 July 2025 Ideathon | Crypton- an AI driven, Quantum Resis...
Agile Chennai 18-19 July 2025 | Leading with Integrity in the Age of AI – A C...
Agile Chennai 18-19 July 2025 | Workshop - Leadership in an Uncertain World: ...
Agile Chennai 18-19 July 2025 | The Human Metrics of Agile: Building Resilien...
Agile Chennai 18-19 July 2025 | Adaptive Organizations: Built to Learn, Ready...
Agile Chennai 18-19 July 2025 | Workshop - Enhancing Agile Collaboration with...
Agile Chennai 18-19 July 2025 | The Purpose Playbook: Building AI that Solves...
Agile Chennai 18-19 July 2025 | The Story of KM Implementation for enabling V...
Agile Chennai 18-19 July 2025 | Beyond Survival: Resilience Through Agility a...
Agile Chennai 18-19 July 2025 | Kanban: The Shop Floor’s Secret to Smooth Wor...
Agile Chennai 18-19 July 2025 | Unpacking OKRs: A Guide to Strategic Sophisti...
Agile Chennai 18-19 July 2025 | Agility for Resilience - Adaptive Systems & C...
Agile Chennai 18-19 July 2025 | Redefining Customer Centricity by Aarthi Ramesh
ANIn Bengaluru 2025 | Workshop- Innovate For Business Agility: Idea Generatio...
ANIn Bengaluru 2025 | Working Smarter: The Fusion of Agile Mindsets and AI Mi...

Recently uploaded (20)

PDF
Literature_Review_methods_ BRACU_MKT426 course material
PDF
Climate and Adaptation MCQs class 7 from chatgpt
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
PPTX
DRUGS USED FOR HORMONAL DISORDER, SUPPLIMENTATION, CONTRACEPTION, & MEDICAL T...
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
Journal of Dental Science - UDMY (2021).pdf
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
PDF
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
PPTX
What’s under the hood: Parsing standardized learning content for AI
PDF
Skin Care and Cosmetic Ingredients Dictionary ( PDFDrive ).pdf
PPTX
Core Concepts of Personalized Learning and Virtual Learning Environments
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PDF
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
DOCX
Cambridge-Practice-Tests-for-IELTS-12.docx
PPTX
Introduction to pro and eukaryotes and differences.pptx
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
IP : I ; Unit I : Preformulation Studies
Literature_Review_methods_ BRACU_MKT426 course material
Climate and Adaptation MCQs class 7 from chatgpt
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
DRUGS USED FOR HORMONAL DISORDER, SUPPLIMENTATION, CONTRACEPTION, & MEDICAL T...
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
Journal of Dental Science - UDMY (2021).pdf
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
What’s under the hood: Parsing standardized learning content for AI
Skin Care and Cosmetic Ingredients Dictionary ( PDFDrive ).pdf
Core Concepts of Personalized Learning and Virtual Learning Environments
B.Sc. DS Unit 2 Software Engineering.pptx
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
Cambridge-Practice-Tests-for-IELTS-12.docx
Introduction to pro and eukaryotes and differences.pptx
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
IP : I ; Unit I : Preformulation Studies

Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan

  • 1. Delivering Transformation. Together. SITE RELIABILITY ENGINEERING & AIOPS Murugan Muthayan Agility Day – Noida – Aug 2019
  • 2. AGILE, DEVOPS AND SRE.. 2 Agile Development • Transformed the way software being built • Collaboration & quicker feedback loop • Better control, early value DevOps • Cultural transformation focused on delivery speed • Enable automation wherever possible • Make development and operation process frictionless Site Reliability Engineering Focus to improve the reliability of software in production by implementing the best practices in engineering and operations
  • 3. Tesco Transport Systems Adjustment3 SITE RELIABILITY ENGINEERING SRE incorporates Engineering, Infrastructure and Operation aspects to create scalable and reliable software systems that are highly automatic and self-healing. SRE aims at DevOps to NoOps - “what happens when a software engineer is tasked with what used to be called operations.” - Ben Treynor, Founder of Google SRE The purpose of SRE is to achieve reliability by implementing the best practices in engineering and operations. SRE can be thought of as an extreme implementation of DevOps.
  • 4. FOCUS AREA FOR SRE 4 MONITORING (Performance Metrics) ALERTING (Immediate notifications) INFRASTRUCTURE (Scalability / Limitations) ENGINEERING (Application Design) DEBUGGING (Log files, code analysis) SECURITY (Vulnerabilities) BEST PRACTICES (Documentation & Training)
  • 5. SITE RELIABILITY ENGINEER 5 The ideal site reliability engineer is either a software engineer with a good administration background or a highly skilled system administrator with knowledge of coding and automation – “Part systems administrator, part second tier support and part developer” 50% cap on the aggregate "ops" work for all SREs. SRE team must spend the remaining 50% of its time actually doing development activities An SRE team is responsible for, • availability, • latency, • performance, • efficiency, • change management, • monitoring, • emergency response, • capacity planning
  • 6. 6 SRE – SKILLS CHECKLIST
  • 7. SRE - METRICS & MEASUREMENTS 7 Service Level Indicators that measures failures per request by calculating request latencySLI Service Level Objectives that sets goals for System availability, performance, success ratesSLO Service level agreements that are driven from SLO and dictate commercial penaltiesSLA It is a measure of risk and the amount of headroom you have above the SLAError Budget Mean time to repair is average time required to repair a failureMTTR Predicted elapsed time between inherent failures of a system during operationMTTF
  • 8. TAKE AWAY.. 8 ..and AIOps takes a further step from SRE towards automating IT operations using advanced analytics !!!
  • 9. COGNITIVE LEARNING – INTELLIGENT OPERATIONS (AIOps) 9 Insight Predict Big Data Machine Learning
  • 10. Definition - What Does AIOps Mean? 10 AIOps is a methodology that is on the frontier of enterprise IT operations. AIOps automates various aspects of IT and utilizes the power of artificial intelligence to create self-learning programs that help revolutionize IT services. It is the application of advanced analytics—in the form of machine learning (ML) and artificial intelligence (AI), towards automating operations so that your IT Ops team can move at the speed that your business expects today. AIOps refers to multi-layered technology platforms that automate and enhance IT operations by 1) using analytics and machine learning to analyze big data collected from various IT operations tools and devices, in order to 2) automatically spot and react to issues in real time.
  • 11. What Will Tomorrow Look Like ? 11 ….Function Follows Need Distributed Computing Software Defined Everything Monitoring Platforms ISV Platforms Patchwork, Open source, Departmental Source Events Custom/Standard/Fixed ~ 100 – 1000 eps Chaotic, Unstructured ~ 1000 – 100,000 eps Configuration Flexible TBC ~ hours Chaotic TBC < 1 second Infrastructure Multi vendor UNIX/IP/Windows client server Virtualised/Containers Fluid/UNIX/Mobile/Micro Digital Transformation Demands DevOps & elastic 2010 2020
  • 12. Current and Future Demands 12 Scale • 105+ Moving Parts • 106+ Notifications • 109+ Data Points • 1012 -> 10120+ Possible Failure Modes + Bounded by the estimated information content of the universe ! Compulsion of Change Complexity Reduction in the Unit of compute Mainframe → Server → VM → Container Multiple Orders of Magnitude Increase in Change Cycle Fully fluid CI/CD Cycle
  • 13. Traditional IT Ops caught Flat - Footed 13 Overwhelmed by DATA and a lack of INFORMATION Siloed teams and tools Too many alerts No context when an incident occurs No early warning DevOps lacks proactive assurance 75-80% ~ 90% > 45% > 73% Many Siloed War room
  • 14. IT Ops Priorities Driven by Digital Transformation 14 INCREASE frequency of change, stability and availability of IT services1 REDUCE resource operations workload and INCREASE productivity2 CONSOLDATE tools3 MIGRATE to the cloud4 SUPPORT software-defined services5 SUPPORT microservices based software architecture6
  • 15. AIOps Agile and Proactive Event-to-Resolution Workflow 15 Early Detection, fewer tickets, reduced MTTR Industrialised data ingestion from multiple sources Automatically resolves signals from alert noise Proactively and automatically detects incidents and probable root causes (reduced MTTD) Enables collaborative workflows (reduces adverse business impact) Triggers automation to restore services Predictive insights (reduced support escalations and MTTR)
  • 16. How AIOps makes ITOps Robutst ? 16 • Determine the service health of mission-critical services or applications. • Gain control and visibility to spiraling consumption of cloud resources. • Accelerate MTTR with automated incident management and real- time configuration management database (CMDB) updates. • Build context-rich data lakes integrating disparate, third-party data sources.
  • 17. AIOps makes Teams Faster, Smarter, and More Productive 17 Level 0/NOC Operators • Improve efficiency by consolidating related alerts together • Reduce catch-n-dispatch activities Support SMEs & Developers • Pass incident resolution knowledge to lower support tiers • Collaborate across complex multi-disciplinary incidents IT Operations Managers • Delivery service-level state monitoring • Improve efficiency and job satisfaction • Identify and address repeating mundane work with run book automation • Investigate and problem-solve for frequently repeating P3-P5 incidents IT Senior Management • Achieve overall per-alert efforts reduction • Re -purpose the savings towards business’s bottom line