SlideShare a Scribd company logo
© 2018 IBM Corporation
Cloud Service Management and why Machine
Learning is now essential
June 21 2018
© 2018 IBM Corporation
Agenda
Part 1: - Cloud Service Management
Part 2: - Machine Learning is essential (for adaptive automation)
Part 3: - Wrap-up, Call to action & Q&A
Cloud enables digital
transformation
To transform, organizations are
employing App Modernization, Hybrid,
and DevOps
Supporting agility at scale requires
managing increasing data growth,
complexity, and dynamic
environments
Deliver, reliable, competitive applications Fast
Business Reality & needs:
• Agile Application Delivery
• End user experience & reliability
• Lean Operations Management
Ops Goal: Fewer problem tickets, faster
resolution
Dev Goal: Faster time to market,
reduce disruptions
Dev Test Stage Prod
Sto
p
Shift Right Shift Left
virtually every application & service
will incorporate AI, Gartner10yrs
of the top 20 companies in
every industry will be
disrupted in the next 3 years1/3 of apps must be refactored
to move to cloud99%
LoB Executive
Application Owner
Application Developer
Chief Information Officer
IT Operations Manager
IT Operations Engineer
Business Imperatives are Driving Faster Change
Agility depends on DevOps practices and Cloud-Enabled Process Innovation
Systems of Record
Operational Excellence
Systems of Engagement
Transformation & Differentiation
Agile Management
Traditional Management
Traditional Model Agile Model
Some, big IT projects Many, small
2-3 years Time to go live 2-3 months
Lower Change rate Higher
Centralized Governance Decentralized
Cloud-ready, on-prem Tools Cloud-Native
ITIL, CMMI Processes DevOps, Lean
Hybrid Ops
Hybrid Apps
Source: The agile CIO: Mastering digital disruption. http://blog.kpmg.ch/the-agile-cio-mastering-digital-disruption/
5
Process,
Tools and
Culture
Growing an Agile
organization
requires adaptation
across the
organization
.
Process
Tools
&
Technology
Culture
• Adjust processes to enable Agility
• Continued High Availability and Performance
• Built-to-Manage Approach
• Integrate Cloud Service Management toolchain
with existing ITSM capabilities
• Implement New Tools
(ChatOps, Runbook Automation, etc.)
• Orient on Application Agility and shared
success (DevOps)
• Transition to New Roles (i.e. Site
Reliability Engineer, First Responder)
• Transition to Proactive monitoring
(Analytics)
6
Enterprise DevOps Adoption
“The Future is already here, it is just unevenly distributed” – William Gibson
7
New
DevOps
Startup
• Full Stack Engineers
• Highly Collaborative
• Informal and Agile
• Focused and Independent
Enterprise Business Reality:
• Some Agile Applications
• Some Legacy Applications
• Adopting Cloud Operating Model
• Mix of Traditional and Cloud
IT Service Management (ITIL)
• Process Oriented
• Resistant to Change
Cloud Service Management
• Service Oriented
• Dynamic and Agile
L1 Ops
L2 Ops
SME
Site Reliability
Engineer
First Responder
DevOps/SME
Hybrid Cloud Management enables the
transformation journey
Theme Value
Digital
transformationAgility
Adaptive
Automation
Select and manage the
right cloud path for you
Manageable, secure
DevOps delivered at scale
Recognize and respond to
dynamic environments
Flexibility
Cognitive
Data Scientist
learns,
decides,
improves
AI to speed problem determination
© 2018 IBM Corporation
Agenda
Part 1: - Cloud Service Management
Part 2: - Machine Learning is essential (for adaptive automation)
Part 3: - Wrap-up, Call to action & Q&A
Adaptive automation
of IT professionals agree:
we will be overwhelmed without
automation.
70%
Proactive
Predictive insights
Adaptive
automation
Reactive
Real-time analytics
Adaptive
Cognitively
enhanced workflow
Scale
Complexity
Recognize and respond
to dynamic environments
Adaptive
Automation
Recognize and respond
to dynamic
environments
Insights to increase efficiency
• Automated noise reduction
• Automation of complex tasks
Insights to Avoid Outages
• Automatically detect
behavioural changes
• Take action, before users are
impacted
Insights to reduce MTTR
• Probable cause identification
• Context, in dynamic
environments
13
Adaptive
Automation
Machine learning, advanced analytics and cognitive
technologies delivering automated value for Centralized
IT Operations and DevOps teams
Insights to increase efficiencyInsights to Avoid Outages
Insights from your Terabytes of
Operational Data
Machine Learning applied automatically
to your performance data
Automate, automate, automate with Machine
Learning applied your event and performance
data. Extend with Watson.
Reactive
Real-time analytics
Proactive
Predictive Insights
Adaptive
Cognitively Enhanced
Workflows
Insights to reduce MTTR
"“Right there - visually - we saw
proof that you can use machine
learning to be able to identify
root cause….. Everyone sat
there in silence for three
minutes.”
David Nestic
Technical operations manager, NBN
Source:
"After testing the cognitive
monitoring solution (IBM Operations
Analytics Predictive Insights) ..we
saw a significant reduction in server
incidents..Thanks to it we will have a
platform that can help us act before
an incident occurs”
Jan Steen Olsen
Executive Vice President and CTO, Danske
Bank
Source:
“We live on the edge of control, trying
to assure our systems and deal with
ever-changing business and user
requirements. To control costs, we
need to keep operations lean by
processing only actionable
alarms”…….On average, we reduced
15% of the “noise” alarms.”
Operations Leader, Fast Growing Canadian
Telco
Correlated Event Groups
Traditional Events
Cisco ACI
Docker
Kubernetes
OpenStack
TADDM
NOI
VMware
vCenter
ITNM
IBM ALM
DNS
REST
Netcool
Ops
Insight
- Event Clustering
- Seasonal Analysis and Suppression
- Weighted probable cause
Machine Learning for Reactive Management
Cisco ACI
Cognitive
Event MoM
Collaboration
& Automation
ChatOps
Notification
Run Books
Correlated Event Groups
Traditional Events
Proactive Events
Metrics
Cisco ACI
Docker
Kubernetes
OpenStack
TADDM
NOI
VMware
vCenter
ITNM
IBM ALM
DNS
REST
Predictive
Insights
Netcool
Ops
Insight
- AI driven Model selection
- Variance Analysis
- Dependency Determination
- Dynamic Threshold
- Event Clustering
- Seasonal Analysis and Suppression
- Weighted probable cause
Machine Learning for Reactive and Proactive Management
Cisco ACI
Cognitive
Performance MoM
Cognitive
Event MoM
Collaboration
& Automation
ChatOps
Notification
Run Books
Advanced Analytics for Rapid Context
17
Agile Service
Manager
Dynamic
Topology MoM
RESULT: Cognitive Manager of Managers across Event, Performance and Topology data
Cognitive
Data Scientist
learns,
decides,
improves
Sophisticated
Seasonal
Modelling
Robust
Statistical
approaches
(independent of data
distribution)
Multiple
Anomaly
Detection
Algorithms
Automatic
Model
Validation
Long term
learning
(monthly/
yearly patterns)
Mathematical
Relationship
Discovery
Rapid analysis
of highly
dynamic
environments
Automated
Runbooks
User
Domain
knowledge
Alert Mgmt &
Collaboration
Probable
Cause
Identification
Context, in
highly Dynamic
Environments
Automated
Remediation
Mean-Time-To-Identify
(MTTI)
Mean-Time-to-Know
(MTTK)
Automated Event
Suppression &
Incident Correlation
Automated
Early
Detection
Mean-Time-to-Fix
and Verify
Adaptive Automation
Incident Management Example
© 2018 IBM Corporation
Agenda
Part 1: - Cloud Service Management
Part 2: - Machine Learning is essential (for adaptive automation)
Part 3: - Wrap-up, Call to action & Q&A
Patterns of behavior w/
Machine Learning
Seasonality of environment
behavior
Abnormal behaviors that
precursor events
Predict to Get Ahead Augment the Process
Cognitive Automated Ticket Creation
and Routing
Cognitive Process Automation with
robotics and Watson guided advise
Cognitive Process Automation for
zero-touch automation with robotics
and Watson embedded advise and
next steps
Simplify & Focus
Pattern Analysis to Correlate &
De-duplicate events
Pattern Analysis for IT Operations
Cognitive Network 360* Insights
Real Time Federated Topology
Augment Staff
Cognitive Incident Advisor
Cognitive Agent Assist
Cognitive Knowledgebase w/
semantic search
Cognitive Assistant for Change
§Netcool Operations Insight
§Agile Service Manager
§Hadoop HDFS
§Watson Data Platform (DSX)
§Watson Explorer
§Watson Discovery
§Watson Knowledge Studio
§IBM Operations Analytics –
Predictive Insights
§Netcool Operations Insight
§RPA tools
§Watson Explorer Semantic
Analysis
§Dynamic Automation
§PASIR
§Watson Discovery
§Watson Knowledge Studio
§Watson Assistant
§Watson Conversation Services
§Watson Explorer
§Watson Discovery
§Watson Knowledge Studio
§Speech To Text / Text To Speech
§Watson for Cyber Security
§Qradar Watson Advisor
CapabilityProducts/CloudServices
Adaptive Automation
Machine Learning and leveraging user experience
Predict to Get Ahead Augment the ProcessSimplify & Focus Augment Staff
Adaptive Automation
Call to action and Q&A
Short Videos Predictive Capabilities
§ The Value Video
§ The Capability video
IBM Marketplace:
§ Operations Analytics
§ Netcool Operations Insight
§ Application Performance Management
Forrester Total Economic Studies
§ The Operations Management TEI
§ The Application Management TEI
IT Operations Maturity Assessment
§ Questionnaire to get you thinking
Find Out More
© 2018 IBM Corporation
Thank you
Key Capabilities: Reduce MTTR
With 2nd Gen Advanced Real-Time Event,
Performance and Topology Analytics
• Groups events that always occur together, providing increased
context for faster resolution
• Learns complex relationships across your applications and
infrastructure and provides insights for potential root cause
• Rapidly analyses multiple sources of topology to provide up-to-
date service and topology views for context
With 1st Gen capabilities for Rapid Problem
Resolution
• Big data search across all operational data, supplemented text
derived insights and log monitoring
Insights to reduce MTTR
• Probable cause identification
• Context, in dynamic environments
Reactive
Real-time analytics
Insights from your Terabytes of Operational Data
Key Capabilities: Avoid Outages
Utilise IBM’s advanced machine learning to
proactive manage your critical application and
infrastructure
Solution automatically detects behavioural
changes and provides insights to help root
cause
Operations can take corrective action, before
critical services and users are impacted
Solution has been successfully deployed and
has dramatically reduced outages
Insights to Avoid Outages
• Automatically detect behavioural
changes
• Take action, before users are
impacted
Proactive
Predictive insights
Machine Learning applied automatically to your
performance data
2nd Gen ITOA
Key Capabilities: Increase Efficiency
Due to advances, Machine Learning can now
automate many human decisions AND it scales
and adapts
– Reduces alert noise due to advances seasonal
behaviour analysis
– Reduces manual effort by utilising machine learning to
set and maintain thresholds
– Reduce tickets and manual effort, by automatically
grouping events that always occur together
Automatically analyses patterns in operational data
to identify waste and automation opportunities
Insights to Increase Efficiency
• Automated noise reduction
• Automation of complex tasks
Increase Efficiency
Automate, automate, automate with Machine
Learning applied your event and performance
data
1st and 2nd Gen ITOA

More Related Content

PDF
Modernize and Simplify IT Operations Management for DevOps Success
DevOps.com
 
PPTX
IBM Netcool Operations Insight
Tulsie Narine
 
PPTX
Netcool OMNIbus Customer Case
IBM Danmark
 
PPTX
Five Steps to DevOps Success - Avoiding the High Cost of Downtime
Anand Akela
 
PDF
Cortex v5: Re-designed Re-engineered Re-launched
Cortex
 
PPT
Fantastic Slide on z-Operations Analytics Solution from IBM
Luigi Tommaseo
 
PDF
#PCMVision: Oracle Hybrid Cloud Solutions
PCM
 
PPTX
Jazz for Service Management
IBM Danmark
 
Modernize and Simplify IT Operations Management for DevOps Success
DevOps.com
 
IBM Netcool Operations Insight
Tulsie Narine
 
Netcool OMNIbus Customer Case
IBM Danmark
 
Five Steps to DevOps Success - Avoiding the High Cost of Downtime
Anand Akela
 
Cortex v5: Re-designed Re-engineered Re-launched
Cortex
 
Fantastic Slide on z-Operations Analytics Solution from IBM
Luigi Tommaseo
 
#PCMVision: Oracle Hybrid Cloud Solutions
PCM
 
Jazz for Service Management
IBM Danmark
 

What's hot (20)

PDF
HPE Software at Discover 2016 London 29 November—1 December
at MicroFocus Italy ❖✔
 
PPTX
Building Service Intelligence with Splunk IT Service Intelligence (ITSI)
Splunk
 
PPT
Keynote Address at 2013 CloudCon: A day in the life of the SMB by Michael To...
exponential-inc
 
PDF
SplunkLive! London 2016 Get your service intelligence off to a flying start
Splunk
 
PDF
Jazz for Service Management - OMNIbus
IBM_BSM
 
PDF
Azul Systems - Our corporate overview
Azul Systems Inc.
 
PDF
Collaborate 2011 Majestic Presentation V2
Melissa Penfield
 
PDF
HPE_Software_Portfolio_VKS2016
Vijayakumar KS FInstSMM
 
PDF
Steve Chambers - Cloud for GrownUps ITSM17
itSMF UK
 
PDF
IPsoft Autonomics IT Service Management
cheahwk
 
PPT
SmartCloud Monitoring and Capacity Planning
IBM Danmark
 
PDF
Data Movement, Management and Governance In The Cloud: DocuSign Case Study
Dell World
 
PDF
VMworld 2013: The Economics of vCloud: Which Cloud Do I Need and How Do I Get...
VMworld
 
PDF
Re-Architect Your Legacy Environment To Enable An Agile, Future-Ready Enterprise
Dell World
 
PDF
Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan
AgileNetwork
 
PDF
Ibm itsm portfolio
Detlef Wolf
 
PPT
For Developers : Real-Time Analytics on Data in Motion
Avadhoot Patwardhan
 
PDF
MT101 Dell OCIO: Delivering data and analytics in real time
Dell EMC World
 
PDF
Agile Team Autonomy – Don’t Just Give It Away Make Teams Earn It
Consortium for Information & Software Quality (CISQ)
 
PDF
Sage People Case Studies
Net at Work
 
HPE Software at Discover 2016 London 29 November—1 December
at MicroFocus Italy ❖✔
 
Building Service Intelligence with Splunk IT Service Intelligence (ITSI)
Splunk
 
Keynote Address at 2013 CloudCon: A day in the life of the SMB by Michael To...
exponential-inc
 
SplunkLive! London 2016 Get your service intelligence off to a flying start
Splunk
 
Jazz for Service Management - OMNIbus
IBM_BSM
 
Azul Systems - Our corporate overview
Azul Systems Inc.
 
Collaborate 2011 Majestic Presentation V2
Melissa Penfield
 
HPE_Software_Portfolio_VKS2016
Vijayakumar KS FInstSMM
 
Steve Chambers - Cloud for GrownUps ITSM17
itSMF UK
 
IPsoft Autonomics IT Service Management
cheahwk
 
SmartCloud Monitoring and Capacity Planning
IBM Danmark
 
Data Movement, Management and Governance In The Cloud: DocuSign Case Study
Dell World
 
VMworld 2013: The Economics of vCloud: Which Cloud Do I Need and How Do I Get...
VMworld
 
Re-Architect Your Legacy Environment To Enable An Agile, Future-Ready Enterprise
Dell World
 
Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan
AgileNetwork
 
Ibm itsm portfolio
Detlef Wolf
 
For Developers : Real-Time Analytics on Data in Motion
Avadhoot Patwardhan
 
MT101 Dell OCIO: Delivering data and analytics in real time
Dell EMC World
 
Agile Team Autonomy – Don’t Just Give It Away Make Teams Earn It
Consortium for Information & Software Quality (CISQ)
 
Sage People Case Studies
Net at Work
 
Ad

Similar to Cloud Service Management: Why Machine Learning is Now Essential (20)

PPTX
Improve IT operations management with ServiceNow and Ironstream
Precisely
 
PDF
Mastering System Resiliency with AIOps
Peterson Technology Partners
 
PDF
Next generation business automation with the red hat decision manager and red...
Masahiko Umeno
 
PPTX
The Business Justification for APM
Jonah Kowall
 
PPTX
Lunch and Learn and Sneakers
Bill Zajac
 
PDF
How to Revamp your Legacy Applications For More Agility and Better Service - ...
NRB
 
PDF
Brighttalk understanding the promise of sde - final
Andrew White
 
PDF
Digital transformation slideshare
ShivamPatsariya1
 
PPTX
Neev Application Performance Management Services
Neev Technologies
 
PPTX
Enabling a Smarter Infrastructure for your Cloud Environment - IBM Smarter Bu...
IBM Sverige
 
PDF
Why Your Digital Transformation Strategy Demands Middleware Modernization
VMware Tanzu
 
PDF
NoOps in a Serverless World
Gary Arora
 
PDF
Enterprise Service Management: Taking a Paradign Shift in the Digital Era
JK Tech
 
PPTX
End to-End Monitoring for ITSM and DevOps
eG Innovations
 
PPTX
Prov International - Our Service-Now ITOM Delivery Capabilities
Sonny Nnamchi (Ph.D)
 
PDF
Cisco Connect 2018 Thailand - Enabling the next gen data center transformatio...
NetworkCollaborators
 
PPTX
Instana Customer Presentation for apm monitoring
riadelidrissi
 
PPTX
Data Analytics in Digital Transformation
Mukund Babbar
 
PPTX
Wavefront presentation-May-2019
Anil Gupta (AJ) - vExpert
 
PDF
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
Enterprise Management Associates
 
Improve IT operations management with ServiceNow and Ironstream
Precisely
 
Mastering System Resiliency with AIOps
Peterson Technology Partners
 
Next generation business automation with the red hat decision manager and red...
Masahiko Umeno
 
The Business Justification for APM
Jonah Kowall
 
Lunch and Learn and Sneakers
Bill Zajac
 
How to Revamp your Legacy Applications For More Agility and Better Service - ...
NRB
 
Brighttalk understanding the promise of sde - final
Andrew White
 
Digital transformation slideshare
ShivamPatsariya1
 
Neev Application Performance Management Services
Neev Technologies
 
Enabling a Smarter Infrastructure for your Cloud Environment - IBM Smarter Bu...
IBM Sverige
 
Why Your Digital Transformation Strategy Demands Middleware Modernization
VMware Tanzu
 
NoOps in a Serverless World
Gary Arora
 
Enterprise Service Management: Taking a Paradign Shift in the Digital Era
JK Tech
 
End to-End Monitoring for ITSM and DevOps
eG Innovations
 
Prov International - Our Service-Now ITOM Delivery Capabilities
Sonny Nnamchi (Ph.D)
 
Cisco Connect 2018 Thailand - Enabling the next gen data center transformatio...
NetworkCollaborators
 
Instana Customer Presentation for apm monitoring
riadelidrissi
 
Data Analytics in Digital Transformation
Mukund Babbar
 
Wavefront presentation-May-2019
Anil Gupta (AJ) - vExpert
 
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
Enterprise Management Associates
 
Ad

More from DevOps.com (20)

PDF
Modernizing on IBM Z Made Easier With Open Source Software
DevOps.com
 
PPTX
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com
 
PPTX
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com
 
PDF
Next Generation Vulnerability Assessment Using Datadog and Snyk
DevOps.com
 
PPTX
Vulnerability Discovery in the Cloud
DevOps.com
 
PDF
2021 Open Source Governance: Top Ten Trends and Predictions
DevOps.com
 
PDF
A New Year’s Ransomware Resolution
DevOps.com
 
PPTX
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
DevOps.com
 
PDF
Don't Panic! Effective Incident Response
DevOps.com
 
PDF
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
DevOps.com
 
PDF
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
DevOps.com
 
PDF
Monitoring Serverless Applications with Datadog
DevOps.com
 
PDF
Deliver your App Anywhere … Publicly or Privately
DevOps.com
 
PPTX
Securing medical apps in the age of covid final
DevOps.com
 
PDF
How to Build a Healthy On-Call Culture
DevOps.com
 
PPTX
The Evolving Role of the Developer in 2021
DevOps.com
 
PDF
Service Mesh: Two Big Words But Do You Need It?
DevOps.com
 
PPTX
Secure Data Sharing in OpenShift Environments
DevOps.com
 
PPTX
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
DevOps.com
 
PDF
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
DevOps.com
 
Modernizing on IBM Z Made Easier With Open Source Software
DevOps.com
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com
 
Next Generation Vulnerability Assessment Using Datadog and Snyk
DevOps.com
 
Vulnerability Discovery in the Cloud
DevOps.com
 
2021 Open Source Governance: Top Ten Trends and Predictions
DevOps.com
 
A New Year’s Ransomware Resolution
DevOps.com
 
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
DevOps.com
 
Don't Panic! Effective Incident Response
DevOps.com
 
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
DevOps.com
 
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
DevOps.com
 
Monitoring Serverless Applications with Datadog
DevOps.com
 
Deliver your App Anywhere … Publicly or Privately
DevOps.com
 
Securing medical apps in the age of covid final
DevOps.com
 
How to Build a Healthy On-Call Culture
DevOps.com
 
The Evolving Role of the Developer in 2021
DevOps.com
 
Service Mesh: Two Big Words But Do You Need It?
DevOps.com
 
Secure Data Sharing in OpenShift Environments
DevOps.com
 
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
DevOps.com
 
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
DevOps.com
 

Recently uploaded (20)

PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
Doc9.....................................
SofiaCollazos
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
Software Development Methodologies in 2025
KodekX
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
Doc9.....................................
SofiaCollazos
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 

Cloud Service Management: Why Machine Learning is Now Essential

  • 1. © 2018 IBM Corporation Cloud Service Management and why Machine Learning is now essential June 21 2018
  • 2. © 2018 IBM Corporation Agenda Part 1: - Cloud Service Management Part 2: - Machine Learning is essential (for adaptive automation) Part 3: - Wrap-up, Call to action & Q&A
  • 3. Cloud enables digital transformation To transform, organizations are employing App Modernization, Hybrid, and DevOps Supporting agility at scale requires managing increasing data growth, complexity, and dynamic environments
  • 4. Deliver, reliable, competitive applications Fast Business Reality & needs: • Agile Application Delivery • End user experience & reliability • Lean Operations Management Ops Goal: Fewer problem tickets, faster resolution Dev Goal: Faster time to market, reduce disruptions Dev Test Stage Prod Sto p Shift Right Shift Left virtually every application & service will incorporate AI, Gartner10yrs of the top 20 companies in every industry will be disrupted in the next 3 years1/3 of apps must be refactored to move to cloud99% LoB Executive Application Owner Application Developer Chief Information Officer IT Operations Manager IT Operations Engineer
  • 5. Business Imperatives are Driving Faster Change Agility depends on DevOps practices and Cloud-Enabled Process Innovation Systems of Record Operational Excellence Systems of Engagement Transformation & Differentiation Agile Management Traditional Management Traditional Model Agile Model Some, big IT projects Many, small 2-3 years Time to go live 2-3 months Lower Change rate Higher Centralized Governance Decentralized Cloud-ready, on-prem Tools Cloud-Native ITIL, CMMI Processes DevOps, Lean Hybrid Ops Hybrid Apps Source: The agile CIO: Mastering digital disruption. http://blog.kpmg.ch/the-agile-cio-mastering-digital-disruption/ 5
  • 6. Process, Tools and Culture Growing an Agile organization requires adaptation across the organization . Process Tools & Technology Culture • Adjust processes to enable Agility • Continued High Availability and Performance • Built-to-Manage Approach • Integrate Cloud Service Management toolchain with existing ITSM capabilities • Implement New Tools (ChatOps, Runbook Automation, etc.) • Orient on Application Agility and shared success (DevOps) • Transition to New Roles (i.e. Site Reliability Engineer, First Responder) • Transition to Proactive monitoring (Analytics) 6
  • 7. Enterprise DevOps Adoption “The Future is already here, it is just unevenly distributed” – William Gibson 7 New DevOps Startup • Full Stack Engineers • Highly Collaborative • Informal and Agile • Focused and Independent Enterprise Business Reality: • Some Agile Applications • Some Legacy Applications • Adopting Cloud Operating Model • Mix of Traditional and Cloud IT Service Management (ITIL) • Process Oriented • Resistant to Change Cloud Service Management • Service Oriented • Dynamic and Agile L1 Ops L2 Ops SME Site Reliability Engineer First Responder DevOps/SME
  • 8. Hybrid Cloud Management enables the transformation journey Theme Value Digital transformationAgility Adaptive Automation Select and manage the right cloud path for you Manageable, secure DevOps delivered at scale Recognize and respond to dynamic environments Flexibility
  • 10. © 2018 IBM Corporation Agenda Part 1: - Cloud Service Management Part 2: - Machine Learning is essential (for adaptive automation) Part 3: - Wrap-up, Call to action & Q&A
  • 11. Adaptive automation of IT professionals agree: we will be overwhelmed without automation. 70%
  • 12. Proactive Predictive insights Adaptive automation Reactive Real-time analytics Adaptive Cognitively enhanced workflow Scale Complexity Recognize and respond to dynamic environments Adaptive Automation Recognize and respond to dynamic environments
  • 13. Insights to increase efficiency • Automated noise reduction • Automation of complex tasks Insights to Avoid Outages • Automatically detect behavioural changes • Take action, before users are impacted Insights to reduce MTTR • Probable cause identification • Context, in dynamic environments 13 Adaptive Automation Machine learning, advanced analytics and cognitive technologies delivering automated value for Centralized IT Operations and DevOps teams
  • 14. Insights to increase efficiencyInsights to Avoid Outages Insights from your Terabytes of Operational Data Machine Learning applied automatically to your performance data Automate, automate, automate with Machine Learning applied your event and performance data. Extend with Watson. Reactive Real-time analytics Proactive Predictive Insights Adaptive Cognitively Enhanced Workflows Insights to reduce MTTR "“Right there - visually - we saw proof that you can use machine learning to be able to identify root cause….. Everyone sat there in silence for three minutes.” David Nestic Technical operations manager, NBN Source: "After testing the cognitive monitoring solution (IBM Operations Analytics Predictive Insights) ..we saw a significant reduction in server incidents..Thanks to it we will have a platform that can help us act before an incident occurs” Jan Steen Olsen Executive Vice President and CTO, Danske Bank Source: “We live on the edge of control, trying to assure our systems and deal with ever-changing business and user requirements. To control costs, we need to keep operations lean by processing only actionable alarms”…….On average, we reduced 15% of the “noise” alarms.” Operations Leader, Fast Growing Canadian Telco
  • 15. Correlated Event Groups Traditional Events Cisco ACI Docker Kubernetes OpenStack TADDM NOI VMware vCenter ITNM IBM ALM DNS REST Netcool Ops Insight - Event Clustering - Seasonal Analysis and Suppression - Weighted probable cause Machine Learning for Reactive Management Cisco ACI Cognitive Event MoM Collaboration & Automation ChatOps Notification Run Books
  • 16. Correlated Event Groups Traditional Events Proactive Events Metrics Cisco ACI Docker Kubernetes OpenStack TADDM NOI VMware vCenter ITNM IBM ALM DNS REST Predictive Insights Netcool Ops Insight - AI driven Model selection - Variance Analysis - Dependency Determination - Dynamic Threshold - Event Clustering - Seasonal Analysis and Suppression - Weighted probable cause Machine Learning for Reactive and Proactive Management Cisco ACI Cognitive Performance MoM Cognitive Event MoM Collaboration & Automation ChatOps Notification Run Books
  • 17. Advanced Analytics for Rapid Context 17 Agile Service Manager Dynamic Topology MoM
  • 18. RESULT: Cognitive Manager of Managers across Event, Performance and Topology data Cognitive Data Scientist learns, decides, improves Sophisticated Seasonal Modelling Robust Statistical approaches (independent of data distribution) Multiple Anomaly Detection Algorithms Automatic Model Validation Long term learning (monthly/ yearly patterns) Mathematical Relationship Discovery Rapid analysis of highly dynamic environments Automated Runbooks User Domain knowledge Alert Mgmt & Collaboration Probable Cause Identification Context, in highly Dynamic Environments Automated Remediation Mean-Time-To-Identify (MTTI) Mean-Time-to-Know (MTTK) Automated Event Suppression & Incident Correlation Automated Early Detection Mean-Time-to-Fix and Verify Adaptive Automation Incident Management Example
  • 19. © 2018 IBM Corporation Agenda Part 1: - Cloud Service Management Part 2: - Machine Learning is essential (for adaptive automation) Part 3: - Wrap-up, Call to action & Q&A
  • 20. Patterns of behavior w/ Machine Learning Seasonality of environment behavior Abnormal behaviors that precursor events Predict to Get Ahead Augment the Process Cognitive Automated Ticket Creation and Routing Cognitive Process Automation with robotics and Watson guided advise Cognitive Process Automation for zero-touch automation with robotics and Watson embedded advise and next steps Simplify & Focus Pattern Analysis to Correlate & De-duplicate events Pattern Analysis for IT Operations Cognitive Network 360* Insights Real Time Federated Topology Augment Staff Cognitive Incident Advisor Cognitive Agent Assist Cognitive Knowledgebase w/ semantic search Cognitive Assistant for Change §Netcool Operations Insight §Agile Service Manager §Hadoop HDFS §Watson Data Platform (DSX) §Watson Explorer §Watson Discovery §Watson Knowledge Studio §IBM Operations Analytics – Predictive Insights §Netcool Operations Insight §RPA tools §Watson Explorer Semantic Analysis §Dynamic Automation §PASIR §Watson Discovery §Watson Knowledge Studio §Watson Assistant §Watson Conversation Services §Watson Explorer §Watson Discovery §Watson Knowledge Studio §Speech To Text / Text To Speech §Watson for Cyber Security §Qradar Watson Advisor CapabilityProducts/CloudServices Adaptive Automation Machine Learning and leveraging user experience
  • 21. Predict to Get Ahead Augment the ProcessSimplify & Focus Augment Staff Adaptive Automation Call to action and Q&A Short Videos Predictive Capabilities § The Value Video § The Capability video IBM Marketplace: § Operations Analytics § Netcool Operations Insight § Application Performance Management Forrester Total Economic Studies § The Operations Management TEI § The Application Management TEI IT Operations Maturity Assessment § Questionnaire to get you thinking Find Out More
  • 22. © 2018 IBM Corporation Thank you
  • 23. Key Capabilities: Reduce MTTR With 2nd Gen Advanced Real-Time Event, Performance and Topology Analytics • Groups events that always occur together, providing increased context for faster resolution • Learns complex relationships across your applications and infrastructure and provides insights for potential root cause • Rapidly analyses multiple sources of topology to provide up-to- date service and topology views for context With 1st Gen capabilities for Rapid Problem Resolution • Big data search across all operational data, supplemented text derived insights and log monitoring Insights to reduce MTTR • Probable cause identification • Context, in dynamic environments Reactive Real-time analytics Insights from your Terabytes of Operational Data
  • 24. Key Capabilities: Avoid Outages Utilise IBM’s advanced machine learning to proactive manage your critical application and infrastructure Solution automatically detects behavioural changes and provides insights to help root cause Operations can take corrective action, before critical services and users are impacted Solution has been successfully deployed and has dramatically reduced outages Insights to Avoid Outages • Automatically detect behavioural changes • Take action, before users are impacted Proactive Predictive insights Machine Learning applied automatically to your performance data 2nd Gen ITOA
  • 25. Key Capabilities: Increase Efficiency Due to advances, Machine Learning can now automate many human decisions AND it scales and adapts – Reduces alert noise due to advances seasonal behaviour analysis – Reduces manual effort by utilising machine learning to set and maintain thresholds – Reduce tickets and manual effort, by automatically grouping events that always occur together Automatically analyses patterns in operational data to identify waste and automation opportunities Insights to Increase Efficiency • Automated noise reduction • Automation of complex tasks Increase Efficiency Automate, automate, automate with Machine Learning applied your event and performance data 1st and 2nd Gen ITOA