SlideShare a Scribd company logo
Operating Kubernetes at
Scale
@joerg_schad @gaunetes @dcos
Chris Gaun
PMM at Mesosphere /
Kubernetes Expert /
CNCF Ambasador
● Previous to that
Gartner analyst
covering public IaaS
● Kubernetes
community for 3 years
Jörg Schad
Technical Community
Lead / Developer
● Core Mesos
developer at
Mesosphere
● Passions are deep
learning, distributed
data systems, and
data analytics
Bootcamp: Building Kubernetes-as-a-Service at
Scale, Anywhere
● Episode 1: Building Kubernetes-as-a-Service
at Scale
● Episode 2: Deploying Kubernetes at Scale
with DC/OS
● Episode 3: Kubernetes and Big Data
Services
● Episode 4: Operating Kubernetes at Scale
with DC/OS
● End-to-end components
and best practices
● Automated management
of Kubernetes
● Connecting Kubernetes
to Big Data services
● Delivering an entire
Kubernetes solution
5
Star / Clone Github
1. Go to Kubernetes DC/OS
quickstart
2. Search “DC/OS Kubernetes
Quickstart Github” or
https://github.com/mesospher
e/dcos-kubernetes-quickstart
3. Live demo
https://github.com/dcos/demos
/tree/master/flink-k8s/1.11
6
Sign Up For Slack
1. Slack URL: https://chat.dcos.io/
2. Join #kubernetes channel
3. OSS support / feedback
Server Server Server Server Server
Server Server Server Server Server
as-a-Service
Installation
KubernetesOne-Click
20+
MORE
Cloud Native
Services
How to operate?
© 2018 Mesosphere, Inc. All Rights Reserved. 10
DC/OS
PHYSICAL INFRASTRUCTURE
MICROSERVICES, CONTAINERS, & DEV TOOLS
VIRTUAL MACHINES PUBLIC CLOUDS
DATA SERVICES, MACHINE LEARNING, & AI
Security &
Compliance
Application-Aware
Automation
Multitenancy
Hybrid Cloud
Management
100+
MORE
DatacenterEdge
Datacenter and Cloud as a Single Computing Resource
Powered by Apache Mesos
20+
MORE
• Brings “as-a-Service”
automation to any application
technology on any
infrastructure
• Organizations Run All Types of
Container Management as-a-
Service Using Mesos:
"(Netflix) launches up to 500,000
containers and 200,000
clusters/day"
-Netflix OSS, on using Titus container
management ontop Mesos
© 2017 Mesosphere, Inc. All Rights Reserved. 12
Day 2
Challenges
© 2018 Mesosphere, Inc. All Rights Reserved. 13
Day 2 Operations
Maintenance
- Cluster Upgrades
- Cluster Resizing
- Capacity Planning
- User & Package Management
- Networking Policies
- Auditing
- Backups & Disaster Recovery
Troubleshooting
- Debugging
- Services
- System
- Tracing
- Chaos engineering
© 2018 Mesosphere, Inc. All Rights Reserved. 14
DAY 2 OPERATIONS
Metrics and Monitoring
- Collecting metrics
- Downstream processing
○ Alerting
○ Dashboards
○ Storage (long-term retention)
Logging
- Scopes
- Local vs. centralized
- Security considerations
ServerServer Server
as-a-Service
Automated Self Healing
Server Server Server Server Server
Server Server Server
KubernetesZero
Touch
20+
MORE
Cloud Native
Services
© 2018 Mesosphere, Inc. All Rights Reserved.
Two-level Scheduling
1. Agents advertise resources to Master
2. Master offers resources to Framework
3. Framework rejects / uses resources
4. Agent reports task status to Master
16
MESOS ARCHITECTURE
Mesos
Master
Mesos
Master
Mesos
Master
Mesos AgentMesos Agent Service
Cassandra
Executor
Cassandra
Task
Kubernetes
Scheduler
Spark
Executor
Spark
Task
Mesos AgentMesos Agent Service
Docker
Executor
Docker
Task
K8s Executor
Kubelet
Task
Marathon
Scheduler
Kafka
Scheduler
© 2018 Mesosphere, Inc. All Rights Reserved.
Plans
dcos hdfs --name=hdfs plan status deploy
deploy (serial strategy) (COMPLETE)
├─ journal (serial strategy) (COMPLETE)
│ ├─ journal-0:[node] (COMPLETE)
│ ├─ journal-1:[node] (COMPLETE)
│ └─ journal-2:[node] (COMPLETE)
├─ name (serial strategy) (COMPLETE)
│ ├─ name-0:[node, zkfc] (COMPLETE)
│ └─ name-1:[node, zkfc] (COMPLETE)
└─ data (serial strategy) (COMPLETE)
├─ data-0:[node] (COMPLETE)
├─ data-1:[node] (COMPLETE)
└─ data-2:[node] (COMPLETE)
© 2018 Mesosphere, Inc. All Rights Reserved. 18
TROUBLES
HOOTING
© 2017 Mesosphere, Inc. All Rights Reserved. 19
Debuggin
g
Dashboar
d
© 2018 Mesosphere, Inc. All Rights Reserved. 20
Demo
Failure Handling Configuration Updates
Download Now
https://mesosphere.com/resource/category/ebook/
© 2018 Mesosphere, Inc. All Rights Reserved. 22
THANK YOU!
ANY
QUESTION
S?
@dcos
users@dcos.io
/groups/8295652
/dcos
/dcos/examples
/dcos/demos
chat.dcos.io
https://github.com/mesosphere/dcos-kubernetes-quickstart
https://mesosphere.com/blog/another-kubernetes-service/
© 2017 Mesosphere, Inc. All Rights Reserved. 23
MapReduce is
crunching Data
Ancient
Times...
© 2016 Mesosphere, Inc. All Rights Reserved. 24
But then business
demanded
FAST DATA
We need to turn faster!
Today...
© 2016 Mesosphere, Inc. All Rights Reserved. 25
Fast Data
Batch Event ProcessingMicro-Batch
Days Hours Minutes Seconds Microseconds
Solves problems using predictive and prescriptive analyticsReports what has happened using descriptive analytics
Predictive User InterfaceReal-time Pricing and Routing Real-time AdvertisingBilling, Chargeback Product recommendations
© 2016 Mesosphere, Inc. All Rights Reserved. 26
The SMACK Stack
EVENTS
Ubiquitous data streams
from connected devices
INGEST
Apache Kafka
STORE
Apache Spark
ANALYZE
Apache Cassandra
ACT
Akka
Ingest millions of events
per second
Distributed & highly
scalable database
Real-time and batch
process data
Visualize data and build
data driven applications
Apache Mesos/ DC/OS
Sensors
Devices
Clients
Episode 4: Operating Kubernetes at Scale with DC/OS
© 2016 Mesosphere, Inc. All Rights Reserved. 28
The SMACK Stack
EVENTS
Ubiquitous data streams
from connected devices
INGEST
Apache Kafka
STORE
Apache Flink
ANALYZE
Apache Cassandra
ACT
Akka
Ingest millions of events
per second
Distributed & highly
scalable database
Real-time and batch
process data
Visualize data and build
data driven applications
Apache Mesos/ DC/OS
Sensors
Devices
Clients
© 2018 Mesosphere, Inc. All Rights Reserved. 29
Datacenter
Typical Datacenter
siloed, over-provisioned servers,
low utilization
Kubernetes
Jenkins
Kafka
Spark
Cassandra
3 AM
Typical Datacenter
siloed, over-provisioned servers,
low utilization
Kubernetes
Jenkins
Kafka
Spark
Cassandra
DC1
DC2
DC3
3 AM
© 2017 Mesosphere, Inc. All Rights Reserved. 32
© 2018 Mesosphere, Inc. All Rights Reserved. 33
Datacenter
Typical Datacenter
siloed, over-provisioned servers,
low utilization
Mesos/ DC/OS
automated schedulers, workload multiplexing onto the
same machines
Kubernetes
Jenkins
Kafka
Spark
Cassandra
© 2018 Mesosphere, Inc. All Rights Reserved. 34
© 2018 Mesosphere, Inc. All Rights Reserved.
Distributed Systems are ...
HDFS Scheduler
Server Server Server Server Server
Server Server Server Server Server
as-a-Service
Installation
KubernetesOne-Click
20+
MORE
Cloud Native
Services
© 2018 Mesosphere, Inc. All Rights Reserved. 37
Why {Spark, HDFS, ..} on K8s today?
Kelsey Hightower
Kubernetes Thought
Leader
Ranked #1 K8s Influencer
Staff Developer
Advocate
PM & Chief Advocate
Today Big Data on K8s is more DIY
Top
Kubernetes
Advocate
© 2018 Mesosphere, Inc. All Rights Reserved. 38
SMACK Stack
Generator Display
1. Financial data created
by generator
2. Written to
Kafka topics
3. Kafka Topics
consumed by Spark or
Flink
4. Results written back into Kafka
stream (another topic)
7. Results displayed
39
Star / Clone Github
1. Go to Kubernetes DC/OS
quickstart
2. Search “DC/OS Kubernetes
Quickstart Github” or
https://github.com/mesospher
e/dcos-kubernetes-quickstart
3. Live demo
https://github.com/dcos/demos
/tree/master/flink-k8s/1.11

More Related Content

What's hot (20)

PPTX
Tech Preview: Kubernetes on Mesosphere DC/OS 1.10
Mesosphere Inc.
 
PDF
9 - Making Sense of Containers in the Microsoft Cloud
Kangaroot
 
PDF
Containers and Kubernetes
Altoros
 
PDF
Kubernetes on DC/OS
Cloud Technology Experts
 
ODP
Cloud Native Java Development Patterns
Bilgin Ibryam
 
PDF
Serverless and Design Patterns In GCP
Oliver Fierro
 
PDF
An Architectural Deep Dive With Kubernetes And Containers Powerpoint Presenta...
SlideTeam
 
PDF
Jolt: Distributed, fault-tolerant test running at scale using Mesos
Mesosphere Inc.
 
PDF
Google Anthos - Azure Stack - AWS Outposts :Comparison
Krishna-Kumar
 
PPTX
Building Cloud Native Applications Using Spring Boot and Spring Cloud
GeekNightHyderabad
 
PPTX
'Cloud-Native' Ecosystem - Aug 2015
Lenny Pruss
 
PPTX
Episode 3: Kubernetes and Big Data Services
Mesosphere Inc.
 
PPTX
Introduction to container mangement
Martin Marcher
 
PPTX
Introduction to helm
Jeeva Chelladhurai
 
PPTX
Introduction to Container Storage Interface (CSI)
Idan Atias
 
PDF
Storage os kubernetes clusters need persistent data
LibbySchulze
 
PDF
Cloud Foundry Diego: Modular and Extensible Substructure for Microservices
Matt Stine
 
PDF
Managing Ceph operational complexity with Juju
ShapeBlue
 
PDF
The Good, the Bad and the Ugly of Migrating Hundreds of Legacy Applications ...
Josef Adersberger
 
PPTX
Docker 101
Arthur De Magalhaes
 
Tech Preview: Kubernetes on Mesosphere DC/OS 1.10
Mesosphere Inc.
 
9 - Making Sense of Containers in the Microsoft Cloud
Kangaroot
 
Containers and Kubernetes
Altoros
 
Kubernetes on DC/OS
Cloud Technology Experts
 
Cloud Native Java Development Patterns
Bilgin Ibryam
 
Serverless and Design Patterns In GCP
Oliver Fierro
 
An Architectural Deep Dive With Kubernetes And Containers Powerpoint Presenta...
SlideTeam
 
Jolt: Distributed, fault-tolerant test running at scale using Mesos
Mesosphere Inc.
 
Google Anthos - Azure Stack - AWS Outposts :Comparison
Krishna-Kumar
 
Building Cloud Native Applications Using Spring Boot and Spring Cloud
GeekNightHyderabad
 
'Cloud-Native' Ecosystem - Aug 2015
Lenny Pruss
 
Episode 3: Kubernetes and Big Data Services
Mesosphere Inc.
 
Introduction to container mangement
Martin Marcher
 
Introduction to helm
Jeeva Chelladhurai
 
Introduction to Container Storage Interface (CSI)
Idan Atias
 
Storage os kubernetes clusters need persistent data
LibbySchulze
 
Cloud Foundry Diego: Modular and Extensible Substructure for Microservices
Matt Stine
 
Managing Ceph operational complexity with Juju
ShapeBlue
 
The Good, the Bad and the Ugly of Migrating Hundreds of Legacy Applications ...
Josef Adersberger
 

Similar to Episode 4: Operating Kubernetes at Scale with DC/OS (20)

PDF
DevOps vs. Site Reliability Engineering (SRE) in Age of Kubernetes
DevOps.com
 
PDF
Kubernetes One-Click Deployment: Hands-on Workshop (Munich)
QAware GmbH
 
PDF
Downtime is not an option - day 2 operations - Jörg Schad
Codemotion
 
PDF
OSDC 2018 | From batch to pipelines – why Apache Mesos and DC/OS are a soluti...
NETWAYS
 
PDF
Kubernetes on Top of Mesos on Top of DCOS
Stefan Schimanski
 
PPTX
Dealing with kubesprawl tetris style !
Taco Scargo
 
PDF
Container Orchestration Wars (Micro Edition)
Karl Isenberg
 
PDF
Webinar - Nightmares of a Container Orchestration System - Jorg Schad
Codemotion
 
PDF
Webinar: Nightmares of a Container Orchestration System - Jorg Schad
Codemotion
 
PDF
Containerizing couchbase with microservice architecture on mesosphere.pptx
Ravi Yadav
 
PDF
Introduction to DC/OS
Matt Jarvis
 
PPTX
Doing Dropbox the Native Cloud Native Way
Minio
 
PDF
Introduction to DC/OS
Matt Jarvis
 
PDF
Powering Predictive Mapping at Scale with Spark, Kafka, and Elastic Search: S...
Spark Summit
 
PPTX
Sergey Dzyuban "To Build My Own Cloud with Blackjack…"
Fwdays
 
PPTX
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Mesosphere Inc.
 
PDF
DOD 2016 - Jörg Schad - How Fast Data and Microservices Change the Datacenter.
PROIDEA
 
PPTX
Introduction to DC/OS
Amita Ekbote
 
PPTX
To Build My Own Cloud with Blackjack…
Sergey Dzyuban
 
PDF
OSDC 2016 - Mesos and the Architecture of the New Datacenter by Jörg Schad
NETWAYS
 
DevOps vs. Site Reliability Engineering (SRE) in Age of Kubernetes
DevOps.com
 
Kubernetes One-Click Deployment: Hands-on Workshop (Munich)
QAware GmbH
 
Downtime is not an option - day 2 operations - Jörg Schad
Codemotion
 
OSDC 2018 | From batch to pipelines – why Apache Mesos and DC/OS are a soluti...
NETWAYS
 
Kubernetes on Top of Mesos on Top of DCOS
Stefan Schimanski
 
Dealing with kubesprawl tetris style !
Taco Scargo
 
Container Orchestration Wars (Micro Edition)
Karl Isenberg
 
Webinar - Nightmares of a Container Orchestration System - Jorg Schad
Codemotion
 
Webinar: Nightmares of a Container Orchestration System - Jorg Schad
Codemotion
 
Containerizing couchbase with microservice architecture on mesosphere.pptx
Ravi Yadav
 
Introduction to DC/OS
Matt Jarvis
 
Doing Dropbox the Native Cloud Native Way
Minio
 
Introduction to DC/OS
Matt Jarvis
 
Powering Predictive Mapping at Scale with Spark, Kafka, and Elastic Search: S...
Spark Summit
 
Sergey Dzyuban "To Build My Own Cloud with Blackjack…"
Fwdays
 
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Mesosphere Inc.
 
DOD 2016 - Jörg Schad - How Fast Data and Microservices Change the Datacenter.
PROIDEA
 
Introduction to DC/OS
Amita Ekbote
 
To Build My Own Cloud with Blackjack…
Sergey Dzyuban
 
OSDC 2016 - Mesos and the Architecture of the New Datacenter by Jörg Schad
NETWAYS
 
Ad

More from Mesosphere Inc. (16)

PPTX
Webinar: End-to-End CI/CD with GitLab and DC/OS
Mesosphere Inc.
 
PPTX
Webinar: Déployez facilement Kubernetes & vos containers
Mesosphere Inc.
 
PPTX
Webinar: Deep Learning Pipelines Beyond the Learning
Mesosphere Inc.
 
PPTX
Running Distributed TensorFlow with GPUs on Mesos with DC/OS
Mesosphere Inc.
 
PDF
Deploying Kong with Mesosphere DC/OS
Mesosphere Inc.
 
PDF
Discover the all new Mesosphere DC/OS 1.10
Mesosphere Inc.
 
PDF
Mesosphere & Magnetic: Take the pain out of running complex and critical serv...
Mesosphere Inc.
 
PDF
Easy Docker Deployments with Mesosphere DCOS on Azure
Mesosphere Inc.
 
PPTX
Mesos framework API v1
Mesosphere Inc.
 
PPTX
Scaling Like Twitter with Apache Mesos
Mesosphere Inc.
 
PDF
Elastic jenkins with mesos and dcos (2016 01-20)
Mesosphere Inc.
 
PDF
Growing the Mesos Ecosystem
Mesosphere Inc.
 
PDF
Doing Big Data for Real with Docker
Mesosphere Inc.
 
PDF
Deploying Containers in Production and at Scale
Mesosphere Inc.
 
PDF
Re-Platforming All the Things
Mesosphere Inc.
 
PDF
Mesos Networking
Mesosphere Inc.
 
Webinar: End-to-End CI/CD with GitLab and DC/OS
Mesosphere Inc.
 
Webinar: Déployez facilement Kubernetes & vos containers
Mesosphere Inc.
 
Webinar: Deep Learning Pipelines Beyond the Learning
Mesosphere Inc.
 
Running Distributed TensorFlow with GPUs on Mesos with DC/OS
Mesosphere Inc.
 
Deploying Kong with Mesosphere DC/OS
Mesosphere Inc.
 
Discover the all new Mesosphere DC/OS 1.10
Mesosphere Inc.
 
Mesosphere & Magnetic: Take the pain out of running complex and critical serv...
Mesosphere Inc.
 
Easy Docker Deployments with Mesosphere DCOS on Azure
Mesosphere Inc.
 
Mesos framework API v1
Mesosphere Inc.
 
Scaling Like Twitter with Apache Mesos
Mesosphere Inc.
 
Elastic jenkins with mesos and dcos (2016 01-20)
Mesosphere Inc.
 
Growing the Mesos Ecosystem
Mesosphere Inc.
 
Doing Big Data for Real with Docker
Mesosphere Inc.
 
Deploying Containers in Production and at Scale
Mesosphere Inc.
 
Re-Platforming All the Things
Mesosphere Inc.
 
Mesos Networking
Mesosphere Inc.
 
Ad

Recently uploaded (20)

PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
Persuasive AI: risks and opportunities in the age of digital debate
Speck&Tech
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
Why Orbit Edge Tech is a Top Next JS Development Company in 2025
mahendraalaska08
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PPT
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PPTX
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
Persuasive AI: risks and opportunities in the age of digital debate
Speck&Tech
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
Why Orbit Edge Tech is a Top Next JS Development Company in 2025
mahendraalaska08
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 

Episode 4: Operating Kubernetes at Scale with DC/OS

  • 2. Chris Gaun PMM at Mesosphere / Kubernetes Expert / CNCF Ambasador ● Previous to that Gartner analyst covering public IaaS ● Kubernetes community for 3 years
  • 3. Jörg Schad Technical Community Lead / Developer ● Core Mesos developer at Mesosphere ● Passions are deep learning, distributed data systems, and data analytics
  • 4. Bootcamp: Building Kubernetes-as-a-Service at Scale, Anywhere ● Episode 1: Building Kubernetes-as-a-Service at Scale ● Episode 2: Deploying Kubernetes at Scale with DC/OS ● Episode 3: Kubernetes and Big Data Services ● Episode 4: Operating Kubernetes at Scale with DC/OS ● End-to-end components and best practices ● Automated management of Kubernetes ● Connecting Kubernetes to Big Data services ● Delivering an entire Kubernetes solution
  • 5. 5 Star / Clone Github 1. Go to Kubernetes DC/OS quickstart 2. Search “DC/OS Kubernetes Quickstart Github” or https://github.com/mesospher e/dcos-kubernetes-quickstart 3. Live demo https://github.com/dcos/demos /tree/master/flink-k8s/1.11
  • 6. 6 Sign Up For Slack 1. Slack URL: https://chat.dcos.io/ 2. Join #kubernetes channel 3. OSS support / feedback
  • 7. Server Server Server Server Server Server Server Server Server Server as-a-Service Installation KubernetesOne-Click 20+ MORE Cloud Native Services
  • 9. © 2018 Mesosphere, Inc. All Rights Reserved. 10 DC/OS PHYSICAL INFRASTRUCTURE MICROSERVICES, CONTAINERS, & DEV TOOLS VIRTUAL MACHINES PUBLIC CLOUDS DATA SERVICES, MACHINE LEARNING, & AI Security & Compliance Application-Aware Automation Multitenancy Hybrid Cloud Management 100+ MORE DatacenterEdge Datacenter and Cloud as a Single Computing Resource Powered by Apache Mesos 20+ MORE
  • 10. • Brings “as-a-Service” automation to any application technology on any infrastructure • Organizations Run All Types of Container Management as-a- Service Using Mesos: "(Netflix) launches up to 500,000 containers and 200,000 clusters/day" -Netflix OSS, on using Titus container management ontop Mesos
  • 11. © 2017 Mesosphere, Inc. All Rights Reserved. 12 Day 2 Challenges
  • 12. © 2018 Mesosphere, Inc. All Rights Reserved. 13 Day 2 Operations Maintenance - Cluster Upgrades - Cluster Resizing - Capacity Planning - User & Package Management - Networking Policies - Auditing - Backups & Disaster Recovery Troubleshooting - Debugging - Services - System - Tracing - Chaos engineering
  • 13. © 2018 Mesosphere, Inc. All Rights Reserved. 14 DAY 2 OPERATIONS Metrics and Monitoring - Collecting metrics - Downstream processing ○ Alerting ○ Dashboards ○ Storage (long-term retention) Logging - Scopes - Local vs. centralized - Security considerations
  • 14. ServerServer Server as-a-Service Automated Self Healing Server Server Server Server Server Server Server Server KubernetesZero Touch 20+ MORE Cloud Native Services
  • 15. © 2018 Mesosphere, Inc. All Rights Reserved. Two-level Scheduling 1. Agents advertise resources to Master 2. Master offers resources to Framework 3. Framework rejects / uses resources 4. Agent reports task status to Master 16 MESOS ARCHITECTURE Mesos Master Mesos Master Mesos Master Mesos AgentMesos Agent Service Cassandra Executor Cassandra Task Kubernetes Scheduler Spark Executor Spark Task Mesos AgentMesos Agent Service Docker Executor Docker Task K8s Executor Kubelet Task Marathon Scheduler Kafka Scheduler
  • 16. © 2018 Mesosphere, Inc. All Rights Reserved. Plans dcos hdfs --name=hdfs plan status deploy deploy (serial strategy) (COMPLETE) ├─ journal (serial strategy) (COMPLETE) │ ├─ journal-0:[node] (COMPLETE) │ ├─ journal-1:[node] (COMPLETE) │ └─ journal-2:[node] (COMPLETE) ├─ name (serial strategy) (COMPLETE) │ ├─ name-0:[node, zkfc] (COMPLETE) │ └─ name-1:[node, zkfc] (COMPLETE) └─ data (serial strategy) (COMPLETE) ├─ data-0:[node] (COMPLETE) ├─ data-1:[node] (COMPLETE) └─ data-2:[node] (COMPLETE)
  • 17. © 2018 Mesosphere, Inc. All Rights Reserved. 18 TROUBLES HOOTING
  • 18. © 2017 Mesosphere, Inc. All Rights Reserved. 19 Debuggin g Dashboar d
  • 19. © 2018 Mesosphere, Inc. All Rights Reserved. 20 Demo Failure Handling Configuration Updates
  • 21. © 2018 Mesosphere, Inc. All Rights Reserved. 22 THANK YOU! ANY QUESTION S? @dcos [email protected] /groups/8295652 /dcos /dcos/examples /dcos/demos chat.dcos.io https://github.com/mesosphere/dcos-kubernetes-quickstart https://mesosphere.com/blog/another-kubernetes-service/
  • 22. © 2017 Mesosphere, Inc. All Rights Reserved. 23 MapReduce is crunching Data Ancient Times...
  • 23. © 2016 Mesosphere, Inc. All Rights Reserved. 24 But then business demanded FAST DATA We need to turn faster! Today...
  • 24. © 2016 Mesosphere, Inc. All Rights Reserved. 25 Fast Data Batch Event ProcessingMicro-Batch Days Hours Minutes Seconds Microseconds Solves problems using predictive and prescriptive analyticsReports what has happened using descriptive analytics Predictive User InterfaceReal-time Pricing and Routing Real-time AdvertisingBilling, Chargeback Product recommendations
  • 25. © 2016 Mesosphere, Inc. All Rights Reserved. 26 The SMACK Stack EVENTS Ubiquitous data streams from connected devices INGEST Apache Kafka STORE Apache Spark ANALYZE Apache Cassandra ACT Akka Ingest millions of events per second Distributed & highly scalable database Real-time and batch process data Visualize data and build data driven applications Apache Mesos/ DC/OS Sensors Devices Clients
  • 27. © 2016 Mesosphere, Inc. All Rights Reserved. 28 The SMACK Stack EVENTS Ubiquitous data streams from connected devices INGEST Apache Kafka STORE Apache Flink ANALYZE Apache Cassandra ACT Akka Ingest millions of events per second Distributed & highly scalable database Real-time and batch process data Visualize data and build data driven applications Apache Mesos/ DC/OS Sensors Devices Clients
  • 28. © 2018 Mesosphere, Inc. All Rights Reserved. 29 Datacenter Typical Datacenter siloed, over-provisioned servers, low utilization Kubernetes Jenkins Kafka Spark Cassandra
  • 29. 3 AM Typical Datacenter siloed, over-provisioned servers, low utilization Kubernetes Jenkins Kafka Spark Cassandra
  • 31. © 2017 Mesosphere, Inc. All Rights Reserved. 32
  • 32. © 2018 Mesosphere, Inc. All Rights Reserved. 33 Datacenter Typical Datacenter siloed, over-provisioned servers, low utilization Mesos/ DC/OS automated schedulers, workload multiplexing onto the same machines Kubernetes Jenkins Kafka Spark Cassandra
  • 33. © 2018 Mesosphere, Inc. All Rights Reserved. 34
  • 34. © 2018 Mesosphere, Inc. All Rights Reserved. Distributed Systems are ... HDFS Scheduler
  • 35. Server Server Server Server Server Server Server Server Server Server as-a-Service Installation KubernetesOne-Click 20+ MORE Cloud Native Services
  • 36. © 2018 Mesosphere, Inc. All Rights Reserved. 37 Why {Spark, HDFS, ..} on K8s today? Kelsey Hightower Kubernetes Thought Leader Ranked #1 K8s Influencer Staff Developer Advocate PM & Chief Advocate Today Big Data on K8s is more DIY Top Kubernetes Advocate
  • 37. © 2018 Mesosphere, Inc. All Rights Reserved. 38 SMACK Stack Generator Display 1. Financial data created by generator 2. Written to Kafka topics 3. Kafka Topics consumed by Spark or Flink 4. Results written back into Kafka stream (another topic) 7. Results displayed
  • 38. 39 Star / Clone Github 1. Go to Kubernetes DC/OS quickstart 2. Search “DC/OS Kubernetes Quickstart Github” or https://github.com/mesospher e/dcos-kubernetes-quickstart 3. Live demo https://github.com/dcos/demos /tree/master/flink-k8s/1.11