SlideShare a Scribd company logo
1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Toward Better Multi-
Tenancy Support from
HDFS
Xiaoyu Yao
Email: xyao@hortonworks.com
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
About myself
⬢ Member of Technical Staff at Hortonworks since 2014
⬢ Apache Hadoop Committer and PMC member.
⬢ Currently working on HDFS.
⬢ This talk is to help better understanding of HDFS multi-tenancy support and ongoing
work for better resource management.
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
⬢ Overview
⬢ Hadoop multi-tenancy features
⬢ HDFS resources and multi-tenancy offerings
⬢ HDFS resource management via resource coupon
⬢ Q&A
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Overview
⬢ Centrally managed infrastructure
–Consolidate to simplify management and lower TCO
–Better utilization and efficiency
⬢ Requirement
–Resource Sharing
–Resource Isolation
–Resource Control
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Multi-Tenancy Support from Hadoop
Resource
Sharing
Resource
Isolation
Resource
Management
HBASE Y Namespace,
Region Server
Group
Quota
YARN Y Queue, Node Label
...
Capacity Scheduler,
...
HDFS Y Federation Quota,
FairCallQueue,
Backoff
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Resources
⬢ Capacity
–Namespace
–Storage Space
–Storage Type
⬢ Operational Resources
–Namenode
•RPC
–Datanode
•Disk & Network
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Resource Sharing/Isolation – Federation
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Capacity Management – Quota
⬢ Quota
–Namespace
–StorageSpace
–HDFS-7584 Quota by Storage Types
⬢ Limitations
–Static
–Per directory
–No per user/job control
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Operational Resource Management – Namenode RPC
Isolation (1)
⬢Internal RPC
–DN->NN block report, heartbeat, etc.
–ZKFC->NN liveness check
⬢External RPC
–Client RPCs from HDFSClients such as MR jobs/Hive queries/HBase
Client Listener
Reader
Reader
Call Queue
Handler
Handler
Handler
FSN
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Operational Resource Management – Namenode RPC
Isolation (2)
⬢Use case:
–HFDS access from normal jobs impacted by offending jobs
–Internal RPCs impacted by External RPCs
–One blocked RPC method could affect others
⬢Protect HDFS internal RPCs:
–Dedicated service RPC server/port
•Isolate DN->NN block report, heartbeat, etc.
–Dedicated lifeline RPC server/port
•Protect ZKFC->NN liveness check
⬢All external RPCs go to the default port (e.g., 8020)
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Resource Management – Name Node RPC Call Queue
⬢ In multi-tenancy scenario, call queue should play an important role like a shock
absorber to accommodate different workload, converting busty arrivals into smooth,
steady departures.
⬢ Good call queue
–queue without call bloat
–catches and handles bursts with no more than a temporary increase of queue delay
–maximum server utilization
⬢ Bad call queue
–queue that exhibits call bloat
–queue filled up and stay filled upon bursts
–low utilization and high queue latency
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Resource Management - Fair Call Queue
⬢ Before HADOOP-9640 LinkedBlockingQueue
–Single queue
–Client blocked and timeout/fail when queue is full
⬢ HADOOP-9640 - Fair Call Queue
–Multiple priority levels and call queues with different processing priority
–Each RPC is assigned a priority by scheduler
–High priority RPC calls are put into call queue with higher probability of being executed.
Scheduler
Queue 0
Queue ...
Queue 2
Multiplexer (WRR)
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Resource Management – Namenode RPC Throttling <1>
⬢ HADOOP-10597 Backoff when the call queue is full
–Send back a Retriable exception
–Let the client do exponential wait and retry instead of blocking/timeout/failed
the call.
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Resource Management – Namenode RPC Throttling <2>
⬢ HADOOP-12916 Backoff based on response time
–The basic idea: Backoff earlier to avoid call queue overload so that namenode
can recover quickly.
–Low priority calls get backed off if response time of high priority call is over
predefined threshold.
–More per user/queue metrics added for trouble shooting.
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Resource Management – Namenode RPC Throttling <3>
⬢ Abstract scheduler interface from call queue for pluggable RPC priority assignment
–DefaultRpcScheduler: all RPC calls with same priority
–DecayRpcScheduler: from original FairCallQueue priority assigned based on
previous call volumes of users.
–Other experimental schedulers: configurable list of high priority user/group for
low latency jobs, medium priority user/group for normal jobs and low priority
user/group for batch jobs.
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS resource management - QoS
⬢ Use case:
–Allow high performance QoS mechanism with minimum decoding effort on server side
⬢ HADOOP-9194 QoS support for Hadoop RPC
–One bytes in RPC header to facilitate QoS mechanism
–E.g., differentiate OLTP/OLAP, batch/streaming against the same HDFS
⬢ Limitation
–No mechanism level implementation yet
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS resource management with YARN
⬢ Use Case
–Priority inversion without centralized resource management (e.g., RPC calls from high priority
YARN jobs may be put into low priority HDFS namenode call queue)
–Identify and manage ”bad” caller effectively
⬢ Namenode – RPC handler
–FairCallQueue offers the fairness use of namenode RPC handlers
–No guarantee of differentiation
⬢ Datanode – I/O bandwidth
–No differentiation of writer/reader and bandwidth usage.
–Datanode allows static throttling balancer I/O.
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Namenode Resource Reservation
⬢ HADOOP-13128 propose HDFS namenode resource reservation via resource coupon
–From throttling to manage
–Similar to delegation token in many aspects
–Works for both Kerberos and non-Kerberos cluster
–Allows only privileged service user to request resource coupons from namenode.
–Coupon can be serialized/de-serialized for use within container.
–Coupon can be renewed for long running jobs or canceled after the intended job is finished.
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Namenode Resource Coupon
⬢ Coupon Identifier
–Finer grain owner (MR job ID, Hive Query ID) to help identify and manage “good” and “bad”
callers
–Resource type (Namenode RPC or Datanode I/O bandwidth)
–Flexible management unit for different resources.
•Min/Max percentage (e.g. Namenode RPC)
•Absolute value (Datanode I/O bandwidth)
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Namenode Resource Coupon Manager (RCM)
⬢ Grant/Renew/Cancel resource coupon
⬢ Monitor and report resource usage
⬢ Check and validate resource use requests
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Namenode Resource Pool
HDFS Namenode
Resource Pool
Fairness Pool Managed Pool
Applications supporting
Resource Coupon
(YARN/HBASE)
Legacy Applications
without Resource
Coupon
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Namenode Resource Coupon Manager (RCM)
NEW
Client
YARN
Resource
Manager
HDFS Namenode
RCM
HDFS Datanode
YARN Node Manager
YARN Container
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS Resource Management – Datanode
⬢ Use case:
–When a client writes to HDFS faster than the disk bandwidth of the DNs, it saturates the disk
bandwidth and put the DNs into an unresponsive state.
–The client only backs off by aborting / recovering the pipeline, which causes failed writes and
unnecessary pipeline recovery.
⬢ Static I/O Throttling
–HDFS-7265 Support HDFS IO throttling
–HDFS-9796 Use a throttler for replica write in datanode
–HDFS-4412 Add throttler for datanode bandwidth
–HADOOP-10410 datanode Qos via ioprio_set on DataXceiver thread
⬢ Dynamic I/O Throttling
–HDFS-7270 Add congestion signaling capability to DataNode write pipline(ECN)
⬢ Future work: I/O bandwidth reservation with resource coupon
24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Thank you!
Q&A

More Related Content

PDF
Improving HDFS Availability with Hadoop RPC Quality of Service
Ming Ma
 
PDF
Ceph scale testing with 10 Billion Objects
Karan Singh
 
PDF
Impala Architecture presentation
hadooparchbook
 
PPT
Pacemaker+DRBD
Dan Frincu
 
PPTX
Hadoop Operations - Best Practices from the Field
DataWorks Summit
 
PPTX
Introduction to Big Data and hadoop
Sandeep Patil
 
PPTX
Achieving 100k Queries per Hour on Hive on Tez
DataWorks Summit/Hadoop Summit
 
PDF
Kafka High Availability in multi data center setup with floating Observers wi...
HostedbyConfluent
 
Improving HDFS Availability with Hadoop RPC Quality of Service
Ming Ma
 
Ceph scale testing with 10 Billion Objects
Karan Singh
 
Impala Architecture presentation
hadooparchbook
 
Pacemaker+DRBD
Dan Frincu
 
Hadoop Operations - Best Practices from the Field
DataWorks Summit
 
Introduction to Big Data and hadoop
Sandeep Patil
 
Achieving 100k Queries per Hour on Hive on Tez
DataWorks Summit/Hadoop Summit
 
Kafka High Availability in multi data center setup with floating Observers wi...
HostedbyConfluent
 

What's hot (20)

PPTX
Hive + Tez: A Performance Deep Dive
DataWorks Summit
 
PDF
Facebook Messages & HBase
强 王
 
PDF
Parquet performance tuning: the missing guide
Ryan Blue
 
PDF
Neutron packet logging framework
Vietnam Open Infrastructure User Group
 
PPTX
Ozone- Object store for Apache Hadoop
Hortonworks
 
PDF
2021.02 new in Ceph Pacific Dashboard
Ceph Community
 
PPTX
Storage Basics
Murali Rajesh
 
PDF
Ceph RBD Update - June 2021
Ceph Community
 
PDF
Cassandra Introduction & Features
DataStax Academy
 
PPTX
What you need to know about ceph
Emma Haruka Iwao
 
PPTX
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
Sridhar Kumar N
 
PDF
Introduction and Overview of Apache Kafka, TriHUG July 23, 2013
mumrah
 
PDF
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
Rongze Zhu
 
PPTX
Apache Tez - A New Chapter in Hadoop Data Processing
DataWorks Summit
 
PPTX
Apache Spark Architecture
Alexey Grishchenko
 
PDF
A crash course in CRUSH
Sage Weil
 
PPTX
HBase Low Latency
DataWorks Summit
 
PPTX
Getting started with YANG
CoreStack
 
PPTX
Apache sqoop with an use case
Davin Abraham
 
PPTX
Millions of Regions in HBase: Size Matters
DataWorks Summit
 
Hive + Tez: A Performance Deep Dive
DataWorks Summit
 
Facebook Messages & HBase
强 王
 
Parquet performance tuning: the missing guide
Ryan Blue
 
Neutron packet logging framework
Vietnam Open Infrastructure User Group
 
Ozone- Object store for Apache Hadoop
Hortonworks
 
2021.02 new in Ceph Pacific Dashboard
Ceph Community
 
Storage Basics
Murali Rajesh
 
Ceph RBD Update - June 2021
Ceph Community
 
Cassandra Introduction & Features
DataStax Academy
 
What you need to know about ceph
Emma Haruka Iwao
 
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
Sridhar Kumar N
 
Introduction and Overview of Apache Kafka, TriHUG July 23, 2013
mumrah
 
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
Rongze Zhu
 
Apache Tez - A New Chapter in Hadoop Data Processing
DataWorks Summit
 
Apache Spark Architecture
Alexey Grishchenko
 
A crash course in CRUSH
Sage Weil
 
HBase Low Latency
DataWorks Summit
 
Getting started with YANG
CoreStack
 
Apache sqoop with an use case
Davin Abraham
 
Millions of Regions in HBase: Size Matters
DataWorks Summit
 
Ad

Viewers also liked (6)

PPTX
Real time analytics using Hadoop and Elasticsearch
Abhishek Andhavarapu
 
PPTX
Pivotal HD and Spring for Apache Hadoop
marklpollack
 
PDF
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
EMC
 
PPTX
Hadoop and Your Data Warehouse
Caserta
 
PDF
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
EMC
 
PDF
Implementing a Data Lake with Enterprise Grade Data Governance
Hortonworks
 
Real time analytics using Hadoop and Elasticsearch
Abhishek Andhavarapu
 
Pivotal HD and Spring for Apache Hadoop
marklpollack
 
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
EMC
 
Hadoop and Your Data Warehouse
Caserta
 
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
EMC
 
Implementing a Data Lake with Enterprise Grade Data Governance
Hortonworks
 
Ad

Similar to Toward Better Multi-Tenancy Support from HDFS (20)

PPTX
Taming the Elephant: Efficient and Effective Apache Hadoop Management
DataWorks Summit/Hadoop Summit
 
PPTX
Hadoop Summit - Scheduling policies in YARN - San Jose 2016
Wangda Tan
 
PPTX
Scheduling Policies in YARN
DataWorks Summit/Hadoop Summit
 
PPTX
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
 
PPTX
Managing enterprise users in Hadoop ecosystem
DataWorks Summit
 
PPTX
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
 
PPTX
Running Services on YARN
DataWorks Summit/Hadoop Summit
 
PPTX
Hadoop 3 in a Nutshell
DataWorks Summit/Hadoop Summit
 
PPTX
Apache Hadoop 3.0 What's new in YARN and MapReduce
DataWorks Summit/Hadoop Summit
 
PPTX
An Apache Hive Based Data Warehouse
DataWorks Summit
 
PPTX
Hive edw-dataworks summit-eu-april-2017
alanfgates
 
PDF
SAP HANA SPS09 - Multitenant Database Containers
SAP Technology
 
PPTX
Big data spain keynote nov 2016
alanfgates
 
PPTX
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
DataWorks Summit
 
PDF
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
Big Data Spain
 
PPTX
Hdfs 2016-hadoop-summit-san-jose-v4
Chris Nauroth
 
PPTX
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Wangda Tan
 
PPTX
Apache Hadoop YARN: state of the union
DataWorks Summit
 
PPTX
Apache HBase Internals you hoped you Never Needed to Understand
Josh Elser
 
PDF
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
 
Taming the Elephant: Efficient and Effective Apache Hadoop Management
DataWorks Summit/Hadoop Summit
 
Hadoop Summit - Scheduling policies in YARN - San Jose 2016
Wangda Tan
 
Scheduling Policies in YARN
DataWorks Summit/Hadoop Summit
 
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
 
Managing enterprise users in Hadoop ecosystem
DataWorks Summit
 
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
 
Running Services on YARN
DataWorks Summit/Hadoop Summit
 
Hadoop 3 in a Nutshell
DataWorks Summit/Hadoop Summit
 
Apache Hadoop 3.0 What's new in YARN and MapReduce
DataWorks Summit/Hadoop Summit
 
An Apache Hive Based Data Warehouse
DataWorks Summit
 
Hive edw-dataworks summit-eu-april-2017
alanfgates
 
SAP HANA SPS09 - Multitenant Database Containers
SAP Technology
 
Big data spain keynote nov 2016
alanfgates
 
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
DataWorks Summit
 
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
Big Data Spain
 
Hdfs 2016-hadoop-summit-san-jose-v4
Chris Nauroth
 
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Wangda Tan
 
Apache Hadoop YARN: state of the union
DataWorks Summit
 
Apache HBase Internals you hoped you Never Needed to Understand
Josh Elser
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
 

More from DataWorks Summit/Hadoop Summit (20)

PPT
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
PPT
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
PDF
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
PDF
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
PDF
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
PDF
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
PDF
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
PPTX
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
PPTX
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
PDF
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
PPTX
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
PPTX
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
PPTX
HBase in Practice
DataWorks Summit/Hadoop Summit
 
PPTX
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
PDF
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
PPTX
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
PPTX
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
HBase in Practice
DataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 

Recently uploaded (20)

PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Software Development Methodologies in 2025
KodekX
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 

Toward Better Multi-Tenancy Support from HDFS

  • 1. 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Toward Better Multi- Tenancy Support from HDFS Xiaoyu Yao Email: [email protected]
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved About myself ⬢ Member of Technical Staff at Hortonworks since 2014 ⬢ Apache Hadoop Committer and PMC member. ⬢ Currently working on HDFS. ⬢ This talk is to help better understanding of HDFS multi-tenancy support and ongoing work for better resource management.
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda ⬢ Overview ⬢ Hadoop multi-tenancy features ⬢ HDFS resources and multi-tenancy offerings ⬢ HDFS resource management via resource coupon ⬢ Q&A
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Overview ⬢ Centrally managed infrastructure –Consolidate to simplify management and lower TCO –Better utilization and efficiency ⬢ Requirement –Resource Sharing –Resource Isolation –Resource Control
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Multi-Tenancy Support from Hadoop Resource Sharing Resource Isolation Resource Management HBASE Y Namespace, Region Server Group Quota YARN Y Queue, Node Label ... Capacity Scheduler, ... HDFS Y Federation Quota, FairCallQueue, Backoff
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Resources ⬢ Capacity –Namespace –Storage Space –Storage Type ⬢ Operational Resources –Namenode •RPC –Datanode •Disk & Network
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Resource Sharing/Isolation – Federation
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Capacity Management – Quota ⬢ Quota –Namespace –StorageSpace –HDFS-7584 Quota by Storage Types ⬢ Limitations –Static –Per directory –No per user/job control
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Operational Resource Management – Namenode RPC Isolation (1) ⬢Internal RPC –DN->NN block report, heartbeat, etc. –ZKFC->NN liveness check ⬢External RPC –Client RPCs from HDFSClients such as MR jobs/Hive queries/HBase Client Listener Reader Reader Call Queue Handler Handler Handler FSN
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Operational Resource Management – Namenode RPC Isolation (2) ⬢Use case: –HFDS access from normal jobs impacted by offending jobs –Internal RPCs impacted by External RPCs –One blocked RPC method could affect others ⬢Protect HDFS internal RPCs: –Dedicated service RPC server/port •Isolate DN->NN block report, heartbeat, etc. –Dedicated lifeline RPC server/port •Protect ZKFC->NN liveness check ⬢All external RPCs go to the default port (e.g., 8020)
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Resource Management – Name Node RPC Call Queue ⬢ In multi-tenancy scenario, call queue should play an important role like a shock absorber to accommodate different workload, converting busty arrivals into smooth, steady departures. ⬢ Good call queue –queue without call bloat –catches and handles bursts with no more than a temporary increase of queue delay –maximum server utilization ⬢ Bad call queue –queue that exhibits call bloat –queue filled up and stay filled upon bursts –low utilization and high queue latency
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Resource Management - Fair Call Queue ⬢ Before HADOOP-9640 LinkedBlockingQueue –Single queue –Client blocked and timeout/fail when queue is full ⬢ HADOOP-9640 - Fair Call Queue –Multiple priority levels and call queues with different processing priority –Each RPC is assigned a priority by scheduler –High priority RPC calls are put into call queue with higher probability of being executed. Scheduler Queue 0 Queue ... Queue 2 Multiplexer (WRR)
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Resource Management – Namenode RPC Throttling <1> ⬢ HADOOP-10597 Backoff when the call queue is full –Send back a Retriable exception –Let the client do exponential wait and retry instead of blocking/timeout/failed the call.
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Resource Management – Namenode RPC Throttling <2> ⬢ HADOOP-12916 Backoff based on response time –The basic idea: Backoff earlier to avoid call queue overload so that namenode can recover quickly. –Low priority calls get backed off if response time of high priority call is over predefined threshold. –More per user/queue metrics added for trouble shooting.
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Resource Management – Namenode RPC Throttling <3> ⬢ Abstract scheduler interface from call queue for pluggable RPC priority assignment –DefaultRpcScheduler: all RPC calls with same priority –DecayRpcScheduler: from original FairCallQueue priority assigned based on previous call volumes of users. –Other experimental schedulers: configurable list of high priority user/group for low latency jobs, medium priority user/group for normal jobs and low priority user/group for batch jobs.
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS resource management - QoS ⬢ Use case: –Allow high performance QoS mechanism with minimum decoding effort on server side ⬢ HADOOP-9194 QoS support for Hadoop RPC –One bytes in RPC header to facilitate QoS mechanism –E.g., differentiate OLTP/OLAP, batch/streaming against the same HDFS ⬢ Limitation –No mechanism level implementation yet
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS resource management with YARN ⬢ Use Case –Priority inversion without centralized resource management (e.g., RPC calls from high priority YARN jobs may be put into low priority HDFS namenode call queue) –Identify and manage ”bad” caller effectively ⬢ Namenode – RPC handler –FairCallQueue offers the fairness use of namenode RPC handlers –No guarantee of differentiation ⬢ Datanode – I/O bandwidth –No differentiation of writer/reader and bandwidth usage. –Datanode allows static throttling balancer I/O.
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Namenode Resource Reservation ⬢ HADOOP-13128 propose HDFS namenode resource reservation via resource coupon –From throttling to manage –Similar to delegation token in many aspects –Works for both Kerberos and non-Kerberos cluster –Allows only privileged service user to request resource coupons from namenode. –Coupon can be serialized/de-serialized for use within container. –Coupon can be renewed for long running jobs or canceled after the intended job is finished.
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Namenode Resource Coupon ⬢ Coupon Identifier –Finer grain owner (MR job ID, Hive Query ID) to help identify and manage “good” and “bad” callers –Resource type (Namenode RPC or Datanode I/O bandwidth) –Flexible management unit for different resources. •Min/Max percentage (e.g. Namenode RPC) •Absolute value (Datanode I/O bandwidth)
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Namenode Resource Coupon Manager (RCM) ⬢ Grant/Renew/Cancel resource coupon ⬢ Monitor and report resource usage ⬢ Check and validate resource use requests
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Namenode Resource Pool HDFS Namenode Resource Pool Fairness Pool Managed Pool Applications supporting Resource Coupon (YARN/HBASE) Legacy Applications without Resource Coupon
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Namenode Resource Coupon Manager (RCM) NEW Client YARN Resource Manager HDFS Namenode RCM HDFS Datanode YARN Node Manager YARN Container
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS Resource Management – Datanode ⬢ Use case: –When a client writes to HDFS faster than the disk bandwidth of the DNs, it saturates the disk bandwidth and put the DNs into an unresponsive state. –The client only backs off by aborting / recovering the pipeline, which causes failed writes and unnecessary pipeline recovery. ⬢ Static I/O Throttling –HDFS-7265 Support HDFS IO throttling –HDFS-9796 Use a throttler for replica write in datanode –HDFS-4412 Add throttler for datanode bandwidth –HADOOP-10410 datanode Qos via ioprio_set on DataXceiver thread ⬢ Dynamic I/O Throttling –HDFS-7270 Add congestion signaling capability to DataNode write pipline(ECN) ⬢ Future work: I/O bandwidth reservation with resource coupon
  • 24. 24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Thank you! Q&A

Editor's Notes

  • #4: move the yarn pic here
  • #10: sever/client
  • #19: bandwidth via ioprio for dfsclient and xceiver thread maybe no standard across OS
  • #24: Reservation based dynamic throttling utilizes existing DataXceiver bandwidth throttling