SlideShare a Scribd company logo
© 2023 All Rights Reserved
1
YugabyteDB
Introduction and key features
Franck Pachot, Developer Advocate
@FranckPachot
© 2023 All Rights Reserved
Franck Pachot
Developer Advocate at Yugabyte
Past:
20+ years in databases, dev and ops, consulting
Oracle ACE Director, AWS Data Hero
Oracle Certified Master, AWS Database Specialty
2
fpachot@yugabyte.com
dev.to/FranckPachot
@FranckPachot
© 2023 All Rights Reserved
YugabyteDB
3
Distributed SQL database
PostgreSQL compatible
Open Source
⇨ Scale-Out (horizontal scalability)
⇨ Familiarity with the most popular DB for OLTP
⇨ Freedom, no lock-in, community
Founders have worked on Cassandra and HBASE at Facebook
Started based on Google Spanner paper
© 2023 All Rights Reserved
NoSQL
Monolithic RDBMS
Future of "systems of record" databases (OLTP)
Architectural resilience
Horizontal scaling
Geo-distribution
ACID transactions
SQL + RDBMS features
SQL
Distributed SQL
© 2023 All Rights Reserved
On-Premises
Datacenters
Deploy Anywhere
YugabyteDB architecture
5
Distributed, Transactional Storage Layer
Automatic
Sharding
Load
Balancing
Distributed
Transactions
Raft
Consensus
Open Source
Distributed SQL
Database
⭐ https://github.com/yugabyte/yugabyte-db
Raft consensus
protocol
geo distribution
YSQL
YCQL
Pluggable Query Layer
YCQL API
Cassandra Compatible
YSQL API
Postgres Compatible
Other APIs (future)
PostgreSQL
Application
© 2023 All Rights Reserved
Sharding
6
Breaks up large tables and indexes to smaller pieces (around 1-10 GB)
Each row in the user table (or index entry) gets mapped to exactly 1 shard
Split by hash or range on the primary key (or index keys)
Sharding in YugabyteDB is automatic, distributed across the nodes
Sharding
------------
© 2023 All Rights Reserved
Raft algorithm and LSM-Tree
7
Each tablet is a Raft group
Each write go to the Leader, is replicated to Followers, wait for quorum
Consistent read go to Leader (bounded stale reads from Followers)
In case of failure, an new leader is elected from the quorum
The tablet peer rows are stored in LSM-Tree (based on RocksDB)
© 2023 All Rights Reserved
YugabyteDB deployment
8
On any Linux: VMs, Pods, containers
Each node has a PostgreSQL (Backend, Planner, Executor)
Each node stores some tablet peers
Those are Tablet Servers with local storage and network access
An additional "master" tablet holds the cluster metadata
DocDB
distributed
storage &
transactions
YSQL
stateless
query layer
Read/
Write
Read/
Write
Read/
Write
Read/
Write
Read/
Write
Read/
Write
© 2023 All Rights Reserved
1. Single Region, Multi-Zone
Availability Zone 1
Availability Zone 2 Availability Zone 3
Consistent Across Zones
No WAN Latency But No
Region-Level Failover/Repair
2. Single Cloud, Multi-Region
Region 1
Region 2 Region 3
Consistent Across Regions
with Auto Region-Level
Failover/Repair
3. Multi-Cloud, Multi-Region
Cloud 1
Cloud 2 Cloud 3
Consistent Across Clouds
with Auto Cloud-Level
Failover/Repair
Deployment topology
9
© 2023 All Rights Reserved
Quick Start (dev environment)
10
# Start YugabyteDB (3 nodes)
docker network create -d bridge yb
docker run -d --name yb0 --hostname yb0 --net=yb -p 5433:5433 -p7000:7000 
yugabytedb/yugabyte:2.14.6.1-b4 yugabyted start --daemon=false --listen yb0
docker run -d --name yb1 --hostname yb1 --net=yb -p 5434:5433 
yugabytedb/yugabyte:2.14.6.1-b4 yugabyted start --daemon=false --listen yb1 --join yb0
docker run -d --name yb2 --hostname yb2 --net=yb -p 5435:5433 
yugabytedb/yugabyte:2.14.6.1-b4 yugabyted start --daemon=false --listen yb2 --join yb0
# Connect to any nodes
psql postgres://localhost:5433
-- Web console on http://localhost:7000
© 2023 All Rights Reserved 11
Q & A
● Slack: https://communityinviter.com/apps/yugabyte-db/register
● Github: https://github.com/yugabyte
● Newsletter: https://www.yugabyte.com/newsletter
© 2023 All Rights Reserved 12
Reason for Distributed SQL
- scale-out for elasticity and resilience
- rolling upgrades, geo-distribution
Reason for PostgreSQL compatibility
- no need to learn a new DB
Reason for a fork of PostgreSQL
- the best compatibility with advanced features
Core messages
© 2023 All Rights Reserved 13
Major difference with PostgreSQL:
- think about the primary key (hash/range sharding)
- leverage new access patterns
(primary index, skip scan, index only scan on secondary index )
- don't worry about vacuum
Core messages
© 2023 All Rights Reserved 14
When it can be an alternative to PostgreSQL:
- OLTP (with some analytics push-downs)
- cloud native (Compute instances, kubernetes pods,... all active)
- multitenant (no hardware limitation to scale)
- geo-distribution (data residency)
Core messages
© 2023 All Rights Reserved 15
E-mail:
fpachot@yugabyte.com
Blogs:
dev.to/FranckPachot
blog.yugabyte.com/author/fpachot
Twitter:
@FranckPachot
Youtube:
youtube.pachot.net
Twitch:
www.twitch.tv/franckpachot
LinkedIn:
www.linkedin.com/in/franckpachot
Q & A
● Slack: https://communityinviter.com/apps/yugabyte-db/register
● Github: https://github.com/yugabyte
● Newsletter: https://www.yugabyte.com/newsletter

More Related Content

Similar to Meetup - YugabyteDB - Introduction and key features (20)

PPT
Docker Multi Host Networking, Rachit Arora, IBM
Neependra Khare
 
PDF
Surviving KubeDoom - Even demons can't kill an intrusion of CockroachDB!
Faheem Memon
 
PPTX
Directory Write Leases in MagFS
Maginatics
 
PDF
Kubernetes for Java developers
Robert Barr
 
PDF
Openflow for Cloud Scalability
DaoliCloud Ltd
 
PPTX
OpenEBS Technical Workshop - KubeCon San Diego 2019
MayaData Inc
 
PPTX
BigData Clusters Redefined
DataWorks Summit
 
PPTX
OpenStack Dragonflow shenzhen and Hangzhou meetups
Eran Gampel
 
PPT
2007-05-23 Cecchet_PGCon2007.ppt
nadirpervez2
 
PDF
[OpenInfra Days Korea 2018] Day 1 - T4-7: "Ceph 스토리지, PaaS로 서비스 운영하기"
OpenStack Korea Community
 
PPT
Clustered Architecture Patterns Delivering Scalability And Availability
ConSanFrancisco123
 
PPTX
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Patrick Chanezon
 
PDF
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Alluxio, Inc.
 
PDF
JClouds at San Francisco Java User Group
Marakana Inc.
 
PPTX
Azure DBA with IaaS
Kellyn Pot'Vin-Gorman
 
PDF
[KubeCon NA 2020] containerd: Rootless Containers 2020
Akihiro Suda
 
PDF
Learning From Real Practice of Providing Highly Available Hybrid Cloud Servic...
LF Events
 
PDF
Cloud read java with kubernetes
Cesar Tron-Lozai
 
PDF
Ivan Zhuravel and Ihor Khlaponin "DC/OS vs Kubernetes. Let the Fight Begin!"
LogeekNightUkraine
 
PPTX
Devoxx France 2015 - The Docker Orchestration Ecosystem on Azure
Patrick Chanezon
 
Docker Multi Host Networking, Rachit Arora, IBM
Neependra Khare
 
Surviving KubeDoom - Even demons can't kill an intrusion of CockroachDB!
Faheem Memon
 
Directory Write Leases in MagFS
Maginatics
 
Kubernetes for Java developers
Robert Barr
 
Openflow for Cloud Scalability
DaoliCloud Ltd
 
OpenEBS Technical Workshop - KubeCon San Diego 2019
MayaData Inc
 
BigData Clusters Redefined
DataWorks Summit
 
OpenStack Dragonflow shenzhen and Hangzhou meetups
Eran Gampel
 
2007-05-23 Cecchet_PGCon2007.ppt
nadirpervez2
 
[OpenInfra Days Korea 2018] Day 1 - T4-7: "Ceph 스토리지, PaaS로 서비스 운영하기"
OpenStack Korea Community
 
Clustered Architecture Patterns Delivering Scalability And Availability
ConSanFrancisco123
 
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Patrick Chanezon
 
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Alluxio, Inc.
 
JClouds at San Francisco Java User Group
Marakana Inc.
 
Azure DBA with IaaS
Kellyn Pot'Vin-Gorman
 
[KubeCon NA 2020] containerd: Rootless Containers 2020
Akihiro Suda
 
Learning From Real Practice of Providing Highly Available Hybrid Cloud Servic...
LF Events
 
Cloud read java with kubernetes
Cesar Tron-Lozai
 
Ivan Zhuravel and Ihor Khlaponin "DC/OS vs Kubernetes. Let the Fight Begin!"
LogeekNightUkraine
 
Devoxx France 2015 - The Docker Orchestration Ecosystem on Azure
Patrick Chanezon
 

More from Franck Pachot (17)

PPTX
Oracle dbms_xplan.display_cursor format
Franck Pachot
 
PDF
19 features you will miss if you leave Oracle Database
Franck Pachot
 
PDF
Oracle Database on Docker
Franck Pachot
 
PDF
12cR2 Single-Tenant: Multitenant Features for All Editions
Franck Pachot
 
PDF
Les bases BI sont-elles différentes?
Franck Pachot
 
PDF
Oracle in-Memory Column Store for BI
Franck Pachot
 
PDF
Testing Delphix: easy data virtualization
Franck Pachot
 
PDF
12c SQL Plan Directives
Franck Pachot
 
PDF
Star Transformation, 12c Adaptive Bitmap Pruning and In-Memory option
Franck Pachot
 
PDF
CBO choice between Index and Full Scan: the good, the bad and the ugly param...
Franck Pachot
 
PDF
Oracle Parallel Distribution and 12c Adaptive Plans
Franck Pachot
 
PDF
Oracle Join Methods and 12c Adaptive Plans
Franck Pachot
 
PDF
Oracle NOLOGGING
Franck Pachot
 
PDF
Exadata X3 in action: Measuring Smart Scan efficiency with AWR
Franck Pachot
 
PDF
Oracle table lock modes
Franck Pachot
 
PDF
Dbvisit replicate: logical replication made easy
Franck Pachot
 
PDF
Reading AWR or Statspack Report - Straight to the Goal
Franck Pachot
 
Oracle dbms_xplan.display_cursor format
Franck Pachot
 
19 features you will miss if you leave Oracle Database
Franck Pachot
 
Oracle Database on Docker
Franck Pachot
 
12cR2 Single-Tenant: Multitenant Features for All Editions
Franck Pachot
 
Les bases BI sont-elles différentes?
Franck Pachot
 
Oracle in-Memory Column Store for BI
Franck Pachot
 
Testing Delphix: easy data virtualization
Franck Pachot
 
12c SQL Plan Directives
Franck Pachot
 
Star Transformation, 12c Adaptive Bitmap Pruning and In-Memory option
Franck Pachot
 
CBO choice between Index and Full Scan: the good, the bad and the ugly param...
Franck Pachot
 
Oracle Parallel Distribution and 12c Adaptive Plans
Franck Pachot
 
Oracle Join Methods and 12c Adaptive Plans
Franck Pachot
 
Oracle NOLOGGING
Franck Pachot
 
Exadata X3 in action: Measuring Smart Scan efficiency with AWR
Franck Pachot
 
Oracle table lock modes
Franck Pachot
 
Dbvisit replicate: logical replication made easy
Franck Pachot
 
Reading AWR or Statspack Report - Straight to the Goal
Franck Pachot
 
Ad

Recently uploaded (20)

PDF
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
PPTX
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
PDF
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
PDF
Zilliz Cloud Demo for performance and scale
Zilliz
 
PDF
Design Thinking basics for Engineers.pdf
CMR University
 
PPTX
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
PPTX
Product Development & DevelopmentLecture02.pptx
zeeshanwazir2
 
PDF
Halide Perovskites’ Multifunctional Properties: Coordination Engineering, Coo...
TaameBerhe2
 
PPTX
Introduction to Design of Machine Elements
PradeepKumarS27
 
PDF
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
PPTX
Element 11. ELECTRICITY safety and hazards
merrandomohandas
 
PPTX
GitOps_Without_K8s_Training_detailed git repository
DanialHabibi2
 
PDF
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
PPTX
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
PPTX
Big Data and Data Science hype .pptx
SUNEEL37
 
PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PPTX
Thermal runway and thermal stability.pptx
godow93766
 
PPTX
Lecture 1 Shell and Tube Heat exchanger-1.pptx
mailforillegalwork
 
PDF
Electrical Engineer operation Supervisor
ssaruntatapower143
 
PPTX
2025 CGI Congres - Surviving agile v05.pptx
Derk-Jan de Grood
 
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
Zilliz Cloud Demo for performance and scale
Zilliz
 
Design Thinking basics for Engineers.pdf
CMR University
 
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
Product Development & DevelopmentLecture02.pptx
zeeshanwazir2
 
Halide Perovskites’ Multifunctional Properties: Coordination Engineering, Coo...
TaameBerhe2
 
Introduction to Design of Machine Elements
PradeepKumarS27
 
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
Element 11. ELECTRICITY safety and hazards
merrandomohandas
 
GitOps_Without_K8s_Training_detailed git repository
DanialHabibi2
 
Reasons for the succes of MENARD PRESSUREMETER.pdf
majdiamz
 
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
Big Data and Data Science hype .pptx
SUNEEL37
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
Thermal runway and thermal stability.pptx
godow93766
 
Lecture 1 Shell and Tube Heat exchanger-1.pptx
mailforillegalwork
 
Electrical Engineer operation Supervisor
ssaruntatapower143
 
2025 CGI Congres - Surviving agile v05.pptx
Derk-Jan de Grood
 
Ad

Meetup - YugabyteDB - Introduction and key features

  • 1. © 2023 All Rights Reserved 1 YugabyteDB Introduction and key features Franck Pachot, Developer Advocate @FranckPachot
  • 2. © 2023 All Rights Reserved Franck Pachot Developer Advocate at Yugabyte Past: 20+ years in databases, dev and ops, consulting Oracle ACE Director, AWS Data Hero Oracle Certified Master, AWS Database Specialty 2 [email protected] dev.to/FranckPachot @FranckPachot
  • 3. © 2023 All Rights Reserved YugabyteDB 3 Distributed SQL database PostgreSQL compatible Open Source ⇨ Scale-Out (horizontal scalability) ⇨ Familiarity with the most popular DB for OLTP ⇨ Freedom, no lock-in, community Founders have worked on Cassandra and HBASE at Facebook Started based on Google Spanner paper
  • 4. © 2023 All Rights Reserved NoSQL Monolithic RDBMS Future of "systems of record" databases (OLTP) Architectural resilience Horizontal scaling Geo-distribution ACID transactions SQL + RDBMS features SQL Distributed SQL
  • 5. © 2023 All Rights Reserved On-Premises Datacenters Deploy Anywhere YugabyteDB architecture 5 Distributed, Transactional Storage Layer Automatic Sharding Load Balancing Distributed Transactions Raft Consensus Open Source Distributed SQL Database ⭐ https://github.com/yugabyte/yugabyte-db Raft consensus protocol geo distribution YSQL YCQL Pluggable Query Layer YCQL API Cassandra Compatible YSQL API Postgres Compatible Other APIs (future) PostgreSQL Application
  • 6. © 2023 All Rights Reserved Sharding 6 Breaks up large tables and indexes to smaller pieces (around 1-10 GB) Each row in the user table (or index entry) gets mapped to exactly 1 shard Split by hash or range on the primary key (or index keys) Sharding in YugabyteDB is automatic, distributed across the nodes Sharding ------------
  • 7. © 2023 All Rights Reserved Raft algorithm and LSM-Tree 7 Each tablet is a Raft group Each write go to the Leader, is replicated to Followers, wait for quorum Consistent read go to Leader (bounded stale reads from Followers) In case of failure, an new leader is elected from the quorum The tablet peer rows are stored in LSM-Tree (based on RocksDB)
  • 8. © 2023 All Rights Reserved YugabyteDB deployment 8 On any Linux: VMs, Pods, containers Each node has a PostgreSQL (Backend, Planner, Executor) Each node stores some tablet peers Those are Tablet Servers with local storage and network access An additional "master" tablet holds the cluster metadata DocDB distributed storage & transactions YSQL stateless query layer Read/ Write Read/ Write Read/ Write Read/ Write Read/ Write Read/ Write
  • 9. © 2023 All Rights Reserved 1. Single Region, Multi-Zone Availability Zone 1 Availability Zone 2 Availability Zone 3 Consistent Across Zones No WAN Latency But No Region-Level Failover/Repair 2. Single Cloud, Multi-Region Region 1 Region 2 Region 3 Consistent Across Regions with Auto Region-Level Failover/Repair 3. Multi-Cloud, Multi-Region Cloud 1 Cloud 2 Cloud 3 Consistent Across Clouds with Auto Cloud-Level Failover/Repair Deployment topology 9
  • 10. © 2023 All Rights Reserved Quick Start (dev environment) 10 # Start YugabyteDB (3 nodes) docker network create -d bridge yb docker run -d --name yb0 --hostname yb0 --net=yb -p 5433:5433 -p7000:7000 yugabytedb/yugabyte:2.14.6.1-b4 yugabyted start --daemon=false --listen yb0 docker run -d --name yb1 --hostname yb1 --net=yb -p 5434:5433 yugabytedb/yugabyte:2.14.6.1-b4 yugabyted start --daemon=false --listen yb1 --join yb0 docker run -d --name yb2 --hostname yb2 --net=yb -p 5435:5433 yugabytedb/yugabyte:2.14.6.1-b4 yugabyted start --daemon=false --listen yb2 --join yb0 # Connect to any nodes psql postgres://localhost:5433 -- Web console on http://localhost:7000
  • 11. © 2023 All Rights Reserved 11 Q & A ● Slack: https://communityinviter.com/apps/yugabyte-db/register ● Github: https://github.com/yugabyte ● Newsletter: https://www.yugabyte.com/newsletter
  • 12. © 2023 All Rights Reserved 12 Reason for Distributed SQL - scale-out for elasticity and resilience - rolling upgrades, geo-distribution Reason for PostgreSQL compatibility - no need to learn a new DB Reason for a fork of PostgreSQL - the best compatibility with advanced features Core messages
  • 13. © 2023 All Rights Reserved 13 Major difference with PostgreSQL: - think about the primary key (hash/range sharding) - leverage new access patterns (primary index, skip scan, index only scan on secondary index ) - don't worry about vacuum Core messages
  • 14. © 2023 All Rights Reserved 14 When it can be an alternative to PostgreSQL: - OLTP (with some analytics push-downs) - cloud native (Compute instances, kubernetes pods,... all active) - multitenant (no hardware limitation to scale) - geo-distribution (data residency) Core messages
  • 15. © 2023 All Rights Reserved 15 E-mail: [email protected] Blogs: dev.to/FranckPachot blog.yugabyte.com/author/fpachot Twitter: @FranckPachot Youtube: youtube.pachot.net Twitch: www.twitch.tv/franckpachot LinkedIn: www.linkedin.com/in/franckpachot Q & A ● Slack: https://communityinviter.com/apps/yugabyte-db/register ● Github: https://github.com/yugabyte ● Newsletter: https://www.yugabyte.com/newsletter