SlideShare a Scribd company logo
Building PostgreSQL as a Service
with Kubernetes
PGConf.Asia 2019
2019/9/9
@tzkb
3
My Actitvities
PGConf.Asia 2018 @Tokyo
A guide of PostgreSQL on Kuberntes
- In terms of Storage -
CloudNativeDays Tokyo 2019
The Future of Database on Kubernetes
- What run with Cloud Native Storage -
 + =∞
4
Agenda
1. Recap: What is Kubernetes?
2. The Issues for Database on Kubernetes
3. How to run your PostgreSQL on K8s
4. Kubernetes becomes The Platform
6
1. Recap: What is Kubernetes?
7
What is Kubernetes?
Pod Pod
Pod
Pod Pod
• Kubernetes(K8s) is the orchestrator tool for containers.
It has 3 features below.
• Declarative config
• Auto-healing
• Immutable
Database is
not Immutable.
8
Better to handle the database system by Kubernetes?
Node Node Node
Master Slave
Replicate
• The database usually has a state that is not easy to maintain
by Kubernetes.
<Disadvantage for DB>
• Necessary to startup in
turn.
• Must never lose their
data.
• Handle the database as
pets.
9
Example of Database on Kubernetes: Vitess
VTtablet
VTtablet
VTtablet
VTgate
app
app
app
SQL
SQL
SQL
• Vitess that used on YouTube is the CNCF incubating project.
• Vitess provides MySQL
sharding in K8s.
• VTgate and VTtablet
can scale by K8s.
• When terminating a
component abnormally,
Kubernetes repair it
automatically.
10
The choice: How to manage your database
Compute
Storage
Managed
Amazon Aurora
Amazon Redshift
Amazon RDS
on Cloud on Kubernetes
• You can choose to manage the database by yourself or else.
17
2. The Issues for Database on Kubernetes
18
Kubernetes is the Distributed Systems
• Developed as following a distributed architecture.
• When doesn’t a node reply
– Network partition?
– Process failure?
– Node failure?
• If the disk resource attached,
harder to determine.
FailOver?
20
Database Architects are familiar with Clustering
 “If you don’t know the status, it’s okay.
We act on the premise of failsafe.”
 “No need to share resources. Right?”
 “Both have long been known for database
clustering.”
21
Basic: Database Clustering
HA
(Active/Standby)
1
Sharding
Replication
(Active/Active)
2or
more
Instances Redundancy
2 or
more
Shared
Disk
Log
Shipping
---
×
Scaleout?
Read
Read/
Write
Failover
(Fencing)
Availability
Promotion
(Election)
---
• There are differences to build a DB cluster with some nodes.
22
Clustering #1: HA
• With Linux-HA
• Use high-available shared
storage
<Worst Case>
• Multiple writes to storage
<Solution>
• Fencing
VIP
Linux-HA
Controller Controller
• It's been used since before Linux but helpful.
23
Note: Fencing
VIP
Linux-HA
Controller Controller
< When Detecting Node Failure >
1. Forced node power off
i. Definite processes stop
ii. Unmount storage
iii. Detach virtual IP
2. PostgreSQL starts to run on
the standby node.
• Failed node is isolated from resources = Fencing
24
Clustering #2: Replication
WAL
• The master can Read/Write,
Slaves are Read-Only.
• Data synchronization by WAL
transmission
<Worst Case>
• 2 or more Masters
<Solution>
• Leader Election
• Redundancy built into PostgreSQL = Streaming Replication
Master
SlaveSlave
25
Note: Leader Election
WAL
Be promoted as
a master,
The other is still
a slave.
• Always one master
• The former master joins as a
slave.
<Master in unknown state>
1. The remaining one slave is
elected as the leader
2. The leader is promoted as a
master.
• Algorithms such as Paxos and Raft are used.
MasterSlave
26
Clustering #3 Sharding
• Divide data between nodes
and operates as one DB.
• Dispatches queries to relevant
nodes.
• Basically no availability.
• Problems with the transaction.
• For rather scalability than availability.
Coordinator
27
3. How to Run your on Kubernetes
28
Implemetation Overview : on Kubernetes
# Category OSS used Description
ⅰ
HA
• Use Rook/Ceph as Shared
Storage.
ⅱ
• Use LINSTOR/DRBD as
Shared Storage.
ⅲ Replication
• Use Streaming Replication,
without Shared Storage.
ⅳ Operator
• Building and Operating
Replication automatically.
• We can see following four patterns.
29
• K8s manages
everything(DB,storage)
• Shared-Storage: Ceph
• Fenced by kube-fencing
< Disadvantage >
• Complicated
• Insufficient IO
HA (i):
Replicas:1
• is deployed as StatefulSet using Rook/Ceph.
kube-fencing
30
Note: Without Fencing
Replicas:1
• When a node goes down, never failover.
• To avoid network
partition.
• It is by design.
31
Note: What is
• Rook is Kubernetes Operator managing Ceph or others.
operator
agent/discover agent/discover agent/discover
osd osd osd
mon mon mon
CSI
csi-provisioner
csi-rbdplugin csi-rbdplugin csi-rbdplugin
Rook
• Rook makes easy to
build Ceph cluster.
• Also easy to deploy
CSI modules.
• CSI: Containar
Storage Interface
32
HA (ii):
Replicas:1
kube-fencing
• LINSTOR is Software-Defined Storage based on DRBD.
• K8s manages
everything(DB,storage)
• Redundancy: DRBD
• Simple, Read IO
without Network
< Disadvantage >
• Limited to Scale
33
Benchmark Results
Single(with EBS) Rook/Ceph DRBD
1nodes 5nodes 2nodes
100
37.8
77.1
• Measured by pgbench for 3 patterns.
TPS
34
Replication :
proxy proxy proxy
keeper keeper keeper
sentinel sentinel sentinel
• Builds Streaming Replication on top of Kubernetes.
• 3 types of processes
have different roles
• Without Shared-
Resources
< Disadvantage >
• Not builtin Read Off-
loading
36
Operator :
• KubeDB operates not only but also others.
kubedb-operator
-0 -1 -2
postgres snapshotdormantdabases
• Database Operator for
– PostgreSQL
– MySQL
– Redis
• Kubedb-operator
builds SR.
• Able to get/restore
snapshot easily.
37
Example : PostgreSQL Configration by KubeDB
apiVersion: kubedb.com/v1alpha1
kind: Postgres
metadata:
name: ha-postgres
namespace: demo
spec:
version: “10.6-v2"
replicas: 3
storageType: Durable
storage:
storageClassName: "standard"
accessModes:
- ReadWriteOnce
resources:
requests:
storage: 100Gi
 spec.version
– Choose PostgreSQL version.
 spec.replicas
– The number of Instances.
 spec.storage
– Define storage type/size, etc.
• Allows to define Streaming Replication with a simple YAML.
38
Example : Snapshot by KubeDB
apiVersion: kubedb.com/v1alpha1
kind: Snapshot
metadata:
name: snapshot-to-s3
labels:
kubedb.com/kind: Postgres
spec:
databaseName: ha-postgres
storageSecretName: s3-secret
s3:
endpoint: 's3.amazonaws.com'
bucket: kubedb-qa
prefix: demo
• Write declarative Snapshot settings by YAML.
• Simple backup that applies only
this YAML.
• You can select storage,
– S3
– Swift
– Kubernetes Persistent Volume
39
Note : Backup with PostgreSQL + Ceph
$ kubectl exec -it -n rook-ceph rook-ceph-tools-seq -- rbd -p replicapool ls
pvc-bdbc6e53-f6e9-11e8-b0d9-02f062df6b48
$ kubectl exec -it pg-rook-sf-0 -- psql -h localhost -U postgres -c "SELECT pg_start_backup(now()::text);"
pg_start_backup
-----------------
0/C000028
(1 row)
$ kubectl exec -it -n rook-ceph rook-ceph-tools-seq -- rbd snap create replicapool/img@snap
$ kubectl exec -it pg-rook-sf-0 -- psql -h localhost -U postgres -c "SELECT pg_stop_backup();"
NOTICE: pg_stop_backup complete, all required WAL segments have been archived
pg_stop_backup
----------------
0/D000050
(1 row)
• Need to know PostgreSQL & Ceph Commands.
41
4. Kubernetes becomes The Platform
42
To Recap
 The components of database clustering with
Kubernetes Native are already available.
 You can see some operators for DBA task
automation.
 However, it is not over yet.
Cloud Native Storage + + = ???
43
The Signs
I. Pluggable Storage
 Optimized Storage system for DB on K8s?
II. Forked and Cloud-Oriented PostgreSQL
 AWS Aurora, Azure Hyperscale
44
THE LOG IS THE DATABASE.
SQL
Transactions
Caching
Storage
Logging
Storage
Logging
Storage
Logging
CPU
Memory
Cache(SSD)
Page
Cache(SSD) Log
AWS Aurora(PostgreSQL) Azure Hyperscale
• Both divide RDBMS functions and are extended by each cloud.
45
As the platform for PostgreSQL as a Service
DBaaS by Kubernetes
STaaS by Kubernetes
What we got for DBaaS
• HA
• Streaming Replication
• DB Operator
Also for STaaS
• Simple Redundancy
• Distributed Storage
• Interoperable IF(CSI)
• Kubernetes will be "The Platform for Platforms."
46
Questions?
@tzkb
@tzkoba
47
Appendix

More Related Content

PDF
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 - High Availability, 10 Seconds Failover - Lucky Haryadi
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 Bali - PostgreSQL on K8S at Zalando - Alexander Kukushkin
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 Bali - AppOS: PostgreSQL Extension for Scalable File I/O - K...
Equnix Business Solutions
 
PDF
A guide of PostgreSQL on Kubernetes
t8kobayashi
 
PDF
PGConf.ASIA 2019 Bali - Mission Critical Production High Availability Postgre...
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 Bali - Setup a High-Availability and Load Balancing PostgreS...
Equnix Business Solutions
 
PDF
Deploying postgre sql on amazon ec2
Denish Patel
 
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
Equnix Business Solutions
 
PGConf.ASIA 2019 - High Availability, 10 Seconds Failover - Lucky Haryadi
Equnix Business Solutions
 
PGConf.ASIA 2019 Bali - PostgreSQL on K8S at Zalando - Alexander Kukushkin
Equnix Business Solutions
 
PGConf.ASIA 2019 Bali - AppOS: PostgreSQL Extension for Scalable File I/O - K...
Equnix Business Solutions
 
A guide of PostgreSQL on Kubernetes
t8kobayashi
 
PGConf.ASIA 2019 Bali - Mission Critical Production High Availability Postgre...
Equnix Business Solutions
 
PGConf.ASIA 2019 Bali - Setup a High-Availability and Load Balancing PostgreS...
Equnix Business Solutions
 
Deploying postgre sql on amazon ec2
Denish Patel
 

What's hot (20)

PDF
Postgres in Amazon RDS
Denish Patel
 
PDF
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Odinot Stanislas
 
PPTX
Ceph - High Performance Without High Costs
Jonathan Long
 
PDF
PostgreSQL WAL for DBAs
PGConf APAC
 
PDF
GPGPU Accelerates PostgreSQL (English)
Kohei KaiGai
 
PDF
Patroni - HA PostgreSQL made easy
Alexander Kukushkin
 
ODP
PostgreSQL Replication in 10 Minutes - SCALE
PostgreSQL Experts, Inc.
 
PDF
Online Upgrade Using Logical Replication.
EDB
 
PDF
Data warehouse on Kubernetes - gentle intro to Clickhouse Operator, by Robert...
Altinity Ltd
 
PDF
Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergen
distributed matters
 
PDF
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Danielle Womboldt
 
PDF
Out of the box replication in postgres 9.4
Denish Patel
 
PDF
Out of the box replication in postgres 9.4(pg confus)
Denish Patel
 
PDF
PostgreSQL HA
haroonm
 
PPTX
MySQL Head-to-Head
Patrick McGarry
 
PDF
Operating PostgreSQL at Scale with Kubernetes
Jonathan Katz
 
ODP
Logical replication with pglogical
Umair Shahid
 
PDF
PostgreSQL Replication High Availability Methods
Mydbops
 
PDF
Spark / Mesos Cluster Optimization
ebiznext
 
PDF
PostgreSQL High Availability in a Containerized World
Jignesh Shah
 
Postgres in Amazon RDS
Denish Patel
 
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Odinot Stanislas
 
Ceph - High Performance Without High Costs
Jonathan Long
 
PostgreSQL WAL for DBAs
PGConf APAC
 
GPGPU Accelerates PostgreSQL (English)
Kohei KaiGai
 
Patroni - HA PostgreSQL made easy
Alexander Kukushkin
 
PostgreSQL Replication in 10 Minutes - SCALE
PostgreSQL Experts, Inc.
 
Online Upgrade Using Logical Replication.
EDB
 
Data warehouse on Kubernetes - gentle intro to Clickhouse Operator, by Robert...
Altinity Ltd
 
Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergen
distributed matters
 
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Danielle Womboldt
 
Out of the box replication in postgres 9.4
Denish Patel
 
Out of the box replication in postgres 9.4(pg confus)
Denish Patel
 
PostgreSQL HA
haroonm
 
MySQL Head-to-Head
Patrick McGarry
 
Operating PostgreSQL at Scale with Kubernetes
Jonathan Katz
 
Logical replication with pglogical
Umair Shahid
 
PostgreSQL Replication High Availability Methods
Mydbops
 
Spark / Mesos Cluster Optimization
ebiznext
 
PostgreSQL High Availability in a Containerized World
Jignesh Shah
 
Ad

Similar to PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Takahiro Kobayashi (20)

PDF
PGConf APAC 2018 - Patroni: Kubernetes-native PostgreSQL companion
PGConf APAC
 
PDF
Deploying PostgreSQL on Kubernetes
Jimmy Angelakos
 
PDF
Cloud Native PostgreSQL - APJ
EDB
 
PPTX
Cloud Native PostgreSQL
EDB
 
PDF
YugabyteDB - Distributed SQL Database on Kubernetes
DoKC
 
PDF
Postgres on Kubernetes - Dos and Donts.pdf
Christoph Engelbert
 
PDF
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
DoKC
 
PDF
Running PostgreSQL in a Kubernetes cluster: CloudNativePG
Nick Ivanov
 
PDF
Zero-to-Hero: Running Postgres in Kubernetes
EDB
 
PPTX
Introduction to Kubernetes
Vishal Biyani
 
PDF
Using PostgreSQL With Docker & Kubernetes - July 2018
Jonathan Katz
 
PPTX
Zero-to-hero: Running Postgres in Kubernetes, Enterprise Postgres Day
EDB
 
PPTX
On Docker and its use for LHC at CERN
Sebastien Goasguen
 
PDF
Redis Conf 2019--Container Attached Storage for Redis
OpenEBS
 
PDF
Benchmarking for postgresql workloads in kubernetes
DoKC
 
PDF
Kubernetes: My BFF
Jonathan Yu
 
PDF
Kubernetes Basics - ICP Workshop Batch II
PT Datacomm Diangraha
 
PDF
Cluster management with Kubernetes
Satnam Singh
 
PPTX
Kubernetes Immersion
Juan Larriba
 
PDF
Kubernetes
Linjith Kunnon
 
PGConf APAC 2018 - Patroni: Kubernetes-native PostgreSQL companion
PGConf APAC
 
Deploying PostgreSQL on Kubernetes
Jimmy Angelakos
 
Cloud Native PostgreSQL - APJ
EDB
 
Cloud Native PostgreSQL
EDB
 
YugabyteDB - Distributed SQL Database on Kubernetes
DoKC
 
Postgres on Kubernetes - Dos and Donts.pdf
Christoph Engelbert
 
Running PostgreSQL in Kubernetes: from day 0 to day 2 with CloudNativePG - Do...
DoKC
 
Running PostgreSQL in a Kubernetes cluster: CloudNativePG
Nick Ivanov
 
Zero-to-Hero: Running Postgres in Kubernetes
EDB
 
Introduction to Kubernetes
Vishal Biyani
 
Using PostgreSQL With Docker & Kubernetes - July 2018
Jonathan Katz
 
Zero-to-hero: Running Postgres in Kubernetes, Enterprise Postgres Day
EDB
 
On Docker and its use for LHC at CERN
Sebastien Goasguen
 
Redis Conf 2019--Container Attached Storage for Redis
OpenEBS
 
Benchmarking for postgresql workloads in kubernetes
DoKC
 
Kubernetes: My BFF
Jonathan Yu
 
Kubernetes Basics - ICP Workshop Batch II
PT Datacomm Diangraha
 
Cluster management with Kubernetes
Satnam Singh
 
Kubernetes Immersion
Juan Larriba
 
Kubernetes
Linjith Kunnon
 
Ad

More from Equnix Business Solutions (20)

PDF
Yang perlu kita ketahui Untuk memahami aspek utama IT dalam bisnis_.pdf
Equnix Business Solutions
 
PDF
Kebocoran Data_ Tindakan Hacker atau Kriminal_ Bagaimana kita mengantisipasi...
Equnix Business Solutions
 
PDF
Kuliah Tamu - Dari Proses Bisnis Menuju Struktur Data.pdf
Equnix Business Solutions
 
PDF
EWTT22_ Apakah Open Source Cocok digunakan dalam Korporasi_.pdf
Equnix Business Solutions
 
PDF
Oracle to PostgreSQL, Challenges to Opportunity.pdf
Equnix Business Solutions
 
PDF
[EWTT2022] Strategi Implementasi Database dalam Microservice Architecture.pdf
Equnix Business Solutions
 
PDF
PostgreSQL as Enterprise Solution v1.1.pdf
Equnix Business Solutions
 
PDF
Webinar2021 - Does HA Can Help You Balance Your Load-.pdf
Equnix Business Solutions
 
PDF
Webinar2021 - In-Memory Database, is it really faster-.pdf
Equnix Business Solutions
 
PDF
EQUNIX - PPT 11DB-Postgres™.pdf
Equnix Business Solutions
 
PPTX
equpos - General Presentation v20230420.pptx
Equnix Business Solutions
 
PDF
Equnix Appliance- Jawaban terbaik untuk kebutuhan komputasi yang mumpuni.pdf
Equnix Business Solutions
 
PDF
OSPX - Professional PostgreSQL Certification Scheme v20201111.pdf
Equnix Business Solutions
 
PDF
Equnix Company Profile v20230329.pdf
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 - The Future of TDEforPG - Taiki Kondo
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 - PGSpider High Performance Cluster Engine - Shigeo Hirose
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 Bali - Keynote Speech 3 - Kohei KaiGai
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 Bali - Keynote Speech 2 - Ivan Pachenko
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 Bali - Keynote Speech 1 - Bruce Momjian
Equnix Business Solutions
 
PDF
PGConf.ASIA 2019 Bali - Modern PostgreSQL Monitoring & Diagnostics - Mahadeva...
Equnix Business Solutions
 
Yang perlu kita ketahui Untuk memahami aspek utama IT dalam bisnis_.pdf
Equnix Business Solutions
 
Kebocoran Data_ Tindakan Hacker atau Kriminal_ Bagaimana kita mengantisipasi...
Equnix Business Solutions
 
Kuliah Tamu - Dari Proses Bisnis Menuju Struktur Data.pdf
Equnix Business Solutions
 
EWTT22_ Apakah Open Source Cocok digunakan dalam Korporasi_.pdf
Equnix Business Solutions
 
Oracle to PostgreSQL, Challenges to Opportunity.pdf
Equnix Business Solutions
 
[EWTT2022] Strategi Implementasi Database dalam Microservice Architecture.pdf
Equnix Business Solutions
 
PostgreSQL as Enterprise Solution v1.1.pdf
Equnix Business Solutions
 
Webinar2021 - Does HA Can Help You Balance Your Load-.pdf
Equnix Business Solutions
 
Webinar2021 - In-Memory Database, is it really faster-.pdf
Equnix Business Solutions
 
EQUNIX - PPT 11DB-Postgres™.pdf
Equnix Business Solutions
 
equpos - General Presentation v20230420.pptx
Equnix Business Solutions
 
Equnix Appliance- Jawaban terbaik untuk kebutuhan komputasi yang mumpuni.pdf
Equnix Business Solutions
 
OSPX - Professional PostgreSQL Certification Scheme v20201111.pdf
Equnix Business Solutions
 
Equnix Company Profile v20230329.pdf
Equnix Business Solutions
 
PGConf.ASIA 2019 - The Future of TDEforPG - Taiki Kondo
Equnix Business Solutions
 
PGConf.ASIA 2019 - PGSpider High Performance Cluster Engine - Shigeo Hirose
Equnix Business Solutions
 
PGConf.ASIA 2019 Bali - Keynote Speech 3 - Kohei KaiGai
Equnix Business Solutions
 
PGConf.ASIA 2019 Bali - Keynote Speech 2 - Ivan Pachenko
Equnix Business Solutions
 
PGConf.ASIA 2019 Bali - Keynote Speech 1 - Bruce Momjian
Equnix Business Solutions
 
PGConf.ASIA 2019 Bali - Modern PostgreSQL Monitoring & Diagnostics - Mahadeva...
Equnix Business Solutions
 

Recently uploaded (20)

PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Software Development Methodologies in 2025
KodekX
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Software Development Methodologies in 2025
KodekX
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 

PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Takahiro Kobayashi

  • 1. Building PostgreSQL as a Service with Kubernetes PGConf.Asia 2019 2019/9/9 @tzkb
  • 2. 3 My Actitvities PGConf.Asia 2018 @Tokyo A guide of PostgreSQL on Kuberntes - In terms of Storage - CloudNativeDays Tokyo 2019 The Future of Database on Kubernetes - What run with Cloud Native Storage -  + =∞
  • 3. 4 Agenda 1. Recap: What is Kubernetes? 2. The Issues for Database on Kubernetes 3. How to run your PostgreSQL on K8s 4. Kubernetes becomes The Platform
  • 4. 6 1. Recap: What is Kubernetes?
  • 5. 7 What is Kubernetes? Pod Pod Pod Pod Pod • Kubernetes(K8s) is the orchestrator tool for containers. It has 3 features below. • Declarative config • Auto-healing • Immutable Database is not Immutable.
  • 6. 8 Better to handle the database system by Kubernetes? Node Node Node Master Slave Replicate • The database usually has a state that is not easy to maintain by Kubernetes. <Disadvantage for DB> • Necessary to startup in turn. • Must never lose their data. • Handle the database as pets.
  • 7. 9 Example of Database on Kubernetes: Vitess VTtablet VTtablet VTtablet VTgate app app app SQL SQL SQL • Vitess that used on YouTube is the CNCF incubating project. • Vitess provides MySQL sharding in K8s. • VTgate and VTtablet can scale by K8s. • When terminating a component abnormally, Kubernetes repair it automatically.
  • 8. 10 The choice: How to manage your database Compute Storage Managed Amazon Aurora Amazon Redshift Amazon RDS on Cloud on Kubernetes • You can choose to manage the database by yourself or else.
  • 9. 17 2. The Issues for Database on Kubernetes
  • 10. 18 Kubernetes is the Distributed Systems • Developed as following a distributed architecture. • When doesn’t a node reply – Network partition? – Process failure? – Node failure? • If the disk resource attached, harder to determine. FailOver?
  • 11. 20 Database Architects are familiar with Clustering  “If you don’t know the status, it’s okay. We act on the premise of failsafe.”  “No need to share resources. Right?”  “Both have long been known for database clustering.”
  • 12. 21 Basic: Database Clustering HA (Active/Standby) 1 Sharding Replication (Active/Active) 2or more Instances Redundancy 2 or more Shared Disk Log Shipping --- × Scaleout? Read Read/ Write Failover (Fencing) Availability Promotion (Election) --- • There are differences to build a DB cluster with some nodes.
  • 13. 22 Clustering #1: HA • With Linux-HA • Use high-available shared storage <Worst Case> • Multiple writes to storage <Solution> • Fencing VIP Linux-HA Controller Controller • It's been used since before Linux but helpful.
  • 14. 23 Note: Fencing VIP Linux-HA Controller Controller < When Detecting Node Failure > 1. Forced node power off i. Definite processes stop ii. Unmount storage iii. Detach virtual IP 2. PostgreSQL starts to run on the standby node. • Failed node is isolated from resources = Fencing
  • 15. 24 Clustering #2: Replication WAL • The master can Read/Write, Slaves are Read-Only. • Data synchronization by WAL transmission <Worst Case> • 2 or more Masters <Solution> • Leader Election • Redundancy built into PostgreSQL = Streaming Replication Master SlaveSlave
  • 16. 25 Note: Leader Election WAL Be promoted as a master, The other is still a slave. • Always one master • The former master joins as a slave. <Master in unknown state> 1. The remaining one slave is elected as the leader 2. The leader is promoted as a master. • Algorithms such as Paxos and Raft are used. MasterSlave
  • 17. 26 Clustering #3 Sharding • Divide data between nodes and operates as one DB. • Dispatches queries to relevant nodes. • Basically no availability. • Problems with the transaction. • For rather scalability than availability. Coordinator
  • 18. 27 3. How to Run your on Kubernetes
  • 19. 28 Implemetation Overview : on Kubernetes # Category OSS used Description ⅰ HA • Use Rook/Ceph as Shared Storage. ⅱ • Use LINSTOR/DRBD as Shared Storage. ⅲ Replication • Use Streaming Replication, without Shared Storage. ⅳ Operator • Building and Operating Replication automatically. • We can see following four patterns.
  • 20. 29 • K8s manages everything(DB,storage) • Shared-Storage: Ceph • Fenced by kube-fencing < Disadvantage > • Complicated • Insufficient IO HA (i): Replicas:1 • is deployed as StatefulSet using Rook/Ceph. kube-fencing
  • 21. 30 Note: Without Fencing Replicas:1 • When a node goes down, never failover. • To avoid network partition. • It is by design.
  • 22. 31 Note: What is • Rook is Kubernetes Operator managing Ceph or others. operator agent/discover agent/discover agent/discover osd osd osd mon mon mon CSI csi-provisioner csi-rbdplugin csi-rbdplugin csi-rbdplugin Rook • Rook makes easy to build Ceph cluster. • Also easy to deploy CSI modules. • CSI: Containar Storage Interface
  • 23. 32 HA (ii): Replicas:1 kube-fencing • LINSTOR is Software-Defined Storage based on DRBD. • K8s manages everything(DB,storage) • Redundancy: DRBD • Simple, Read IO without Network < Disadvantage > • Limited to Scale
  • 24. 33 Benchmark Results Single(with EBS) Rook/Ceph DRBD 1nodes 5nodes 2nodes 100 37.8 77.1 • Measured by pgbench for 3 patterns. TPS
  • 25. 34 Replication : proxy proxy proxy keeper keeper keeper sentinel sentinel sentinel • Builds Streaming Replication on top of Kubernetes. • 3 types of processes have different roles • Without Shared- Resources < Disadvantage > • Not builtin Read Off- loading
  • 26. 36 Operator : • KubeDB operates not only but also others. kubedb-operator -0 -1 -2 postgres snapshotdormantdabases • Database Operator for – PostgreSQL – MySQL – Redis • Kubedb-operator builds SR. • Able to get/restore snapshot easily.
  • 27. 37 Example : PostgreSQL Configration by KubeDB apiVersion: kubedb.com/v1alpha1 kind: Postgres metadata: name: ha-postgres namespace: demo spec: version: “10.6-v2" replicas: 3 storageType: Durable storage: storageClassName: "standard" accessModes: - ReadWriteOnce resources: requests: storage: 100Gi  spec.version – Choose PostgreSQL version.  spec.replicas – The number of Instances.  spec.storage – Define storage type/size, etc. • Allows to define Streaming Replication with a simple YAML.
  • 28. 38 Example : Snapshot by KubeDB apiVersion: kubedb.com/v1alpha1 kind: Snapshot metadata: name: snapshot-to-s3 labels: kubedb.com/kind: Postgres spec: databaseName: ha-postgres storageSecretName: s3-secret s3: endpoint: 's3.amazonaws.com' bucket: kubedb-qa prefix: demo • Write declarative Snapshot settings by YAML. • Simple backup that applies only this YAML. • You can select storage, – S3 – Swift – Kubernetes Persistent Volume
  • 29. 39 Note : Backup with PostgreSQL + Ceph $ kubectl exec -it -n rook-ceph rook-ceph-tools-seq -- rbd -p replicapool ls pvc-bdbc6e53-f6e9-11e8-b0d9-02f062df6b48 $ kubectl exec -it pg-rook-sf-0 -- psql -h localhost -U postgres -c "SELECT pg_start_backup(now()::text);" pg_start_backup ----------------- 0/C000028 (1 row) $ kubectl exec -it -n rook-ceph rook-ceph-tools-seq -- rbd snap create replicapool/img@snap $ kubectl exec -it pg-rook-sf-0 -- psql -h localhost -U postgres -c "SELECT pg_stop_backup();" NOTICE: pg_stop_backup complete, all required WAL segments have been archived pg_stop_backup ---------------- 0/D000050 (1 row) • Need to know PostgreSQL & Ceph Commands.
  • 30. 41 4. Kubernetes becomes The Platform
  • 31. 42 To Recap  The components of database clustering with Kubernetes Native are already available.  You can see some operators for DBA task automation.  However, it is not over yet. Cloud Native Storage + + = ???
  • 32. 43 The Signs I. Pluggable Storage  Optimized Storage system for DB on K8s? II. Forked and Cloud-Oriented PostgreSQL  AWS Aurora, Azure Hyperscale
  • 33. 44 THE LOG IS THE DATABASE. SQL Transactions Caching Storage Logging Storage Logging Storage Logging CPU Memory Cache(SSD) Page Cache(SSD) Log AWS Aurora(PostgreSQL) Azure Hyperscale • Both divide RDBMS functions and are extended by each cloud.
  • 34. 45 As the platform for PostgreSQL as a Service DBaaS by Kubernetes STaaS by Kubernetes What we got for DBaaS • HA • Streaming Replication • DB Operator Also for STaaS • Simple Redundancy • Distributed Storage • Interoperable IF(CSI) • Kubernetes will be "The Platform for Platforms."