SlideShare a Scribd company logo
1YugaByte Confidential © 2018 All rights reserved.
Introducing YugaByte DB
Karthik Ranganathan - CTO
Cloud Native Data Day, Nov 2018
2YugaByte Confidential © 2018 All rights reserved.
Overview
• YugaByte DB – Our roots…and why they matter
• Why YugaByte DB is a better choice for Cloud-Native, Online applications … for
either SQL or NoSQL
• Advantages of Partnering with YugaByte
3YugaByte Confidential © 2018 All rights reserved.
About Us
Kannan
Muthukkaruppan, CEO
Karthik
Ranganathan, CTO
Mikhail Bautin,
Software Architect
Created Cassandra & HBase
Scaled data platforms for growth from
30 Million to 1.4 Billion Users at Facebook
Team members from Oracle, Google, LinkedIN,
Nutanix, and MapR have built three or four
databases including YugaByte
Founders
Funding From Leading VCs in Cloud and Strategic Investors
4YugaByte Confidential © 2018 All rights reserved.
YugaByte story starts with….Facebook in 2007
5YugaByte Confidential © 2018 All rights reserved.
Facebook in 2008-2009…..
How to scale to a billion users?
Also: How to survive the week?
6YugaByte Confidential © 2018 All rights reserved.
What happens at 1 billion users?
Dozens of petabytes
Billions of IOPS
Scale out frequently
Rolling upgrades – zero downtime!
7YugaByte Confidential © 2018 All rights reserved.
Transformation of Facebook
How we achieved scale
Few
datacenters
Traditional
servers
Monolithic applications and
services
Geo-distributed
DC’s
Containers for
applications
Microservices
based design
Its all about developer agility
8YugaByte Confidential © 2018 All rights reserved.
How did the Tech leaders simplify this?
Application Tier (Stateless Microservices)
Custom Data Platform
Transactional, Performant, Global
But there’s no general platform for the enterprise
9YugaByte Confidential © 2018 All rights reserved.
Typical Stack Today
Fragile infrastructure with many moving parts
Datacenter 1
SQL Master SQL Slave
Application Tier (Stateless Microservices)
Datacenter 2
10YugaByte Confidential © 2018 All rights reserved.
NoSQL + SQL Cloud Native
YugaByte DB Eliminates Complexity in a Cloud-
native World
11YugaByte Confidential © 2018 All rights reserved.
tablet 1’
tablet 1’
YugaByte DB: Developer Agility & Operational Simplicity
tablet 1’
Cassandra++
Redis++
BETA
Self-Healing, Fault-Tolerant
Auto Sharding & Rebalancing
Distributed ACID Transactions
Global Data Distribution
High Throughput
Deep Cloud Integration
Standard API’s
Purpose Built Data Fabric
12YugaByte Confidential © 2018 All rights reserved.
Do More (or less) With
Your Cassandra Apps
13YugaByte Confidential © 2018 All rights reserved.
Doing More (or Less)
More Developer Agility:
• Extending Cassandra:
• Strong consistency
• Consistent secondary indexes
• JSON data
• Distributed transactions
• 10x data per node
• Superior performance
• 2.7x throughput
• 50% lower P99 latency
• Streaming ingest performance without
separate SST table load pipeline
Less Operational Complexity:
• Fewer nodes
• Expand in minutes not days
• Less time maintaining, tuning and
managing:
• No read repairs or anti-entropy
• No tombstones or deletes reappear
• No garbage collection pauses
• Reduced RTO and RPO
• More frequent backups
14YugaByte Confidential © 2018 All rights reserved.
YugaByte DB: 10x More Data per Node
High-density benchmark:
• 26TB over 4 YugaByte DB nodes compared to 30 nodes for Cassandra
• 385K reads/sec (0.25 ms) & 6.5K writes/sec on "Recent Data” Workload
• Expand to 5 nodes (complete in 8 hours), data available in 5 minutes
• Induced node failure. Cluster rebalanced in 2.2 hours
10x Density Minutes not days
15YugaByte Confidential © 2018 All rights reserved.
YugaByte DB: Improved Apache Cassandra Performance
2.6x
2.2x
1.9x
3.9x
3.0x
2.3x
2.7x Throughput
16YugaByte Confidential © 2018 All rights reserved.
Best Distributed SQL
17YugaByte Confidential © 2018 All rights reserved.
Best Distributed SQL: Background
• Directly Re-use PostgreSQL
• Re-use of PostgreSQL code base versus re-write in Go
• 100% compatible with PostgreSQL
• YugaByte will stay in synch with PostgreSQL (e.g. PostgreSQL v 12)
• Core data engine built for cloud-native environments
• Customized combination of RocksDB, Raft and PostgreSQL
• Database written in C++
• Superior performance
• Team members have developed and operated multiple databases
• Developed HBase, Cassandra, Presto, Hive and now YugaByte.
• Extensive hands-on implementation experience
18YugaByte Confidential © 2018 All rights reserved.
Re-use not Re-write of PostgreSQL
CLIENT Postmaster
(Authentication, authorization)
Rewriter Planner
OptimizerExecutor
WAL Writer BG Writer…
DISK
Reuse
Stateless
PostgreSQL
19YugaByte Confidential © 2018 All rights reserved.
Re-use not Re-write of PostgreSQL
CLIENT Postmaster
(Authentication, authorization)
Rewriter Planner
OptimizerExecutor
YugaByte Node YugaByte Node …… YugaByte Node
Replace table
storage with
YugaByte DB
20YugaByte Confidential © 2018 All rights reserved.
Re-use not Re-write of PostgreSQL
CLIENT Postmaster
(Authentication, authorization)
Rewriter Planner
OptimizerExecutor
YugaByte Node YugaByte Node …… YugaByte Node
Enhance
optimizer and
executor for
distributed DB
21YugaByte Confidential © 2018 All rights reserved.
YugaByte PostgreSQL feature-set support
Expect to support most PostgreSQL features
• All data types
• Built-in functions and expressions
• Various kinds of joins
• Constraints (primary key, foreign key, unique, not null, check)
• Secondary indexes (including multi-column and covering columns)
• Distributed transactions (Serializable and Snapshot Isolation)
• Views
• Stored Procedures
• Triggers
YugaByte DB inherits features
developed in PostgreSQL.
Vendors who re-write PostgreSQL
will need to reimplement from
scratch
22YugaByte Confidential © 2018 All rights reserved.
Best Cloud-native DB
Orchestration
23YugaByte Confidential © 2018 All rights reserved.
Transformation of Infrastructure & Apps
Few
datacenters
Traditional
servers
Monolithic applications and
services
Geo-distributed
DC’s
Containers for
applications
Microservices
based design
24YugaByte Confidential © 2018 All rights reserved.
Operational Data Tier has Lagged
?
legacy databases
Open Source,
Cloud-native Databases
25YugaByte Confidential © 2018 All rights reserved.
Why YugaByte DB for Cloud-native Apps?
legacy databases
Open Source,
Cloud-native Databases
• Deep cloud integration
• K8s integration including PKS
• Intent based deployments
• Self-service development
26YugaByte Confidential © 2018 All rights reserved.
Deep Cloud Integration including K8s
27YugaByte Confidential © 2018 All rights reserved.
REAL-WORLD CASE STUDIES
28YugaByte Confidential © 2018 All rights reserved.
1. MySQL master-slave replication
2. Cassandra cross-DC queue for cache updates
3. Per-DC Couchbase for caching
Current State
Global User Identity - login, change password, view profile
29YugaByte Confidential © 2018 All rights reserved.
With YugaByte DB
Unified platform
Zero data loss
even on region failures
Add new regions with ease
1-click Deployment of Primary Cluster and Read Replicas
Read Replicas
Global User Identity - login, change password, view profile
30YugaByte Confidential © 2018 All rights reserved.
Redis cluster for low latency reads
Fragile (manually sharded & load balanced)
Expensive (entire dataset in memory)
On-premises only, need hybrid/public cloud scaling and distribution
DB
Current State
Financial Data Service – Market Data
31YugaByte Confidential © 2018 All rights reserved.
Financial Data Service
Higher release velocity
Cost-efficient storage
Faster cloud migration
1-click Deploy of Redis as a Primary Database
With YugaByte DB
32YugaByte Confidential © 2018 All rights reserved.
Real-time analytics – large datasets with indexes
Crypto-Currency Fraud Detection
• Production dataset size is 4TB+
• Regular load of crypto transactions
• High throughput write and reads
• Spark analytics for fraud detection
• Need batch loadable secondary index
• Bit coin to wallet and reverse lookup
• Replaces
• Datastax Enterprise (Cassandra)
33YugaByte Confidential © 2018 All rights reserved.
Better user features, geo-distributed data
Financial Market Data
• Improve service SLAs for B2B users
• Low latency reads on large scans
• Fix cache inconsistency
• Multi-region replication (USAsia)
• Dataset size is 10TB+
• Replaces
• Redis Enterprise
• AWS RDS MySQL
34YugaByte Confidential © 2018 All rights reserved.
• Deep Experience building High Performance, Scale-Out Databases
• Reduced Risk:
• Open source
• Open standards – PostgreSQL, Cassandra and Redis
• Multi-model
• Cloud-agnostic: Private and Hybrid
• Alignment with your eco-system (Cassandra, PostgreSQL, Redis, Multi-Cloud, PKS)
• Economic benefits:
• Cost savings: Hardware, Software Licensing, Operational
• Flexible Commercial model
Why Partner with YugaByte?
35YugaByte Confidential © 2018 All rights reserved.
Q&A

More Related Content

What's hot (19)

PDF
Webinar: Faster Big Data Analytics with MongoDB
MongoDB
 
PPTX
Webinar: Enterprise Trends for Database-as-a-Service
MongoDB
 
PDF
Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...
MongoDB
 
PDF
Snowflakes in the Cloud Real world experience on a new approach for Big Data
DevFest DC
 
PDF
Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...
Alfresco Software
 
PDF
Actionable Insights with AI - Snowflake for Data Science
Harald Erb
 
PDF
Self Service Analytics at Twitch
Imply
 
PPTX
Enterprise Reporting with MongoDB and JasperSoft
MongoDB
 
PDF
QMeeting 2018 - Como integrar qlik e cloudera
Roberto Oliveira
 
PPTX
Big Data in the Real World
Mark Kromer
 
PDF
Big query
Tanvi Parikh
 
PDF
AWS Summit Singapore 2019 | Snowflake: Your Data. No Limits
AWS Summits
 
PDF
Smartsheet’s Transition to Snowflake and Databricks: The Why and Immediate Im...
Databricks
 
PPTX
MongoDB Operations for Developers
MongoDB
 
PDF
Redshift VS BigQuery
Kostas Pardalis
 
PDF
MongoDB .local London 2019: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB
 
PPTX
Architecting Snowflake for High Concurrency and High Performance
SamanthaBerlant
 
PPTX
MongoDB in a Mainframe World
MongoDB
 
PDF
SLC Snowflake User Group - Mar 12, 2020
Nathan Skousen
 
Webinar: Faster Big Data Analytics with MongoDB
MongoDB
 
Webinar: Enterprise Trends for Database-as-a-Service
MongoDB
 
Advanced Reporting and ETL for MongoDB: Easily Build a 360-Degree View of You...
MongoDB
 
Snowflakes in the Cloud Real world experience on a new approach for Big Data
DevFest DC
 
Partner Solutions: CIGNEX Datamatics – Alfresco integration with Liferay Port...
Alfresco Software
 
Actionable Insights with AI - Snowflake for Data Science
Harald Erb
 
Self Service Analytics at Twitch
Imply
 
Enterprise Reporting with MongoDB and JasperSoft
MongoDB
 
QMeeting 2018 - Como integrar qlik e cloudera
Roberto Oliveira
 
Big Data in the Real World
Mark Kromer
 
Big query
Tanvi Parikh
 
AWS Summit Singapore 2019 | Snowflake: Your Data. No Limits
AWS Summits
 
Smartsheet’s Transition to Snowflake and Databricks: The Why and Immediate Im...
Databricks
 
MongoDB Operations for Developers
MongoDB
 
Redshift VS BigQuery
Kostas Pardalis
 
MongoDB .local London 2019: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB
 
Architecting Snowflake for High Concurrency and High Performance
SamanthaBerlant
 
MongoDB in a Mainframe World
MongoDB
 
SLC Snowflake User Group - Mar 12, 2020
Nathan Skousen
 

Similar to A Planet-Scale Database for Low Latency Transactional Apps by Yugabyte (20)

PPTX
YugaByte + PKS CloudFoundry Meetup 10/15/2018
AlanCaldera
 
PPTX
How YugaByte DB Implements Distributed PostgreSQL
Yugabyte
 
PDF
YugabyteDB - Distributed SQL Database on Kubernetes
DoKC
 
PDF
YugabyteDB_TVA-Datastax.pdf
AmitAgarwal355193
 
PDF
Meetup - YugabyteDB - Introduction and key features
Franck Pachot
 
PDF
YugaByte DB—A Planet-Scale Database for Low Latency Transactional Apps
VMware Tanzu
 
PPTX
YugaByte DB Internals - Storage Engine and Transactions
Yugabyte
 
PDF
Using ScyllaDB for Extreme Scale Workloads
MarisaDelao3
 
PPTX
YugaByte DB - "Designing a Distributed Database Architecture for GDPR Complia...
Jimmy Guerrero
 
PPTX
Scale Transactional Apps Across Multiple Regions with Low Latency
Yugabyte
 
PPTX
Apache Cassandra introduction
fardinjamshidi
 
PPTX
YugabyteDB Developer Tools
Anant Corporation
 
PDF
Reducing Database Pain & Costs with Postgres
EDB
 
PDF
Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine
ScaleGrid.io
 
PDF
Your Database is Trying to Kill You
Kevin Lawver
 
ZIP
Gluecon 2012 - DynamoDB
Jeff Douglas
 
KEY
DynamoDB Gluecon 2012
Appirio
 
PDF
Webinar: SQL for Machine Data?
Crate.io
 
PPTX
Minnebar 2013 - Scaling with Cassandra
Jeff Bollinger
 
PDF
Webinar: The Future of SQL
Crate.io
 
YugaByte + PKS CloudFoundry Meetup 10/15/2018
AlanCaldera
 
How YugaByte DB Implements Distributed PostgreSQL
Yugabyte
 
YugabyteDB - Distributed SQL Database on Kubernetes
DoKC
 
YugabyteDB_TVA-Datastax.pdf
AmitAgarwal355193
 
Meetup - YugabyteDB - Introduction and key features
Franck Pachot
 
YugaByte DB—A Planet-Scale Database for Low Latency Transactional Apps
VMware Tanzu
 
YugaByte DB Internals - Storage Engine and Transactions
Yugabyte
 
Using ScyllaDB for Extreme Scale Workloads
MarisaDelao3
 
YugaByte DB - "Designing a Distributed Database Architecture for GDPR Complia...
Jimmy Guerrero
 
Scale Transactional Apps Across Multiple Regions with Low Latency
Yugabyte
 
Apache Cassandra introduction
fardinjamshidi
 
YugabyteDB Developer Tools
Anant Corporation
 
Reducing Database Pain & Costs with Postgres
EDB
 
Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine
ScaleGrid.io
 
Your Database is Trying to Kill You
Kevin Lawver
 
Gluecon 2012 - DynamoDB
Jeff Douglas
 
DynamoDB Gluecon 2012
Appirio
 
Webinar: SQL for Machine Data?
Crate.io
 
Minnebar 2013 - Scaling with Cassandra
Jeff Bollinger
 
Webinar: The Future of SQL
Crate.io
 
Ad

More from VMware Tanzu (20)

PDF
Spring into AI presented by Dan Vega 5/14
VMware Tanzu
 
PDF
What AI Means For Your Product Strategy And What To Do About It
VMware Tanzu
 
PDF
Make the Right Thing the Obvious Thing at Cardinal Health 2023
VMware Tanzu
 
PPTX
Enhancing DevEx and Simplifying Operations at Scale
VMware Tanzu
 
PDF
Spring Update | July 2023
VMware Tanzu
 
PPTX
Platforms, Platform Engineering, & Platform as a Product
VMware Tanzu
 
PPTX
Building Cloud Ready Apps
VMware Tanzu
 
PDF
Spring Boot 3 And Beyond
VMware Tanzu
 
PDF
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
VMware Tanzu
 
PDF
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
VMware Tanzu
 
PDF
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
VMware Tanzu
 
PPTX
tanzu_developer_connect.pptx
VMware Tanzu
 
PDF
Tanzu Virtual Developer Connect Workshop - French
VMware Tanzu
 
PDF
Tanzu Developer Connect Workshop - English
VMware Tanzu
 
PDF
Virtual Developer Connect Workshop - English
VMware Tanzu
 
PDF
Tanzu Developer Connect - French
VMware Tanzu
 
PDF
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
VMware Tanzu
 
PDF
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
VMware Tanzu
 
PDF
SpringOne Tour: The Influential Software Engineer
VMware Tanzu
 
PDF
SpringOne Tour: Domain-Driven Design: Theory vs Practice
VMware Tanzu
 
Spring into AI presented by Dan Vega 5/14
VMware Tanzu
 
What AI Means For Your Product Strategy And What To Do About It
VMware Tanzu
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
VMware Tanzu
 
Enhancing DevEx and Simplifying Operations at Scale
VMware Tanzu
 
Spring Update | July 2023
VMware Tanzu
 
Platforms, Platform Engineering, & Platform as a Product
VMware Tanzu
 
Building Cloud Ready Apps
VMware Tanzu
 
Spring Boot 3 And Beyond
VMware Tanzu
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
VMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
VMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
VMware Tanzu
 
tanzu_developer_connect.pptx
VMware Tanzu
 
Tanzu Virtual Developer Connect Workshop - French
VMware Tanzu
 
Tanzu Developer Connect Workshop - English
VMware Tanzu
 
Virtual Developer Connect Workshop - English
VMware Tanzu
 
Tanzu Developer Connect - French
VMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
VMware Tanzu
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
VMware Tanzu
 
SpringOne Tour: The Influential Software Engineer
VMware Tanzu
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
VMware Tanzu
 
Ad

Recently uploaded (20)

PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PDF
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PDF
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PPTX
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 

A Planet-Scale Database for Low Latency Transactional Apps by Yugabyte

  • 1. 1YugaByte Confidential © 2018 All rights reserved. Introducing YugaByte DB Karthik Ranganathan - CTO Cloud Native Data Day, Nov 2018
  • 2. 2YugaByte Confidential © 2018 All rights reserved. Overview • YugaByte DB – Our roots…and why they matter • Why YugaByte DB is a better choice for Cloud-Native, Online applications … for either SQL or NoSQL • Advantages of Partnering with YugaByte
  • 3. 3YugaByte Confidential © 2018 All rights reserved. About Us Kannan Muthukkaruppan, CEO Karthik Ranganathan, CTO Mikhail Bautin, Software Architect Created Cassandra & HBase Scaled data platforms for growth from 30 Million to 1.4 Billion Users at Facebook Team members from Oracle, Google, LinkedIN, Nutanix, and MapR have built three or four databases including YugaByte Founders Funding From Leading VCs in Cloud and Strategic Investors
  • 4. 4YugaByte Confidential © 2018 All rights reserved. YugaByte story starts with….Facebook in 2007
  • 5. 5YugaByte Confidential © 2018 All rights reserved. Facebook in 2008-2009….. How to scale to a billion users? Also: How to survive the week?
  • 6. 6YugaByte Confidential © 2018 All rights reserved. What happens at 1 billion users? Dozens of petabytes Billions of IOPS Scale out frequently Rolling upgrades – zero downtime!
  • 7. 7YugaByte Confidential © 2018 All rights reserved. Transformation of Facebook How we achieved scale Few datacenters Traditional servers Monolithic applications and services Geo-distributed DC’s Containers for applications Microservices based design Its all about developer agility
  • 8. 8YugaByte Confidential © 2018 All rights reserved. How did the Tech leaders simplify this? Application Tier (Stateless Microservices) Custom Data Platform Transactional, Performant, Global But there’s no general platform for the enterprise
  • 9. 9YugaByte Confidential © 2018 All rights reserved. Typical Stack Today Fragile infrastructure with many moving parts Datacenter 1 SQL Master SQL Slave Application Tier (Stateless Microservices) Datacenter 2
  • 10. 10YugaByte Confidential © 2018 All rights reserved. NoSQL + SQL Cloud Native YugaByte DB Eliminates Complexity in a Cloud- native World
  • 11. 11YugaByte Confidential © 2018 All rights reserved. tablet 1’ tablet 1’ YugaByte DB: Developer Agility & Operational Simplicity tablet 1’ Cassandra++ Redis++ BETA Self-Healing, Fault-Tolerant Auto Sharding & Rebalancing Distributed ACID Transactions Global Data Distribution High Throughput Deep Cloud Integration Standard API’s Purpose Built Data Fabric
  • 12. 12YugaByte Confidential © 2018 All rights reserved. Do More (or less) With Your Cassandra Apps
  • 13. 13YugaByte Confidential © 2018 All rights reserved. Doing More (or Less) More Developer Agility: • Extending Cassandra: • Strong consistency • Consistent secondary indexes • JSON data • Distributed transactions • 10x data per node • Superior performance • 2.7x throughput • 50% lower P99 latency • Streaming ingest performance without separate SST table load pipeline Less Operational Complexity: • Fewer nodes • Expand in minutes not days • Less time maintaining, tuning and managing: • No read repairs or anti-entropy • No tombstones or deletes reappear • No garbage collection pauses • Reduced RTO and RPO • More frequent backups
  • 14. 14YugaByte Confidential © 2018 All rights reserved. YugaByte DB: 10x More Data per Node High-density benchmark: • 26TB over 4 YugaByte DB nodes compared to 30 nodes for Cassandra • 385K reads/sec (0.25 ms) & 6.5K writes/sec on "Recent Data” Workload • Expand to 5 nodes (complete in 8 hours), data available in 5 minutes • Induced node failure. Cluster rebalanced in 2.2 hours 10x Density Minutes not days
  • 15. 15YugaByte Confidential © 2018 All rights reserved. YugaByte DB: Improved Apache Cassandra Performance 2.6x 2.2x 1.9x 3.9x 3.0x 2.3x 2.7x Throughput
  • 16. 16YugaByte Confidential © 2018 All rights reserved. Best Distributed SQL
  • 17. 17YugaByte Confidential © 2018 All rights reserved. Best Distributed SQL: Background • Directly Re-use PostgreSQL • Re-use of PostgreSQL code base versus re-write in Go • 100% compatible with PostgreSQL • YugaByte will stay in synch with PostgreSQL (e.g. PostgreSQL v 12) • Core data engine built for cloud-native environments • Customized combination of RocksDB, Raft and PostgreSQL • Database written in C++ • Superior performance • Team members have developed and operated multiple databases • Developed HBase, Cassandra, Presto, Hive and now YugaByte. • Extensive hands-on implementation experience
  • 18. 18YugaByte Confidential © 2018 All rights reserved. Re-use not Re-write of PostgreSQL CLIENT Postmaster (Authentication, authorization) Rewriter Planner OptimizerExecutor WAL Writer BG Writer… DISK Reuse Stateless PostgreSQL
  • 19. 19YugaByte Confidential © 2018 All rights reserved. Re-use not Re-write of PostgreSQL CLIENT Postmaster (Authentication, authorization) Rewriter Planner OptimizerExecutor YugaByte Node YugaByte Node …… YugaByte Node Replace table storage with YugaByte DB
  • 20. 20YugaByte Confidential © 2018 All rights reserved. Re-use not Re-write of PostgreSQL CLIENT Postmaster (Authentication, authorization) Rewriter Planner OptimizerExecutor YugaByte Node YugaByte Node …… YugaByte Node Enhance optimizer and executor for distributed DB
  • 21. 21YugaByte Confidential © 2018 All rights reserved. YugaByte PostgreSQL feature-set support Expect to support most PostgreSQL features • All data types • Built-in functions and expressions • Various kinds of joins • Constraints (primary key, foreign key, unique, not null, check) • Secondary indexes (including multi-column and covering columns) • Distributed transactions (Serializable and Snapshot Isolation) • Views • Stored Procedures • Triggers YugaByte DB inherits features developed in PostgreSQL. Vendors who re-write PostgreSQL will need to reimplement from scratch
  • 22. 22YugaByte Confidential © 2018 All rights reserved. Best Cloud-native DB Orchestration
  • 23. 23YugaByte Confidential © 2018 All rights reserved. Transformation of Infrastructure & Apps Few datacenters Traditional servers Monolithic applications and services Geo-distributed DC’s Containers for applications Microservices based design
  • 24. 24YugaByte Confidential © 2018 All rights reserved. Operational Data Tier has Lagged ? legacy databases Open Source, Cloud-native Databases
  • 25. 25YugaByte Confidential © 2018 All rights reserved. Why YugaByte DB for Cloud-native Apps? legacy databases Open Source, Cloud-native Databases • Deep cloud integration • K8s integration including PKS • Intent based deployments • Self-service development
  • 26. 26YugaByte Confidential © 2018 All rights reserved. Deep Cloud Integration including K8s
  • 27. 27YugaByte Confidential © 2018 All rights reserved. REAL-WORLD CASE STUDIES
  • 28. 28YugaByte Confidential © 2018 All rights reserved. 1. MySQL master-slave replication 2. Cassandra cross-DC queue for cache updates 3. Per-DC Couchbase for caching Current State Global User Identity - login, change password, view profile
  • 29. 29YugaByte Confidential © 2018 All rights reserved. With YugaByte DB Unified platform Zero data loss even on region failures Add new regions with ease 1-click Deployment of Primary Cluster and Read Replicas Read Replicas Global User Identity - login, change password, view profile
  • 30. 30YugaByte Confidential © 2018 All rights reserved. Redis cluster for low latency reads Fragile (manually sharded & load balanced) Expensive (entire dataset in memory) On-premises only, need hybrid/public cloud scaling and distribution DB Current State Financial Data Service – Market Data
  • 31. 31YugaByte Confidential © 2018 All rights reserved. Financial Data Service Higher release velocity Cost-efficient storage Faster cloud migration 1-click Deploy of Redis as a Primary Database With YugaByte DB
  • 32. 32YugaByte Confidential © 2018 All rights reserved. Real-time analytics – large datasets with indexes Crypto-Currency Fraud Detection • Production dataset size is 4TB+ • Regular load of crypto transactions • High throughput write and reads • Spark analytics for fraud detection • Need batch loadable secondary index • Bit coin to wallet and reverse lookup • Replaces • Datastax Enterprise (Cassandra)
  • 33. 33YugaByte Confidential © 2018 All rights reserved. Better user features, geo-distributed data Financial Market Data • Improve service SLAs for B2B users • Low latency reads on large scans • Fix cache inconsistency • Multi-region replication (USAsia) • Dataset size is 10TB+ • Replaces • Redis Enterprise • AWS RDS MySQL
  • 34. 34YugaByte Confidential © 2018 All rights reserved. • Deep Experience building High Performance, Scale-Out Databases • Reduced Risk: • Open source • Open standards – PostgreSQL, Cassandra and Redis • Multi-model • Cloud-agnostic: Private and Hybrid • Alignment with your eco-system (Cassandra, PostgreSQL, Redis, Multi-Cloud, PKS) • Economic benefits: • Cost savings: Hardware, Software Licensing, Operational • Flexible Commercial model Why Partner with YugaByte?
  • 35. 35YugaByte Confidential © 2018 All rights reserved. Q&A

Editor's Notes

  • #3: Our Roots – Why you should have confidence in YB
  • #4: Founded by a team from Facebook 9 members of the core Data Infrastructure team @ FB From 2006-2013 Unique journey …Started off on Bare-Metal….moved to Containers…..had to address multiple DC’s in a very short time……over 1 Billion people needing low latency reads….all across the planet FaceBook Messenger - Inbox/Messages Operations Data Store Site Integrity Application Fraud Detections Needed strong consistency for Site Integrity and Fraud so they created H-Base Determined there was a strong need for a Cloud-Based DB Platform Maturity of the company…..went GA with 1.0 in April Added Oracle , Nutanics personnel Just closed a round with LightSpeed and Dell Technologies Capital Brought 1.0 to market April 2018 Scaled from 30M to 1.2 B
  • #5: Huge amount of growth in users
  • #9: Custom tier abstracts the complexity… Transactional Performant Scales Devloper agility Open API’s you already know/use Our API’s extend the capabilities….our SQL functionality is now in NoSQL….and NoSQL functionality is now in SQL
  • #10: Does this look like your environment? Describe pain/process for the developers How do they sync ----pushing it up into the app layer….you end up implementing work-arounds….slowing your dev cycles Data tier is usually behind the application teir Application tier is usually stateless Transactional data written in int an SQL DB that is manually sharded…..CLICK,….now you need to be in 2 data centers so you have to replicate the data…So you usually have Cassandra or MongoDB….data starts to get silo’ed in the organizations…and because performance isnt what it should be….you introduce a caching layer…..such as Redis So….you end up with 4 data stores…..a lot of complexity…..and your architecture becomes BRITTLE… Does moving to the cloud affect this…..NO…..you are just using cloud-versions on these solutions…. Cassandra Consistency R+W>N (N=replication factor) Quorum is easy when you are using eventual consistency Consistency – Strong vs Eventual 3 reasons why this is not agile 3 observations why it is operationally complex
  • #12: We built a new core New core engine for the data fabric Open Standards Purpose built data fabric – a new cloud-native database not another legacy database Multi-cloud Nomenclature check
  • #13: Cassandra friendly Existing Cassandra apps can run against YB
  • #15: Turvo Built app that unifies all the functions they need to monitor Started with Mongo Narvar OEM for doing customer experience….. Handles the entire customer experience Started on Dynamo and Elasticache SQL and NoSQL requirements meant we were good fit for them For Retailers….very important to scale to meet holiday buying season
  • #18: Credibility slide
  • #19: YB will be 100% PostgreSQL compatible
  • #27: Multi-cloud set up Zero down-time migrations and upgrade Rolling upgrades Trivial to bring on new regions Trivial to bring on new IaaS - Including database
  • #29: This use case is Good to orient you around how YugaByte provides value for an organization This is a web conferencing application (Video/audio/chat) The service using YugaByte is for user-indentity: User name/password/attributes Millions of users Individuals Corporations Important to know where they were loggin in from Build out slide.... User name needs to be strongly consistent….can’t be eventually…… Initial write (username & Password) is written in NoSQL DB….(MySQL) ….which is replicated for redundancy in a master/slave configuration…..all of the writes were happening in the US….but the reads where happening across the globe so they replicated the data in Cassandra and used that to stage the data across the globe….and to achieve low latency reads….they were using CouchBase as caching tier Lots of complexity…..which leads to cost….but the biggest cost was to opportunity-cost. When ever there was a change to the application, the developers needed to coordinate across 3 different DB’s With Yugabyte….you have one cluster taking rights (of an homogenous DB)…and then data replicated to 3 data centers to achieve low-latency reads. BENEFITS: Writing to 1 platform We maintain data resiliency Feature velocity / Agility for the company
  • #31: Focuses on Redis as a DB Redis – great as a caching tier….benefits of ease of use for some data types Large website for obtaining news, stock tickers and information about companies …..and to persist the data it was written to a homegrown DB. Far right side…..Large Redis cluster was used to achieve low latency reads. Adding to functionality/applications was very time consuming…..it meant they had to manually Shard the cluster….which made it VERY brittle (usually took 6-9 months)…started a process of…..deploy additional nodes…..reshard…re-deploy If you look….you had to write to 2 separate DB’s….which creates additional complexity
  • #32: Now…with Yugabyte…. They write into 1 cluster (using the Redis API) Reads come into the Redis tier from the app itself…low latency reads are served up….single Milisecond (6 or less) Started with 4 node cluster Doubled to 8 Nodes only took 30 minutes (versus 6 – 9 months) They also wanted the ability to birst into the cloud….(which we do via our read replication) Multi-Cloud…Multi-Datacenter
  • #35: Turvo Built app that unifies all the functions they need to monitor Started with Mongo Narvar OEM for doing customer experience….. Handles the entire customer experience Started on Dynamo and Elasticache SQL and NoSQL requirements meant we were good fit for them For Retailers….very important to scale to meet holiday buying season