October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and Spark

Splice Machine Proprietary and Confidential
Open Source RDBMS
For Mixed Operational and Analytical Workloads
Monte Zweben
October 20, 2016

Who We Are
The Open Source RDBMS Powered By Hadoop & Spark
2
ANSI SQL
No retraining or rewrites for SQL-based
analysts, reports, and applications
¼ the Cost
Scales out on
commodity hardware
SQL Scale Out Speed
Transactions
Ensure reliable updates
across multiple rows
Mixed Workloads
Simultaneously support
OLTP and OLAP workloads
Elastic
Increase scale in
just a few minutes
10x Faster
Leverages Spark
in-memory technology

Life Sciences
Digital Marketing Financial Services
DECISIONS IN THE MOMENT
Supply Chain Optimization

Today’s Reality: Stale Data, Backward-Looking Decisions
4
How old is the data in your reports?
 1 day +
 1 day
 4 hours +
 1 hour +
 Real-time

Today’s Reality: Stale Data, Backward-Looking Decisions
5
24%
50%
7%
9%
9%
* Source: Webinars on 11-3-15 and 12-10-15, 237 respondents
How old is the data in your reports?
 1 day +
 1 day
 4 hours +
 1 hour +
 Real-time

Legacy ETL Architectures Unable to Keep Up
Ad Hoc
Analytics
Executive
Business Reports
Operational Reports
ERP
CRM
Supply
Chain
HR
…
Data
Warehouse
Datamart
Stream or Batch
Updates
Mixed Workload
Apps
ODS
ETL
OLTP
Systems
Extract
Transform
Load
OLAP
Systems Pain
 Separate OLTP & OLAP
systems
 Messy ETL “glue”
 Why?
 Different workloads
 Different data structures
 Hard to isolate workloads
 No longer adequate
 Can’t afford to wait days or
hours to analyze data
6

Recent Approach: Lambda Architecture
Complex to setup and maintain
7
Speed Layer
Batch Layer
Serving Layer
Developer Integrates Specialized Compute Engines

New Approach: Lambda-In-A-Box Architecture
Easy to use with SQL
8
Speed Layer
Batch Layer
SQL Optimizer Selects Pre-Integrated Compute Engines
Serving Layer

Simultaneous OLTP & OLAP Workloads
9
Unique Dual-Engine Architecture isolates workloads
Traditional RDBMSs Splice Machine
HBASE
Engine
SPARK
Engine
BOTTLENECKS, DELAYS
O L A P
WORKLOAD ISOLATION
O L T P
K E Y

Simultaneous OLTP & OLAP Workloads
10
Unique Dual-Engine Architecture isolates workloads
Traditional RDBMSs Splice Machine
As OLAP load rises,
OLTP response times increase
OLAP LOAD
OLTPRESPONSETIME
As OLAP load rises,
OLTP response times remain flat
OLAP LOAD
OLTPRESPONSETIME

Power Old and New Applications

Proven Building Blocks: Spark, Hadoop and Derby
Apache Derby
 ANSI SQL-99 RDBMS
 Java-based
 ODBC/JDBC Compliant
Apache HBase/Hadoop
 Auto-sharding
 High availability
 Scalability to 100s of PBs
Apache Spark
 Analytical engine
 Fast, in-memory technology
 Memory resilient to node failure
12

HBase: Proven Scale-Out
 Auto-sharding
 Scales with commodity hardware
 Cost-effective from GBs to PBs
 High availability thru failover and replication
 LSM-trees
13

Apache
14
Unmatched Performance
 Fastest sort of 1PB of data
Advanced In-Memory Technology
 Spill-to-disk for large datasets
 Resilient against node failures
 Pipelining for computation parallelism
Most Active Apache Community
 Almost 1000 contributors
Extensive Libraries
 Over 140 and growing
 Libraries for machine learning,
streaming and graph processing

Splice Machine: Advanced Spark Integration
15
Innovative, High-Performance
RDD Creation
 Fast access to HFiles in HDFS
 Merged with deltas from Memstore
 Avoids slower HBase API
Universal Execution Plan
and Byte Code
 Optimizer, plan and code shared across
Spark or HBase execution
•••
HBase Region Server
HDFS
•••
Region 1
Memstore
Spark Worker
•••RDD 1
HFile HFile•••
P H Y S I C A L N O D E
RDD N
HFile••• HFile•••
Region N
Memstore
HBase Region Server
HDFS
•••
Region 1
Memstore
Spark Worker
•••RDD 1
HFile HFile•••
P H Y S I C A L N O D E
RDD N
HFile••• HFile•••
Region N
Memstore

Splice Machine Architecture
1. Standard install of HBase
Cluster (HBase, HDFS,
ZooKeeper) with Spark
HBase
Co-Processor
L
E
G
E
N
D
2. Distribute Splice Machine
JAR to each region server
3. Automatically invoke co-
processors on each region
16
Cach
e
•••
Tas
k
Executor
Tas
k
HBase Region Server
•••
HDFS
SPLICE PARSER
SPLICE PLANNER
SPLICE OPTIMIZER
SPLICE EXECUTOR
• Snapshot Isolation
• Indexes
Region Region
SPLICE EXECUTOR
• Indexes
Spark Worker RDD
Spark Master
RDD
Cach
e
•••
Tas
k
Executor
Tas
k
•••
•••
•••
Cach
e
•••
Tas
k
Executor
Tas
k
HBase Region Server
HDFS
SPLICE PARSER
SPLICE PLANNER
SPLICE OPTIMIZER
SPLICE EXECUTOR
• Indexes
Region Region
SPLICE EXECUTOR
• Indexes
Spark Worker RDDRDD
Cach
e
•••
Tas
k
Executor
Tas
k
•••
•••
•••
HMasterZookeeper

Splice Machine: Query Execution
17

18
1. Parse SQL
• Generate Abstract Syntax Tree (AST)
• Bind AST to Transactional Dictionary

19
1. Parse SQL
2. Optimize query plan
• Determine access plan (e.g., base table,
index), join order and join algorithm
using cost-based statistics (e.g.,
cardinality estimates)
• Unroll nested subqueries

20
3. Generate optimal byte code
1. Parse SQL

21
OLTP Execution on HBase
4a. Execute OLTP query from
byte code
5a. Use block cache and bloom
filters to optimize data access
6a. Return results
1. Parse SQL

22
OLAP Execution on Spark
4b. Generate Spark execution plan
OLTP Execution on HBase
4a. Execute OLTP query from
byte code
5a. Use block cache and bloom
filters to optimize data access
6a. Return results
1. Parse SQL
OLAP Execution on Spark
4b. Generate Spark execution plan
5b. Submit Spark plan with byte code
6b. Fair scheduling of distributed of tasks
7b. Generate RDD from HFiles and Memstore
8b. Execute query and return results

Isolated Resource Management
23
Isolate Spark & HBase resources through Linux Cgroups

Isolated Resource Management
24
Isolate Spark & HBase resources through Linux Cgroups

Configurable Spark Resource Management
25
Prioritize Spark resources between Query, Admin & Import jobs
Custom resource pools
through XML

Spark Query Management
26
Visualization of active and completed queries

Spark Query Management (cont’d)
27
Visualization of stages for each query, plus kill function

28
Visualization of stages for query plan, plus kill function

29
Detailed metrics for tasks in each stage

30

Working With External Data and Compute Engines
31
Virtual Table Interface (VTI)
 Execute federated queries against external
files, libraries or databases
 External Databases
 Use JDBC to access data in DBs such as Oracle
and DB2
 External Libraries
 Access over 140 Spark libraries for machine
learning and streaming
 External Files
 Pre-defined or dynamic schema
 Access local FS, HDFS, AWS S3
 Sample query:
MapReduce I/O Formats
 Accept federated queries from
MapReduce, Pig, and Hive
 Register Splice Machine schema in
HCATALOG
 Merge structured (Splice) and
unstructured data in ad-hoc query
 Seamless integration to Hadoop
ecosystem

ANSI SQL-99+ Coverage
32
 Data types – e.g., INTEGER, REAL,
CHARACTER, DATE, BOOLEAN, BIGINT
 DDL – e.g., CREATE TABLE, CREATE SCHEMA,
ALTER TABLE, DELETE, UPDATE TABLE
 Predicates – e.g., IN, BETWEEN, LIKE, EXISTS
 DML – e.g., INSERT, DELETE, UPDATE, SELECT
 Query specification – e.g., GROUP BY,
HAVING
 SET functions – e.g., UNION, ABS, MOD, ALL,
INTERSECT, EXCEPT
 Aggregation functions – e.g., AVG, MAX,
COUNT
 String functions – e.g., SUBSTRING,
concatenation, UPPER, LOWER, TRIM,
LENGTH
 Constraints – e.g., PRIMARY KEY, CHECK,
FOREIGN KEY, UNIQUE, NOT NULL
 Conditional functions – e.g., CASE,
searched CASE
 Privileges – e.g., privileges for SELECT,
DELETE, INSERT, EXECUTE
 Joins – e.g., INNER JOIN, LEFT OUTER JOIN
 Transactions – e.g., COMMIT, ROLLBACK,
Snapshot Isolation
 Sub-queries
 Triggers
 User-defined functions (UDFs)
 Views – including grouped views
 Window Functions – e.g., FIRST_VALUE,
LAST_VALUE, LEAD, LAG

Splice Machine Proprietary and Confidential 33
High Concurrency, ACID transactions
Required to support OLTP applications
share_quantity share_price
TIMESTAMP VALUE TIMESTAMP VALUE
T12 4,000
“Virtual”
Snapshot
T7 $15.11
T7 2,000 T5 $15.65
T3 5,000
Transaction
@T6
T2 $15.74
T1 3,000 T0 $15.27
T3 5,000
Transaction
@T6
T2 $15.74
T5 $15.65
value_held = share_quality* share_price
@T6: value_held = 5,000 * $15.65
@T3: value_held = 5,000 * $15.74
 State-of-the-art, distributed
snapshot isolation
 Form of Multi-Version
Concurrency Control (MVCC)
 Writers do not block readers
 Fast, high concurrency
 Delivers performance for small
reads/writes & batch loads
 Extends research from Google
Percolator & Yahoo Labs
 Patent pending technology

BI and SQL tool support via ODBC/JDBC
34
No application rewrites needed

Open Source
Features Community
Edition
Enterprise
Edition
Scale-out Architecture, ANSI SQL & Concurrent ACID Transactions ✓ ✓
OLAP and OLTP Resource Isolation ✓ ✓
Distributed In-Memory Joins, Aggregations, Scans and Groupings ✓ ✓
Cost-Based Statistics, Query Optimizer, Management Console ✓ ✓
Compaction Optimization ✓ ✓
Apache Kafka-enabled Streaming ✓ ✓
Virtual Table Interfaces ✓ ✓
New Releases and Maintenance Updates ✓ ✓
Tutorials, Forums, Videos, Documentation, Community Support ✓ ✓
Backup and Restore, Column Access Control ✓
Encryption, Kerberos, LDAP Support ✓
24/7 Support via Web and Phone ✓
Complimentary Account Management Services ✓

Try it at scale immediately on AWS Sandbox
 5 Click Sand Box
 Cluster has full system deployed
 SSH for CLI
 URL to Management Consoles
 Open SQL connection on any
node
 Customize template

Community
 Slack channel - #splicecommunity
 Video and code tutorials
 GitHub

Advisory Board
41
Advisory Board includes luminaries in databases and technology
Roger Bamford
Former Principal Architect at Oracle
Father of Oracle RAC
Mike Franklin
Chair,Dept of Computer Science, UChicago
Director, UC Berkeley AMPLab
Founder of Apache Spark
Marie-Anne Neimat
Co-Founder, Times-Ten Database
Former VP, Database Eng. at Oracle
Ken Rudin
Head of Growth and Analytics for Google Search
Head of Analytics at Facebook
Abhinav Gupta
Co-Founder, Rocket Fuel
Runs 15PB HBase Cluster

Splice Machine Proprietary and Confidential 42
WE ARE HIRING

Seasoned Team
43
Monte Zweben
Co-Founder &
Chief Executive
Officer
John Leach
Co-Founder &
Chief Technology
Officer
St. Louis
Hadoop
User Group
Krishnan
Parasuraman
VP of Sales and
Business
Development
Eran Pilovsky
Chief Financial
Officer
Gene Davis
Co-Founder & VP
of Products &
Operations
Eric Kalabacos
VP of Customer
Solutions

Next Steps
44
Try Us!
splicemachine.com/get-started
GitHub • Tutorials • Sandbox

Powering Real-Time
Applications & Analytics
Enabling Decisions in the Moment
October 20, 2016

October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and Spark

More Related Content

What's hot (20)

Similar to October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and Spark (20)

More from Yahoo Developer Network (20)

Recently uploaded (20)

October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and Spark

Editor's Notes