Bringing OLTP woth OLAP: Lumos on Hadoop

Scaling ETL on Hadoop: Bridging OLTP with OLAP

Agenda
 Data Ecosystem @ LinkedIn
 Problem : Bridging OLTP with OLAP
 Solution
 Details
 Conclusion and Future Work
2

Data Ecosystem - Overview
4
Serving App
Online Stores
Espresso
Oracle
MySQL
Logs
Analytics Infra
Business
Engines
Serving
OLAP

Data Ecosystem – Data
5
 Tracking Data
 Tracks user activity at web site
 Append only
 Example: Page View
 Database Data
 Member provided data in online-stores
 Inserts, Updates and Deletes
 Example: Member Profiles, Likes, Comments

Problem
Scaling ETL on Hadoop
6

Bridging OLTP to OLAP
7
OLTP OLAP
 Integrating site-serving data stores with Hadoop
at scale with low latency.
 Critical to LinkedIn’s
 Member engagement
 Business decision making
Kafka
Engines
Serving
OLAP
Databases
Tracking Data
Espresso
Oracle
MySQL

Challenge - Scalable ETL
8
 600+ Tracking topics
 500+ Database tables
 XXX TB of Data at rest
 X TB of new data generated per day
 5000 Nodes, Several Hadoop clusters
Kafka
Engines
Serving
OLAP
Databases
Tracking Data
Espresso
Oracle
MySQL
OLTP OLAP

Challenge – Consistent Snapshot with SLA
9
 Apply updates, deletes
 Copy full tables
 But, resource overheads
 Small fraction of data changes
Kafka
Engines
Serving
OLAP
Databases
Tracking Data
Espresso
Oracle
MySQL
OLTP OLAP

Engines
Requirements
10
OLTP
Oracle Espresso
OLAP
 Refresh data on HDFS frequently
 Seamless handling of schema evolution
 Optimal resource usage
 Handle multi data centers
 Efficient change capture on source
 Ensure Last-Update semantics
 Handle deletes
Serving
OLAP
Database Data
Tracking Data

Lumos
12
Data Capture
 Can use commit logs
 Delta processing
 Latencies in minutes
 Schema agnostic framework
Databus
Others
Hadoop : Data Center
DB
Extract
Files
Data Center
Colo-1
Databases
Colo-2
Databases
Lumos
databases
(HDFS)
dbchanges
(HDFS)

Lumos – Multi-Datacenter
13
Data Capture
 Handle multi-datacenter stores
 Resolve updates via commit order
Databus
Others
Hadoop : Data Center
DB
Extract
Files
Data Center
Colo-1
Databases
Colo-2
Databases
Lumos
databases
(HDFS)
dbchanges
(HDFS)

Lumos – Data Organization
14
-
Virtual Snapshot
HDFS Layout
InputFormat
Pig&Hive
Loaders
 Database Snapshot
- Entire database on HDFS
- With added latency
 Database Virtual Snapshot
- Previous Snapshot + Delta
- Enables faster refresh
/db/table/snapshot-0
_delta
dir-1
dir-2
dir-3

Lumos - High Level Architecture
15

Virtual
Snapshot
Builder
ETL Hadoop Cluster
Staging
(internal)
Lazy
Snapshot
Builder
User
Jobs
HDFS
Published
Virtual
Snapshot
MR/Pig/Hiv
e
Loaders
Compactor
Change
Captur
e Increments
Pre-
Process
Full Drops

Alternative Approaches
 Sqoop
 Hbase
 Hive Streaming
16

Change Capture – File Based
18
 File Format
 Compressed CSV
 Metadata
 Full Drop
 Via Fast Reader (Oracle, MySQL)
 Via MySQL backups (Espresso)
 Runs for hours with Dirty reads
 Increments
 Via SQL
 Transactional
Full Drop
1am 4am
Inc
h-1
Inc
h-2
Inc
h-3
2am 3am
Prev.
HW
New
High-water mark
DB
Files
Web
Service
HDFS
HTTPS
Pulls
Inc
H-4

Change Capture – Databus Based
19
Databus
Relay
Mapper
Databus
Consumer
dbchanges
(HDFS)
Reducer
Database
Mapper
Databus
Consumer
Reducer
 Reads Database commit logs
 Multi datacenter via Databus Relay
 Runs as MR Job
 Output : date-time partitioned with multiple versions
 True change capture (including hard deletes)
Databus
RelayDatabase
Hadoop

Pre-Processing
20
 Data format conversion
 Field level transformations
 Privacy
 Cleansing – Eg. Remove recursive schema
 Metadata annotation
 Add row counts for data validation
Virtual
Snapshot
Builder
(HDFS)
Internal
Staging
Lazy
Snapshot
Builder
User Jobs
(HDFS)
Published
Virtual
Snapshot
MR/Pig/Hive
Loaders
Compactor
Change
Capture Increments
Pre-
Process
Full Drops

Snapshotting – Lazy Materializer
21
 One MR job per table, consumes full drops
 Supports dirty reads.
 Hash Partition on primary key
 Number of partitions based on data size
 Sorts on primary key
 Results published into staging directory
Virtual
Snapshot
Builder
(HDFS)
Internal
Staging
Lazy
Snapshot
Builder
User Jobs
(HDFS)
Published
Virtual
Snapshot
MR/Pig/Hive
Loaders
Compactor
Change
Capture Increments
Pre-
Process
Full Drops

Snapshotting – Virtual Snapshot Builder
22
 One MR Job for all tables
 Identifies all existing snapshots, both published and staged
 Creates appropriate delta partitions for every snapshot
 Delta partition count equals Snapshot partition count
 Club multiple partition in one file
 Outputs latest row using delta column
 Publishes staged snapshots with new deltas
 Previously published snapshots updated with new deltas
Virtual
Snapshot
Builder
(HDFS)
Internal
Staging
Lazy
Snapshot
Builder
User Jobs
(HDFS)
Published
Virtual
Snapshot
MR/Pig/Hive
Loaders
Compactor
Change
Capture Increments
Pre-
Process
Full Drops

Snapshotting – Virtual Snapshot Builder
23
(10 partitions, 10 Avro files)
_delta
inc-1
(10 partitions, 2 Avro file)
Part-0 . .
.Part-9
Index files
Inc-2
Part-0
Part-5
Part-0
 Incremental data is small
 Rolls increments
 Avoid creating small files
 Equi-partitions INC as Snapshot
 Seek and Read a partition
Partition-0
Part-0.avro File
Partition-4
Partition-5
Partition-9
Index file
Index files
Part-5
Index file
Part-5.avro File

Snapshotting – Loaders
24
 Custom InputFormat (MR)
 Uses the Index file to create Splits
 RecordReader merges partition-0 of Snapshot and
Delta
 Returns latest row from Delta if present
 Masks row if deleted
 Otherwise returns row from snapshot
 Pig Loader enables reading virtual snapshot via Pig
 Storage handler enables reading virtual snapshot via Hive

Snapshotting – Loaders (2)
25
(10 partitions, 10 Avro files)
_delta
Part-0
Part-9
Delta-1
Part-5
Part-0
Custom
InputFormat
Index files
Part-1
Part-2 . .
.
Mapper-0
Custom
InputFormat
Mapper-9
 Delta-1.Part-0 contains partitions 0 to 4
 Delta-2.Part-5 contains partitions 5 to 9
 Snapshot-0.Part-0 contains partition 0
 Both sorted on primary key

Snapshotting – Compactor
26
 Required when partition size exceeds threshold
 Materializes Virtual Snapshot to Snapshot
 With more partitions
 MR job with Reducer
Virtual
Snapshot
Builder
(HDFS)
Internal
Staging
Lazy
Snapshot
Builder
User Jobs
(HDFS)
Published
Virtual
Snapshot
MR/Pig/Hive
Loaders
Compactor
Change
Capture
Increments
Pre-
Process
Full Drops

Operating billions of rows per day
 Dude, where’s my row?
– Automatic Data validation
 When data misses the bus
– Handling late data
– Look back window
 Cluster downtime
– Restart-ability
– Active-active
– Idempotent processing
27

Conclusion and Future Work
 Conclusion
 Lumos : Scalable ETL framework
 Battle tested in production
 Future Work
 Unify Internal and External data
 Open source
28

Bringing OLTP woth OLAP: Lumos on Hadoop

More Related Content

What's hot (20)

Viewers also liked (20)

Similar to Bringing OLTP woth OLAP: Lumos on Hadoop (20)

More from DataWorks Summit (20)

Recently uploaded (20)

Bringing OLTP woth OLAP: Lumos on Hadoop

Editor's Notes