SlideShare a Scribd company logo
A ScyllaDB Community
ScyllaDB Status Report
Avi Kivity
CTO
Avi Kivity
■ Linux KVM
■ First line written
■ Seastar
■ First line written
■ ScyllaDB
■ First line written
■ 2024 Advances
■ 2025 Plans
Agenda
■ Maintenance & Operations
■ Security
■ Performance & Footprint
■ Reliability & Scaling
■ Features
Agenda
Maintenance & Operations
Maintenance & Operations
■ Native nodetool
■ Maintenance socket
■ Task Management
task_id type kind scope state sequence_number keyspace table entity shard start_time end_time
a9b96262-3544-4189-8712-ef28abe69fb5 upgrade sstables compaction node keyspace running 1303 keyspace1 0 2025-02-24T21:32:08Z
657148a9-46f1-4093-8a90-2f239ff5faf8 upgrade sstables compaction node keyspace done 1301 system_distributed_everywhere 0 2025-02-24T21:32:08Z 2025-02-24T21:32:08Z
aebdcb14-75dd-4360-a5f6-deb5e5f3d5fe upgrade sstables compaction node keyspace done 1300 system_auth 0 2025-02-24T21:32:08Z 2025-02-24T21:32:08Z
44bd1318-1c91-4f30-8596-c77a46efd30e upgrade sstables compaction node keyspace done 1275 system_schema 0 2025-02-24T21:31:43Z 2025-02-24T21:31:44Z
2be36997-5de6-4fcf-8044-0fc1d52bde4f upgrade sstables compaction node keyspace done 1270 system_distributed 0 2025-02-24T21:31:43Z 2025-02-24T21:31:43Z
Maintenance & Operations
■ Native backup/restore
■ Utilize full bandwidth via Seastar scheduler integration
■ Exercise S3 driver
■ Reduce Recovery Point Objective
■ Tablet-aware
■ Rack-aware
■ Also in load-and-stream
6TB, 6xi4i.8xlarge, with stress
test, 3TB backup size
Native ScyllaDB rclone
Backup/
Minute
00:30:32 04:02:09
P99 Latency Read 9.05 3.41
P99 Latency Write 6.04 3.17
Repair improvements
■ Small-table repair performance
■ Automatic repair scheduling
■ Avoid whack-a-mole
■ Incremental repair
■ Repair more frequently
Security
Security - Enterprise Features
■ Encryption at Rest
■ LDAP Authentication and Authorization
■ Audit
Security - New
■ Transitional Authentication
■ Alternator Role-Based Access Control (RBAC)
Performance & Footprint
LTO/PGO
■ Link-Time Optimization
■ Optimize the program as a whole rather
than a collection of modules
■ Profile-Guided Optimization
■ Optimize the program based on actual
behavior in a training run
■ 30% Improvement (combined)
■ Fewer/small nodes
■ More work!
Compression
■ Dictionary compression for RPC
■ Reduce cross-AZ cost
■ Dictionary compression for sstables
■ Reduce storage costs
90% Storage Utilization with Tablets
■ Tablets can migrate at line speed
■ Up to 10 GB/s on large instances
■ Schema-independent
■ Launch new instances just-in-time
■ Decommission them when unneeded
Time to finish Stream bandwidth Bytes on wire per
tablet
CPU Load
Mutation based
streaming
3003 seconds 100 MB/s 20090 MB 12%
File based
streaming
116 seconds 1000 MB/s 7280 MB 4%
Difference 25X 10X 2.75X 3X
Storage Tiering
■ Place cold data on S3
■ Cheap but slow
■ Data migration is transparent
■ Targeting time-series workloads
■ Using Time-Window Compaction Strategy
Reliability & Scaling
More system tables in Raft
■ Avoid repair and replication management chores
■ Avoid hot node problems
■ Avoid authentication problems during reduced availability periods
■ Moved: auth, service levels, CDC metadata
Coordinator-only (“zero-token”) nodes
■ Node that do not own any data
■ Symmetric -> two-tier architecture
■ Arbiter for two datacenters
✔ ✔ ✔
✔ ✔ ✔
✔
Non-Voting Nodes
■ Reduce Raft latency in large clusters
■ Maintain Quorum in asymmetric clusters
✔ ✔ ✔
✔ ✔ ✔
✔
Simplified Schema Healing
■ Replace merge-by-content with linear history
■ Eliminates stalls with large schemas
Materialized View Write Path Improvements
■ Improve throttling of base-table writes when view updates lag
■ Improve many-row DELETE performance
■ Capture more details
■ Roles and grants
■ Hashed passwords
■ CDC
DESCRIBE SCHEMA round-tripping
cqlsh> DESCRIBE SCHEMA
CREATE KEYSPACE ks1 WITH replication = {'class': 'org.apache.cassandra.locator.NetworkTopologyStrategy',
'datacenter1': '1'} AND durable_writes = true AND tablets = {'enabled': true};
CREATE TABLE ks1.t1 (
id int,
c int,
s1 set<int>,
s2 set<int>,
s3 frozen<set<int>>,
PRIMARY KEY (id)
) WITH bloom_filter_fp_chance = 0.01
AND caching = {'keys': 'ALL', 'rows_per_partition': 'ALL'}
AND comment = ''
AND compaction = {'class': 'IncrementalCompactionStrategy'}
AND compression = {'sstable_compression': 'org.apache.cassandra.io.compress.LZ4Compressor'}
AND crc_check_chance = 1
AND default_time_to_live = 0
AND gc_grace_seconds = 864000
AND max_index_interval = 2048
AND memtable_flush_period_in_ms = 0
AND min_index_interval = 128
AND speculative_retry = '99.0PERCENTILE'
AND tombstone_gc = {'mode': 'timeout', 'propagation_delay_in_seconds': '3600'};
CREATE TABLE ks1.t2 (
pk int,
ck int,
c int,
PRIMARY KEY (pk, ck)
) WITH CLUSTERING ORDER BY (ck ASC)
AND bloom_filter_fp_chance = 0.01
AND caching = {'keys': 'ALL', 'rows_per_partition': 'ALL'}
■ Internal addressing of nodes using UUID
■ Important when IP addresses are ephemeral
■ Kubernetes
Reduced Reliance on IP Addresses
■ Dynamically adjust Bloom filter memory
■ Drop filters during high memory demand
■ Reload filters when memory is available
■ Rebuild filter for optimal size
Bloom Filter Dynamic Memory
■ Reduces chance of bitrot
■ Mostly on non-compressed sstables
■ Compressed sstables already protected
More SSTable Checksum Validation
CQL
CQL Features
■ WHERE … NOT IN
■ SELECT … set_intersection(set1, set2)
■ SELECT … USING SERVICE LEVEL = ‘...’
vector<>
ALTER TABLE tab ADD embedding vector<float, 768>
■ Indexing = work in progress
■ Offload to dedicated search engine
■ Use GPU when available
■ Avoid scaling problems
Tablet plans
Tablets, Tablets, & Tablets
■ Complete feature integration with tablets
■ Materialized views (simple case)
■ Materialized views (complex cases)
■ Change Data Capture (CDC)
■ Lightweight Transactions (LWT)
■ Counter columns
■ Leverage fast scaling to reduce TCO
■ 90% storage utilization
■ Improve load-balancing algorithms
■ In-place migration from vnodes
Strong consistency
model
Strong consistency model
■ Leverage previous Raft work for data plane consistency
Stay in Touch
Avi Kivity
avi@scylladb.com
@AviKivity
@avikivity

More Related Content

PDF
Highload Perf Tuning
HighLoad2009
 
PDF
Leveraging Cassandra for real-time multi-datacenter public cloud analytics
Julien Anguenot
 
PDF
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
DataStax Academy
 
PPTX
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB
 
PPTX
Lightweight Transactions at Lightning Speed
ScyllaDB
 
PDF
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
Rongze Zhu
 
KEY
Varnish @ Velocity Ignite
Artur Bergman
 
PDF
MEETUP - Unboxing Apache Cassandra 3.10
Erick Ramirez
 
Highload Perf Tuning
HighLoad2009
 
Leveraging Cassandra for real-time multi-datacenter public cloud analytics
Julien Anguenot
 
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
DataStax Academy
 
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB
 
Lightweight Transactions at Lightning Speed
ScyllaDB
 
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
Rongze Zhu
 
Varnish @ Velocity Ignite
Artur Bergman
 
MEETUP - Unboxing Apache Cassandra 3.10
Erick Ramirez
 

Similar to ScyllaDB’s Monstrous Engineering Advances by Avi Kivity (20)

PDF
To Serverless and Beyond
ScyllaDB
 
PPTX
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
DataWorks Summit/Hadoop Summit
 
PPTX
High Performance, High Reliability Data Loading on ClickHouse
Altinity Ltd
 
PPTX
Percona Live UK 2014 Part III
Alkin Tezuysal
 
PDF
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
ScyllaDB
 
PDF
Hochverfügbarkeitslösungen mit MariaDB
MariaDB plc
 
PDF
Oracle Database In-Memory Option in Action
Tanel Poder
 
PDF
In Memory Database In Action by Tanel Poder and Kerry Osborne
Enkitec
 
PPTX
Scylla Summit 2018: Introducing ValuStor, A Memcached Alternative Made to Run...
ScyllaDB
 
PDF
x86-less ScyllaDB: Exploring an All-ARM Cluster
ScyllaDB
 
PPTX
Apache Cassandra at the Geek2Geek Berlin
Christian Johannsen
 
PPT
11thingsabout11g 12659705398222 Phpapp01
Karam Abuataya
 
PPT
11 Things About11g
fcamachob
 
PPTX
Devops kc
Philip Thompson
 
PPTX
Meeting the challenges of OLTP Big Data with Scylla
ScyllaDB
 
PDF
Dissecting Real-World Database Performance Dilemmas
ScyllaDB
 
PDF
DataStax: Extreme Cassandra Optimization: The Sequel
DataStax Academy
 
PDF
Troubleshooting Complex Performance issues - Oracle SEG$ contention
Tanel Poder
 
PDF
Brkdct 3101
Nguyen Van Linh
 
PDF
Replacing Your Cache with ScyllaDB by Felipe Cardeneti Mendes and Tomasz Grabiec
ScyllaDB
 
To Serverless and Beyond
ScyllaDB
 
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
DataWorks Summit/Hadoop Summit
 
High Performance, High Reliability Data Loading on ClickHouse
Altinity Ltd
 
Percona Live UK 2014 Part III
Alkin Tezuysal
 
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
ScyllaDB
 
Hochverfügbarkeitslösungen mit MariaDB
MariaDB plc
 
Oracle Database In-Memory Option in Action
Tanel Poder
 
In Memory Database In Action by Tanel Poder and Kerry Osborne
Enkitec
 
Scylla Summit 2018: Introducing ValuStor, A Memcached Alternative Made to Run...
ScyllaDB
 
x86-less ScyllaDB: Exploring an All-ARM Cluster
ScyllaDB
 
Apache Cassandra at the Geek2Geek Berlin
Christian Johannsen
 
11thingsabout11g 12659705398222 Phpapp01
Karam Abuataya
 
11 Things About11g
fcamachob
 
Devops kc
Philip Thompson
 
Meeting the challenges of OLTP Big Data with Scylla
ScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
ScyllaDB
 
DataStax: Extreme Cassandra Optimization: The Sequel
DataStax Academy
 
Troubleshooting Complex Performance issues - Oracle SEG$ contention
Tanel Poder
 
Brkdct 3101
Nguyen Van Linh
 
Replacing Your Cache with ScyllaDB by Felipe Cardeneti Mendes and Tomasz Grabiec
ScyllaDB
 
Ad

More from ScyllaDB (20)

PDF
Understanding The True Cost of DynamoDB Webinar
ScyllaDB
 
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
PDF
New Ways to Reduce Database Costs with ScyllaDB
ScyllaDB
 
PDF
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
PDF
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
ScyllaDB
 
PDF
Leading a High-Stakes Database Migration
ScyllaDB
 
PDF
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
ScyllaDB
 
PDF
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
ScyllaDB
 
PDF
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
ScyllaDB
 
PDF
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB
 
PDF
ScyllaDB: 10 Years and Beyond by Dor Laor
ScyllaDB
 
PDF
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
ScyllaDB
 
PDF
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
ScyllaDB
 
PDF
Vector Search with ScyllaDB by Szymon Wasik
ScyllaDB
 
PDF
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
ScyllaDB
 
PDF
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
ScyllaDB
 
PDF
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
ScyllaDB
 
PDF
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
ScyllaDB
 
PDF
Lessons Learned from Building a Serverless Notifications System by Srushith R...
ScyllaDB
 
Understanding The True Cost of DynamoDB Webinar
ScyllaDB
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
New Ways to Reduce Database Costs with ScyllaDB
ScyllaDB
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
ScyllaDB
 
Leading a High-Stakes Database Migration
ScyllaDB
 
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
ScyllaDB
 
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
ScyllaDB
 
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
ScyllaDB
 
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB
 
ScyllaDB: 10 Years and Beyond by Dor Laor
ScyllaDB
 
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
ScyllaDB
 
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
ScyllaDB
 
Vector Search with ScyllaDB by Szymon Wasik
ScyllaDB
 
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
ScyllaDB
 
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
ScyllaDB
 
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
ScyllaDB
 
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
ScyllaDB
 
Lessons Learned from Building a Serverless Notifications System by Srushith R...
ScyllaDB
 
Ad

Recently uploaded (20)

PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
PDF
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
Software Development Methodologies in 2025
KodekX
 
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 

ScyllaDB’s Monstrous Engineering Advances by Avi Kivity

  • 1. A ScyllaDB Community ScyllaDB Status Report Avi Kivity CTO
  • 2. Avi Kivity ■ Linux KVM ■ First line written ■ Seastar ■ First line written ■ ScyllaDB ■ First line written
  • 3. ■ 2024 Advances ■ 2025 Plans Agenda
  • 4. ■ Maintenance & Operations ■ Security ■ Performance & Footprint ■ Reliability & Scaling ■ Features Agenda
  • 6. Maintenance & Operations ■ Native nodetool ■ Maintenance socket ■ Task Management task_id type kind scope state sequence_number keyspace table entity shard start_time end_time a9b96262-3544-4189-8712-ef28abe69fb5 upgrade sstables compaction node keyspace running 1303 keyspace1 0 2025-02-24T21:32:08Z 657148a9-46f1-4093-8a90-2f239ff5faf8 upgrade sstables compaction node keyspace done 1301 system_distributed_everywhere 0 2025-02-24T21:32:08Z 2025-02-24T21:32:08Z aebdcb14-75dd-4360-a5f6-deb5e5f3d5fe upgrade sstables compaction node keyspace done 1300 system_auth 0 2025-02-24T21:32:08Z 2025-02-24T21:32:08Z 44bd1318-1c91-4f30-8596-c77a46efd30e upgrade sstables compaction node keyspace done 1275 system_schema 0 2025-02-24T21:31:43Z 2025-02-24T21:31:44Z 2be36997-5de6-4fcf-8044-0fc1d52bde4f upgrade sstables compaction node keyspace done 1270 system_distributed 0 2025-02-24T21:31:43Z 2025-02-24T21:31:43Z
  • 7. Maintenance & Operations ■ Native backup/restore ■ Utilize full bandwidth via Seastar scheduler integration ■ Exercise S3 driver ■ Reduce Recovery Point Objective ■ Tablet-aware ■ Rack-aware ■ Also in load-and-stream 6TB, 6xi4i.8xlarge, with stress test, 3TB backup size Native ScyllaDB rclone Backup/ Minute 00:30:32 04:02:09 P99 Latency Read 9.05 3.41 P99 Latency Write 6.04 3.17
  • 8. Repair improvements ■ Small-table repair performance ■ Automatic repair scheduling ■ Avoid whack-a-mole ■ Incremental repair ■ Repair more frequently
  • 10. Security - Enterprise Features ■ Encryption at Rest ■ LDAP Authentication and Authorization ■ Audit
  • 11. Security - New ■ Transitional Authentication ■ Alternator Role-Based Access Control (RBAC)
  • 13. LTO/PGO ■ Link-Time Optimization ■ Optimize the program as a whole rather than a collection of modules ■ Profile-Guided Optimization ■ Optimize the program based on actual behavior in a training run ■ 30% Improvement (combined) ■ Fewer/small nodes ■ More work!
  • 14. Compression ■ Dictionary compression for RPC ■ Reduce cross-AZ cost ■ Dictionary compression for sstables ■ Reduce storage costs
  • 15. 90% Storage Utilization with Tablets ■ Tablets can migrate at line speed ■ Up to 10 GB/s on large instances ■ Schema-independent ■ Launch new instances just-in-time ■ Decommission them when unneeded Time to finish Stream bandwidth Bytes on wire per tablet CPU Load Mutation based streaming 3003 seconds 100 MB/s 20090 MB 12% File based streaming 116 seconds 1000 MB/s 7280 MB 4% Difference 25X 10X 2.75X 3X
  • 16. Storage Tiering ■ Place cold data on S3 ■ Cheap but slow ■ Data migration is transparent ■ Targeting time-series workloads ■ Using Time-Window Compaction Strategy
  • 18. More system tables in Raft ■ Avoid repair and replication management chores ■ Avoid hot node problems ■ Avoid authentication problems during reduced availability periods ■ Moved: auth, service levels, CDC metadata
  • 19. Coordinator-only (“zero-token”) nodes ■ Node that do not own any data ■ Symmetric -> two-tier architecture ■ Arbiter for two datacenters ✔ ✔ ✔ ✔ ✔ ✔ ✔
  • 20. Non-Voting Nodes ■ Reduce Raft latency in large clusters ■ Maintain Quorum in asymmetric clusters ✔ ✔ ✔ ✔ ✔ ✔ ✔
  • 21. Simplified Schema Healing ■ Replace merge-by-content with linear history ■ Eliminates stalls with large schemas
  • 22. Materialized View Write Path Improvements ■ Improve throttling of base-table writes when view updates lag ■ Improve many-row DELETE performance
  • 23. ■ Capture more details ■ Roles and grants ■ Hashed passwords ■ CDC DESCRIBE SCHEMA round-tripping cqlsh> DESCRIBE SCHEMA CREATE KEYSPACE ks1 WITH replication = {'class': 'org.apache.cassandra.locator.NetworkTopologyStrategy', 'datacenter1': '1'} AND durable_writes = true AND tablets = {'enabled': true}; CREATE TABLE ks1.t1 ( id int, c int, s1 set<int>, s2 set<int>, s3 frozen<set<int>>, PRIMARY KEY (id) ) WITH bloom_filter_fp_chance = 0.01 AND caching = {'keys': 'ALL', 'rows_per_partition': 'ALL'} AND comment = '' AND compaction = {'class': 'IncrementalCompactionStrategy'} AND compression = {'sstable_compression': 'org.apache.cassandra.io.compress.LZ4Compressor'} AND crc_check_chance = 1 AND default_time_to_live = 0 AND gc_grace_seconds = 864000 AND max_index_interval = 2048 AND memtable_flush_period_in_ms = 0 AND min_index_interval = 128 AND speculative_retry = '99.0PERCENTILE' AND tombstone_gc = {'mode': 'timeout', 'propagation_delay_in_seconds': '3600'}; CREATE TABLE ks1.t2 ( pk int, ck int, c int, PRIMARY KEY (pk, ck) ) WITH CLUSTERING ORDER BY (ck ASC) AND bloom_filter_fp_chance = 0.01 AND caching = {'keys': 'ALL', 'rows_per_partition': 'ALL'}
  • 24. ■ Internal addressing of nodes using UUID ■ Important when IP addresses are ephemeral ■ Kubernetes Reduced Reliance on IP Addresses
  • 25. ■ Dynamically adjust Bloom filter memory ■ Drop filters during high memory demand ■ Reload filters when memory is available ■ Rebuild filter for optimal size Bloom Filter Dynamic Memory
  • 26. ■ Reduces chance of bitrot ■ Mostly on non-compressed sstables ■ Compressed sstables already protected More SSTable Checksum Validation
  • 27. CQL
  • 28. CQL Features ■ WHERE … NOT IN ■ SELECT … set_intersection(set1, set2) ■ SELECT … USING SERVICE LEVEL = ‘...’
  • 29. vector<> ALTER TABLE tab ADD embedding vector<float, 768> ■ Indexing = work in progress ■ Offload to dedicated search engine ■ Use GPU when available ■ Avoid scaling problems
  • 31. Tablets, Tablets, & Tablets ■ Complete feature integration with tablets ■ Materialized views (simple case) ■ Materialized views (complex cases) ■ Change Data Capture (CDC) ■ Lightweight Transactions (LWT) ■ Counter columns ■ Leverage fast scaling to reduce TCO ■ 90% storage utilization ■ Improve load-balancing algorithms ■ In-place migration from vnodes
  • 33. Strong consistency model ■ Leverage previous Raft work for data plane consistency
  • 34. Stay in Touch Avi Kivity [email protected] @AviKivity @avikivity