How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham

0 likes•278 views

Learn about Agoda's performance tuning strategies for ScyllaDB. Worakarn shares how they optimized disk performance, fine-tuned compaction strategies, and adjusted SSTable settings to match their workload for peak efficiency.

Technology

More Related Content

PDF

Replacing Your Cache with ScyllaDB by Felipe Cardeneti Mendes and Tomasz GrabiecScyllaDB

PPTX

Replacing Your Cache with ScyllaDBScyllaDB

PPTX

7 Reasons Not to Put an External Cache in Front of Your Database.pptxScyllaDB

PDF

Using ScyllaDB for Real-Time Read-Heavy Workloads.pdfScyllaDB

PDF

How Development Teams Cut Costs with ScyllaDB.pdfScyllaDB

PDF

Using ScyllaDB for Real-Time Write-Heavy WorkloadsScyllaDB

PDF

ScyllaDB Virtual Workshop: Getting Started with ScyllaDB 2024ScyllaDB

PDF

Dissecting Real-World Database Performance DilemmasScyllaDB

Replacing Your Cache with ScyllaDB by Felipe Cardeneti Mendes and Tomasz GrabiecScyllaDB

Replacing Your Cache with ScyllaDBScyllaDB

7 Reasons Not to Put an External Cache in Front of Your Database.pptxScyllaDB

Using ScyllaDB for Real-Time Read-Heavy Workloads.pdfScyllaDB

How Development Teams Cut Costs with ScyllaDB.pdfScyllaDB

Using ScyllaDB for Real-Time Write-Heavy WorkloadsScyllaDB

ScyllaDB Virtual Workshop: Getting Started with ScyllaDB 2024ScyllaDB

Dissecting Real-World Database Performance DilemmasScyllaDB

Similar to How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham (20)

PDF

ShareChat’s Path to High-Performance NoSQL with ScyllaDBScyllaDB

PDF

Why Databases Cache, but Caches Go to DiskScyllaDB

PDF

Dissecting Real-World Database Performance DilemmasScyllaDB

PDF

DynamoDB Cost Optimization Masterclass: ScyllaDB as a DynamoDB AlternativeScyllaDB

PDF

Scylla Summit 2022: Operating at Monstrous Scales: Benchmarking Petabyte Work...ScyllaDB

PDF

Using ScyllaDB for Extreme Scale WorkloadsMarisaDelao3

PDF

Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityScyllaDB

PPTX

Radically Outperforming DynamoDB @ Digital Turbine with SADA and Google CloudScyllaDB

PDF

Elasticity, Speed & Simplicity: Get the Most Out of New ScyllaDB CapabilitiesScyllaDB

PPTX

Scylla Virtual Workshop 2022ScyllaDB

PDF

Achieving Extreme Scale with ScyllaDB: Tips & TradeoffsScyllaDB

PDF

AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...ScyllaDB

PDF

Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScyllaDB

PPTX

A Deep Dive into ScyllaDB's ArchitectureScyllaDB

PDF

ScyllaDB Virtual WorkshopScyllaDB

PPTX

Scylla Summit 2019 Keynote - Dor Laor - Beyond CassandraScyllaDB

PDF

Database Performance at Scale Masterclass: Workload Characteristics by Felipe...ScyllaDB

PDF

Understanding The True Cost of DynamoDB WebinarScyllaDB

PDF

Use ScyllaDB Alternator to Use Amazon DynamoDB API, Everywhere, Better, More ...ScyllaDB

PPTX

MongoDB vs ScyllaDB: Tractian’s Experience with Real-Time MLScyllaDB

ShareChat’s Path to High-Performance NoSQL with ScyllaDBScyllaDB

Why Databases Cache, but Caches Go to DiskScyllaDB

Dissecting Real-World Database Performance DilemmasScyllaDB

DynamoDB Cost Optimization Masterclass: ScyllaDB as a DynamoDB AlternativeScyllaDB

Scylla Summit 2022: Operating at Monstrous Scales: Benchmarking Petabyte Work...ScyllaDB

Using ScyllaDB for Extreme Scale WorkloadsMarisaDelao3

Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityScyllaDB

Radically Outperforming DynamoDB @ Digital Turbine with SADA and Google CloudScyllaDB

Elasticity, Speed & Simplicity: Get the Most Out of New ScyllaDB CapabilitiesScyllaDB

Scylla Virtual Workshop 2022ScyllaDB

Achieving Extreme Scale with ScyllaDB: Tips & TradeoffsScyllaDB

AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...ScyllaDB

Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScyllaDB

A Deep Dive into ScyllaDB's ArchitectureScyllaDB

ScyllaDB Virtual WorkshopScyllaDB

Scylla Summit 2019 Keynote - Dor Laor - Beyond CassandraScyllaDB

Database Performance at Scale Masterclass: Workload Characteristics by Felipe...ScyllaDB

Understanding The True Cost of DynamoDB WebinarScyllaDB

Use ScyllaDB Alternator to Use Amazon DynamoDB API, Everywhere, Better, More ...ScyllaDB

MongoDB vs ScyllaDB: Tractian’s Experience with Real-Time MLScyllaDB

More from ScyllaDB (20)

PDF

Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...ScyllaDB

PDF

Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...ScyllaDB

PDF

New Ways to Reduce Database Costs with ScyllaDBScyllaDB

PDF

Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveScyllaDB

PDF

Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...ScyllaDB

PDF

Leading a High-Stakes Database MigrationScyllaDB

PDF

Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...ScyllaDB

PDF

How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd ColemanScyllaDB

PDF

ScyllaDB: 10 Years and Beyond by Dor LaorScyllaDB

PDF

Reduce Your Cloud Spend with ScyllaDB by Tzach LivyatanScyllaDB

PDF

Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence LiuScyllaDB

PDF

Vector Search with ScyllaDB by Szymon WasikScyllaDB

PDF

Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...ScyllaDB

PDF

Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...ScyllaDB

PDF

Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...ScyllaDB

PDF

Object Storage in ScyllaDB by Ran Regev, ScyllaDBScyllaDB

PDF

Lessons Learned from Building a Serverless Notifications System by Srushith R...ScyllaDB

PDF

A Dist Sys Programmer's Journey into AI by Piotr SarnaScyllaDB

PDF

High Availability: Lessons Learned by Paul PreuveneersScyllaDB

PDF

How Natura Uses ScyllaDB and ScyllaDB Connector to Create a Real-time Data Pi...ScyllaDB

Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...ScyllaDB

Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...ScyllaDB

New Ways to Reduce Database Costs with ScyllaDBScyllaDB

Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveScyllaDB

Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...ScyllaDB

Leading a High-Stakes Database MigrationScyllaDB

Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...ScyllaDB

How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd ColemanScyllaDB

ScyllaDB: 10 Years and Beyond by Dor LaorScyllaDB

Reduce Your Cloud Spend with ScyllaDB by Tzach LivyatanScyllaDB

Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence LiuScyllaDB

Vector Search with ScyllaDB by Szymon WasikScyllaDB

Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...ScyllaDB

Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...ScyllaDB

Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...ScyllaDB

Object Storage in ScyllaDB by Ran Regev, ScyllaDBScyllaDB

Lessons Learned from Building a Serverless Notifications System by Srushith R...ScyllaDB

A Dist Sys Programmer's Journey into AI by Piotr SarnaScyllaDB

High Availability: Lessons Learned by Paul PreuveneersScyllaDB

How Natura Uses ScyllaDB and ScyllaDB Connector to Create a Real-time Data Pi...ScyllaDB

Recently uploaded (20)

PDF

Software Development Methodologies in 2025KodekX

PDF

Using Anchore and DefectDojo to Stand Up Your DevSecOps FunctionAnchore

PDF

NewMind AI Weekly Chronicles - July'25 - Week IVNewMind AI

PPTX

The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptxsujalchauhan1305

PDF

Make GenAI investments go further with the Dell AI FactoryPrincipled Technologies

PDF

AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdfArtjoker Software Development Company

PDF

The Future of Mobile Is Context-Aware—Are You Ready?iProgrammer Solutions Private Limited

PDF

Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...Artjoker Software Development Company

PDF

A Day in the Life of Location Data - Turning Where into How.pdfPrecisely

PDF

Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdfCA Suvidha Chaplot

PDF

Economic Impact of Data Centres to the Malaysian Economyflintglobalapac

PDF

How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdfArtjoker Software Development Company

PDF

Trying to figure out MCP by actually building an app from scratch with open s...Julien SIMON

PDF

Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdfNabajyoti Banik

PDF

OFFOFFBOX™ – A New Era for African Film | Startup Presentationambaicciwalkerbrian

PDF

Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdfPrecisely

PDF

Presentation about Hardware and Software in Computersnehamodhawadiya

PDF

The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)Enterprise Knowledge

PDF

REPORT: Heating appliances market in Poland 2024SPIUG

PDF

Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...Sandesh Rao

Software Development Methodologies in 2025KodekX

Using Anchore and DefectDojo to Stand Up Your DevSecOps FunctionAnchore

NewMind AI Weekly Chronicles - July'25 - Week IVNewMind AI

The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptxsujalchauhan1305

Make GenAI investments go further with the Dell AI FactoryPrincipled Technologies

AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdfArtjoker Software Development Company

The Future of Mobile Is Context-Aware—Are You Ready?iProgrammer Solutions Private Limited

Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...Artjoker Software Development Company

A Day in the Life of Location Data - Turning Where into How.pdfPrecisely

Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdfCA Suvidha Chaplot

Economic Impact of Data Centres to the Malaysian Economyflintglobalapac

How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdfArtjoker Software Development Company

Trying to figure out MCP by actually building an app from scratch with open s...Julien SIMON

Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdfNabajyoti Banik

OFFOFFBOX™ – A New Era for African Film | Startup Presentationambaicciwalkerbrian

Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdfPrecisely

Presentation about Hardware and Software in Computersnehamodhawadiya

The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)Enterprise Knowledge

REPORT: Heating appliances market in Poland 2024SPIUG

Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...Sandesh Rao

How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham

1. A ScyllaDB Community How Agoda Scaled 50x Throughput with ScyllaDB Worakarn Isaratham Lead Software Engineer

2. Worakarn Isaratham (he/him) ■ Lead Software Engineer, Agoda ■ Based in Bangkok, Thailand ■ Experience in distributed computing, software testing ■ Interested in dependable software systems

3. ■ ScyllaDB in Agoda Feature Store ■ Capacity Problem ■ Potential Solutions Presentation Agenda

4. Agoda Feature Store

5. Online Feature Serving Client SDK Cache ScyllaDB App Servers 3.5M EPS 1.7M EPS 200k EPS P99 Latency: 5 ms P99 Latency: 8 ms Average 5 features / entities

6. Growth Since the start of 2023 ■ Servers traﬃc: 50x Peak servers traﬃc, on the busiest DC

7. Growth Since the start of 2023 ■ Servers traffic: 50x ■ ScyllaDB traffic: 10x 10K EPS Peak ScyllaDB traffic, on the busiest DC

8. A Capacity Problem ■ A new use case wanted to onboard ■ Problematic usage pattern: ■ Bursty traﬃc from cold cache, hitting ScyllaDB at 120K EPS. ■ Many duplicated requests in very quick succession ■ Keep retrying any failed requests 12x of the load then 2x of the load now!

9. A Capacity Problem ■ One DC was able to survive this load without errors. ■ The other DC got lots of problems ■ Very high error rate ■ Took 40 minutes to ﬁnish all the retries ■ Metrics were pointing to slow read on ScyllaDB nodes

10. Slow Disks Bad DC Good DC Advantage Disks SATA SSD RAID 0 NVMe SSD RAID 0 Read iops 6868 79566 11.6x Read bandwidth 1.5G 10.1G 6.7x Write iops 6615 41104 6.2x Write bandwidth 1.9G 6.3G 3.3x

11. Just Buy New Disks? ● New disks were ordered ● Improved user-side caching, reduced this load to 7K. ● How long could we survive? Capacity

12. Cache-Avoiding Load Test ■ Use artiﬁcial, one-time-used load to avoid ScyllaDB caching. 25K 5K Normal load ScyllaDB cache one-time-used entities BYPASS CACHE Flush, Restart ScyllaDB Baseline EPS for SATA

13. Idea 1: Different Data Modeling Current: one tall table Alternative: one table per feature set

14. Idea 1: Different Data Modeling

15. Idea 2: Change Compaction Strategy ■ Our workload is “Read-mostly, many updates”. Size-tiered strategy is recommended. Prioritized read latency Slow disk read Large SSTable ﬁles Size-tiered Compaction Leveled Compaction

16. Idea 2: Change Compaction Strategy 1.5x

17. Idea 3: Increase Summary File Size ■ ScyllaDB uses summary files to help navigate to index files summary file size ≈ data file size × summary ratio High ratio Larger summary More efficient index Less disk I/O

18. Idea 3: Increase Summary File Size 4x

19. NVMe 60x

20. Rollout Jul 2023 New summary ratio applied Oct 2023 Migrated to NVMe disks Focus shifted to other components. Still trying out some new ideas on ScyllaDB. Leveled Compaction: Only applied to new table, need data migration

21. Recent Experiments ● Partitioned By Feature Set, clustered by Entity ○ Disastrous! 400x worse ● All features as a blob in a single row ○ +35% throughput

22. Lessons ● Fast disks are essential! ● Benchmark your load ● Tailor your data model to ﬁt the needs

23. Stay in Touch Worakarn Isaratham [email protected] github.com/arkorwan www.linkedin.com/in/worakarn