SlideShare a Scribd company logo
How RightScale Architects
            Its Databases

(for Worldwide Scale, HA and DR Scenarios)

January 30, 2013
              Watch the recording of this webinar

                                                    #rightscale
2#



Your Panel Today
Presenting
• Rafael H. Saavedra, VP Engineering, RightScale
• Josep Blanquer, Chief Architect, RightScale

Q&A
• Jared Marcell, Account Manager, RightScale
• David Manriquez, Account Manager, RightScale




Please use the “Questions” window to ask questions any time!

                                                     #rightscale
3#




       Menu
        Intro
  Data Taxonomy
Data Storage Design
   Scale, HA and DR

    Conclusion




                      #rightscale
4#



Intro: Expectations and scope
                  What this is and what is not
• IS a talk about:
    • how RightScale has designed and implemented its backing datastores
    • …for a few of the most representative internal systems
    • …with the rationale behind it
• Is NOT a talk about
    • RightScale’s overall architecture
    • Nodes or hosts, it’s about Systems
    • RightScale’s data modeling

 Note: Most of the design is implemented and in production but some of the
   most advanced things that are still in beta, or are still being worked on


                                                                  #rightscale
5#



Intro: Tools and Technologies
• RightScale uses a mix of RDBMS and NoSQL technologies:
   • MySQL , Cassandra and S3 (for backups and archiving)


• Transactionality:
   • MySQL: strong ACID properties
   • Cassandra: no Atomicity, eventually Consistent, some Isolation,
     Durable
• Availability:
   • MySQL: async replication. Master-SlaveN or Master-Master
   • Cassandra: Distributed, master-less, highly-replicated (multi-DC)
• Sharding:
   • MySQL: no explicit inter-node tools. (Sharding done by application)
   • Cassandra: partitions data internally across nodes.

                                                                   #rightscale
6#



Glossary: Examples we will use
    Marketplace Assets
                             Configuration data objects that are
       RightScripts
                             user-generated, private or shared
     ServerTemplates
                             Resource data that drives automation and
           Tags
                             reporting
                             Data used to communicate recent events and
          Events
                             news feeds to users
                             Data that records actions and states of external
 Cloud Polling and Gateway
                             API-linked services
                             Data used to locate and transport messages
         Routing
                             across instances and/or our services
                             Infrastructure monitoring data recorded and
        Monitoring
                             presented on behalf of users



                                                                     #rightscale
7#



Taxonomy of RightScale’s Data
    Representative systems
 with different data semantics:
      Global Objects
          Marketplace Assets
      Dashboard Objects
          Audits
          Tags
          Recent Events
      Cloud Polling Data
      Routing Data
      Monitoring/Syslog
                                  #rightscale
8#



Taxonomy of RightScale’s Data
    Representative systems
 with different data semantics:
      Global Objects
          Marketplace Assets     Common across accounts:
                                   Users
      Dashboard Objects            Account Plans
          Audits                  Settings
                                   MultiCloud Marketplace:
          Tags
                                         Published Assets
          Recent Events                 Sharing Groups
                                         …
      Cloud Polling Data
      Routing Data
      Monitoring/Syslog
                                                              #rightscale
9#



Taxonomy of RightScale’s Data
    Representative systems
 with different data semantics:
      Global Objects
          Marketplace Assets
      Dashboard Objects
          Audits                 Private to each account:
          Tags                    Deployments
                                   Imported assets
          Recent Events           Alert Specifications
                                   Server Inputs
      Cloud Polling Data
                                     Audit
      Routing Data                   Tags
                                     User Events
      Monitoring/Syslog              …

                                                             #rightscale
10#



Taxonomy of RightScale’s Data
    Representative systems
 with different data semantics:
      Global Objects
          Marketplace Assets
      Dashboard Objects
          Audits               Private to each account:
          Tags                  Cloud resource states (cache)
                                 Cloud credentials
          Recent Events
      Cloud Polling Data
      Routing Data
      Monitoring/Syslog
                                                                  #rightscale
11#



Taxonomy of RightScale’s Data
    Representative systems
 with different data semantics:
      Global Objects
          Marketplace Assets
      Dashboard Objects
          Audits
                                  Private to each account:
          Tags                    Instance agents location
          Recent Events           Core agents location
                                   Agent action registry
      Cloud Polling Data           …

      Routing Data
      Monitoring/Syslog
                                                               #rightscale
12#



Taxonomy of RightScale’s Data
    Representative systems
 with different data semantics:
      Global Objects
          Marketplace Assets
      Dashboard Objects
          Audits
          Tags
          Recent Events
                                  Private to each account:
      Cloud Polling Data           Collected metric data
                                   Collected syslog data
      Routing Data                 …


      Monitoring/Syslog
                                                             #rightscale
13#



Taxonomy of RightScale’s Data

                                      Which data do we need?
            Global Objects
  X-acct




                                      •   Data for all accounts
               Marketplace Assets    •   Data for a single account

            Dashboard Objects         Data shared between accounts
               Audits
               Tags
                                     Data scope and containment
  Account




               Recent Events
            Cloud Polling Data        Data required within scope
                                      of a single account
            Routing Data
            Monitoring/Syslog
                                                             #rightscale
14#



Taxonomy of RightScale’s Data

                                     Who uses the data?
            Global Objects           •   Users through the Dash/API
               Marketplace Assets   •   Instances from the Cloud
Users




            Dashboard Objects
               Audits
                                     Data close to the Users
               Tags
               Recent Events
                                             Data Placement
            Cloud Polling Data
Instances




            Routing Data             Data close to the Cloud
            Monitoring/Syslog
                                                           #rightscale
15#



Taxonomy of RightScale’s Data
                      Who uses the data? Proximity to User vs. Cloud
                       Which data do we need? Scope of data available


                      Global Objects
            X-acct




                                                               Close to user
                           Marketplace Assets                 Globally accessible data
Users




                      Dashboard Objects
                           Audits                             Close to user
                           Tags                               Account-shardable data
            Account




                           Recent Events
                      Cloud Polling Data
Instances




                                                               Close to cloud resources
                      Routing Data                             Account-shardable* data

                      Monitoring/Syslog
                                                                                     #rightscale
16#

            X-Account   Account
Users
Instances




                                  #rightscale
17#

            X-Account


                                                   Why custom? More control
                                                   •   Multiple sources
Users




              global
                                                   •   Individual columns
                              Custom replication   •   Apply transformations
                                                   •   Smart re-sync features




             Global: MySQL
              • ACID semantics
              • Master-Slave replication
Instances




                                                                                #rightscale
18#

            X-Account              Account
                                                                        Data archive: S3
                                                        S3               • Low read rate
                                     tags                                • Globally accessible
Users




              global    dash                  audit

                                                         Other systems: Cassandra
                                    events                   •   Simpler Key-Value access
                                                             •   Great scalability
                                                             •   Great replica control
                                                             •   High write availability
                        Dashboard: MySQL                     •   Time-to-live expiration as cache
                        •   ACID semantics                   •   Rows tagged by account
                        •   Master-SlaveN replication
                        •   Slave reads
Instances




                        •   Rows tagged by account




                                                                                     #rightscale
19#

            X-Account          Account

                                                  S3
                                tags                                   tags
Users




              global    dash             audit            dash                    audit


                               events                                 events




                                  So we can horizontally scale our
                                  dashboard by partitioning objects
                                     based on account groups:

                                          Clusters
Instances




                                                                              #rightscale
20#

                           Account


                                              S3                                          S3                                          S3
                             tags                                       tags                                         tags
        Cluster 1




                                                   Cluster 3




                                                                                                    Cluster
                    dash              audit                    dash               audit
                                                                                               …              dash           audit




                                                                                                       N
                             event                                      event                                        event
                               s                                          s                                            s



                                                                                               Features:
Users




                                                                                                   • 1 cluster: N accounts
                                                                                                   • 1 account: 1 home
                                                               RightScale Accounts                 • Migratable accounts

                                                                                               Benefits:
                                                                                                   • Great horizontal growth
                                                                  Account Set 2                    • Better failure isolation
                      Account Set 1
                                                                                                   • Independent scale
                                                                                                   • Load rebalancing
                                                                                                   • Versionable code
                                                                                                   • Differentiated service



                                                                                                                       #rightscale
21#

            X-Account              Account

                                                       S3
                                    tags                            tags
Users




              global    dash                   audit        dash               audit


                                   events                          events




                                             polling
Instances




                               monitor


                                             routing



                                                                           #rightscale
22#

            X-Account            Account

                                                       S3
                                  tags
                        dash And partition our cloud objects based on the cloud
Users




              global                       audit
                                       the instances of an account run on:

                                 events              Islands

                                           polling                           polling
Instances




                             monitor                           monitor


                                           routing                           routing



                                                                               #rightscale
23#

                                        Account


                              polling                            polling
                                                                 polling                                 polling
            Polling Clouds: MySQL                                                    Monitoring: Custom
            •    Master-Slave replication                                            • Replicated files




                                                                                              Island N
                Island 1




                                                      Island 2
            •    Can port to NoSQL easily
                                   monitor                                 monitor
                                                                           monitor
                                                                                     • Backup to S3                monitor
            •    Mostly a resource cache                                             • Archive to S3
            •    But cloud partitionable
                              routing                            routing
                                                                 routing                                 routing
Instances




                                                                                      Features:
                                                                                        • 1 instance: 1 home island
                                                                                        • 1 Island can serve N clouds
                                                                                        • Core Agents: global data
                                                  Routing: Cassandra
                                                  •   Simpler Key-Value access        Benefits:
                                                  •   Very high availability            • Close to cloud resources
                           Services co-located    •      Services co-located
                                                      Great scalability                 • Good failure isolation
                                                                                                 Services co-located
                             with resources       •         with resources
                                                      Great replica control                   • As good resources
                                                                                                    with as cloud 
                                                  •   Plus cross DC replication*        • Good scale: global replicas
                                                                                          across Cassandra DCs
                                   Cloud 1                       Cloud 2                                Cloud N



                                                                                                                   #rightscale
24#

                                    Account


                                                           S3                                                     S3                                                         S3
                                         tags                                            tags                                                             tags
            Cluster 1




                                                                Cluster 3
Users




                                                                                                                           Cluster
                        dash                    audit                        dash                         audit
                                                                                                                       …             dash                         audit




                                                                                                                              N
                                        event                                            event                                                            event
                                          s                                                s                                                                s



                                                                  Differentif the cloud
                                                                     What Geographies
                                                                    where the cluster
                                                                     is deployed on…

                                    polling
                                                                                    Fails?
                                                                               polling                                                          polling
Instances




                                                                                                                                     Island N
                         Island 1




                                                                 Island 2




                                                 monitor                                        monitor                                                           monitor




                                    routing                                    routing                                                          routing




                                                                            Different Clouds
                                                                                                                                                            #rightscale
25#

                                    Account
                                                    Sister Clusters

                                                             S3                                         S3                                               S3
                                         tags                                           tags                                          tags
            Cluster 1




                                                                  Cluster 3
Users




                                                                                                                 Cluster
                                                audit Full   replica
                        dash                                                   dash             audit
                                                                                                             …             dash               audit




                                                                                                                    N
                                        event                                           event                                         event
                                          s                                               s                                             s




                                                                   Features:
                                                                              • Each master has an extra remote slave
                                                                              • Each cluster in a pair is a DC replica of the other’s
                                    polling                                       polling                                polling
Instances




                                                                                localring




                                                                                                                           Island N
                         Island 1




                                                                   Island 2




                                                 monitor           At Disaster Recovery time:
                                                                               monitor                                                        monitor

                                                                              • Apps are told to start serving an extra shard
                                    routing                                   • No need to provision more infrastructure to recover
                                                                                  routing                                routing

                                                                                (try to avoid since everybody is on the same boat)
                                                                              • New resources can be allocated over time to help
                                                                                offload existing ones

                                                                                                                                        #rightscale
26#



Conclusions
• Shown that RightScale uses multiple database technologies
   • RDBMS – MySQL for the ACID semantics and ‘queryability’
       • Using a Master to N-Slaves for RO scale, and quick failure recovery
       • And ReadOnly Provisioning – To increase RO availability and scale remote systems
   • NoSQL: Cassandra for Availability and Scalability
       • for higher Read/Write availability within a cluster
       • For fully replicated regions across the globe (for Read/Write!)

• Shown how RightScale uses them in different techniques
   • It partitions resource data into Islands based on cloud proximity
       • Can achieve in-cloud polling,and keep monitoring/syslog data storage next to instances
       • Can provide routing availability, colocated with instances for any world region
   • It partitions core data into Clusters based on account groups
       • To scale the core horizontally, and independently and achieve account isolation/differentiation
       • Enhances fault isolation: Assigning accounts to Clusters deployed away their cloud resources
   • It maintains cluster pairs (sister sites)
       • To recover from full cloud region failures
       • It doesn’t require massive amounts of new resources to recover
                                                                                          #rightscale
27#



Next Steps                                       Contact RightScale
                                                     (866) 720-0208
1. Learn: Building Scalable Applications          sales@rightscale.com
   in the Cloud Whitepaper                         www.rightscale.com
   www.rightscale.com/whitepapers

2. Analyze: Deployment review of your
   environment
                                           The next big RightScale Community Event!
   www.rightscale.com/contact
                                                April 25-26 in San Francisco
                                                 www.RightScaleCompute.com
3. Try: Free Edition                          •Attend technical breakout sessions
   www.rightscale.com/free                          •Get RightScale training
                                                •Talk with RightScale customers
                                                •Ask questions at the Expert Bar




                                                                        #rightscale

More Related Content

PPTX
Understanding Virtual Networking in the Cloud - RightScale Compute 2013
RightScale
 
PDF
RightScale Webinar: Considerations For Choosing Cloud Providers
RightScale
 
PPTX
Delivering SaaS Using IaaS - RightScale Compute 2013
RightScale
 
PPTX
Rackspace: Unlock Your Cloud - RightScale Compute 2013
RightScale
 
PPTX
NextGen IBM Cloud Monitoring and Logging
Nagesh Ramamoorthy
 
PDF
Building Complete Private Clouds with Apache CloudStack and Riak CS
John Burwell
 
PDF
RightScale Webinar: Successfully Deploy Your Windows Workloads
RightScale
 
PPTX
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
HostedbyConfluent
 
Understanding Virtual Networking in the Cloud - RightScale Compute 2013
RightScale
 
RightScale Webinar: Considerations For Choosing Cloud Providers
RightScale
 
Delivering SaaS Using IaaS - RightScale Compute 2013
RightScale
 
Rackspace: Unlock Your Cloud - RightScale Compute 2013
RightScale
 
NextGen IBM Cloud Monitoring and Logging
Nagesh Ramamoorthy
 
Building Complete Private Clouds with Apache CloudStack and Riak CS
John Burwell
 
RightScale Webinar: Successfully Deploy Your Windows Workloads
RightScale
 
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
HostedbyConfluent
 

What's hot (16)

PPTX
Migration to Alibaba Cloud
Alibaba Cloud
 
PDF
Designing For Multicloud, CF Summit Frankfurt 2016
Mark D'Cunha
 
PDF
How IT at Getty Images Brokers Cloud Services
RightScale
 
PPTX
Key Design Considerations Private and Hybrid Clouds - RightScale Compute 2013
RightScale
 
PDF
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
John Burwell
 
PPSX
Apache Flink, AWS Kinesis, Analytics
Araf Karsh Hamid
 
PPTX
How to Manage Clouds, VMs and Bare Metal via RightScale
RightScale
 
PDF
Aneka platform
Shyam Krishna Khadka
 
PPTX
When the Cloud is a Rockin: High Availability in Apache CloudStack
John Burwell
 
PDF
Protecting Your Big Data on the Cloud
Alibaba Cloud
 
PPTX
Cloud Bursting with A10 Lightning ADS
Akshay Mathur
 
PDF
Building Scalable Real-Time Data Pipelines with the Couchbase Kafka Connector...
HostedbyConfluent
 
PDF
Getting Started with Elasticsearch
Alibaba Cloud
 
PPTX
Power of OpenStack & Hadoop
Tuan Yang
 
PDF
Openstack Cloud Management and Automation Using Red Hat Cloudforms 4.0
Prasad Mukhedkar
 
PPTX
Identifying Workloads to Move to the Cloud
RightScale
 
Migration to Alibaba Cloud
Alibaba Cloud
 
Designing For Multicloud, CF Summit Frankfurt 2016
Mark D'Cunha
 
How IT at Getty Images Brokers Cloud Services
RightScale
 
Key Design Considerations Private and Hybrid Clouds - RightScale Compute 2013
RightScale
 
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
John Burwell
 
Apache Flink, AWS Kinesis, Analytics
Araf Karsh Hamid
 
How to Manage Clouds, VMs and Bare Metal via RightScale
RightScale
 
Aneka platform
Shyam Krishna Khadka
 
When the Cloud is a Rockin: High Availability in Apache CloudStack
John Burwell
 
Protecting Your Big Data on the Cloud
Alibaba Cloud
 
Cloud Bursting with A10 Lightning ADS
Akshay Mathur
 
Building Scalable Real-Time Data Pipelines with the Couchbase Kafka Connector...
HostedbyConfluent
 
Getting Started with Elasticsearch
Alibaba Cloud
 
Power of OpenStack & Hadoop
Tuan Yang
 
Openstack Cloud Management and Automation Using Red Hat Cloudforms 4.0
Prasad Mukhedkar
 
Identifying Workloads to Move to the Cloud
RightScale
 
Ad

Similar to RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Scale, HA and DR Scenarios) (20)

PPTX
How RightScale Architects Its Own Databases for Worldwide Scale, HA, and DR S...
RightScale
 
PPTX
Building RightScale's Globally Distributed Datastore - RightScale Compute 2013
RightScale
 
PDF
Analytics&IoT
Selvaraj Kesavan
 
PDF
Klout changing landscape of social media
DataWorks Summit
 
PPTX
Telco analytics at scale
datamantra
 
PPTX
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
AWS User Group Kochi
 
PPTX
How Klout is changing the landscape of social media with Hadoop and BI
Denny Lee
 
PDF
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
PPTX
Implementing Big Data at the Speed of Business
DataWorks Summit
 
PDF
16h00 globant - aws globant-big-data_summit2012
infolive
 
PDF
Globant and Big Data on AWS
Amazon Web Services LATAM
 
PPTX
Evolving analytics at ebay - 2012 Tableau Customer Conference
gdougan1
 
PDF
Digital_IOT_(Microsoft_Solution).pdf
ssuserd23711
 
PPTX
Data & analytics challenges in a microservice architecture
Niels Naglé
 
PDF
Archonnex at ICPSR
Harshakumar Ummerpillai
 
PPTX
Introducing Splunk – The Big Data Engine
Swiss Big Data User Group
 
PPTX
Big Data Analytics PPT - S1 working .pptx
VivekChaurasia43
 
PDF
Architecting Data Lakes on AWS
Sajith Appukuttan
 
PPTX
Introducing SQL Server Data Services
goodfriday
 
PPTX
Introducing SQL Server Data Services
goodfriday
 
How RightScale Architects Its Own Databases for Worldwide Scale, HA, and DR S...
RightScale
 
Building RightScale's Globally Distributed Datastore - RightScale Compute 2013
RightScale
 
Analytics&IoT
Selvaraj Kesavan
 
Klout changing landscape of social media
DataWorks Summit
 
Telco analytics at scale
datamantra
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
AWS User Group Kochi
 
How Klout is changing the landscape of social media with Hadoop and BI
Denny Lee
 
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
Implementing Big Data at the Speed of Business
DataWorks Summit
 
16h00 globant - aws globant-big-data_summit2012
infolive
 
Globant and Big Data on AWS
Amazon Web Services LATAM
 
Evolving analytics at ebay - 2012 Tableau Customer Conference
gdougan1
 
Digital_IOT_(Microsoft_Solution).pdf
ssuserd23711
 
Data & analytics challenges in a microservice architecture
Niels Naglé
 
Archonnex at ICPSR
Harshakumar Ummerpillai
 
Introducing Splunk – The Big Data Engine
Swiss Big Data User Group
 
Big Data Analytics PPT - S1 working .pptx
VivekChaurasia43
 
Architecting Data Lakes on AWS
Sajith Appukuttan
 
Introducing SQL Server Data Services
goodfriday
 
Introducing SQL Server Data Services
goodfriday
 
Ad

More from RightScale (20)

PDF
10 Must-Have Automated Cloud Policies for IT Governance
RightScale
 
PDF
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
RightScale
 
PDF
Optimize Software, SaaS, and Cloud with Flexera and RightScale
RightScale
 
PDF
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
RightScale
 
PDF
How to Set Up a Cloud Cost Optimization Process for your Enterprise
RightScale
 
PDF
Multi-Cloud Management with RightScale CMP (Demo)
RightScale
 
PDF
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
RightScale
 
PDF
How to Allocate and Report Cloud Costs with RightScale Optima
RightScale
 
PDF
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
RightScale
 
PDF
Using RightScale CMP with Cloud Provider Tools
RightScale
 
PDF
Best Practices for Multi-Cloud Security and Compliance
RightScale
 
PDF
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
RightScale
 
PDF
The 5 Stages of Cloud Management for Enterprises
RightScale
 
PDF
9 Ways to Reduce Cloud Storage Costs
RightScale
 
PDF
Serverless Comparison: AWS vs Azure vs Google vs IBM
RightScale
 
PDF
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
RightScale
 
PDF
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
RightScale
 
PDF
2018 Cloud Trends: RightScale State of the Cloud Report
RightScale
 
PDF
Got a Multi-Cloud Strategy? How RightScale CMP Helps
RightScale
 
PDF
How to Manage Cloud Costs with RightScale Optima
RightScale
 
10 Must-Have Automated Cloud Policies for IT Governance
RightScale
 
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
RightScale
 
Optimize Software, SaaS, and Cloud with Flexera and RightScale
RightScale
 
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
RightScale
 
How to Set Up a Cloud Cost Optimization Process for your Enterprise
RightScale
 
Multi-Cloud Management with RightScale CMP (Demo)
RightScale
 
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
RightScale
 
How to Allocate and Report Cloud Costs with RightScale Optima
RightScale
 
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
RightScale
 
Using RightScale CMP with Cloud Provider Tools
RightScale
 
Best Practices for Multi-Cloud Security and Compliance
RightScale
 
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
RightScale
 
The 5 Stages of Cloud Management for Enterprises
RightScale
 
9 Ways to Reduce Cloud Storage Costs
RightScale
 
Serverless Comparison: AWS vs Azure vs Google vs IBM
RightScale
 
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
RightScale
 
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
RightScale
 
2018 Cloud Trends: RightScale State of the Cloud Report
RightScale
 
Got a Multi-Cloud Strategy? How RightScale CMP Helps
RightScale
 
How to Manage Cloud Costs with RightScale Optima
RightScale
 

Recently uploaded (20)

PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 

RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Scale, HA and DR Scenarios)

  • 1. How RightScale Architects Its Databases (for Worldwide Scale, HA and DR Scenarios) January 30, 2013 Watch the recording of this webinar #rightscale
  • 2. 2# Your Panel Today Presenting • Rafael H. Saavedra, VP Engineering, RightScale • Josep Blanquer, Chief Architect, RightScale Q&A • Jared Marcell, Account Manager, RightScale • David Manriquez, Account Manager, RightScale Please use the “Questions” window to ask questions any time! #rightscale
  • 3. 3# Menu Intro Data Taxonomy Data Storage Design Scale, HA and DR Conclusion #rightscale
  • 4. 4# Intro: Expectations and scope What this is and what is not • IS a talk about: • how RightScale has designed and implemented its backing datastores • …for a few of the most representative internal systems • …with the rationale behind it • Is NOT a talk about • RightScale’s overall architecture • Nodes or hosts, it’s about Systems • RightScale’s data modeling Note: Most of the design is implemented and in production but some of the most advanced things that are still in beta, or are still being worked on #rightscale
  • 5. 5# Intro: Tools and Technologies • RightScale uses a mix of RDBMS and NoSQL technologies: • MySQL , Cassandra and S3 (for backups and archiving) • Transactionality: • MySQL: strong ACID properties • Cassandra: no Atomicity, eventually Consistent, some Isolation, Durable • Availability: • MySQL: async replication. Master-SlaveN or Master-Master • Cassandra: Distributed, master-less, highly-replicated (multi-DC) • Sharding: • MySQL: no explicit inter-node tools. (Sharding done by application) • Cassandra: partitions data internally across nodes. #rightscale
  • 6. 6# Glossary: Examples we will use Marketplace Assets Configuration data objects that are RightScripts user-generated, private or shared ServerTemplates Resource data that drives automation and Tags reporting Data used to communicate recent events and Events news feeds to users Data that records actions and states of external Cloud Polling and Gateway API-linked services Data used to locate and transport messages Routing across instances and/or our services Infrastructure monitoring data recorded and Monitoring presented on behalf of users #rightscale
  • 7. 7# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits  Tags  Recent Events Cloud Polling Data Routing Data Monitoring/Syslog #rightscale
  • 8. 8# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Common across accounts:  Users Dashboard Objects  Account Plans  Audits  Settings  MultiCloud Marketplace:  Tags  Published Assets  Recent Events  Sharing Groups  … Cloud Polling Data Routing Data Monitoring/Syslog #rightscale
  • 9. 9# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits Private to each account:  Tags  Deployments  Imported assets  Recent Events  Alert Specifications  Server Inputs Cloud Polling Data  Audit Routing Data  Tags  User Events Monitoring/Syslog  … #rightscale
  • 10. 10# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits Private to each account:  Tags  Cloud resource states (cache)  Cloud credentials  Recent Events Cloud Polling Data Routing Data Monitoring/Syslog #rightscale
  • 11. 11# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits Private to each account:  Tags  Instance agents location  Recent Events  Core agents location  Agent action registry Cloud Polling Data  … Routing Data Monitoring/Syslog #rightscale
  • 12. 12# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits  Tags  Recent Events Private to each account: Cloud Polling Data  Collected metric data  Collected syslog data Routing Data  … Monitoring/Syslog #rightscale
  • 13. 13# Taxonomy of RightScale’s Data Which data do we need? Global Objects X-acct • Data for all accounts  Marketplace Assets • Data for a single account Dashboard Objects Data shared between accounts  Audits  Tags Data scope and containment Account  Recent Events Cloud Polling Data Data required within scope of a single account Routing Data Monitoring/Syslog #rightscale
  • 14. 14# Taxonomy of RightScale’s Data Who uses the data? Global Objects • Users through the Dash/API  Marketplace Assets • Instances from the Cloud Users Dashboard Objects  Audits Data close to the Users  Tags  Recent Events Data Placement Cloud Polling Data Instances Routing Data Data close to the Cloud Monitoring/Syslog #rightscale
  • 15. 15# Taxonomy of RightScale’s Data Who uses the data? Proximity to User vs. Cloud Which data do we need? Scope of data available Global Objects X-acct Close to user  Marketplace Assets Globally accessible data Users Dashboard Objects  Audits Close to user  Tags Account-shardable data Account  Recent Events Cloud Polling Data Instances Close to cloud resources Routing Data Account-shardable* data Monitoring/Syslog #rightscale
  • 16. 16# X-Account Account Users Instances #rightscale
  • 17. 17# X-Account Why custom? More control • Multiple sources Users global • Individual columns Custom replication • Apply transformations • Smart re-sync features Global: MySQL • ACID semantics • Master-Slave replication Instances #rightscale
  • 18. 18# X-Account Account Data archive: S3 S3 • Low read rate tags • Globally accessible Users global dash audit Other systems: Cassandra events • Simpler Key-Value access • Great scalability • Great replica control • High write availability Dashboard: MySQL • Time-to-live expiration as cache • ACID semantics • Rows tagged by account • Master-SlaveN replication • Slave reads Instances • Rows tagged by account #rightscale
  • 19. 19# X-Account Account S3 tags tags Users global dash audit dash audit events events So we can horizontally scale our dashboard by partitioning objects based on account groups: Clusters Instances #rightscale
  • 20. 20# Account S3 S3 S3 tags tags tags Cluster 1 Cluster 3 Cluster dash audit dash audit … dash audit N event event event s s s Features: Users • 1 cluster: N accounts • 1 account: 1 home RightScale Accounts • Migratable accounts Benefits: • Great horizontal growth Account Set 2 • Better failure isolation Account Set 1 • Independent scale • Load rebalancing • Versionable code • Differentiated service #rightscale
  • 21. 21# X-Account Account S3 tags tags Users global dash audit dash audit events events polling Instances monitor routing #rightscale
  • 22. 22# X-Account Account S3 tags dash And partition our cloud objects based on the cloud Users global audit the instances of an account run on: events Islands polling polling Instances monitor monitor routing routing #rightscale
  • 23. 23# Account polling polling polling polling Polling Clouds: MySQL Monitoring: Custom • Master-Slave replication • Replicated files Island N Island 1 Island 2 • Can port to NoSQL easily monitor monitor monitor • Backup to S3 monitor • Mostly a resource cache • Archive to S3 • But cloud partitionable routing routing routing routing Instances Features: • 1 instance: 1 home island • 1 Island can serve N clouds • Core Agents: global data Routing: Cassandra • Simpler Key-Value access Benefits: • Very high availability • Close to cloud resources Services co-located • Services co-located Great scalability • Good failure isolation Services co-located with resources • with resources Great replica control • As good resources with as cloud  • Plus cross DC replication* • Good scale: global replicas across Cassandra DCs Cloud 1 Cloud 2 Cloud N #rightscale
  • 24. 24# Account S3 S3 S3 tags tags tags Cluster 1 Cluster 3 Users Cluster dash audit dash audit … dash audit N event event event s s s Differentif the cloud What Geographies where the cluster is deployed on… polling Fails? polling polling Instances Island N Island 1 Island 2 monitor monitor monitor routing routing routing Different Clouds #rightscale
  • 25. 25# Account Sister Clusters S3 S3 S3 tags tags tags Cluster 1 Cluster 3 Users Cluster audit Full replica dash dash audit … dash audit N event event event s s s Features: • Each master has an extra remote slave • Each cluster in a pair is a DC replica of the other’s polling polling polling Instances localring Island N Island 1 Island 2 monitor At Disaster Recovery time: monitor monitor • Apps are told to start serving an extra shard routing • No need to provision more infrastructure to recover routing routing (try to avoid since everybody is on the same boat) • New resources can be allocated over time to help offload existing ones #rightscale
  • 26. 26# Conclusions • Shown that RightScale uses multiple database technologies • RDBMS – MySQL for the ACID semantics and ‘queryability’ • Using a Master to N-Slaves for RO scale, and quick failure recovery • And ReadOnly Provisioning – To increase RO availability and scale remote systems • NoSQL: Cassandra for Availability and Scalability • for higher Read/Write availability within a cluster • For fully replicated regions across the globe (for Read/Write!) • Shown how RightScale uses them in different techniques • It partitions resource data into Islands based on cloud proximity • Can achieve in-cloud polling,and keep monitoring/syslog data storage next to instances • Can provide routing availability, colocated with instances for any world region • It partitions core data into Clusters based on account groups • To scale the core horizontally, and independently and achieve account isolation/differentiation • Enhances fault isolation: Assigning accounts to Clusters deployed away their cloud resources • It maintains cluster pairs (sister sites) • To recover from full cloud region failures • It doesn’t require massive amounts of new resources to recover #rightscale
  • 27. 27# Next Steps Contact RightScale (866) 720-0208 1. Learn: Building Scalable Applications [email protected] in the Cloud Whitepaper www.rightscale.com www.rightscale.com/whitepapers 2. Analyze: Deployment review of your environment The next big RightScale Community Event! www.rightscale.com/contact April 25-26 in San Francisco www.RightScaleCompute.com 3. Try: Free Edition •Attend technical breakout sessions www.rightscale.com/free •Get RightScale training •Talk with RightScale customers •Ask questions at the Expert Bar #rightscale

Editor's Notes

  • #3: Telcos built point-2-point networks for their customers