SlideShare a Scribd company logo
MySQL HA Using different solutions Robert Krzykawski DB Team Coordinator,  bwin  games. Anders Karlsson Principal Sales Engineer, MySQL
Agenda Who are we? HA Basics – Anders How we did it; Success or failure – Robert Summary Questions?
Anders Karlsson Sales Engineer with Sun / MySQL for 5+ years I have been in the RDBMS business for 20+ years I have worked for many of the major vendors and with most of the vendor products I’ve been in roles as Sales Engineer Consultant Porting engineer Support engineer Etc. Outside MySQL I build websites (www.papablues.com), develop Open Source software (MyQuery, ndbtop etc), am a keen photographer and drives sub-standard cars, among other things. Also:  www.makezfsgpl.com  ! Right now!
Robert Krzykawski DB Team Coordinator @  bwin  Games AB Have been working with MySQL in every way from system admin, DBA, DBD and now taking a more system architectural role. Been involved in building both small and big web based solutions since 1998 using MySQL. My roles throughout my professional life have varied. System administrator, Technical Sales support, DBA, DBD, Programmer, Application architect and System architect. Off work I am trying to automate things with scripts and programs to off load myself when “on work”.   I am also trying to find time to snowboard, play some paintball and a recently introduced hobby is our Maine Coon kittens.  
Why do you need HA Something can break. It usually will, eventually You will need to maintain your database eventually, without shutting the whole system down Adding HA to an existing running system is difficult, Much more so than to provide HA from the start You want a good nights sleep! You want failover to be automatic!
HA Concepts Fault tolerant architectures These are hardware architectures with supporting software that prevents against even individual component failures Single Point of Failure (SPOF) In any fault tolerant setup, you want to avoid a SPOF, as a link is not better than it’s weakest link Fail over and Fail back Fail over is the process of switching from a failed component to another component, dormant or also active. Fail back is the process of failing back from the backup component to the original one.
Some HA Components Heartbeat Heartbeat is an HA component that checks that the services that are being failed over, are alive. Heartbeat can check individual servers, software services, networking etc. HA Monitor The HA Monitor has different names in different frameworks. This is the component that allows configuration of the services, ensures proper shutdown and startup and allows manual control Replication Replication is a common component that ensures that the data content of managed data rich components are in sync
What should I require? Don’t aim too high, aim for what is reasonable for your needs Aim to ensure that no important data is lost What is “important data”? You decide! Different data means different “needs”! Aim to ensure that the solution can be automated. You will want this eventually anyway Aim to ensure a solution that can easily be tested and administered Aim to ensure that the solution is performant and scalable
MySQL Replication Easy to use and set up. Low performance impact Asynchronous only. Failback can be difficult. Need additional components MySQL with DRBD / ZFS / AVS Easy to use. Low cost software only. Synchronous. Good HA software integration. Certain performance impact. Limited data size and transaction rates. HA with MySQL – In short
MySQL with Shared storage Good performance. Eases hardware management. Good integration with HA software. Costly. SAN itself is a SPOF. MySQL Cluster Very good performance. Self contained. Very short fail-over times. Software only solution. Needs several physical servers. Not optimized for all MySQL applications. HA with MySQL – In short
bwin  games ab
Our goal at  bwin We were faced with a requirement; establish a highly available database platform. We had some rules to follow from management. interruptions due to hardware failure should not require hands-on work. Downtime should be minimized during interruptions. Performance of DB platform should not decrease when operating as usual Performance can decrease if a failure has occurred but should not deem the service unusable. Implementation should be done by the operations department. Developers should not be involved.
What solutions did we consider? Master/Master Linux HA HP Service Guard Sun Cluster Combination of the above MySQL Cluster Will walk through all of the above
Master/Master Master/Master with two active nodes would give us a seamless switch if we have a good load balancer. Will give us the ability to do schema changes “on line” Not only higher availability when both nodes are up, but better performance. Can eliminate the use of production slaves.  One entry point for application when using “LB”
Linux HA/ServiceGuard/SunCluster Service IP switch will cause a glitch in service. Since we are running 4.0 we can’t really do a master/master setup with service IP switching. Slave integrity is important and we are running 4.0; One master data. Can’t switch to slave and hope that everything was replicated. We are using SAN – Shared storage possible. One instance, two machines – One active, one standby. Innodb log size will be a problem. Timeout during recovery can cause problems during switch.
MySQL Cluster High availability built in if implemented correct Requires more hardware. More complex solution Requires application to support NDB Not full feature set.
Obstacles We are using MySQL 4.0 in our biggest database Master/Master scenario on 4.0 requires higher level of application awareness. LinuxHA/ServiceGuard/Sun Cluster will cause small glitch when we move resources. MySQL Cluster will require even more application changes in our case.
Our Choice LinuxHA because it is GPL/LGPL. Free and not owned by an organization. Fastest way to implement, did not require any support from dev. Department. All other ways required changes in application.
Layout Two versions
We do.. Use Linux HA 2.0. Needed for setup of “cluster” Use SAN. Shared storage is easier and faster, but Expensive.  DRBD can be used but saves the same data twice Also comes with a performance decrease.  Heartbeat on two bonds. Primary database interconnect network, secondary on database service network We have LUNs presented to multiple hosts Services have rules to be run on specific hosts only. We fence using RiLOE Have plans to fence on port level in FC switches.
What’s good and what’s bad.. Easy and fast implementation Our config does not increase/decrease performance. Innodb log size causes long recovery times. Testing to decrease it has caused performance penalties. Our solution is not fool proof because of long recovery times. It causes interruption of service. We can say it’s HA, but true HA solution would give us 100% uptime. 2nd Setup is complicated. We should aim for having simple setups. More common
What can we do better. Fine tune config for faster recovery/startup Add better fencing Monitor failover in case recovery takes long Master/Master or Multi master. If application can reconnect or if we have a smart load balancer we have no outages. Upgrades or schema changes can be made “online” No separation between writes and reads. Less complicated for developers. One entry point.
Summary Concepts Components Requirements Technologies Your goal Considerations Obstacles How we did it @  bwin  games AB HA recommendations
Questions The question is not, ‘What is the answer?’ The question is, ‘What is the question?’ Henri Poincaré
Thank you for your time! And thank you for listening so kindly. We can be found on: Robert Krzykawski –  http://krzykawski.com Anders Karlsson –  http://papablues.com http://karlssonondatabases.blogspot.com /

More Related Content

What's hot (20)

PPTX
Salesforce enabling real time scenarios at scale using kafka
Thomas Alex
 
PDF
Webinar Slides: Geo-Distributed MySQL Clustering Done Right!
Continuent
 
PPTX
Deploy Office Web Apps Server 2013 in Azure
SPSSTHLM
 
PPTX
Webinar slides - ClusterControl 1.2.11: with support for MariaDB’s MaxScale a...
Severalnines
 
PPTX
Sapuki sig 2013
Chris Kernaghan
 
PPTX
Pascal benois performance_troubleshooting-spsbe18
BIWUG
 
PDF
Top 15 Exchange Questions that Senior Admin ask - Jaap Wesselius
Kemp
 
PDF
Percona, software libre y bases de datos
LibreCon
 
PPTX
Security of Oracle EBS - How I can Protect my System (UKOUG APPS 18 edition)
Andrejs Prokopjevs
 
PDF
Nippon It Solutions Data services offering 2015
Vinay Mistry
 
ODP
Zero Downtime JEE Architectures
Alexander Penev
 
PPTX
Always On - Zero Downtime releases
Anders Lundsgård
 
PDF
Database failover from client perspective
Priit Piipuu
 
PPTX
Effective SharePoint Architecture - SharePoint Saturday Stockholm 2016
Alistair Pugin
 
PPTX
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Andrejs Prokopjevs
 
PPTX
Running Oracle EBS in the cloud (DOAG TECH17 edition)
Andrejs Prokopjevs
 
TXT
services order
Charan Patnaik
 
PDF
Become a MySQL DBA: performing live database upgrades - webinar slides
Severalnines
 
PDF
Principal Propagation with SAP Cloud Platform
Gary Jackson MBCS
 
PDF
Single Sign-On for APEX applications based on Kerberos (Important: latest ver...
Niels de Bruijn
 
Salesforce enabling real time scenarios at scale using kafka
Thomas Alex
 
Webinar Slides: Geo-Distributed MySQL Clustering Done Right!
Continuent
 
Deploy Office Web Apps Server 2013 in Azure
SPSSTHLM
 
Webinar slides - ClusterControl 1.2.11: with support for MariaDB’s MaxScale a...
Severalnines
 
Sapuki sig 2013
Chris Kernaghan
 
Pascal benois performance_troubleshooting-spsbe18
BIWUG
 
Top 15 Exchange Questions that Senior Admin ask - Jaap Wesselius
Kemp
 
Percona, software libre y bases de datos
LibreCon
 
Security of Oracle EBS - How I can Protect my System (UKOUG APPS 18 edition)
Andrejs Prokopjevs
 
Nippon It Solutions Data services offering 2015
Vinay Mistry
 
Zero Downtime JEE Architectures
Alexander Penev
 
Always On - Zero Downtime releases
Anders Lundsgård
 
Database failover from client perspective
Priit Piipuu
 
Effective SharePoint Architecture - SharePoint Saturday Stockholm 2016
Alistair Pugin
 
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Andrejs Prokopjevs
 
Running Oracle EBS in the cloud (DOAG TECH17 edition)
Andrejs Prokopjevs
 
services order
Charan Patnaik
 
Become a MySQL DBA: performing live database upgrades - webinar slides
Severalnines
 
Principal Propagation with SAP Cloud Platform
Gary Jackson MBCS
 
Single Sign-On for APEX applications based on Kerberos (Important: latest ver...
Niels de Bruijn
 

Viewers also liked (7)

PDF
Dancing cluster
Aleksey Chizhkoff
 
PDF
Wellness In The Community & Work Place
PamB43
 
PPTX
Spa party presentation
Carla King
 
PPT
Lesson 5 - Spray Tanning - The treatment area
Richard Brown
 
PPT
Gurgaonmoms pink pamper party
Gurgaonmoms
 
PPTX
Spa ppt
Sohini Bhuwalka
 
PPTX
Presentación empanada lunch - 10 de abrl de 2012
Santiago Scialabba
 
Dancing cluster
Aleksey Chizhkoff
 
Wellness In The Community & Work Place
PamB43
 
Spa party presentation
Carla King
 
Lesson 5 - Spray Tanning - The treatment area
Richard Brown
 
Gurgaonmoms pink pamper party
Gurgaonmoms
 
Presentación empanada lunch - 10 de abrl de 2012
Santiago Scialabba
 
Ad

Similar to MySQL HA Presentation (20)

PDF
MySQL High Availability Solutions
Lenz Grimmer
 
PDF
Mysqlhacodebits20091203 1260184765-phpapp02
Louis liu
 
PDF
MySQL High Availability Solutions
Lenz Grimmer
 
PPTX
MySQL High Availability Solutions - Feb 2015 webinar
Andrew Morgan
 
PPTX
MySQL High Availibility Solutions
Mark Swarbrick
 
KEY
MOW2010: 1TB MySQL Database Migration and HA Infrastructure by Alex Gorbachev...
Alex Gorbachev
 
PDF
Buytaert kris my_sql-pacemaker
kuchinskaya
 
PDF
Drupal Con My Sql Ha 2008 08 29
liufabin 66688
 
PDF
EXPERIENCE WITH MYSQL HA SOLUTION AND GROUP REPLICATION
Mysql User Camp
 
PDF
Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...
Nagios
 
ODP
MySQL HA Alternatives 2010
Kris Buytaert
 
PDF
20120426 high availability MySQL
Jui-Nan Lin
 
PPTX
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
Clustrix
 
PDF
1 architecture & design
Mark Swarbrick
 
PDF
MySQL高可用
thinkinlamp
 
PDF
Become a MySQL DBA - webinar series - slides: Which High Availability solution?
Severalnines
 
PPTX
A Year in Google - Percona Live Europe 2018
Carmen Mason
 
PPT
MySQL Alta Disponibilidade com Replicação
MySQL Brasil
 
PDF
MySQL 5.7 InnoDB Cluster (Jan 2018)
Olivier DASINI
 
PDF
Why MySQL High Availability Matters
Matt Lord
 
MySQL High Availability Solutions
Lenz Grimmer
 
Mysqlhacodebits20091203 1260184765-phpapp02
Louis liu
 
MySQL High Availability Solutions
Lenz Grimmer
 
MySQL High Availability Solutions - Feb 2015 webinar
Andrew Morgan
 
MySQL High Availibility Solutions
Mark Swarbrick
 
MOW2010: 1TB MySQL Database Migration and HA Infrastructure by Alex Gorbachev...
Alex Gorbachev
 
Buytaert kris my_sql-pacemaker
kuchinskaya
 
Drupal Con My Sql Ha 2008 08 29
liufabin 66688
 
EXPERIENCE WITH MYSQL HA SOLUTION AND GROUP REPLICATION
Mysql User Camp
 
Nagios Conference 2014 - Jeremy Rust - Avoiding Downtime Using Linux High Ava...
Nagios
 
MySQL HA Alternatives 2010
Kris Buytaert
 
20120426 high availability MySQL
Jui-Nan Lin
 
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
Clustrix
 
1 architecture & design
Mark Swarbrick
 
MySQL高可用
thinkinlamp
 
Become a MySQL DBA - webinar series - slides: Which High Availability solution?
Severalnines
 
A Year in Google - Percona Live Europe 2018
Carmen Mason
 
MySQL Alta Disponibilidade com Replicação
MySQL Brasil
 
MySQL 5.7 InnoDB Cluster (Jan 2018)
Olivier DASINI
 
Why MySQL High Availability Matters
Matt Lord
 
Ad

Recently uploaded (20)

PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PPTX
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Digital Circuits, important subject in CS
contactparinay1
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 

MySQL HA Presentation

  • 1. MySQL HA Using different solutions Robert Krzykawski DB Team Coordinator, bwin games. Anders Karlsson Principal Sales Engineer, MySQL
  • 2. Agenda Who are we? HA Basics – Anders How we did it; Success or failure – Robert Summary Questions?
  • 3. Anders Karlsson Sales Engineer with Sun / MySQL for 5+ years I have been in the RDBMS business for 20+ years I have worked for many of the major vendors and with most of the vendor products I’ve been in roles as Sales Engineer Consultant Porting engineer Support engineer Etc. Outside MySQL I build websites (www.papablues.com), develop Open Source software (MyQuery, ndbtop etc), am a keen photographer and drives sub-standard cars, among other things. Also: www.makezfsgpl.com ! Right now!
  • 4. Robert Krzykawski DB Team Coordinator @ bwin Games AB Have been working with MySQL in every way from system admin, DBA, DBD and now taking a more system architectural role. Been involved in building both small and big web based solutions since 1998 using MySQL. My roles throughout my professional life have varied. System administrator, Technical Sales support, DBA, DBD, Programmer, Application architect and System architect. Off work I am trying to automate things with scripts and programs to off load myself when “on work”.  I am also trying to find time to snowboard, play some paintball and a recently introduced hobby is our Maine Coon kittens. 
  • 5. Why do you need HA Something can break. It usually will, eventually You will need to maintain your database eventually, without shutting the whole system down Adding HA to an existing running system is difficult, Much more so than to provide HA from the start You want a good nights sleep! You want failover to be automatic!
  • 6. HA Concepts Fault tolerant architectures These are hardware architectures with supporting software that prevents against even individual component failures Single Point of Failure (SPOF) In any fault tolerant setup, you want to avoid a SPOF, as a link is not better than it’s weakest link Fail over and Fail back Fail over is the process of switching from a failed component to another component, dormant or also active. Fail back is the process of failing back from the backup component to the original one.
  • 7. Some HA Components Heartbeat Heartbeat is an HA component that checks that the services that are being failed over, are alive. Heartbeat can check individual servers, software services, networking etc. HA Monitor The HA Monitor has different names in different frameworks. This is the component that allows configuration of the services, ensures proper shutdown and startup and allows manual control Replication Replication is a common component that ensures that the data content of managed data rich components are in sync
  • 8. What should I require? Don’t aim too high, aim for what is reasonable for your needs Aim to ensure that no important data is lost What is “important data”? You decide! Different data means different “needs”! Aim to ensure that the solution can be automated. You will want this eventually anyway Aim to ensure a solution that can easily be tested and administered Aim to ensure that the solution is performant and scalable
  • 9. MySQL Replication Easy to use and set up. Low performance impact Asynchronous only. Failback can be difficult. Need additional components MySQL with DRBD / ZFS / AVS Easy to use. Low cost software only. Synchronous. Good HA software integration. Certain performance impact. Limited data size and transaction rates. HA with MySQL – In short
  • 10. MySQL with Shared storage Good performance. Eases hardware management. Good integration with HA software. Costly. SAN itself is a SPOF. MySQL Cluster Very good performance. Self contained. Very short fail-over times. Software only solution. Needs several physical servers. Not optimized for all MySQL applications. HA with MySQL – In short
  • 12. Our goal at bwin We were faced with a requirement; establish a highly available database platform. We had some rules to follow from management. interruptions due to hardware failure should not require hands-on work. Downtime should be minimized during interruptions. Performance of DB platform should not decrease when operating as usual Performance can decrease if a failure has occurred but should not deem the service unusable. Implementation should be done by the operations department. Developers should not be involved.
  • 13. What solutions did we consider? Master/Master Linux HA HP Service Guard Sun Cluster Combination of the above MySQL Cluster Will walk through all of the above
  • 14. Master/Master Master/Master with two active nodes would give us a seamless switch if we have a good load balancer. Will give us the ability to do schema changes “on line” Not only higher availability when both nodes are up, but better performance. Can eliminate the use of production slaves. One entry point for application when using “LB”
  • 15. Linux HA/ServiceGuard/SunCluster Service IP switch will cause a glitch in service. Since we are running 4.0 we can’t really do a master/master setup with service IP switching. Slave integrity is important and we are running 4.0; One master data. Can’t switch to slave and hope that everything was replicated. We are using SAN – Shared storage possible. One instance, two machines – One active, one standby. Innodb log size will be a problem. Timeout during recovery can cause problems during switch.
  • 16. MySQL Cluster High availability built in if implemented correct Requires more hardware. More complex solution Requires application to support NDB Not full feature set.
  • 17. Obstacles We are using MySQL 4.0 in our biggest database Master/Master scenario on 4.0 requires higher level of application awareness. LinuxHA/ServiceGuard/Sun Cluster will cause small glitch when we move resources. MySQL Cluster will require even more application changes in our case.
  • 18. Our Choice LinuxHA because it is GPL/LGPL. Free and not owned by an organization. Fastest way to implement, did not require any support from dev. Department. All other ways required changes in application.
  • 20. We do.. Use Linux HA 2.0. Needed for setup of “cluster” Use SAN. Shared storage is easier and faster, but Expensive. DRBD can be used but saves the same data twice Also comes with a performance decrease. Heartbeat on two bonds. Primary database interconnect network, secondary on database service network We have LUNs presented to multiple hosts Services have rules to be run on specific hosts only. We fence using RiLOE Have plans to fence on port level in FC switches.
  • 21. What’s good and what’s bad.. Easy and fast implementation Our config does not increase/decrease performance. Innodb log size causes long recovery times. Testing to decrease it has caused performance penalties. Our solution is not fool proof because of long recovery times. It causes interruption of service. We can say it’s HA, but true HA solution would give us 100% uptime. 2nd Setup is complicated. We should aim for having simple setups. More common
  • 22. What can we do better. Fine tune config for faster recovery/startup Add better fencing Monitor failover in case recovery takes long Master/Master or Multi master. If application can reconnect or if we have a smart load balancer we have no outages. Upgrades or schema changes can be made “online” No separation between writes and reads. Less complicated for developers. One entry point.
  • 23. Summary Concepts Components Requirements Technologies Your goal Considerations Obstacles How we did it @ bwin games AB HA recommendations
  • 24. Questions The question is not, ‘What is the answer?’ The question is, ‘What is the question?’ Henri Poincaré
  • 25. Thank you for your time! And thank you for listening so kindly. We can be found on: Robert Krzykawski – http://krzykawski.com Anders Karlsson – http://papablues.com http://karlssonondatabases.blogspot.com /