SlideShare a Scribd company logo
OSDC 2013 1
Distributed Storage with GlusterFS
Dr. Udo Seidel
Linux-Strategy @ Amadeus
OSDC 2013 2
Agenda
â—Ź Introduction
â—Ź High level overview
â—Ź Storage inside
â—Ź Use cases
â—Ź Summary
OSDC 2013 3
Introduction
LinuxCon Europe 05-07/11 2012 4
Me ;-)
â—Ź Teacher of mathematics & physics
â—Ź PhD in experimental physics
â—Ź Started with Linux in 1996
â—Ź Linux/UNIX trainer
â—Ź Solution engineer in HPC and CAx environment
â—Ź Head of the Linux Strategy team @Amadeus
OSDC 2013 5
Storage: History
â—Ź Reviewing storage task responsibilities
â—Ź Block allocation
â—Ź Space management
â—Ź Extension of SCSI standard
â—Ź Object based storage
â—Ź Meta-Data handling separated from data
management
OSDC 2013 6
Object based storage
â—Ź Storage objects quite general
â—Ź Partition, file, ...
â—Ź Unique identifier
â—Ź OSD (Object based Storage Device)
â—Ź Hardware -> original trigger
â—Ź Software -> common implementation
â—Ź Main component of distributed file systems
OSDC 2013 7
Distributed storage:
Paradigm changes
â—Ź Block -> Object
â—Ź Central -> Distributed
â—Ź Few -> Many
â—Ź Big -> Small
â—Ź Server <-> Storage
OSDC 2013 8
Distributed File Systems
â—Ź 'Recent' attention on distributed storage
â—Ź Cloud hype
â—Ź Big Data
â—Ź See also CEPH talk the other day
OSDC 2013 9
Distributed storage – Now what?!?
â—Ź Several implementations
â—Ź Different functions
â—Ź Support models
â—Ź Storage vendors initiatives
â—Ź Relation to Linux distributions
Here and now ==> GlusterFS
OSDC 2013 10
High level overview
OSDC 2013 11
History
â—Ź Gluster founded in 2005
â—Ź Gluster = GNU + cluster
â—Ź Acquisition by Red Hat in 2011
â—Ź Community project
â—Ź 3.2 in 2011
â—Ź 3.3 in 2012
â—Ź Commercial product: RedHat Storage Server
OSDC 2013 12
The Client
â—Ź Native
â—Ź 'speaks' GLUSTERFS
â—Ź Not part of the Linux Kernel
â—Ź FUSE-based
â—Ź NFS
â—Ź Normal NFS client stack
â—Ź S3/Swift compatible
â—Ź Proxy needed
OSDC 2013 13
The Server
â—Ź Data
â—Ź Bricks
â—Ź Translators
â—Ź Volumes -> exported/served to the client
â—Ź Meta-Data
â—Ź No dedicated instance
â—Ź Distributed hashing approach
OSDC 2013 14
The picture
OSDC 2013 15
Storage inside
OSDC 2013 16
The Brick
â—Ź Trust each other
â—Ź Interconnect via TCP/IP or RDMA/Infiniband
â—Ź Dedicated file systems on GlusterFS server
â—Ź XFS recommended, EXT4 works too
â—Ź Extended attributes a must
â—Ź Two main processes/daemons
â—Ź glusterd
â—Ź glusterfsd
OSDC 2013 17
The Translator
â—Ź One per purpose
â—Ź Replication
â—Ź POSIX
â—Ź Quota
â—Ź I/O behaviour
â—Ź Chained -> brick graph
â—Ź Technically: configuration
OSDC 2013 18
The Volume
â—Ź Service unit
â—Ź Layer of configuration
â—Ź 'RAID level' -> distributed, replicated, striped,
combinations
â—Ź NFS
â—Ź Cache
â—Ź Permissions
â—Ź ....
OSDC 2013 19
Meta Data
â—Ź 2 kinds
â—Ź More of local file system style
â—Ź Related to distributed nature
â—Ź Some stored in backend file system
â—Ź Permissions
â—Ź Time stamps
â—Ź Distribution/replication
â—Ź Some calculated on the fly
â—Ź Brick location
OSDC 2013 20
Elastic Hash Algorithm
â—Ź Based on file names
â—Ź Name space divided
â—Ź Full brick handled via relinking
â—Ź Stored in extended attributes
â—Ź Client needs to know topology
OSDC 2013 21
Self-Healing
â—Ź On demand vs. Scheduled
â—Ź File based
â—Ź Based on extended attributes
â—Ź Split-brain
â—Ź Quorum function
â—Ź Sometimes: manual intervention
OSDC 2013 22
Geo replication
â—Ź Asynchronous
â—Ź Based on rsync/ssh
â—Ź Master-Slave
â—Ź If needed: cascading
â—Ź One way street
â—Ź Clocks in sync!
OSDC 2013 23
From files to objects
â—Ź Introduced with version 3.3
â—Ź Hard links with some hierarchy
â—Ź Re-uses GFID (inode number)
â—Ź UFO
â—Ź Unified File and Object
â—Ź Combination with RESTful API
â—Ź S3 and swift compatible
OSDC 2013 24
Operations:
Growth, shrinkage .. failures
â—Ź A Must!
â—Ź Easy
â—Ź Rebalance!
â—Ź Order of servers important
OSDC 2013 25
What else ...?
â—Ź Encryption :-|
â—Ź Compression :-(
â—Ź Snapshots :-(
â—Ź Hadoop connector :-)
â—Ź Locking granularity :-|
â—Ź File system statistics :-)
OSDC 2013 26
Use cases
OSDC 2013 27
NAS replacement
â—Ź NFS as 1:1
â—Ź Server: GlusterFS
â—Ź Client: NFS
â—Ź NFS as such
â—Ź Server: GlusterFS
â—Ź Client: GlusterFS
OSDC 2013 28
SAN replacement
â—Ź Not quite advanced (yet)
â—Ź New translator needed
â—Ź Development started
â—Ź Presenting GlusterFS as block device
â—Ź Additional items needed
â—Ź Locking
â—Ź ...
OSDC 2013 29
Storage back-end for KVM and Co
â—Ź Stacked (indirect)
â—Ź Not smart
â—Ź Workable for main hypervisors
â—Ź Direct
â—Ź QEMU
â—Ź libvirt
â—Ź oVirt/RHEV
OSDC 2013 30
Summary
OSDC 2013 31
Take aways
â—Ź Thin distributed file system layer
â—Ź Modular architecture
â—Ź Operationally ready
â—Ź Still some surprises
â—Ź Active development and community
OSDC 2013 32
References
â—Ź http://www.gluster.org
â—Ź http://www.sxc.hu (pictures)
OSDC 2013 33
Thank you!
OSDC 2013 34
Distributed Storage with GlusterFS
Dr. Udo Seidel
Linux-Strategy @ Amadeus

More Related Content

PDF
Lt2013 glusterfs.talk
Udo Seidel
 
PDF
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
NETWAYS
 
PDF
State of the_gluster_-_lceu
Gluster.org
 
ODP
Dedupe nmamit
Gluster.org
 
PDF
Gluster for sysadmins
Gluster.org
 
ODP
GlusterD - Daemon refactoring
Atin Mukherjee
 
PDF
Sdc 2012-challenges
Gluster.org
 
ODP
Gluster d thread_synchronization_using_urcu_lca2016
Gluster.org
 
Lt2013 glusterfs.talk
Udo Seidel
 
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
NETWAYS
 
State of the_gluster_-_lceu
Gluster.org
 
Dedupe nmamit
Gluster.org
 
Gluster for sysadmins
Gluster.org
 
GlusterD - Daemon refactoring
Atin Mukherjee
 
Sdc 2012-challenges
Gluster.org
 
Gluster d thread_synchronization_using_urcu_lca2016
Gluster.org
 

What's hot (19)

PDF
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Gluster.org
 
ODP
Tiering barcelona
Gluster.org
 
ODP
Gluster intro-tdose
Gluster.org
 
PDF
adp.ceph.openstack.talk
Udo Seidel
 
PDF
Osdc2012 xtfs.talk
Udo Seidel
 
ODP
GlusterD 2.0 - Managing Distributed File System Using a Centralized Store
Atin Mukherjee
 
ODP
Ostd.ksplice.talk
Udo Seidel
 
PDF
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Atin Mukherjee
 
PDF
Gluster intro-tdose
Gluster.org
 
ODP
Scale out backups-with_bareos_and_gluster
Gluster.org
 
ODP
20160130 Gluster-roadmap
Gluster.org
 
ODP
GlusterFS Containers
Mohamed Ashiq
 
ODP
20160401 Gluster-roadmap
Gluster.org
 
PDF
kpatch.kgraft
Udo Seidel
 
PDF
Cncf meetup-rook
Juraj Hantak
 
PDF
Cncf meetup-rook
Juraj Hantak
 
PDF
20160401 guster-roadmap
Gluster.org
 
ODP
YDAL Barcelona
Gluster.org
 
ODP
Developing apps and_integrating_with_gluster_fs_-_libgfapi
Gluster.org
 
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Gluster.org
 
Tiering barcelona
Gluster.org
 
Gluster intro-tdose
Gluster.org
 
adp.ceph.openstack.talk
Udo Seidel
 
Osdc2012 xtfs.talk
Udo Seidel
 
GlusterD 2.0 - Managing Distributed File System Using a Centralized Store
Atin Mukherjee
 
Ostd.ksplice.talk
Udo Seidel
 
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Atin Mukherjee
 
Gluster intro-tdose
Gluster.org
 
Scale out backups-with_bareos_and_gluster
Gluster.org
 
20160130 Gluster-roadmap
Gluster.org
 
GlusterFS Containers
Mohamed Ashiq
 
20160401 Gluster-roadmap
Gluster.org
 
kpatch.kgraft
Udo Seidel
 
Cncf meetup-rook
Juraj Hantak
 
Cncf meetup-rook
Juraj Hantak
 
20160401 guster-roadmap
Gluster.org
 
YDAL Barcelona
Gluster.org
 
Developing apps and_integrating_with_gluster_fs_-_libgfapi
Gluster.org
 
Ad

Similar to OSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo Seidel (20)

PDF
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
NETWAYS
 
PDF
Gluster.community.day.2013
Udo Seidel
 
PDF
Linuxtag.ceph.talk
Udo Seidel
 
PDF
Scaling up and accelerating Drupal 8 with NoSQL
OSInet
 
PDF
Cloud storage: the right way OSS EU 2018
Orit Wasserman
 
PDF
2021.02 new in Ceph Pacific Dashboard
Ceph Community
 
PDF
ceph openstack dream team
Udo Seidel
 
PDF
Challenges with Gluster and Persistent Memory with Dan Lambright
Gluster.org
 
PDF
Gluster d2
Gluster.org
 
ODP
Time to rethink /proc
Kir Kolyshkin
 
PDF
Initial presentation of swift (for montreal user group)
Marcos GarcĂ­a
 
PDF
Gluster wireshark niels_de_vos
Gluster.org
 
PDF
[scala.by] Launching new application fast
Denis Karpenko
 
PDF
Ceph data services in a multi- and hybrid cloud world
Sage Weil
 
ODP
Gluster d2
Gluster.org
 
PDF
High-level architecture of a complete MariaDB deployment
Federico Razzoli
 
ODP
20160401 guster-roadmap
Gluster.org
 
PDF
Gluster fs architecture_&amp;_roadmap-vijay_bellur-linuxcon_eu_2013
Gluster.org
 
PPTX
Automating using Ansible
Alok Patra
 
PDF
Elephants in the Cloud
Mike Fowler
 
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
NETWAYS
 
Gluster.community.day.2013
Udo Seidel
 
Linuxtag.ceph.talk
Udo Seidel
 
Scaling up and accelerating Drupal 8 with NoSQL
OSInet
 
Cloud storage: the right way OSS EU 2018
Orit Wasserman
 
2021.02 new in Ceph Pacific Dashboard
Ceph Community
 
ceph openstack dream team
Udo Seidel
 
Challenges with Gluster and Persistent Memory with Dan Lambright
Gluster.org
 
Gluster d2
Gluster.org
 
Time to rethink /proc
Kir Kolyshkin
 
Initial presentation of swift (for montreal user group)
Marcos GarcĂ­a
 
Gluster wireshark niels_de_vos
Gluster.org
 
[scala.by] Launching new application fast
Denis Karpenko
 
Ceph data services in a multi- and hybrid cloud world
Sage Weil
 
Gluster d2
Gluster.org
 
High-level architecture of a complete MariaDB deployment
Federico Razzoli
 
20160401 guster-roadmap
Gluster.org
 
Gluster fs architecture_&amp;_roadmap-vijay_bellur-linuxcon_eu_2013
Gluster.org
 
Automating using Ansible
Alok Patra
 
Elephants in the Cloud
Mike Fowler
 
Ad

Recently uploaded (20)

PPTX
Presentation about variables and constant.pptx
kr2589474
 
PDF
49784907924775488180_LRN2959_Data_Pump_23ai.pdf
Abilash868456
 
PDF
Summary Of Odoo 18.1 to 18.4 : The Way For Odoo 19
CandidRoot Solutions Private Limited
 
PPTX
The-Dawn-of-AI-Reshaping-Our-World.pptxx
parthbhanushali307
 
PPTX
Maximizing Revenue with Marketo Measure: A Deep Dive into Multi-Touch Attribu...
bbedford2
 
PDF
MiniTool Power Data Recovery Crack New Pre Activated Version Latest 2025
imang66g
 
PDF
WatchTraderHub - Watch Dealer software with inventory management and multi-ch...
WatchDealer Pavel
 
PDF
On Software Engineers' Productivity - Beyond Misleading Metrics
Romén Rodríguez-Gil
 
PPTX
slidesgo-unlocking-the-code-the-dynamic-dance-of-variables-and-constants-2024...
kr2589474
 
PPTX
Presentation about variables and constant.pptx
safalsingh810
 
PDF
Protecting the Digital World Cyber Securit
dnthakkar16
 
PPTX
ASSIGNMENT_1[1][1][1][1][1] (1) variables.pptx
kr2589474
 
PPTX
classification of computer and basic part of digital computer
ravisinghrajpurohit3
 
PPT
Why Reliable Server Maintenance Service in New York is Crucial for Your Business
Sam Vohra
 
PDF
An Experience-Based Look at AI Lead Generation Pricing, Features & B2B Results
Thomas albart
 
PDF
Generating Union types w/ Static Analysis
K. Matthew Dupree
 
PDF
New Download MiniTool Partition Wizard Crack Latest Version 2025
imang66g
 
PDF
Download iTop VPN Free 6.1.0.5882 Crack Full Activated Pre Latest 2025
imang66g
 
PPTX
Presentation about Database and Database Administrator
abhishekchauhan86963
 
PPTX
TRAVEL APIs | WHITE LABEL TRAVEL API | TOP TRAVEL APIs
philipnathen82
 
Presentation about variables and constant.pptx
kr2589474
 
49784907924775488180_LRN2959_Data_Pump_23ai.pdf
Abilash868456
 
Summary Of Odoo 18.1 to 18.4 : The Way For Odoo 19
CandidRoot Solutions Private Limited
 
The-Dawn-of-AI-Reshaping-Our-World.pptxx
parthbhanushali307
 
Maximizing Revenue with Marketo Measure: A Deep Dive into Multi-Touch Attribu...
bbedford2
 
MiniTool Power Data Recovery Crack New Pre Activated Version Latest 2025
imang66g
 
WatchTraderHub - Watch Dealer software with inventory management and multi-ch...
WatchDealer Pavel
 
On Software Engineers' Productivity - Beyond Misleading Metrics
Romén Rodríguez-Gil
 
slidesgo-unlocking-the-code-the-dynamic-dance-of-variables-and-constants-2024...
kr2589474
 
Presentation about variables and constant.pptx
safalsingh810
 
Protecting the Digital World Cyber Securit
dnthakkar16
 
ASSIGNMENT_1[1][1][1][1][1] (1) variables.pptx
kr2589474
 
classification of computer and basic part of digital computer
ravisinghrajpurohit3
 
Why Reliable Server Maintenance Service in New York is Crucial for Your Business
Sam Vohra
 
An Experience-Based Look at AI Lead Generation Pricing, Features & B2B Results
Thomas albart
 
Generating Union types w/ Static Analysis
K. Matthew Dupree
 
New Download MiniTool Partition Wizard Crack Latest Version 2025
imang66g
 
Download iTop VPN Free 6.1.0.5882 Crack Full Activated Pre Latest 2025
imang66g
 
Presentation about Database and Database Administrator
abhishekchauhan86963
 
TRAVEL APIs | WHITE LABEL TRAVEL API | TOP TRAVEL APIs
philipnathen82
 

OSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo Seidel

  • 1. OSDC 2013 1 Distributed Storage with GlusterFS Dr. Udo Seidel Linux-Strategy @ Amadeus
  • 2. OSDC 2013 2 Agenda â—Ź Introduction â—Ź High level overview â—Ź Storage inside â—Ź Use cases â—Ź Summary
  • 4. LinuxCon Europe 05-07/11 2012 4 Me ;-) â—Ź Teacher of mathematics & physics â—Ź PhD in experimental physics â—Ź Started with Linux in 1996 â—Ź Linux/UNIX trainer â—Ź Solution engineer in HPC and CAx environment â—Ź Head of the Linux Strategy team @Amadeus
  • 5. OSDC 2013 5 Storage: History â—Ź Reviewing storage task responsibilities â—Ź Block allocation â—Ź Space management â—Ź Extension of SCSI standard â—Ź Object based storage â—Ź Meta-Data handling separated from data management
  • 6. OSDC 2013 6 Object based storage â—Ź Storage objects quite general â—Ź Partition, file, ... â—Ź Unique identifier â—Ź OSD (Object based Storage Device) â—Ź Hardware -> original trigger â—Ź Software -> common implementation â—Ź Main component of distributed file systems
  • 7. OSDC 2013 7 Distributed storage: Paradigm changes â—Ź Block -> Object â—Ź Central -> Distributed â—Ź Few -> Many â—Ź Big -> Small â—Ź Server <-> Storage
  • 8. OSDC 2013 8 Distributed File Systems â—Ź 'Recent' attention on distributed storage â—Ź Cloud hype â—Ź Big Data â—Ź See also CEPH talk the other day
  • 9. OSDC 2013 9 Distributed storage – Now what?!? â—Ź Several implementations â—Ź Different functions â—Ź Support models â—Ź Storage vendors initiatives â—Ź Relation to Linux distributions Here and now ==> GlusterFS
  • 10. OSDC 2013 10 High level overview
  • 11. OSDC 2013 11 History â—Ź Gluster founded in 2005 â—Ź Gluster = GNU + cluster â—Ź Acquisition by Red Hat in 2011 â—Ź Community project â—Ź 3.2 in 2011 â—Ź 3.3 in 2012 â—Ź Commercial product: RedHat Storage Server
  • 12. OSDC 2013 12 The Client â—Ź Native â—Ź 'speaks' GLUSTERFS â—Ź Not part of the Linux Kernel â—Ź FUSE-based â—Ź NFS â—Ź Normal NFS client stack â—Ź S3/Swift compatible â—Ź Proxy needed
  • 13. OSDC 2013 13 The Server â—Ź Data â—Ź Bricks â—Ź Translators â—Ź Volumes -> exported/served to the client â—Ź Meta-Data â—Ź No dedicated instance â—Ź Distributed hashing approach
  • 14. OSDC 2013 14 The picture
  • 16. OSDC 2013 16 The Brick â—Ź Trust each other â—Ź Interconnect via TCP/IP or RDMA/Infiniband â—Ź Dedicated file systems on GlusterFS server â—Ź XFS recommended, EXT4 works too â—Ź Extended attributes a must â—Ź Two main processes/daemons â—Ź glusterd â—Ź glusterfsd
  • 17. OSDC 2013 17 The Translator â—Ź One per purpose â—Ź Replication â—Ź POSIX â—Ź Quota â—Ź I/O behaviour â—Ź Chained -> brick graph â—Ź Technically: configuration
  • 18. OSDC 2013 18 The Volume â—Ź Service unit â—Ź Layer of configuration â—Ź 'RAID level' -> distributed, replicated, striped, combinations â—Ź NFS â—Ź Cache â—Ź Permissions â—Ź ....
  • 19. OSDC 2013 19 Meta Data â—Ź 2 kinds â—Ź More of local file system style â—Ź Related to distributed nature â—Ź Some stored in backend file system â—Ź Permissions â—Ź Time stamps â—Ź Distribution/replication â—Ź Some calculated on the fly â—Ź Brick location
  • 20. OSDC 2013 20 Elastic Hash Algorithm â—Ź Based on file names â—Ź Name space divided â—Ź Full brick handled via relinking â—Ź Stored in extended attributes â—Ź Client needs to know topology
  • 21. OSDC 2013 21 Self-Healing â—Ź On demand vs. Scheduled â—Ź File based â—Ź Based on extended attributes â—Ź Split-brain â—Ź Quorum function â—Ź Sometimes: manual intervention
  • 22. OSDC 2013 22 Geo replication â—Ź Asynchronous â—Ź Based on rsync/ssh â—Ź Master-Slave â—Ź If needed: cascading â—Ź One way street â—Ź Clocks in sync!
  • 23. OSDC 2013 23 From files to objects â—Ź Introduced with version 3.3 â—Ź Hard links with some hierarchy â—Ź Re-uses GFID (inode number) â—Ź UFO â—Ź Unified File and Object â—Ź Combination with RESTful API â—Ź S3 and swift compatible
  • 24. OSDC 2013 24 Operations: Growth, shrinkage .. failures â—Ź A Must! â—Ź Easy â—Ź Rebalance! â—Ź Order of servers important
  • 25. OSDC 2013 25 What else ...? â—Ź Encryption :-| â—Ź Compression :-( â—Ź Snapshots :-( â—Ź Hadoop connector :-) â—Ź Locking granularity :-| â—Ź File system statistics :-)
  • 27. OSDC 2013 27 NAS replacement â—Ź NFS as 1:1 â—Ź Server: GlusterFS â—Ź Client: NFS â—Ź NFS as such â—Ź Server: GlusterFS â—Ź Client: GlusterFS
  • 28. OSDC 2013 28 SAN replacement â—Ź Not quite advanced (yet) â—Ź New translator needed â—Ź Development started â—Ź Presenting GlusterFS as block device â—Ź Additional items needed â—Ź Locking â—Ź ...
  • 29. OSDC 2013 29 Storage back-end for KVM and Co â—Ź Stacked (indirect) â—Ź Not smart â—Ź Workable for main hypervisors â—Ź Direct â—Ź QEMU â—Ź libvirt â—Ź oVirt/RHEV
  • 31. OSDC 2013 31 Take aways â—Ź Thin distributed file system layer â—Ź Modular architecture â—Ź Operationally ready â—Ź Still some surprises â—Ź Active development and community
  • 32. OSDC 2013 32 References â—Ź http://www.gluster.org â—Ź http://www.sxc.hu (pictures)
  • 34. OSDC 2013 34 Distributed Storage with GlusterFS Dr. Udo Seidel Linux-Strategy @ Amadeus