SlideShare a Scribd company logo
Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Spectrum Scale Replication & Stretch
Clusters
© Copyright IBM Corporation 2015
Unit objectives
After completing this unit, you should be able to:
• Describe Replication
• Describe the Pros and Cons of Replication
• Describe a Stretch Cluster
© Copyright IBM Corporation 2015
Synchronous data replication
• Allows you to synchronously replicate
– A file, set of files or the entire file system
• Gives you better replication granularity as opposed to mirroring an entire
volume, which also saves on space used
– Allows you to replicate Metadata and/or data
– Provides additional layer(s) of protection in addition to the RAID level
protection of volumes underneath
– Supports a maximum of 3 copies of the data
– Replication is Synchronous only
– Asynchronous Replication can be achieved using AFM feature
© Copyright IBM Corporation 2015
Synchronous data replication
• To replicate or not replicate?
– This is Spectrum Scale level of replication which is an availability level
"on top of" the already built-in data availability (RAID) characteristics of
the disk subsystem(s) being used
– Can be used cross site
– Some performance impact.
Writes are 50 – 67% slower with replication levels 1 & 2
Reads are the same speed
– Your storage effectively becomes more expensive since you are using
more of your usable space for duplicate copies of your data
© Copyright IBM Corporation 2015
Data Replication Warnings
• If you decide to use Replication
– Always replicate your Metadata at the minimum
– Never replicate your data and not your metadata
• If you were to do this, then in the event of a failure, you would not be able
to mount your file system to retrieve your replicated data
© Copyright IBM Corporation 2015
Replication relies on failure groups
• Failure group
– A group of disks in a storage pool that Spectrum Scale assumes are
separate from the disks other failure groups.
– Can be changed anytime (mmrestripefs to fix data)
• A file is replicated when a copy of the data blocks exist in two
failure groups
– Ensures that no two replicas of the same block will become
unavailable due to a single failure.
• Can be set either at NSD creation time using the mmcrnsd
command or later on using the mmchdisk command.
• Important to set failure groups correctly to have effective file
system replication.
• Replication is per storage pool.
© Copyright IBM Corporation 2015
The third failure group
File System Descriptor Quorum
• In addition to quorum nodes three
disks, by default (NSDs) are used
as file system descriptors disks.
• A majority of the replicas on the
subset of disks must remain
available to sustain file system
operations.
• Spectrum Scale can move them
from one disk to another in case of
failure.
• Use the mmlsdisk –L command
to see the location of the descriptor.
• Can add one by creating a
descOnly NSD.
© Copyright IBM Corporation 2008
Disk Descriptor
Quorum
Node 1
Node 2
Replication/Failure groups and storage pools
• Creation of NSD requires [ mmcrnsd ]
– O/S disk name
– NSD Server List
• Optional, but recommended
– NSD name
– Failure Group (related to Replication)
– Storage Pool (related to Policy / ILM )
• Disk Stanza
%nsd:
device=/dev/sdav2
nsd=nsd1
servers=k145n06,k145n05
usage=dataOnly
failureGroup=5
pool=poolA
© Copyright IBM Corporation 2015
Accessing replicated data
• Default operation
– Read: read from all copies
– Write: write both copies
• Control with readReplicaPolicy
– Local: Read from block device or NSD server on same subnet
– Used for read heavy workload replicated across distance
• Operation with unavailable disk
– Disk marked “down” in FS descriptor
– Read: read available copy
– Write:
• Log changes for fast recovery (possible performance impact)
• Set “missing update” flag in the inode
• Write available copy
• Recovery
– Replay only changes to restored storage
© Copyright IBM Corporation 2015
Replication examples
• Full Replication
– 2 Failure Groups
– Data and Metadata
– On failure file system all ok
• Metadata Replication
– Replicate only metadata
– On failure data missing file system
stays mounted
Failure Group 1
Failure Group 2
Failure Group 1
Failure Group 2
Failure Group 3
Failure Group 4
inode
inode
Missing Data
Metadata OK
Failure Group 3 (Desc Only)
© Copyright IBM Corporation 2015
Mixing replication with pools
• Replicate only metadata
• 3 Data pools for capacity and single namespace
© Copyright IBM Corporation 2015
• Multi-site quorum configuration
• Replicate across sites
• Bandwidth requirements based on
application
• Often called - Two sites and a laptop
• Distributed data
– data is distributed across 2 sites, 3rd site contains
quorum node for availability
• Sites A and B
– Contain the core Spectrum Scale nodes and storage
– Multiple quorum nodes in each site
• Site C
– Contains a single quorum node
– Serves as tie breaker if one of the other sites
becomes inaccessible
– File System Descriptor NSD
Reliability: Multiple site High availability
Single Spectrum Scale System
Site A
Site B
Site C
WAN
© Copyright IBM Corporation 2015
Recovering from a storage failure
• Fix replication using mmrestripefs
mmrestripefs -R
• Usage:
mmrestripefs Device {-m | -r | -b | -p | -R} [-P PoolName]
[-N {Node[,Node...] | NodeFile | NodeClass}]
© Copyright IBM Corporation 2015
Review
• Replication can be on a single file or a whole file system
• Replication is spread across failure groups
• Replication is even more important when you do not have any
RAID support underneath for your volumes
• Replication is always synchronous
• Asynchronous Replication is covered by another feature called
Active File Management (AFM)
© Copyright IBM Corporation 2015
Course materials may not be reproduced in whole or in part without the prior written permission of IBM.
Spectrum Scale Stretch Clusters
© Copyright IBM Corporation 2015
Spectrum Scale Stretch Clusters
• Stretch Clusters combine two or more clusters together to
make one giant cluster
• Stretch Clusters are intended for inter-site or close proximity
clusters, not over WAN unless the amount of data is small
• Replication is not required, but is usually the intention for a
stretch cluster
• If replication between clusters is not the goal, then you might
possibly prefer using a multi-cluster set up.
• If replication is the goal, but it’s between data centers using a
WAN, then AFM may be a better choice if synchronous
replication is NOT required.
© Copyright IBM Corporation 2015
© 2013 IBM Corporation
Spectrum Scale Architecture
(Basics)
SAN, Shared SAS,
Twin Tailed, etc.
LUN = Logical Unit Number / NSD = Network Shared Disk
1
SAN LUN
Spectrum
Scale NSD
„1:1“ Relation
© 2013 IBM Corporation
Spectrum Scale Architecture
(Basics)
SAN
LUN = Logical Unit Number / NSD = Network Shared Disk
1a
SAN (etc) LUN
Spectrum
Scale NSD
„1:1“ Relation
Twin-tailed
SAS
© 2013 IBM Corporation
Spectrum Scale Architecture
(Common)
SAN
LAN
LUN‘s
Spectrum Scale NSD Client
Spectrum Scale NSD Server
2
© 2013 IBM Corporation
Spectrum Scale Architecture (Typical)
SAN
LAN / WAN / Infiniband & any Mixture
3
+ Twin-Tailed Disks
+ Internal Disks
FPO FPO
(FPO = File Placement Optimizer)
Spectrum Scale
replication of data on
disk
One (or multiple) filesystems
Files placed on different devices under policy control
Spectrum Scale NSD Clients
Spectrum Scale NSD Server
LUNs
© 2013 IBM Corporation
LUN‘s
NSD Clients
NSD Server
(NSD = Network Shared Disk)
LAN
Infiniband
remote cluster
Remote Cluster Mount (synchronous)
local cluster
4
© 2013 IBM Corporation
LUN‘s
NSD Clients
NSD Server
Inter-site LAN
Local LAN
Stretch Cluster (synchronous)
4a
Quorum node at 3rd site
Local LAN
Spectrum Scale
replication of data
between sites
Filesystem active across both sites
Site 2Site 1
© 2013 IBM Corporation
LUN‘s
NSD Clients
NSD Server
(NSD = Network Shared Disk)
WAN
Infiniband
remote cluster
Spectrum Scale Advanced File
Management (async)local cluster
Caching (R/W)
5
Exercise 4
Replication
Exercise
© Copyright IBM Corporation 2015
Unit summary
Having completed this unit, you should be able to:
• Describe replication
• Describe a Stretch Cluster
© Copyright IBM Corporation 2015

More Related Content

What's hot (20)

PPTX
IBM Spectrum Scale Authentication for Protocols
Sandeep Patil
 
PDF
Gpfs introandsetup
asihan
 
PDF
Spectrum Scale Memory Usage
Tomer Perry
 
PDF
IBM Spectrum Scale for File and Object Storage
Tony Pearson
 
PPTX
IBM Spectrum Scale Secure- Secure Data in Motion and Rest
Sandeep Patil
 
PPT
Ibm spectrum scale fundamentals workshop for americas part 8 spectrumscale ba...
xKinAnx
 
PDF
Oracle Flex ASM - What’s New and Best Practices by Jim Williams
Markus Michalewicz
 
PPTX
Ansible presentation
Suresh Kumar
 
PPTX
IBM Spectrum Scale Authentication For Object - Deep Dive
Smita Raut
 
PPTX
IBM GPFS
Karthik V
 
PPTX
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
xKinAnx
 
PDF
왜 쿠버네티스는 systemd로 cgroup을 관리하려고 할까요
Jo Hoon
 
PDF
Advanced RAC troubleshooting: Network
Riyaj Shamsudeen
 
PDF
Optimizing MariaDB for maximum performance
MariaDB plc
 
PDF
2021.02 new in Ceph Pacific Dashboard
Ceph Community
 
PDF
Glusterfs 구성제안 및_운영가이드_v2.0
sprdd
 
PDF
Ceph RBD Update - June 2021
Ceph Community
 
PPTX
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
DataWorks Summit
 
PDF
PostgreSQL Streaming Replication Cheatsheet
Alexey Lesovsky
 
PPTX
Maria db 이중화구성_고민하기
NeoClova
 
IBM Spectrum Scale Authentication for Protocols
Sandeep Patil
 
Gpfs introandsetup
asihan
 
Spectrum Scale Memory Usage
Tomer Perry
 
IBM Spectrum Scale for File and Object Storage
Tony Pearson
 
IBM Spectrum Scale Secure- Secure Data in Motion and Rest
Sandeep Patil
 
Ibm spectrum scale fundamentals workshop for americas part 8 spectrumscale ba...
xKinAnx
 
Oracle Flex ASM - What’s New and Best Practices by Jim Williams
Markus Michalewicz
 
Ansible presentation
Suresh Kumar
 
IBM Spectrum Scale Authentication For Object - Deep Dive
Smita Raut
 
IBM GPFS
Karthik V
 
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
xKinAnx
 
왜 쿠버네티스는 systemd로 cgroup을 관리하려고 할까요
Jo Hoon
 
Advanced RAC troubleshooting: Network
Riyaj Shamsudeen
 
Optimizing MariaDB for maximum performance
MariaDB plc
 
2021.02 new in Ceph Pacific Dashboard
Ceph Community
 
Glusterfs 구성제안 및_운영가이드_v2.0
sprdd
 
Ceph RBD Update - June 2021
Ceph Community
 
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
DataWorks Summit
 
PostgreSQL Streaming Replication Cheatsheet
Alexey Lesovsky
 
Maria db 이중화구성_고민하기
NeoClova
 

Similar to Ibm spectrum scale fundamentals workshop for americas part 4 spectrum scale_replication_and_stretch_clusters (20)

PPTX
Ibm spectrum scale_backup_n_archive_v03_ash
Ashutosh Mate
 
PPTX
DAS RAID NAS SAN
Ghassen Smida
 
PDF
Reference Architecture: Architecting Ceph Storage Solutions
Ceph Community
 
PPTX
D81242GC20_les01.pptx
AbdallahAttia9
 
PDF
S ss0885 spectrum-scale-elastic-edge2015-v5
Tony Pearson
 
PDF
les12.pdf
VAMSICHOWDARY61
 
PPT
Storage, San And Business Continuity Overview
Alan McSweeney
 
PDF
Gluster for Geeks: Performance Tuning Tips & Tricks
GlusterFS
 
PDF
1.8 Data Protection.pdf
ssuser8b6c85
 
PDF
Hdfs architecture
Aisha Siddiqa
 
PDF
1Z0-027 Exam-Oracle Exadata Database Machine Administration, Software Release
Isabella789
 
PDF
High Availability Storage (susecon2016)
Roger Zhou 周志强
 
PPTX
Windows Server 2019.pptx
masbulosoke
 
PDF
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
AditiaHamid1
 
PPTX
Massstorage
mari sami
 
PPTX
GDPS and System Complex
Najmi Mansoor Ahmed
 
PPTX
Raid
Piyush Rochwani
 
DOCX
Tier 2 net app baseline design standard revised nov 2011
Accenture
 
PDF
O Racle Asm Best Practices Presentation
eraz
 
PPTX
Presentation db2 best practices for optimal performance
solarisyougood
 
Ibm spectrum scale_backup_n_archive_v03_ash
Ashutosh Mate
 
DAS RAID NAS SAN
Ghassen Smida
 
Reference Architecture: Architecting Ceph Storage Solutions
Ceph Community
 
D81242GC20_les01.pptx
AbdallahAttia9
 
S ss0885 spectrum-scale-elastic-edge2015-v5
Tony Pearson
 
les12.pdf
VAMSICHOWDARY61
 
Storage, San And Business Continuity Overview
Alan McSweeney
 
Gluster for Geeks: Performance Tuning Tips & Tricks
GlusterFS
 
1.8 Data Protection.pdf
ssuser8b6c85
 
Hdfs architecture
Aisha Siddiqa
 
1Z0-027 Exam-Oracle Exadata Database Machine Administration, Software Release
Isabella789
 
High Availability Storage (susecon2016)
Roger Zhou 周志强
 
Windows Server 2019.pptx
masbulosoke
 
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
AditiaHamid1
 
Massstorage
mari sami
 
GDPS and System Complex
Najmi Mansoor Ahmed
 
Tier 2 net app baseline design standard revised nov 2011
Accenture
 
O Racle Asm Best Practices Presentation
eraz
 
Presentation db2 best practices for optimal performance
solarisyougood
 
Ad

More from xKinAnx (20)

PPTX
Engage for success ibm spectrum accelerate 2
xKinAnx
 
PPTX
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive
xKinAnx
 
PDF
Software defined storage provisioning using ibm smart cloud
xKinAnx
 
PDF
Ibm spectrum virtualize 101
xKinAnx
 
PDF
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive dee...
xKinAnx
 
PDF
04 empalis -ibm_spectrum_protect_-_strategy_and_directions
xKinAnx
 
PPTX
Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...
xKinAnx
 
PPTX
Ibm spectrum scale fundamentals workshop for americas part 3 Information Life...
xKinAnx
 
PPTX
Ibm spectrum scale fundamentals workshop for americas part 5 spectrum scale_c...
xKinAnx
 
PPTX
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
xKinAnx
 
PPTX
Ibm spectrum scale fundamentals workshop for americas part 7 spectrumscale el...
xKinAnx
 
PDF
Presentation disaster recovery in virtualization and cloud
xKinAnx
 
PDF
Presentation disaster recovery for oracle fusion middleware with the zfs st...
xKinAnx
 
PDF
Presentation differentiated virtualization for enterprise clouds, large and...
xKinAnx
 
PDF
Presentation desktops for the cloud the view rollout
xKinAnx
 
PDF
Presentation design - key concepts and approaches for designing your deskto...
xKinAnx
 
PDF
Presentation desarrollos cloud con oracle virtualization
xKinAnx
 
PDF
Presentation deploying cloud based services
xKinAnx
 
PDF
Presentation dell™ power vault™ md3
xKinAnx
 
PDF
Presentation defend your company against cyber threats with security solutions
xKinAnx
 
Engage for success ibm spectrum accelerate 2
xKinAnx
 
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive
xKinAnx
 
Software defined storage provisioning using ibm smart cloud
xKinAnx
 
Ibm spectrum virtualize 101
xKinAnx
 
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive dee...
xKinAnx
 
04 empalis -ibm_spectrum_protect_-_strategy_and_directions
xKinAnx
 
Ibm spectrum scale fundamentals workshop for americas part 2 IBM Spectrum Sca...
xKinAnx
 
Ibm spectrum scale fundamentals workshop for americas part 3 Information Life...
xKinAnx
 
Ibm spectrum scale fundamentals workshop for americas part 5 spectrum scale_c...
xKinAnx
 
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
xKinAnx
 
Ibm spectrum scale fundamentals workshop for americas part 7 spectrumscale el...
xKinAnx
 
Presentation disaster recovery in virtualization and cloud
xKinAnx
 
Presentation disaster recovery for oracle fusion middleware with the zfs st...
xKinAnx
 
Presentation differentiated virtualization for enterprise clouds, large and...
xKinAnx
 
Presentation desktops for the cloud the view rollout
xKinAnx
 
Presentation design - key concepts and approaches for designing your deskto...
xKinAnx
 
Presentation desarrollos cloud con oracle virtualization
xKinAnx
 
Presentation deploying cloud based services
xKinAnx
 
Presentation dell™ power vault™ md3
xKinAnx
 
Presentation defend your company against cyber threats with security solutions
xKinAnx
 
Ad

Recently uploaded (20)

PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Biography of Daniel Podor.pdf
Daniel Podor
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Biography of Daniel Podor.pdf
Daniel Podor
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 

Ibm spectrum scale fundamentals workshop for americas part 4 spectrum scale_replication_and_stretch_clusters

  • 1. Course materials may not be reproduced in whole or in part without the prior written permission of IBM. Spectrum Scale Replication & Stretch Clusters © Copyright IBM Corporation 2015
  • 2. Unit objectives After completing this unit, you should be able to: • Describe Replication • Describe the Pros and Cons of Replication • Describe a Stretch Cluster © Copyright IBM Corporation 2015
  • 3. Synchronous data replication • Allows you to synchronously replicate – A file, set of files or the entire file system • Gives you better replication granularity as opposed to mirroring an entire volume, which also saves on space used – Allows you to replicate Metadata and/or data – Provides additional layer(s) of protection in addition to the RAID level protection of volumes underneath – Supports a maximum of 3 copies of the data – Replication is Synchronous only – Asynchronous Replication can be achieved using AFM feature © Copyright IBM Corporation 2015
  • 4. Synchronous data replication • To replicate or not replicate? – This is Spectrum Scale level of replication which is an availability level "on top of" the already built-in data availability (RAID) characteristics of the disk subsystem(s) being used – Can be used cross site – Some performance impact. Writes are 50 – 67% slower with replication levels 1 & 2 Reads are the same speed – Your storage effectively becomes more expensive since you are using more of your usable space for duplicate copies of your data © Copyright IBM Corporation 2015
  • 5. Data Replication Warnings • If you decide to use Replication – Always replicate your Metadata at the minimum – Never replicate your data and not your metadata • If you were to do this, then in the event of a failure, you would not be able to mount your file system to retrieve your replicated data © Copyright IBM Corporation 2015
  • 6. Replication relies on failure groups • Failure group – A group of disks in a storage pool that Spectrum Scale assumes are separate from the disks other failure groups. – Can be changed anytime (mmrestripefs to fix data) • A file is replicated when a copy of the data blocks exist in two failure groups – Ensures that no two replicas of the same block will become unavailable due to a single failure. • Can be set either at NSD creation time using the mmcrnsd command or later on using the mmchdisk command. • Important to set failure groups correctly to have effective file system replication. • Replication is per storage pool. © Copyright IBM Corporation 2015
  • 7. The third failure group File System Descriptor Quorum • In addition to quorum nodes three disks, by default (NSDs) are used as file system descriptors disks. • A majority of the replicas on the subset of disks must remain available to sustain file system operations. • Spectrum Scale can move them from one disk to another in case of failure. • Use the mmlsdisk –L command to see the location of the descriptor. • Can add one by creating a descOnly NSD. © Copyright IBM Corporation 2008 Disk Descriptor Quorum Node 1 Node 2
  • 8. Replication/Failure groups and storage pools • Creation of NSD requires [ mmcrnsd ] – O/S disk name – NSD Server List • Optional, but recommended – NSD name – Failure Group (related to Replication) – Storage Pool (related to Policy / ILM ) • Disk Stanza %nsd: device=/dev/sdav2 nsd=nsd1 servers=k145n06,k145n05 usage=dataOnly failureGroup=5 pool=poolA © Copyright IBM Corporation 2015
  • 9. Accessing replicated data • Default operation – Read: read from all copies – Write: write both copies • Control with readReplicaPolicy – Local: Read from block device or NSD server on same subnet – Used for read heavy workload replicated across distance • Operation with unavailable disk – Disk marked “down” in FS descriptor – Read: read available copy – Write: • Log changes for fast recovery (possible performance impact) • Set “missing update” flag in the inode • Write available copy • Recovery – Replay only changes to restored storage © Copyright IBM Corporation 2015
  • 10. Replication examples • Full Replication – 2 Failure Groups – Data and Metadata – On failure file system all ok • Metadata Replication – Replicate only metadata – On failure data missing file system stays mounted Failure Group 1 Failure Group 2 Failure Group 1 Failure Group 2 Failure Group 3 Failure Group 4 inode inode Missing Data Metadata OK Failure Group 3 (Desc Only) © Copyright IBM Corporation 2015
  • 11. Mixing replication with pools • Replicate only metadata • 3 Data pools for capacity and single namespace © Copyright IBM Corporation 2015
  • 12. • Multi-site quorum configuration • Replicate across sites • Bandwidth requirements based on application • Often called - Two sites and a laptop • Distributed data – data is distributed across 2 sites, 3rd site contains quorum node for availability • Sites A and B – Contain the core Spectrum Scale nodes and storage – Multiple quorum nodes in each site • Site C – Contains a single quorum node – Serves as tie breaker if one of the other sites becomes inaccessible – File System Descriptor NSD Reliability: Multiple site High availability Single Spectrum Scale System Site A Site B Site C WAN © Copyright IBM Corporation 2015
  • 13. Recovering from a storage failure • Fix replication using mmrestripefs mmrestripefs -R • Usage: mmrestripefs Device {-m | -r | -b | -p | -R} [-P PoolName] [-N {Node[,Node...] | NodeFile | NodeClass}] © Copyright IBM Corporation 2015
  • 14. Review • Replication can be on a single file or a whole file system • Replication is spread across failure groups • Replication is even more important when you do not have any RAID support underneath for your volumes • Replication is always synchronous • Asynchronous Replication is covered by another feature called Active File Management (AFM) © Copyright IBM Corporation 2015
  • 15. Course materials may not be reproduced in whole or in part without the prior written permission of IBM. Spectrum Scale Stretch Clusters © Copyright IBM Corporation 2015
  • 16. Spectrum Scale Stretch Clusters • Stretch Clusters combine two or more clusters together to make one giant cluster • Stretch Clusters are intended for inter-site or close proximity clusters, not over WAN unless the amount of data is small • Replication is not required, but is usually the intention for a stretch cluster • If replication between clusters is not the goal, then you might possibly prefer using a multi-cluster set up. • If replication is the goal, but it’s between data centers using a WAN, then AFM may be a better choice if synchronous replication is NOT required. © Copyright IBM Corporation 2015
  • 17. © 2013 IBM Corporation Spectrum Scale Architecture (Basics) SAN, Shared SAS, Twin Tailed, etc. LUN = Logical Unit Number / NSD = Network Shared Disk 1 SAN LUN Spectrum Scale NSD „1:1“ Relation
  • 18. © 2013 IBM Corporation Spectrum Scale Architecture (Basics) SAN LUN = Logical Unit Number / NSD = Network Shared Disk 1a SAN (etc) LUN Spectrum Scale NSD „1:1“ Relation Twin-tailed SAS
  • 19. © 2013 IBM Corporation Spectrum Scale Architecture (Common) SAN LAN LUN‘s Spectrum Scale NSD Client Spectrum Scale NSD Server 2
  • 20. © 2013 IBM Corporation Spectrum Scale Architecture (Typical) SAN LAN / WAN / Infiniband & any Mixture 3 + Twin-Tailed Disks + Internal Disks FPO FPO (FPO = File Placement Optimizer) Spectrum Scale replication of data on disk One (or multiple) filesystems Files placed on different devices under policy control Spectrum Scale NSD Clients Spectrum Scale NSD Server LUNs
  • 21. © 2013 IBM Corporation LUN‘s NSD Clients NSD Server (NSD = Network Shared Disk) LAN Infiniband remote cluster Remote Cluster Mount (synchronous) local cluster 4
  • 22. © 2013 IBM Corporation LUN‘s NSD Clients NSD Server Inter-site LAN Local LAN Stretch Cluster (synchronous) 4a Quorum node at 3rd site Local LAN Spectrum Scale replication of data between sites Filesystem active across both sites Site 2Site 1
  • 23. © 2013 IBM Corporation LUN‘s NSD Clients NSD Server (NSD = Network Shared Disk) WAN Infiniband remote cluster Spectrum Scale Advanced File Management (async)local cluster Caching (R/W) 5
  • 25. Unit summary Having completed this unit, you should be able to: • Describe replication • Describe a Stretch Cluster © Copyright IBM Corporation 2015