SlideShare a Scribd company logo
LIQUID-A Scalable Deduplication File System For Virtual Machine Images
CONTENTS
 INTRODUCTION
 ADVANTAGE OF CLOUD COMPUTING
 VIRTUAL MACHINE
 ADVANTAGES OF VIRTUAL MACHINE
 DISADVANTAGES OF VIRTUAL MACHINE
 DEDUPLICATION
 BENEFITSOFDEDUPLICATION
 EXISTING SYSTEM
 ISSUES IN VM STORAGE
 LIQUID SYSTEM ARCHITECTURE
 DEDUPLICATION IN LIQUID
CONTENTS(cont)
 OPTIMIZATION ON FINGER PRINT CALCULATION
 FILE SYSTEM LAYOUT
 COMMUNICATION AMONG COMPONENTS
 HEART BEAT PROTOCOL
 FAST CLONING FOR VM IMAGE
 FAULT TOLERANCE
 GARBAGE COLLECTIONS
 ADVANTAGES OF LIQUID
 CONCLUSION
 REFERENCES
INTRODUCTION
Cloud computing means storing and accessing
data programs over internet instead of yours
computers hard drive.
Figure 1: A Sample Cloud Computing Network[1]
ADVANTAGE CLOUD COMPUTING
Lower computer cost.
Improved performance.
Reduced software cost.
Instant software cost.
Unlimited storage capacity.
Increased data reliability.
Device independence and the “always on!,anywhere
and any place”.
Free from maintenance and the “no-need-to-know”.
VIRTUAL MACHINE
A virtual machine is a software that creates a
virtualized environment between the computer
platform and the end user in which the end user can
operate software.
Vitualization deals with extending or replacing an
existing interface so as to mimic the behavior of
another system.
Crucial component in cloud computing.
VIRTUAL MACHINE(cont)
Virtual machine – Hypothetical computer.
Execute programs like a physical machine.
Initial state of a virtual machine is stored in a file
called virtual machine image.
VIRTUAL MACHINE(cont)
Figure 2 :Virtual Machine Representation[1]
ADVANTAGES OF VIRTUAL MACHINE
Familiar interfaces
Isolation
-Each OS run seperately with its own virtual
resources.
High Availability
-If one VM server is failed then easily access data
from another one.
Scalability
-Add Or remove resources easily.
ADVANTAGES OF VIRTUAL
MACHINE(cont)
Back up with fast recovery
-Using VMDK data recovery tool.
Reduction of cost
-it save cost by running multi OS on single
machine.
-sharing of hardware.
DISADVANTAGES OF VIRTUAL
MACHINE
Difficulty in direct access to hardware.
Great RAM consumption since each virtual machine
will occupy a separate area of the same.
Greate use of disk space , since it takes all the files for
each operating system installed on each virtual
machine.
A virtual machine is less efficient than an actual
machine when it access the host hard drive indirectly.
DEDUPLICATION
Data Deduplication – data compression technology.
Eliminate duplicate copies of repeating data.
A redundant data block is replaced instead of storing
multiple times.
Improves storage utilization.
DEDUPLICATION(cont)
Figure 3: Deduplicated File System[2]
DEDUPLICATION(cont)
14
Traditional
File system
copycopy
Deduplication
copy
Idea. Eliminate
storage of data with
same content No new
blocks!
User’s point of
view
BENEFITS OF DEDUPLICATION
 Lower storage space requirements.
 Minimize the additional storage cost.
Performance increased.
Increase Network efficiency.
 Efficient Volume replication.
EXISTING SYSTEM
Hypervisors such as xen ,KVM etc.
Network Attached Storage(NAS)
Storage Area Network(SAN)
Direct Attached storage(DAS)
ISSUES IN VM STORAGE
High demand on VM storage remains a challenging
problem.
Existing systems have made efforts to reduce storage
consuptions.
Uses SAN cluster.
Cannot satisfy increasing demand due to cost
limitatations.
Hence we propose LIQUID.
LIQUID SYSTEM ARCHITECTURE
Three compononts – single meta server with hot back
up , multiple data server and multiple clients.
Runs on user – level service process.
VM images are split into fixed size data blocks.
Meta server – namespace , finger print , reference
count.
Meta server – mirrored to hot back up shadow meta
server.
LIQUID SYSTEM ARCHITECTURE(cont)
Data servers – charge of managing data blocks in VM
images.
Organized in a distributed hash table.
A liquid client provides a POSIX compatible file
system.
Client – critical component (provides deduplication).
Fault tolerance – mirroring the meta server.
Replicas of data blocks are stored.
LIQUID SYSTEM ARCHITECTURE(cont)
Figure 4: Liquid Architecture[3]
DEDUPLICATION IN LIQUID
Liquid choose fixed size chunking instead of variable
size chunking.
Better since all files stored in vm images will be aligned
on disk block boundaries.
Advantage – simplicity.
Block size choice.
Block size – balancing factor which is hard to choose.
Great impact on both deduplication and IO
performance.
DEDUPLICATION IN LIQUID(cont)
Smaller block size – more random seeks when
accessing a VM image.
Not tolerable.
A block size also not preferable , it will reduce
deduplication ratio.
Liquid choose different block size under different
situation.
Advised to use a multiplication of 4 kb between 256 kb
and 1 MB to achieve good balance between IO
performance and deduplication ratio.
DEDUPLICATION IN LIQUID(cont)
DEDUPLICATION IN LIQUID(cont)
OPTIMIZATION ON FINGER PRINT
CALCULATION
Rely on comparison of data block finger prints for
redundancy.
Finger print – collision resistant hash value calculated
from data block contents.
MD5(26) and SHA-1[12] are frequently used for this
purpose.
Finger print collision – very small , orders of
magnitude smaller than hardware error rates.
OPTIMIZATION ON FINGER PRINT
CALCULATION(cont)
So we could safely assume that two data blocks are
identical.
Finger print calculation – expensive.
Delays finger print calculation for recently modified
data blocks.
Runs deduplication lazily only when it s necessary.
Client side maintains a shared cache which contains
recently accessed data blocks.
OPTIMIZATION ON FINGER PRINT
CALCULATION(cont)
A portion of memory is used by the client side of
liquid as private cache.
Private cache hold – modified data blocks and delay
finger print calculation on them.
Modified data block ejected from shared cache and
added to private cache.
Modified data will be ejected if private cache becomes
full.
OPTIMIZATION ON FINGER PRINT
CALCULATION(cont)
And ejected based on LRU policy.
Only then will the modified data block’s finger print
be calculated.
Liquid uses multiple threads for finger print
calculation.
Multiple threads will process different data blocks
currently.
Provides good IO performance.
FILE SYSTEM LAYOUT
All file system meta data are stored on the meta server.
Organized in a file system tree.
Client side could cache portions of file systems meta
data for fast accesses.
When a VM is stopped , modified meta data and data
blocks.
Will be pushed back to meta server.
Data servers ensures modification on VM image is
visible to other client nodes.
FILE SYSTEM LAYOUT(cont)
Figure 5: Process Of Look-up by Fingerprint[4]
COMMUNICATION AMONG
COMPONENTS
 Heartbeatprotocol
 P2P Data Block Sharing
 On-Demand Data Block Fetching
HEART BEAT PROTOCOL
META SERVER – manages all data servers.
Exchange regular heart beat message with each data
server in a ROUND ROBIN FASHION.
Detect failed data servers when there are many data
servers.
To speed up failure detection data servers send an
error signal to meta server.
FAST CLONING FOR VM IMAGE
Copying large images may be time consuming.
Liquid provide efficient solution by means of fast
cloning.
VM images represented by meta data files having
reference to data blocks.
By copying meta data file and updating reference
count a clone VM image is achieved.
Modification on cloned images will not effect the
original image.
FAULT TOLERANCE
Data replication
Data migration
Hot backup of meta server
GARBAGE COLLECTIONS
Removes unused garbage data blocks when running
out of space.
Reference counting of all data blocks are maintained
by meta servers.
Garbage collection request is issued periodically to
data server.
Garbage collection is executed based on the data block
membership in the bloom filter.
ADVANTAGES OF LIQUID
Fast virtual machine deployment with peer to peer
data transfer.
Low storage consumption by means of deduplication.
Instant cloning for virtual machine images.
On demand fetching through a network caching with
local disks.
LIQUID files has no specific limit.
CONCLUSION
Presented LIQUID which is a deduplication file system
with good IO performance.
Achieve by caching frequently accessed data blocks in
memory cache.
Avoids additional disk operations.
Deduplication of VM images proved to be effective.
REFERENCES
[1] www.slideshare.net
[2]www.IJTRD1303.pdf
[3] www.cs.newpaltz.edu/~lik/publications/Xun-
Zhao-IEEE-TPDS-2014.pd.
[4]www.internationaljournalssrg.org/IJCSE/2015/Speci
al.../IJCSE-ICETSH-P118.pdf
LIQUID-A Scalable Deduplication File System For Virtual Machine Images
LIQUID-A Scalable Deduplication File System For Virtual Machine Images

More Related Content

What's hot (20)

PPTX
Warehouse scale computer
Hassan A-j
 
PPTX
Double-Take for Migrations - thinkASG University Series
thinkASG
 
PPTX
3 Hyper V
johnbakerMS
 
PDF
Xen Cloud Platform Installation Guide
Susheel Thakur
 
PPT
Lecture 4 Cluster Computing
Dr. Shaikh A.Khalique
 
PPTX
Hyper V In Windows Server 2008 R2.Son Vu
vncson
 
PPTX
Storage Virtualization Challenges
Randy Weis
 
PPTX
High performance computing
punjab engineering college, chandigarh
 
PDF
Scaling Small App
Sabbir Ahmmed
 
PPT
Cluster Computers
shopnil786
 
PPTX
Double-Take Availability - Technical Presentation
Mücahid Akçay
 
PDF
[IC Manage] Workspace Acceleration & Network Storage Reduction
Perforce
 
PDF
TSM 6.4.1 intro
Solv AS
 
PPTX
Nutanix vdi workshop presentation
He Hariyadi
 
PDF
Gridstore's Software-Defined-Storage Architecture
Gridstore
 
PDF
Tombolo
Kiran Srinivasan
 
PDF
Resumption of virtual machines after adaptive deduplication of virtual machin...
IJECEIAES
 
PDF
A Survey of Performance Comparison between Virtual Machines and Containers
prashant desai
 
PDF
Live VM Migration
Shivam Singh
 
Warehouse scale computer
Hassan A-j
 
Double-Take for Migrations - thinkASG University Series
thinkASG
 
3 Hyper V
johnbakerMS
 
Xen Cloud Platform Installation Guide
Susheel Thakur
 
Lecture 4 Cluster Computing
Dr. Shaikh A.Khalique
 
Hyper V In Windows Server 2008 R2.Son Vu
vncson
 
Storage Virtualization Challenges
Randy Weis
 
High performance computing
punjab engineering college, chandigarh
 
Scaling Small App
Sabbir Ahmmed
 
Cluster Computers
shopnil786
 
Double-Take Availability - Technical Presentation
Mücahid Akçay
 
[IC Manage] Workspace Acceleration & Network Storage Reduction
Perforce
 
TSM 6.4.1 intro
Solv AS
 
Nutanix vdi workshop presentation
He Hariyadi
 
Gridstore's Software-Defined-Storage Architecture
Gridstore
 
Resumption of virtual machines after adaptive deduplication of virtual machin...
IJECEIAES
 
A Survey of Performance Comparison between Virtual Machines and Containers
prashant desai
 
Live VM Migration
Shivam Singh
 

Similar to LIQUID-A Scalable Deduplication File System For Virtual Machine Images (20)

PPTX
Database as a Service - Tutorial @ICDE 2010
DBIS @ Ilmenau University of Technology
 
PPTX
Future prediction-ds
Muhammad Umar Farooq
 
PPTX
Windows Server 2019.pptx
masbulosoke
 
PDF
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
AditiaHamid1
 
PPT
DB2 for z/O S Data Sharing
Surekha Parekh
 
PDF
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
IJET - International Journal of Engineering and Techniques
 
PPTX
Final report on GOING BACK AND FORTH EFFICIENT MULTIDEPLOYMENT AND MULTI SNAP...
gnvivekananda4u
 
PPTX
NoSQL
RithikRaj25
 
PPT
Introduction to Cloud computing
Mathews Job
 
PPTX
cluster computing
anjalibhandari11011995
 
PDF
Presentation v mware performance overview
solarisyourep
 
PDF
Azure and cloud design patterns
Venkatesh Narayanan
 
PDF
Benchmarking a Scalable and Highly Available Architecture for Virtual Desktops
DataCore Software
 
PPT
Cio Breakfast Roundtable 05142009 Final Virtualization
guestc900809
 
DOCX
Whitepaper nebucom intelligent application broking and provisioning in a hybr...
Nebucom
 
PPT
Deconstructing the brian paradox
Chetan Venkatesh
 
PDF
Data Grids with Oracle Coherence
Ben Stopford
 
DOCX
School of computer & information sciences its 532 cloud c
ssusere73ce3
 
PPTX
Handling Data in Mega Scale Systems
Directi Group
 
PDF
Virtual Storage Center
IBM Danmark
 
Database as a Service - Tutorial @ICDE 2010
DBIS @ Ilmenau University of Technology
 
Future prediction-ds
Muhammad Umar Farooq
 
Windows Server 2019.pptx
masbulosoke
 
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
AditiaHamid1
 
DB2 for z/O S Data Sharing
Surekha Parekh
 
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
IJET - International Journal of Engineering and Techniques
 
Final report on GOING BACK AND FORTH EFFICIENT MULTIDEPLOYMENT AND MULTI SNAP...
gnvivekananda4u
 
Introduction to Cloud computing
Mathews Job
 
cluster computing
anjalibhandari11011995
 
Presentation v mware performance overview
solarisyourep
 
Azure and cloud design patterns
Venkatesh Narayanan
 
Benchmarking a Scalable and Highly Available Architecture for Virtual Desktops
DataCore Software
 
Cio Breakfast Roundtable 05142009 Final Virtualization
guestc900809
 
Whitepaper nebucom intelligent application broking and provisioning in a hybr...
Nebucom
 
Deconstructing the brian paradox
Chetan Venkatesh
 
Data Grids with Oracle Coherence
Ben Stopford
 
School of computer & information sciences its 532 cloud c
ssusere73ce3
 
Handling Data in Mega Scale Systems
Directi Group
 
Virtual Storage Center
IBM Danmark
 
Ad

Recently uploaded (20)

PDF
2025 Laurence Sigler - Advancing Decision Support. Content Management Ecommer...
Francisco Javier Mora Serrano
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
4 Tier Teamcenter Installation part1.pdf
VnyKumar1
 
PPTX
sunil mishra pptmmmmmmmmmmmmmmmmmmmmmmmmm
singhamit111
 
PPTX
Ground improvement techniques-DEWATERING
DivakarSai4
 
PDF
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
DOCX
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
PDF
Zero carbon Building Design Guidelines V4
BassemOsman1
 
PPTX
FUNDAMENTALS OF ELECTRIC VEHICLES UNIT-1
MikkiliSuresh
 
PPTX
Introduction to Fluid and Thermal Engineering
Avesahemad Husainy
 
PDF
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
PPTX
Information Retrieval and Extraction - Module 7
premSankar19
 
PPTX
MT Chapter 1.pptx- Magnetic particle testing
ABCAnyBodyCanRelax
 
PPTX
MULTI LEVEL DATA TRACKING USING COOJA.pptx
dollysharma12ab
 
PPTX
Inventory management chapter in automation and robotics.
atisht0104
 
PPTX
Precedence and Associativity in C prog. language
Mahendra Dheer
 
PDF
Machine Learning All topics Covers In This Single Slides
AmritTiwari19
 
PDF
Jual GPS Geodetik CHCNAV i93 IMU-RTK Lanjutan dengan Survei Visual
Budi Minds
 
PDF
Natural_Language_processing_Unit_I_notes.pdf
sanguleumeshit
 
PPTX
ETP Presentation(1000m3 Small ETP For Power Plant and industry
MD Azharul Islam
 
2025 Laurence Sigler - Advancing Decision Support. Content Management Ecommer...
Francisco Javier Mora Serrano
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
4 Tier Teamcenter Installation part1.pdf
VnyKumar1
 
sunil mishra pptmmmmmmmmmmmmmmmmmmmmmmmmm
singhamit111
 
Ground improvement techniques-DEWATERING
DivakarSai4
 
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
Zero carbon Building Design Guidelines V4
BassemOsman1
 
FUNDAMENTALS OF ELECTRIC VEHICLES UNIT-1
MikkiliSuresh
 
Introduction to Fluid and Thermal Engineering
Avesahemad Husainy
 
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
Information Retrieval and Extraction - Module 7
premSankar19
 
MT Chapter 1.pptx- Magnetic particle testing
ABCAnyBodyCanRelax
 
MULTI LEVEL DATA TRACKING USING COOJA.pptx
dollysharma12ab
 
Inventory management chapter in automation and robotics.
atisht0104
 
Precedence and Associativity in C prog. language
Mahendra Dheer
 
Machine Learning All topics Covers In This Single Slides
AmritTiwari19
 
Jual GPS Geodetik CHCNAV i93 IMU-RTK Lanjutan dengan Survei Visual
Budi Minds
 
Natural_Language_processing_Unit_I_notes.pdf
sanguleumeshit
 
ETP Presentation(1000m3 Small ETP For Power Plant and industry
MD Azharul Islam
 
Ad

LIQUID-A Scalable Deduplication File System For Virtual Machine Images

  • 2. CONTENTS  INTRODUCTION  ADVANTAGE OF CLOUD COMPUTING  VIRTUAL MACHINE  ADVANTAGES OF VIRTUAL MACHINE  DISADVANTAGES OF VIRTUAL MACHINE  DEDUPLICATION  BENEFITSOFDEDUPLICATION  EXISTING SYSTEM  ISSUES IN VM STORAGE  LIQUID SYSTEM ARCHITECTURE  DEDUPLICATION IN LIQUID
  • 3. CONTENTS(cont)  OPTIMIZATION ON FINGER PRINT CALCULATION  FILE SYSTEM LAYOUT  COMMUNICATION AMONG COMPONENTS  HEART BEAT PROTOCOL  FAST CLONING FOR VM IMAGE  FAULT TOLERANCE  GARBAGE COLLECTIONS  ADVANTAGES OF LIQUID  CONCLUSION  REFERENCES
  • 4. INTRODUCTION Cloud computing means storing and accessing data programs over internet instead of yours computers hard drive. Figure 1: A Sample Cloud Computing Network[1]
  • 5. ADVANTAGE CLOUD COMPUTING Lower computer cost. Improved performance. Reduced software cost. Instant software cost. Unlimited storage capacity. Increased data reliability. Device independence and the “always on!,anywhere and any place”. Free from maintenance and the “no-need-to-know”.
  • 6. VIRTUAL MACHINE A virtual machine is a software that creates a virtualized environment between the computer platform and the end user in which the end user can operate software. Vitualization deals with extending or replacing an existing interface so as to mimic the behavior of another system. Crucial component in cloud computing.
  • 7. VIRTUAL MACHINE(cont) Virtual machine – Hypothetical computer. Execute programs like a physical machine. Initial state of a virtual machine is stored in a file called virtual machine image.
  • 8. VIRTUAL MACHINE(cont) Figure 2 :Virtual Machine Representation[1]
  • 9. ADVANTAGES OF VIRTUAL MACHINE Familiar interfaces Isolation -Each OS run seperately with its own virtual resources. High Availability -If one VM server is failed then easily access data from another one. Scalability -Add Or remove resources easily.
  • 10. ADVANTAGES OF VIRTUAL MACHINE(cont) Back up with fast recovery -Using VMDK data recovery tool. Reduction of cost -it save cost by running multi OS on single machine. -sharing of hardware.
  • 11. DISADVANTAGES OF VIRTUAL MACHINE Difficulty in direct access to hardware. Great RAM consumption since each virtual machine will occupy a separate area of the same. Greate use of disk space , since it takes all the files for each operating system installed on each virtual machine. A virtual machine is less efficient than an actual machine when it access the host hard drive indirectly.
  • 12. DEDUPLICATION Data Deduplication – data compression technology. Eliminate duplicate copies of repeating data. A redundant data block is replaced instead of storing multiple times. Improves storage utilization.
  • 14. DEDUPLICATION(cont) 14 Traditional File system copycopy Deduplication copy Idea. Eliminate storage of data with same content No new blocks! User’s point of view
  • 15. BENEFITS OF DEDUPLICATION  Lower storage space requirements.  Minimize the additional storage cost. Performance increased. Increase Network efficiency.  Efficient Volume replication.
  • 16. EXISTING SYSTEM Hypervisors such as xen ,KVM etc. Network Attached Storage(NAS) Storage Area Network(SAN) Direct Attached storage(DAS)
  • 17. ISSUES IN VM STORAGE High demand on VM storage remains a challenging problem. Existing systems have made efforts to reduce storage consuptions. Uses SAN cluster. Cannot satisfy increasing demand due to cost limitatations. Hence we propose LIQUID.
  • 18. LIQUID SYSTEM ARCHITECTURE Three compononts – single meta server with hot back up , multiple data server and multiple clients. Runs on user – level service process. VM images are split into fixed size data blocks. Meta server – namespace , finger print , reference count. Meta server – mirrored to hot back up shadow meta server.
  • 19. LIQUID SYSTEM ARCHITECTURE(cont) Data servers – charge of managing data blocks in VM images. Organized in a distributed hash table. A liquid client provides a POSIX compatible file system. Client – critical component (provides deduplication). Fault tolerance – mirroring the meta server. Replicas of data blocks are stored.
  • 20. LIQUID SYSTEM ARCHITECTURE(cont) Figure 4: Liquid Architecture[3]
  • 21. DEDUPLICATION IN LIQUID Liquid choose fixed size chunking instead of variable size chunking. Better since all files stored in vm images will be aligned on disk block boundaries. Advantage – simplicity. Block size choice. Block size – balancing factor which is hard to choose. Great impact on both deduplication and IO performance.
  • 22. DEDUPLICATION IN LIQUID(cont) Smaller block size – more random seeks when accessing a VM image. Not tolerable. A block size also not preferable , it will reduce deduplication ratio. Liquid choose different block size under different situation. Advised to use a multiplication of 4 kb between 256 kb and 1 MB to achieve good balance between IO performance and deduplication ratio.
  • 25. OPTIMIZATION ON FINGER PRINT CALCULATION Rely on comparison of data block finger prints for redundancy. Finger print – collision resistant hash value calculated from data block contents. MD5(26) and SHA-1[12] are frequently used for this purpose. Finger print collision – very small , orders of magnitude smaller than hardware error rates.
  • 26. OPTIMIZATION ON FINGER PRINT CALCULATION(cont) So we could safely assume that two data blocks are identical. Finger print calculation – expensive. Delays finger print calculation for recently modified data blocks. Runs deduplication lazily only when it s necessary. Client side maintains a shared cache which contains recently accessed data blocks.
  • 27. OPTIMIZATION ON FINGER PRINT CALCULATION(cont) A portion of memory is used by the client side of liquid as private cache. Private cache hold – modified data blocks and delay finger print calculation on them. Modified data block ejected from shared cache and added to private cache. Modified data will be ejected if private cache becomes full.
  • 28. OPTIMIZATION ON FINGER PRINT CALCULATION(cont) And ejected based on LRU policy. Only then will the modified data block’s finger print be calculated. Liquid uses multiple threads for finger print calculation. Multiple threads will process different data blocks currently. Provides good IO performance.
  • 29. FILE SYSTEM LAYOUT All file system meta data are stored on the meta server. Organized in a file system tree. Client side could cache portions of file systems meta data for fast accesses. When a VM is stopped , modified meta data and data blocks. Will be pushed back to meta server. Data servers ensures modification on VM image is visible to other client nodes.
  • 30. FILE SYSTEM LAYOUT(cont) Figure 5: Process Of Look-up by Fingerprint[4]
  • 31. COMMUNICATION AMONG COMPONENTS  Heartbeatprotocol  P2P Data Block Sharing  On-Demand Data Block Fetching
  • 32. HEART BEAT PROTOCOL META SERVER – manages all data servers. Exchange regular heart beat message with each data server in a ROUND ROBIN FASHION. Detect failed data servers when there are many data servers. To speed up failure detection data servers send an error signal to meta server.
  • 33. FAST CLONING FOR VM IMAGE Copying large images may be time consuming. Liquid provide efficient solution by means of fast cloning. VM images represented by meta data files having reference to data blocks. By copying meta data file and updating reference count a clone VM image is achieved. Modification on cloned images will not effect the original image.
  • 34. FAULT TOLERANCE Data replication Data migration Hot backup of meta server
  • 35. GARBAGE COLLECTIONS Removes unused garbage data blocks when running out of space. Reference counting of all data blocks are maintained by meta servers. Garbage collection request is issued periodically to data server. Garbage collection is executed based on the data block membership in the bloom filter.
  • 36. ADVANTAGES OF LIQUID Fast virtual machine deployment with peer to peer data transfer. Low storage consumption by means of deduplication. Instant cloning for virtual machine images. On demand fetching through a network caching with local disks. LIQUID files has no specific limit.
  • 37. CONCLUSION Presented LIQUID which is a deduplication file system with good IO performance. Achieve by caching frequently accessed data blocks in memory cache. Avoids additional disk operations. Deduplication of VM images proved to be effective.