SlideShare a Scribd company logo
Introduction to Data Gravity
By: John Tkaczewski
President of FileCatalyst
March 4, 2015
Data Gravity
• A term first coined by Dave McCrory circa 2010
• Data is difficult to move around
• Data attracts greater and greater amount of
Apps, Services and other tools as it grows
An Introduction to Data Gravity by John Tkaczewski of FileCatalyst
Why is the data “stuck”?
Throughput and latency
• As throughput and latency to the Data increase, the gravitational pull
of the data mass also increases
• Which forces the apps and services to move closer to the data
If the model stopped here… all apps and
services would end up in a single giant online
BLOB (the cloud) to be closer to the data
There are other forces that keep some data away…
Forces that push away
• Privacy
• Security
• Cost
• Features, Convenience
There is a balance between the gravity
and the “Forces that push away”
Real Life Scenario
USB Thumb Drive VS. Amazon S3
• Unlimited flexible
growing storage
• Easy Sharing with
the rest of the
world
• Security
• Convenience
• Fast Access to Data
• Practically Free
• Can be physically
moved
An Introduction to Data Gravity by John Tkaczewski of FileCatalyst
Data Gravity on the Cloud
• Make inbound data as light as possible
• Make outbound data as heavy as possible
• Cost in VS. cost out
• Make Context of the data proprietary (example of
a picture on flickr from http://datagravity.org/)
Data Gravity as a computational theory
• Borrows from gravitational theory
• Similarities with the way nations negotiate trade tariffs and
trade agreements between countries and cities (ref)
• Shannon’s law how much information can be squeezed down
a wire
• Von Newmann Bottleneck, how fast the data can move from
Persistent Storage to Memory to CPU cache to CPU
How does accelerated file transfer fit
in all of this?
Traditional File Transfers
FTP, SFTP, HTTP, WebDav, SMTP, CIFS etc…
• All use TCP
• Provides reliability, error checking, ordered packets in a stream
• Congestion control built in
• Internet could not survive without it
• Works well for most internet traffic, email, web browsing small ad-
hoc transfers
Problems with TCP
• Flow control limits transmission window, causes dead air with high latency
• Very aggressive in response to network congestion, cannot tune in
application layer
• Result is less than ideal performance on wireless, satellite, or long haul
links
• Can be tuned but still not ideal for many-one, one-many
File Transfer Acceleration
• Ideal for bulk file transfer
• Predictable - Can send at a perfect rate
• Not affected by latency or packet loss
• Congestion Control implemented in application layer
• Tunable congestion control aggression
• Instantly detect link capacity
Overall the effects of Data Gravity
are reduced
(like Anti-Gravity)
An Introduction to Data Gravity by John Tkaczewski of FileCatalyst
• Data gravity still exists but is reduced by
eliminating the latency component
• The gravity continues to exist towards
every storage location
• With faster moving data, the owner can
now have more choices where to store it.
An Introduction to Data Gravity by John Tkaczewski of FileCatalyst
Cloud growth vs. geographical location of the users
• It’s not always possible to make cloud services
available near the all the users
• File Transfer Acceleration can help to reach
those far away users at a lower cost then
building a new data center
Future …
• Cloud services will continue to expand (money maker)
• Local and personal storage will continue to be needed
but merely as a cache to what’s on the cloud
• Throughput will continue to increase but the latency
will stay the same (speed of light++ anyone??)
• The need for faster file transfers will continue to grow
as the cloud, data and links get bigger.
Thank you.

More Related Content

What's hot (11)

PDF
Connecting Akka with Oracle Event Hub Cloud Service
Dalibor Blazevic
 
PPTX
OneDrive Presentation
Rebekah Butcher
 
PDF
Bosh - Configuring Services
Andrew Shafer
 
PPTX
Azure Service Bus Brokered Messaging
BizTalk360
 
PDF
Microsoft OneDrive For Business
David J Rosenthal
 
PPTX
The Evolution of Hosting and What's Next in Cloud Architecture - EE Conf 2018
Josh Ward
 
PDF
Filepicker Slideshow
Mark Bakker
 
PDF
Mozilla Weave: Integrating Services into the Browser
Anant Narayanan
 
PDF
Custom coded projects
Marko Heijnen
 
PPTX
Introduction to public cloud
gangal
 
PPTX
Containerization: The DevOps Revolution
SoftServe
 
Connecting Akka with Oracle Event Hub Cloud Service
Dalibor Blazevic
 
OneDrive Presentation
Rebekah Butcher
 
Bosh - Configuring Services
Andrew Shafer
 
Azure Service Bus Brokered Messaging
BizTalk360
 
Microsoft OneDrive For Business
David J Rosenthal
 
The Evolution of Hosting and What's Next in Cloud Architecture - EE Conf 2018
Josh Ward
 
Filepicker Slideshow
Mark Bakker
 
Mozilla Weave: Integrating Services into the Browser
Anant Narayanan
 
Custom coded projects
Marko Heijnen
 
Introduction to public cloud
gangal
 
Containerization: The DevOps Revolution
SoftServe
 

Viewers also liked (20)

PPTX
Risk Management in the Age of Disruption
Alan Laubsch
 
PDF
Innovation In Professional Services - Sectors Facing Digital Disruption
Matthew Ho
 
PPT
Geophysics Introduction
Hinsdale South High School
 
PPSX
new Techniques at internal audit
Mohammad Draidi
 
PPT
ÖNCEL AKADEMİ: INTRODUCTION TO GEOPHYSICS
Ali Osman Öncel
 
PDF
Sara Melki Bold Magazine 2015
Valerie Nehme
 
PDF
Disruption and Your Firm's Risk Appetite
The Risk Institute
 
PPTX
Solving Compliance Challenges Across Digital Channels
R2integrated
 
PPT
ÖNCEL AKADEMİ: INTRODUCTION TO GEOPHYSICS
Ali Osman Öncel
 
PPTX
AvePoint - Managing the SharePoint Disruption
garthluke
 
PDF
Digital disruption and corruption
Melda Tanyeri
 
PDF
The Postdigital Enterprise: Harnessing Change, Managing Disruption
Corey O'Neal
 
PPTX
QlikView For Healthcare From Top Line Strategies
TopLine Strategies
 
PPTX
Carbon matters print: global tipping points
Alan Laubsch
 
PDF
Board Governance and Emerging Risks in the C21
FERMA
 
PPTX
Reduction of gravity data
Amin khalil
 
PPT
Gravity, Expl.ravity
ahmadraza05
 
PDF
Use Of Techniques And Technology In Internal Audit
Manoj Agarwal
 
PPT
Internal Audit as a decision making tool
sandesh mundra
 
PDF
Business Discovery for Financial Services using QlikView
QlikView-India
 
Risk Management in the Age of Disruption
Alan Laubsch
 
Innovation In Professional Services - Sectors Facing Digital Disruption
Matthew Ho
 
Geophysics Introduction
Hinsdale South High School
 
new Techniques at internal audit
Mohammad Draidi
 
ÖNCEL AKADEMİ: INTRODUCTION TO GEOPHYSICS
Ali Osman Öncel
 
Sara Melki Bold Magazine 2015
Valerie Nehme
 
Disruption and Your Firm's Risk Appetite
The Risk Institute
 
Solving Compliance Challenges Across Digital Channels
R2integrated
 
ÖNCEL AKADEMİ: INTRODUCTION TO GEOPHYSICS
Ali Osman Öncel
 
AvePoint - Managing the SharePoint Disruption
garthluke
 
Digital disruption and corruption
Melda Tanyeri
 
The Postdigital Enterprise: Harnessing Change, Managing Disruption
Corey O'Neal
 
QlikView For Healthcare From Top Line Strategies
TopLine Strategies
 
Carbon matters print: global tipping points
Alan Laubsch
 
Board Governance and Emerging Risks in the C21
FERMA
 
Reduction of gravity data
Amin khalil
 
Gravity, Expl.ravity
ahmadraza05
 
Use Of Techniques And Technology In Internal Audit
Manoj Agarwal
 
Internal Audit as a decision making tool
sandesh mundra
 
Business Discovery for Financial Services using QlikView
QlikView-India
 
Ad

Similar to An Introduction to Data Gravity by John Tkaczewski of FileCatalyst (20)

PPTX
Beyond FTP & hard drives: Accelerating LAN file transfers
FileCatalyst
 
PPTX
An Introduction to FileCatalyst
FileCatalyst
 
PPT
Big data in the energy sector
FileCatalyst
 
PDF
IETF 112: Internet centrality and its impact on routing
APNIC
 
PPTX
Messaging: Harnessing The Cloud
Waterstons Ltd
 
PPTX
What is the Cloud - by LINKS Technology
CoyleFinancial
 
PPTX
Acceleration Technology: Solving File Transfer Issues
FileCatalyst
 
PPT
How to Share and Deliver Big Data Fast – Considerations When Implementing Big...
FileCatalyst
 
PDF
IBM Aspera overview
Carlos Martin Hernandez
 
PPT
UDP accelerated file transfer - introducing an FTP replacement and its benefits
FileCatalyst
 
PDF
When Heroes Become Superheroes Using Apps
SceneDoc
 
PPT
Spotlight on the petroleum and energy vertical
FileCatalyst
 
PPTX
Network Technologies: WAN and Internet services .pptx
Khutso Magolego
 
PDF
Network
Ynon Perek
 
PPTX
Scality SDS Day, London, 20 SEP 2017
Chris Evans
 
PPTX
001_Cloud Computing presentation Unit1.pptx
ronymalik05
 
PPTX
Cambridge Breakfast Seminar
NuoDB
 
PDF
GR 12 CAT Network Technologies Lesson 1.pdf
karabomatome31
 
PPTX
Partner spotlight: Cambridge Imaging Systems
FileCatalyst
 
PPT
Nov 2014 webinar Making The Transition From Ftp
FileCatalyst
 
Beyond FTP & hard drives: Accelerating LAN file transfers
FileCatalyst
 
An Introduction to FileCatalyst
FileCatalyst
 
Big data in the energy sector
FileCatalyst
 
IETF 112: Internet centrality and its impact on routing
APNIC
 
Messaging: Harnessing The Cloud
Waterstons Ltd
 
What is the Cloud - by LINKS Technology
CoyleFinancial
 
Acceleration Technology: Solving File Transfer Issues
FileCatalyst
 
How to Share and Deliver Big Data Fast – Considerations When Implementing Big...
FileCatalyst
 
IBM Aspera overview
Carlos Martin Hernandez
 
UDP accelerated file transfer - introducing an FTP replacement and its benefits
FileCatalyst
 
When Heroes Become Superheroes Using Apps
SceneDoc
 
Spotlight on the petroleum and energy vertical
FileCatalyst
 
Network Technologies: WAN and Internet services .pptx
Khutso Magolego
 
Network
Ynon Perek
 
Scality SDS Day, London, 20 SEP 2017
Chris Evans
 
001_Cloud Computing presentation Unit1.pptx
ronymalik05
 
Cambridge Breakfast Seminar
NuoDB
 
GR 12 CAT Network Technologies Lesson 1.pdf
karabomatome31
 
Partner spotlight: Cambridge Imaging Systems
FileCatalyst
 
Nov 2014 webinar Making The Transition From Ftp
FileCatalyst
 
Ad

More from ETCenter (20)

PDF
Securing Content in the Cloud
ETCenter
 
PPTX
Building Highly Scalable Immersive Media Solutions on AWS
ETCenter
 
PPTX
How broadcasters can get in the VR game with sports
ETCenter
 
PPTX
Improve Efficiency by Double Digits – Leveraging Artificial Intelligence and ...
ETCenter
 
PPTX
Looking beyond the script
ETCenter
 
PPTX
Cloud Apps for Media Processing: IMF Packaging-on-Demand
ETCenter
 
PPTX
IP for Sports broadcast
ETCenter
 
PPTX
The distributive aspect of cloud on the digital world
ETCenter
 
PPTX
Cloud Transition Patterns for Media Enterprises
ETCenter
 
PPTX
Hacking IoT: the new threat for content assets
ETCenter
 
PPTX
BLOCKCHAIN & THE HOLLYWOOD SUPPLY CHAIN
ETCenter
 
PPTX
Graymeta C4 use case, Deduplication
ETCenter
 
PPTX
WRAST, Worldwide Repository for Assets. Project Cloud QTR meeting @ Disney/ABC
ETCenter
 
PDF
Object storage is awesome.. ETC "Project Cloud" QTR meeting @ Disney/ABC
ETCenter
 
PPTX
Federated identity, Project Cloud QTR meeting @ Disney/ABC
ETCenter
 
PPTX
Security + Cloud: What studios and vendors need to consider when adopting clo...
ETCenter
 
PDF
"The Suitcase" Project Cloud QTR meeting presentation @ Disney/ABC
ETCenter
 
PDF
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
ETCenter
 
PDF
Big Data/DIG: Domain-Specific Insight Graphs by Pedro Szekely of ISI/USC
ETCenter
 
PDF
This Is Not Your Parent’s Storage: Transitioning to Cloud Object Storage by I...
ETCenter
 
Securing Content in the Cloud
ETCenter
 
Building Highly Scalable Immersive Media Solutions on AWS
ETCenter
 
How broadcasters can get in the VR game with sports
ETCenter
 
Improve Efficiency by Double Digits – Leveraging Artificial Intelligence and ...
ETCenter
 
Looking beyond the script
ETCenter
 
Cloud Apps for Media Processing: IMF Packaging-on-Demand
ETCenter
 
IP for Sports broadcast
ETCenter
 
The distributive aspect of cloud on the digital world
ETCenter
 
Cloud Transition Patterns for Media Enterprises
ETCenter
 
Hacking IoT: the new threat for content assets
ETCenter
 
BLOCKCHAIN & THE HOLLYWOOD SUPPLY CHAIN
ETCenter
 
Graymeta C4 use case, Deduplication
ETCenter
 
WRAST, Worldwide Repository for Assets. Project Cloud QTR meeting @ Disney/ABC
ETCenter
 
Object storage is awesome.. ETC "Project Cloud" QTR meeting @ Disney/ABC
ETCenter
 
Federated identity, Project Cloud QTR meeting @ Disney/ABC
ETCenter
 
Security + Cloud: What studios and vendors need to consider when adopting clo...
ETCenter
 
"The Suitcase" Project Cloud QTR meeting presentation @ Disney/ABC
ETCenter
 
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
ETCenter
 
Big Data/DIG: Domain-Specific Insight Graphs by Pedro Szekely of ISI/USC
ETCenter
 
This Is Not Your Parent’s Storage: Transitioning to Cloud Object Storage by I...
ETCenter
 

Recently uploaded (20)

PDF
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
July Patch Tuesday
Ivanti
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
July Patch Tuesday
Ivanti
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 

An Introduction to Data Gravity by John Tkaczewski of FileCatalyst

  • 1. Introduction to Data Gravity By: John Tkaczewski President of FileCatalyst March 4, 2015
  • 2. Data Gravity • A term first coined by Dave McCrory circa 2010 • Data is difficult to move around • Data attracts greater and greater amount of Apps, Services and other tools as it grows
  • 4. Why is the data “stuck”?
  • 5. Throughput and latency • As throughput and latency to the Data increase, the gravitational pull of the data mass also increases • Which forces the apps and services to move closer to the data
  • 6. If the model stopped here… all apps and services would end up in a single giant online BLOB (the cloud) to be closer to the data
  • 7. There are other forces that keep some data away…
  • 8. Forces that push away • Privacy • Security • Cost • Features, Convenience
  • 9. There is a balance between the gravity and the “Forces that push away”
  • 10. Real Life Scenario USB Thumb Drive VS. Amazon S3 • Unlimited flexible growing storage • Easy Sharing with the rest of the world • Security • Convenience • Fast Access to Data • Practically Free • Can be physically moved
  • 12. Data Gravity on the Cloud • Make inbound data as light as possible • Make outbound data as heavy as possible • Cost in VS. cost out • Make Context of the data proprietary (example of a picture on flickr from http://datagravity.org/)
  • 13. Data Gravity as a computational theory • Borrows from gravitational theory • Similarities with the way nations negotiate trade tariffs and trade agreements between countries and cities (ref) • Shannon’s law how much information can be squeezed down a wire • Von Newmann Bottleneck, how fast the data can move from Persistent Storage to Memory to CPU cache to CPU
  • 14. How does accelerated file transfer fit in all of this?
  • 15. Traditional File Transfers FTP, SFTP, HTTP, WebDav, SMTP, CIFS etc… • All use TCP • Provides reliability, error checking, ordered packets in a stream • Congestion control built in • Internet could not survive without it • Works well for most internet traffic, email, web browsing small ad- hoc transfers
  • 16. Problems with TCP • Flow control limits transmission window, causes dead air with high latency • Very aggressive in response to network congestion, cannot tune in application layer • Result is less than ideal performance on wireless, satellite, or long haul links • Can be tuned but still not ideal for many-one, one-many
  • 17. File Transfer Acceleration • Ideal for bulk file transfer • Predictable - Can send at a perfect rate • Not affected by latency or packet loss • Congestion Control implemented in application layer • Tunable congestion control aggression • Instantly detect link capacity
  • 18. Overall the effects of Data Gravity are reduced (like Anti-Gravity)
  • 20. • Data gravity still exists but is reduced by eliminating the latency component • The gravity continues to exist towards every storage location • With faster moving data, the owner can now have more choices where to store it.
  • 22. Cloud growth vs. geographical location of the users
  • 23. • It’s not always possible to make cloud services available near the all the users • File Transfer Acceleration can help to reach those far away users at a lower cost then building a new data center
  • 24. Future … • Cloud services will continue to expand (money maker) • Local and personal storage will continue to be needed but merely as a cache to what’s on the cloud • Throughput will continue to increase but the latency will stay the same (speed of light++ anyone??) • The need for faster file transfers will continue to grow as the cloud, data and links get bigger.