SlideShare a Scribd company logo
Elephants in The Cloud
or How to Become Cloud Ready
Krzysztof Adamski, GetInData
So You Say You Don’t Use Cloud?
HR System Online Documents Mobile PhoneEmail Server
Trust as a Key Factor
Image source: https://www.forbes.com/sites/louiscolumbus/2017/04/23/2017-state-of-cloud-adoption-and-security
More Secure or Not
In the end, do you
really think you can
provide better
infrastructure security
than cloud providers
???
Migration Questions?
How fast can you start/expand your analytics initiative?
1
How often is your cluster fully busy and your employees want more computing
power right now?2
How much time you spend on maintaining your infra?
3
How much time does it take you to gracefully apply all the security patches in
your Hadoop cluster?4
Do you need hardware that you don’t have in your data-center e.g. GPU,
terrible amounts of RAM5
Hadoop Operations at Scale
Migration Goals
Transition from infrastructure engineering
towards data engineering
1
Use the best possible technology stack in the
world
2
Free your time
3
Attract the best engineers
4
Ultimate world domination ;)
5
Krzysztof Adamski
Before You Start
Be smart
with which
service you
choose
Avoid
lock-in
Try to
estimate
the costs
See what
others
are doing
Technology
choices
Yet another
migration
Hardware,
engineering, legal
Netflix, Spotify, Etsy
What’s different in
the Cloud ?
Decoupled
storage
and
processing
Different Technologies
Hadoop Ecosystem Google Cloud Platform
File System HDFS Google Cloud Storage
Key Value Store HBase, Cassandra BigTable
SQL Hive, SparkSQL, Presto BigQuery
Messaging Queue Kafka PubSub
Geo-Replicated
RDBMS
CockroachDB Spanner
Cloud
Storage
Decision
Tree
Storage
Connectors
Strong Global Consistency
Google Cloud Storage provides strong global consistency for the following
operations, including both data and metadata:
● Read-after-write
● Read-after-metadata-update
● Read-after-delete
● Bucket listing, Object listing
● Granting access to resources
Eventual Consistency
● Revoking access from resources
It typically takes about a minute for revoking access to take effect. In some
cases it may take longer.
Beware of a cache though.
Pricing
● Pay-per-second billing
Keep in mind that if you often do sub-10
minute analyses using VMs, serverless
options may be better suited since VMs
are relatively slow to boot and serverless
functions are billed at every 100ms.
I want to start.
What’s next?
Data
repository
in a good
shape
Find best
candidates
for
migration
Isolated / self-contained
applications
With mainly external
(public data)
dependencies
Global use case
Baby Steps
Prepare your hadoop cluster to interact
with object storage.
1
Look for existing operators for popular
tools like Apache Airflow.
2
Make a copy of your critical datasets to
the cloud.
3
Use both BigQuery for fast analytics and
GCS output for more advanced trials.
4
Audit costs per query.
5
Networking
High bandwidth, low
latency and consistent
network connectivity is
critical.
Pay attention to such
things like choosing the
right region, number of
cores or even TCP
window size.
But to get the full speed
dedicated interconnect /
direct peering is the way
to go.
Multiple VPN tunnels
are a good starting
point to increase
bandwidth.
Transfer appliances for
offline data migration.
Data
Transfer
Time
Package Your Deployments
● Containers (docker) for tooling.
● Deployment artifacts (Spark / MR
jars).
● Tools like Spydra can help you
executing your packages in both
worlds
$ cat examples.json
{
"client_id": "simple-spydra-test",
"cluster_type": "dataproc",
"log_bucket": "spydra-test-logs",
"region": "europe-west1",
"cluster": {
"options": {
"project": "spydra-test"
}
},
"submit": {
"job_args": [
"pi",
"8",
"100"
],
"options": {
"jar": "hadoop-mapreduce-examples.jar"
}
}
}
$ spydra submit --spydra-json example.json
Other Important Features
● Cluster pooling - using init actions to kill old clusters
● Autoscaling - based on the workload
● Preemptible instances:
○ A reasonable choice for your cluster
○ Keep in mind final resilience (idempotence)
○ Available also with GPUs
No Long-Lived Services
● No patching! - YAY
● No wasting resources
● Latest security patches
applied automatically
Predictions
Forrester predicts
SaaS vendors will de-prioritize
their platform efforts to attain
global scale.
They will compete more at the platform level by running
portions of their services on AWS, Azure, GCP or Oracle Cloud
in 2018.
”
”
Future
Interesting projects:
● Spark on k8s
● dA Platform 2
Kubernetes
There no right answer - it's tradeoff that depends on many variables
Should I Stay or Should I Go?
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetInData

More Related Content

PDF
Modeling Catastrophic Events in Spark: Spark Summit East Talk by Georg Hofman...
Spark Summit
 
PDF
Bulletproof Kafka with Fault Tree Analysis (Andrey Falko, Lyft) Kafka Summit ...
confluent
 
PPTX
Real time analytics with Kafka and SparkStreaming
Ashish Singh
 
PDF
How To Use Kafka and Druid to Tame Your Router Data (Rachel Pedreschi, Imply ...
confluent
 
PPTX
Querying multiple distributed storage systems with Apache Hive robustly
Ashish Singh
 
PDF
Elephants in the cloud or how to become cloud ready
Krzysztof Adamski
 
PPTX
Monitoring and scaling postgres at datadog
Seth Rosenblum
 
PDF
Building highly reliable data pipeline @datadog par Quentin François
Paris Data Engineers !
 
Modeling Catastrophic Events in Spark: Spark Summit East Talk by Georg Hofman...
Spark Summit
 
Bulletproof Kafka with Fault Tree Analysis (Andrey Falko, Lyft) Kafka Summit ...
confluent
 
Real time analytics with Kafka and SparkStreaming
Ashish Singh
 
How To Use Kafka and Druid to Tame Your Router Data (Rachel Pedreschi, Imply ...
confluent
 
Querying multiple distributed storage systems with Apache Hive robustly
Ashish Singh
 
Elephants in the cloud or how to become cloud ready
Krzysztof Adamski
 
Monitoring and scaling postgres at datadog
Seth Rosenblum
 
Building highly reliable data pipeline @datadog par Quentin François
Paris Data Engineers !
 

What's hot (19)

PDF
Hard Truths About Streaming and Eventing (Dan Rosanova, Microsoft) Kafka Summ...
confluent
 
PPTX
Webinar | Building Apps with the Cassandra Python Driver
DataStax Academy
 
PPTX
Using Apache Pulsar to Provide Real-Time IoT Analytics on the Edge
DataWorks Summit
 
PDF
Going from three nines to four nines using Kafka | Tejas Chopra, Netflix
HostedbyConfluent
 
PDF
Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...
Karthik Ramasamy
 
PDF
Using Pluggable Apache Spark SQL Filters to Help GridPocket Users Keep Up wit...
Spark Summit
 
PDF
Kai Wähner, Technology Evangelist at Confluent: "Development of Scalable Mac...
Dataconomy Media
 
PDF
Scaling monitoring with Datadog
alexismidon
 
PDF
Scaling graphite for application metrics
Jim Plush
 
PDF
The Future of Computing is Distributed
Alluxio, Inc.
 
PPTX
Stratio big data spain
Álvaro Agea Herradón
 
PDF
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
C4Media
 
PDF
Consolidate Your Technical Debt With Spark Data Sources -Tools and Techniques...
Databricks
 
PDF
Solving Hybrid Cloud Data Replication with Apache Cassandra
Aaron Ploetz
 
PPTX
Msr2009 ian
SAIL_QU
 
PDF
Elastic Data Analytics Platform @Datadog
C4Media
 
PPTX
ARCHITECTING INFLUXENTERPRISE FOR SUCCESS
InfluxData
 
PDF
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
PDF
Provisioning Datadog with Terraform
Matt Spurlin
 
Hard Truths About Streaming and Eventing (Dan Rosanova, Microsoft) Kafka Summ...
confluent
 
Webinar | Building Apps with the Cassandra Python Driver
DataStax Academy
 
Using Apache Pulsar to Provide Real-Time IoT Analytics on the Edge
DataWorks Summit
 
Going from three nines to four nines using Kafka | Tejas Chopra, Netflix
HostedbyConfluent
 
Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...
Karthik Ramasamy
 
Using Pluggable Apache Spark SQL Filters to Help GridPocket Users Keep Up wit...
Spark Summit
 
Kai Wähner, Technology Evangelist at Confluent: "Development of Scalable Mac...
Dataconomy Media
 
Scaling monitoring with Datadog
alexismidon
 
Scaling graphite for application metrics
Jim Plush
 
The Future of Computing is Distributed
Alluxio, Inc.
 
Stratio big data spain
Álvaro Agea Herradón
 
Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day
C4Media
 
Consolidate Your Technical Debt With Spark Data Sources -Tools and Techniques...
Databricks
 
Solving Hybrid Cloud Data Replication with Apache Cassandra
Aaron Ploetz
 
Msr2009 ian
SAIL_QU
 
Elastic Data Analytics Platform @Datadog
C4Media
 
ARCHITECTING INFLUXENTERPRISE FOR SUCCESS
InfluxData
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
Provisioning Datadog with Terraform
Matt Spurlin
 
Ad

Similar to Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetInData (20)

PDF
Six Steps to Modernize Your Data Ecosystem - Mindtree
samirandev1
 
PDF
6 Steps to Modernize Data Ecosystem with Mindtree
devraajsingh
 
PDF
Steps to Modernize Your Data Ecosystem with Mindtree Blog
sameerroshan
 
PDF
Steps to Modernize Your Data Ecosystem | Mindtree
AnikeyRoy
 
PPTX
High-Performance Analytics in the Cloud with Apache Impala
Cloudera, Inc.
 
PPTX
Big data journey to the cloud 5.30.18 asher bartch
Cloudera, Inc.
 
PPTX
Automating Cloud Cluster Deployment: Beyond the Book
Bill Havanki
 
PPTX
Building a mature foundation for life in the cloud
Impetus Technologies
 
PDF
Strategies for on premise to Google Cloud migration - Mateusz Pytel, GetInData
GetInData
 
PPTX
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
 
PPTX
Hadoop in the Cloud - The what, why and how from the experts
DataWorks Summit/Hadoop Summit
 
PDF
Effectively deploying hadoop to the cloud
Avinash Ramineni
 
PDF
Hadoop on Cloud: Why and How?
Cloudera, Inc.
 
PPTX
Hadoop in the Cloud – The What, Why and How from the Experts
DataWorks Summit/Hadoop Summit
 
PPTX
Big data on cloud infrastructure
PT Datacomm Diangraha
 
PPTX
Big Data on Cloud Native Platform
Sunil Govindan
 
PPTX
Big Data on Cloud Native Platform
Sunil Govindan
 
PPTX
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Tom Laszewski
 
PPTX
Big Data in the Cloud - The What, Why and How from the Experts
DataWorks Summit/Hadoop Summit
 
PPTX
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
 
Six Steps to Modernize Your Data Ecosystem - Mindtree
samirandev1
 
6 Steps to Modernize Data Ecosystem with Mindtree
devraajsingh
 
Steps to Modernize Your Data Ecosystem with Mindtree Blog
sameerroshan
 
Steps to Modernize Your Data Ecosystem | Mindtree
AnikeyRoy
 
High-Performance Analytics in the Cloud with Apache Impala
Cloudera, Inc.
 
Big data journey to the cloud 5.30.18 asher bartch
Cloudera, Inc.
 
Automating Cloud Cluster Deployment: Beyond the Book
Bill Havanki
 
Building a mature foundation for life in the cloud
Impetus Technologies
 
Strategies for on premise to Google Cloud migration - Mateusz Pytel, GetInData
GetInData
 
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
 
Hadoop in the Cloud - The what, why and how from the experts
DataWorks Summit/Hadoop Summit
 
Effectively deploying hadoop to the cloud
Avinash Ramineni
 
Hadoop on Cloud: Why and How?
Cloudera, Inc.
 
Hadoop in the Cloud – The What, Why and How from the Experts
DataWorks Summit/Hadoop Summit
 
Big data on cloud infrastructure
PT Datacomm Diangraha
 
Big Data on Cloud Native Platform
Sunil Govindan
 
Big Data on Cloud Native Platform
Sunil Govindan
 
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Tom Laszewski
 
Big Data in the Cloud - The What, Why and How from the Experts
DataWorks Summit/Hadoop Summit
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
 
Ad

More from Evention (20)

PDF
The Factorization Machines algorithm for building recommendation system - Paw...
Evention
 
PDF
A/B testing powered by Big data - Saurabh Goyal, Booking.com
Evention
 
PDF
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Evention
 
PDF
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Evention
 
PDF
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Evention
 
PDF
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Evention
 
PDF
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Evention
 
PDF
Privacy by Design - Lars Albertsson, Mapflat
Evention
 
PDF
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Evention
 
PDF
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Evention
 
PDF
7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...
Evention
 
PDF
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Evention
 
PDF
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Evention
 
PDF
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Evention
 
PDF
Big Data for unstructured data Dariusz Śliwa
Evention
 
PDF
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Evention
 
PDF
H2 o deep water making deep learning accessible to everyone -jo-fai chow
Evention
 
PDF
That won’t fit into RAM - Michał Brzezicki
Evention
 
PDF
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Evention
 
PDF
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Evention
 
The Factorization Machines algorithm for building recommendation system - Paw...
Evention
 
A/B testing powered by Big data - Saurabh Goyal, Booking.com
Evention
 
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Evention
 
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Evention
 
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Evention
 
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Evention
 
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Evention
 
Privacy by Design - Lars Albertsson, Mapflat
Evention
 
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Evention
 
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Evention
 
7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with...
Evention
 
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Evention
 
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Evention
 
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Evention
 
Big Data for unstructured data Dariusz Śliwa
Evention
 
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Evention
 
H2 o deep water making deep learning accessible to everyone -jo-fai chow
Evention
 
That won’t fit into RAM - Michał Brzezicki
Evention
 
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Evention
 
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Evention
 

Recently uploaded (20)

PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PPT
Grade 5 PPT_Science_Q2_W6_Methods of reproduction.ppt
AaronBaluyut
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PPTX
International-health-agency and it's work.pptx
shreehareeshgs
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PPTX
Economic Sector Performance Recovery.pptx
yulisbaso2020
 
PPTX
Azure Data management Engineer project.pptx
sumitmundhe77
 
PPTX
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
PPTX
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
PPT
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
1intro to AI.pptx AI components & composition
ssuserb993e5
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
Grade 5 PPT_Science_Q2_W6_Methods of reproduction.ppt
AaronBaluyut
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
International-health-agency and it's work.pptx
shreehareeshgs
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Economic Sector Performance Recovery.pptx
yulisbaso2020
 
Azure Data management Engineer project.pptx
sumitmundhe77
 
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
1intro to AI.pptx AI components & composition
ssuserb993e5
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 

Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetInData

  • 1. Elephants in The Cloud or How to Become Cloud Ready Krzysztof Adamski, GetInData
  • 2. So You Say You Don’t Use Cloud? HR System Online Documents Mobile PhoneEmail Server
  • 3. Trust as a Key Factor Image source: https://www.forbes.com/sites/louiscolumbus/2017/04/23/2017-state-of-cloud-adoption-and-security
  • 4. More Secure or Not In the end, do you really think you can provide better infrastructure security than cloud providers ???
  • 5. Migration Questions? How fast can you start/expand your analytics initiative? 1 How often is your cluster fully busy and your employees want more computing power right now?2 How much time you spend on maintaining your infra? 3 How much time does it take you to gracefully apply all the security patches in your Hadoop cluster?4 Do you need hardware that you don’t have in your data-center e.g. GPU, terrible amounts of RAM5
  • 7. Migration Goals Transition from infrastructure engineering towards data engineering 1 Use the best possible technology stack in the world 2 Free your time 3 Attract the best engineers 4 Ultimate world domination ;) 5
  • 9. Before You Start Be smart with which service you choose Avoid lock-in Try to estimate the costs See what others are doing Technology choices Yet another migration Hardware, engineering, legal Netflix, Spotify, Etsy
  • 12. Different Technologies Hadoop Ecosystem Google Cloud Platform File System HDFS Google Cloud Storage Key Value Store HBase, Cassandra BigTable SQL Hive, SparkSQL, Presto BigQuery Messaging Queue Kafka PubSub Geo-Replicated RDBMS CockroachDB Spanner
  • 15. Strong Global Consistency Google Cloud Storage provides strong global consistency for the following operations, including both data and metadata: ● Read-after-write ● Read-after-metadata-update ● Read-after-delete ● Bucket listing, Object listing ● Granting access to resources
  • 16. Eventual Consistency ● Revoking access from resources It typically takes about a minute for revoking access to take effect. In some cases it may take longer. Beware of a cache though.
  • 17. Pricing ● Pay-per-second billing Keep in mind that if you often do sub-10 minute analyses using VMs, serverless options may be better suited since VMs are relatively slow to boot and serverless functions are billed at every 100ms.
  • 18. I want to start. What’s next?
  • 20. Find best candidates for migration Isolated / self-contained applications With mainly external (public data) dependencies Global use case
  • 21. Baby Steps Prepare your hadoop cluster to interact with object storage. 1 Look for existing operators for popular tools like Apache Airflow. 2 Make a copy of your critical datasets to the cloud. 3 Use both BigQuery for fast analytics and GCS output for more advanced trials. 4 Audit costs per query. 5
  • 22. Networking High bandwidth, low latency and consistent network connectivity is critical. Pay attention to such things like choosing the right region, number of cores or even TCP window size. But to get the full speed dedicated interconnect / direct peering is the way to go. Multiple VPN tunnels are a good starting point to increase bandwidth. Transfer appliances for offline data migration.
  • 24. Package Your Deployments ● Containers (docker) for tooling. ● Deployment artifacts (Spark / MR jars). ● Tools like Spydra can help you executing your packages in both worlds $ cat examples.json { "client_id": "simple-spydra-test", "cluster_type": "dataproc", "log_bucket": "spydra-test-logs", "region": "europe-west1", "cluster": { "options": { "project": "spydra-test" } }, "submit": { "job_args": [ "pi", "8", "100" ], "options": { "jar": "hadoop-mapreduce-examples.jar" } } } $ spydra submit --spydra-json example.json
  • 25. Other Important Features ● Cluster pooling - using init actions to kill old clusters ● Autoscaling - based on the workload ● Preemptible instances: ○ A reasonable choice for your cluster ○ Keep in mind final resilience (idempotence) ○ Available also with GPUs
  • 26. No Long-Lived Services ● No patching! - YAY ● No wasting resources ● Latest security patches applied automatically
  • 27. Predictions Forrester predicts SaaS vendors will de-prioritize their platform efforts to attain global scale. They will compete more at the platform level by running portions of their services on AWS, Azure, GCP or Oracle Cloud in 2018. ” ”
  • 28. Future Interesting projects: ● Spark on k8s ● dA Platform 2
  • 30. There no right answer - it's tradeoff that depends on many variables Should I Stay or Should I Go?