SlideShare a Scribd company logo
FROM SQL TO NOSQL
Timothy Shelton, T-Mobile USA
StampedeCon, 2015
1
Outline
• BUSINESS GOALS: Why do we need Big Data?
• TECHNOLOGY: What did we build?
• PRIVACY: Customer preference management
• LESSONS LEARNED: Toes were stubbed.
• Q&A
2
WHY DO WE NEED BIG DATA?
3
Business Goals
4
We want you to have a great experience with your device, and we want
to provide world-class service to quickly resolve any device problems.
Example Use Case
Symptom: “Battery keeps going dead”
Possible Root Causes:
• Bad application
• Misconfigured application
• Misconfigured device
• Incorrect charger
• Improper expectations
• Faulty hardware
5
Tackling Complexity
At the beginning of 2015, there
were an estimated 1.5 million
applications in the Google Play
Store
6
1Q14 4Q14 1Q15
Smartphones 6.9 8.0 8.0
Non-Smartphones 0.5 0.6 0.5
Mobile Broadband 0.1 0.4 0.3
Total Company 7.5 9.0 8.8
(Device Sales, Millions)
Challenge: Accurately
diagnose device problems,
educate customers, solve
problems quickly.
We sell a ton of devices:
By the numbers….
7
30,000
Average number of
technical calls received
daily
32
Average number of
customer-installed
applications per device
56,836,000
Total customers, as of
1Q2015 (combined)
100%
Percentage of
customers expecting
problem resolution
Solution: BIG DATA!
8
Device
GOAL
Provide Customer Care
Representatives visibility
into the device so they are
able to resolve problems.
REQUIREMENTS
1. Ability to gather
metrics from a device
2. Ingest the metrics
3. Store for a period of
time
4. Fast data retrieval
TECHNOLOGY
9
Technology: Version 1
10
Device SQL
Server
First attempt: RDBMS to store data.
Findings:
X Tables had to be de-normalized and indices
removed to maintain write performance
X Reads were barely achieved by batch
processing snapshots of the database
Effectively deployed a write-only database.
? ?
?
?
Technology: Version 2
11
Device
C*
Second attempt: Cassandra!
Findings:
 Stellar write performance
 Stellar read performance
 Relatively low TCO
X Application misused [beta] C* driver
X Application not designed to scale
X Heterogeneous architecture
(.NET/Linux)
Technology: Version 3
12
Device C*
Evolution: Lambda Architecture!
Goals:
 Enables multiple consumers of incoming data
 Allows for near-realtime processing (Spark Streaming)
 Archival to S3 for offline processing (Spark)
 Layered architecture takes advantage of Cloud
Culture: Fast and Lean
• Preference is rapid
prototyping and
proofs-of-concept
• We set aggressive
dates, then iterate
• Cloud deployments
align nicely with our
culture
13
Embrace automation
• CloudFormation
Templates to reduce
risk of error and
increase velocity
• Chef for deployment
and configuration
CUSTOMER PREFERENCE
MANAGEMENT
14
Transparency and Choice
• Clearly describe what
is being collected
• Be transparent about
why it is being
collected
• What are the intended
uses of the data?
• Be clear with whom
the data will be shared
(if anyone)
• Allow the customer to
opt-out
• Opt-in for Location-
Based Services
15
Transparency: How not to do it
16
“It’s in our privacy
policy!”
LESSONS LEARNED
17
Scale Thy Monitoring
18
This looks bad!
OMG, traffic is
being rejected!
What Worked What Didn’t
Chart all the things! In this instance, statsd was being
overloaded with traffic
Organize Thy Data
19
What Didn’t
No temporal organization made
querying this data more difficult every
day
What Worked
Archive all the things!
Cassandra: df -u
• Monitor your capacity
• Leave enough disk for compaction
• Disabling compactions is a terrible strategy
20
QUESTIONS?
21
Thanks!
T-Mobile is actively recruiting engineers – come
change the world with us!
22
@TimothyAShelton
http://www.t-mobile.com

More Related Content

What's hot (20)

PPTX
Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee
Spark Summit
 
PDF
2015 nov 27_thug_paytm_rt_ingest_brief_final
Adam Muise
 
PDF
Glassbeam: Ad-hoc Analytics on Internet of Complex Things with Apache Cassand...
DataStax Academy
 
PDF
Yahoo's Next Generation User Profile Platform
DataWorks Summit/Hadoop Summit
 
PDF
Visualizing Big Data in Realtime
DataWorks Summit
 
PPTX
Real-Time Robot Predictive Maintenance in Action
DataWorks Summit
 
PDF
High Performance Spatial-Temporal Trajectory Analysis with Spark
DataWorks Summit/Hadoop Summit
 
PDF
Big Data Computing Architecture
Gang Tao
 
PDF
Apache Eagle: Secure Hadoop in Real Time
DataWorks Summit/Hadoop Summit
 
PPTX
When Streaming Becomes Strategic
MapR Technologies
 
PPTX
Innovation in the Enterprise Rent-A-Car Data Warehouse
DataWorks Summit
 
PPTX
Building a Big Data Pipeline
Jesus Rodriguez
 
PPTX
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
DataWorks Summit
 
PDF
Lambda architecture for real time big data
Trieu Nguyen
 
PDF
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
Big Data Spain
 
PPTX
Data Science in the Cloud with Spark, Zeppelin, and Cloudbreak
DataWorks Summit
 
PDF
MaaS (Model as a Service): Modern Streaming Data Science with Apache Metron (...
DataWorks Summit
 
PPTX
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
MapR Technologies
 
PDF
Spark Summit East 2015 Keynote -- Databricks CEO Ion Stoica
Databricks
 
PPTX
The key to unlocking the Value in the IoT? Managing the Data!
DataWorks Summit/Hadoop Summit
 
Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee
Spark Summit
 
2015 nov 27_thug_paytm_rt_ingest_brief_final
Adam Muise
 
Glassbeam: Ad-hoc Analytics on Internet of Complex Things with Apache Cassand...
DataStax Academy
 
Yahoo's Next Generation User Profile Platform
DataWorks Summit/Hadoop Summit
 
Visualizing Big Data in Realtime
DataWorks Summit
 
Real-Time Robot Predictive Maintenance in Action
DataWorks Summit
 
High Performance Spatial-Temporal Trajectory Analysis with Spark
DataWorks Summit/Hadoop Summit
 
Big Data Computing Architecture
Gang Tao
 
Apache Eagle: Secure Hadoop in Real Time
DataWorks Summit/Hadoop Summit
 
When Streaming Becomes Strategic
MapR Technologies
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
DataWorks Summit
 
Building a Big Data Pipeline
Jesus Rodriguez
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
DataWorks Summit
 
Lambda architecture for real time big data
Trieu Nguyen
 
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
Big Data Spain
 
Data Science in the Cloud with Spark, Zeppelin, and Cloudbreak
DataWorks Summit
 
MaaS (Model as a Service): Modern Streaming Data Science with Apache Metron (...
DataWorks Summit
 
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
MapR Technologies
 
Spark Summit East 2015 Keynote -- Databricks CEO Ion Stoica
Databricks
 
The key to unlocking the Value in the IoT? Managing the Data!
DataWorks Summit/Hadoop Summit
 

Viewers also liked (20)

PDF
Interactive Visualization in Human Time -StampedeCon 2015
StampedeCon
 
PPTX
Batch and Real-time EHR updates into Hadoop - StampedeCon 2015
StampedeCon
 
PDF
Floods of Twitter Data - StampedeCon 2016
StampedeCon
 
PDF
Enabling Key Business Advantage from Big Data through Advanced Ingest Process...
StampedeCon
 
PPTX
How Big Data Will Save Planet Earth - StampedeCon 2015
StampedeCon
 
PDF
Action from Insight - Joining the 2 Percent Who are Getting Big Data Right
StampedeCon
 
PDF
Graph Database Use Cases - StampedeCon 2015
StampedeCon
 
PDF
Visualizing Big Data – The Fundamentals
StampedeCon
 
PDF
GPUs in Big Data - StampedeCon 2014
StampedeCon
 
PPTX
Introduction to Kudu - StampedeCon 2016
StampedeCon
 
PDF
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
StampedeCon
 
PPTX
Identity Fraud Protection Using Big Data Analytics - StampedeCon 2015
StampedeCon
 
PDF
What’s New in Spark 2.0: Structured Streaming and Datasets - StampedeCon 2016
StampedeCon
 
PDF
Resource Management in Impala - StampedeCon 2016
StampedeCon
 
PDF
Making Machine Learning Work in Practice - StampedeCon 2014
StampedeCon
 
PDF
Interplay of Big Data and IoT - StampedeCon 2016
StampedeCon
 
PPTX
Analyzing Time-Series Data with Apache Spark and Cassandra - StampedeCon 2016
StampedeCon
 
PDF
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
StampedeCon
 
PPTX
Creating a Data Driven Organization - StampedeCon 2016
StampedeCon
 
PDF
Innovation in the Data Warehouse - StampedeCon 2016
StampedeCon
 
Interactive Visualization in Human Time -StampedeCon 2015
StampedeCon
 
Batch and Real-time EHR updates into Hadoop - StampedeCon 2015
StampedeCon
 
Floods of Twitter Data - StampedeCon 2016
StampedeCon
 
Enabling Key Business Advantage from Big Data through Advanced Ingest Process...
StampedeCon
 
How Big Data Will Save Planet Earth - StampedeCon 2015
StampedeCon
 
Action from Insight - Joining the 2 Percent Who are Getting Big Data Right
StampedeCon
 
Graph Database Use Cases - StampedeCon 2015
StampedeCon
 
Visualizing Big Data – The Fundamentals
StampedeCon
 
GPUs in Big Data - StampedeCon 2014
StampedeCon
 
Introduction to Kudu - StampedeCon 2016
StampedeCon
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
StampedeCon
 
Identity Fraud Protection Using Big Data Analytics - StampedeCon 2015
StampedeCon
 
What’s New in Spark 2.0: Structured Streaming and Datasets - StampedeCon 2016
StampedeCon
 
Resource Management in Impala - StampedeCon 2016
StampedeCon
 
Making Machine Learning Work in Practice - StampedeCon 2014
StampedeCon
 
Interplay of Big Data and IoT - StampedeCon 2016
StampedeCon
 
Analyzing Time-Series Data with Apache Spark and Cassandra - StampedeCon 2016
StampedeCon
 
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
StampedeCon
 
Creating a Data Driven Organization - StampedeCon 2016
StampedeCon
 
Innovation in the Data Warehouse - StampedeCon 2016
StampedeCon
 
Ad

Similar to From SQL to NoSQL - StampedeCon 2015 (20)

PPTX
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
AshishHiwale1
 
PPTX
Lecture1 BIG DATA and Types of data in details
AbhishekKumarAgrahar2
 
PPTX
Big Data Analytics PPT - S1 working .pptx
VivekChaurasia43
 
PPTX
Lecture1
Manish Singh
 
PPT
Big data.ppt
IdontKnow66967
 
PPTX
Big Data PPT by Rohit Dubey
Rohit Dubey
 
PDF
From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
SoftServe
 
PDF
big_data_case_studies.pdf
vishal choudhary
 
PDF
Lecture 1-big data engineering (Introduction).pdf
ahmedibrahimghnnam01
 
PPTX
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
SoftServe
 
PPTX
bigdataintro.pptx
Albert Alex
 
PPTX
Kaushal Amin & Big 5 IT trends in the world
Quang PM
 
PPTX
Technology Trends and Big Data in 2013-2014
KMS Technology
 
PPT
Choosing the Right Big Data Architecture for your Business
Chicago Hadoop Users Group
 
PDF
The Big Data Journey at Connexity - Big Data Day LA 2015
Will Gage
 
PDF
Designing the Next Generation Data Lake
Robert Chong
 
PPTX
Big Data Overview 2013-2014
KMS Technology
 
PPTX
IARE_BDBA_ PPT_0.pptx
AIMLSEMINARS
 
PDF
Big data rmoug
Gwen (Chen) Shapira
 
PPT
Big Data = Big Decisions
InnoTech
 
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
AshishHiwale1
 
Lecture1 BIG DATA and Types of data in details
AbhishekKumarAgrahar2
 
Big Data Analytics PPT - S1 working .pptx
VivekChaurasia43
 
Lecture1
Manish Singh
 
Big data.ppt
IdontKnow66967
 
Big Data PPT by Rohit Dubey
Rohit Dubey
 
From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
SoftServe
 
big_data_case_studies.pdf
vishal choudhary
 
Lecture 1-big data engineering (Introduction).pdf
ahmedibrahimghnnam01
 
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
SoftServe
 
bigdataintro.pptx
Albert Alex
 
Kaushal Amin & Big 5 IT trends in the world
Quang PM
 
Technology Trends and Big Data in 2013-2014
KMS Technology
 
Choosing the Right Big Data Architecture for your Business
Chicago Hadoop Users Group
 
The Big Data Journey at Connexity - Big Data Day LA 2015
Will Gage
 
Designing the Next Generation Data Lake
Robert Chong
 
Big Data Overview 2013-2014
KMS Technology
 
IARE_BDBA_ PPT_0.pptx
AIMLSEMINARS
 
Big data rmoug
Gwen (Chen) Shapira
 
Big Data = Big Decisions
InnoTech
 
Ad

More from StampedeCon (20)

PDF
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
StampedeCon
 
PDF
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
StampedeCon
 
PDF
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
StampedeCon
 
PDF
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
StampedeCon
 
PDF
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
StampedeCon
 
PDF
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
StampedeCon
 
PDF
Foundations of Machine Learning - StampedeCon AI Summit 2017
StampedeCon
 
PDF
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
StampedeCon
 
PDF
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
StampedeCon
 
PDF
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
StampedeCon
 
PDF
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
StampedeCon
 
PDF
A Different Data Science Approach - StampedeCon AI Summit 2017
StampedeCon
 
PDF
Graph in Customer 360 - StampedeCon Big Data Conference 2017
StampedeCon
 
PDF
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
StampedeCon
 
PDF
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
StampedeCon
 
PDF
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
StampedeCon
 
PDF
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
StampedeCon
 
PPTX
Using The Internet of Things for Population Health Management - StampedeCon 2016
StampedeCon
 
PDF
Turn Data Into Actionable Insights - StampedeCon 2016
StampedeCon
 
PDF
How to get started in Big Data without Big Costs - StampedeCon 2016
StampedeCon
 
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
StampedeCon
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
StampedeCon
 
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
StampedeCon
 
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
StampedeCon
 
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
StampedeCon
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
StampedeCon
 
Foundations of Machine Learning - StampedeCon AI Summit 2017
StampedeCon
 
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
StampedeCon
 
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
StampedeCon
 
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
StampedeCon
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
StampedeCon
 
A Different Data Science Approach - StampedeCon AI Summit 2017
StampedeCon
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
StampedeCon
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
StampedeCon
 
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
StampedeCon
 
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
StampedeCon
 
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
StampedeCon
 
Using The Internet of Things for Population Health Management - StampedeCon 2016
StampedeCon
 
Turn Data Into Actionable Insights - StampedeCon 2016
StampedeCon
 
How to get started in Big Data without Big Costs - StampedeCon 2016
StampedeCon
 

Recently uploaded (20)

PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
PPTX
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
PPTX
Rocket-Launched-PowerPoint-Template.pptx
Arden31
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PDF
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
PDF
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
PPT
Data base management system Transactions.ppt
gandhamcharan2006
 
PDF
How to Avoid 7 Costly Mainframe Migration Mistakes
JP Infra Pvt Ltd
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PDF
2_Management_of_patients_with_Reproductive_System_Disorders.pdf
motbayhonewunetu
 
PPTX
Human-Action-Recognition-Understanding-Behavior.pptx
nreddyjanga
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PDF
Context Engineering vs. Prompt Engineering, A Comprehensive Guide.pdf
Tamanna
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PPTX
Hadoop_EcoSystem slide by CIDAC India.pptx
migbaruget
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
Choosing the Right Database for Indexing.pdf
Tamanna
 
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
Rocket-Launched-PowerPoint-Template.pptx
Arden31
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
Data base management system Transactions.ppt
gandhamcharan2006
 
How to Avoid 7 Costly Mainframe Migration Mistakes
JP Infra Pvt Ltd
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
2_Management_of_patients_with_Reproductive_System_Disorders.pdf
motbayhonewunetu
 
Human-Action-Recognition-Understanding-Behavior.pptx
nreddyjanga
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
Context Engineering vs. Prompt Engineering, A Comprehensive Guide.pdf
Tamanna
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
Hadoop_EcoSystem slide by CIDAC India.pptx
migbaruget
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 

From SQL to NoSQL - StampedeCon 2015

  • 1. FROM SQL TO NOSQL Timothy Shelton, T-Mobile USA StampedeCon, 2015 1
  • 2. Outline • BUSINESS GOALS: Why do we need Big Data? • TECHNOLOGY: What did we build? • PRIVACY: Customer preference management • LESSONS LEARNED: Toes were stubbed. • Q&A 2
  • 3. WHY DO WE NEED BIG DATA? 3
  • 4. Business Goals 4 We want you to have a great experience with your device, and we want to provide world-class service to quickly resolve any device problems.
  • 5. Example Use Case Symptom: “Battery keeps going dead” Possible Root Causes: • Bad application • Misconfigured application • Misconfigured device • Incorrect charger • Improper expectations • Faulty hardware 5
  • 6. Tackling Complexity At the beginning of 2015, there were an estimated 1.5 million applications in the Google Play Store 6 1Q14 4Q14 1Q15 Smartphones 6.9 8.0 8.0 Non-Smartphones 0.5 0.6 0.5 Mobile Broadband 0.1 0.4 0.3 Total Company 7.5 9.0 8.8 (Device Sales, Millions) Challenge: Accurately diagnose device problems, educate customers, solve problems quickly. We sell a ton of devices:
  • 7. By the numbers…. 7 30,000 Average number of technical calls received daily 32 Average number of customer-installed applications per device 56,836,000 Total customers, as of 1Q2015 (combined) 100% Percentage of customers expecting problem resolution
  • 8. Solution: BIG DATA! 8 Device GOAL Provide Customer Care Representatives visibility into the device so they are able to resolve problems. REQUIREMENTS 1. Ability to gather metrics from a device 2. Ingest the metrics 3. Store for a period of time 4. Fast data retrieval
  • 10. Technology: Version 1 10 Device SQL Server First attempt: RDBMS to store data. Findings: X Tables had to be de-normalized and indices removed to maintain write performance X Reads were barely achieved by batch processing snapshots of the database Effectively deployed a write-only database. ? ? ? ?
  • 11. Technology: Version 2 11 Device C* Second attempt: Cassandra! Findings:  Stellar write performance  Stellar read performance  Relatively low TCO X Application misused [beta] C* driver X Application not designed to scale X Heterogeneous architecture (.NET/Linux)
  • 12. Technology: Version 3 12 Device C* Evolution: Lambda Architecture! Goals:  Enables multiple consumers of incoming data  Allows for near-realtime processing (Spark Streaming)  Archival to S3 for offline processing (Spark)  Layered architecture takes advantage of Cloud
  • 13. Culture: Fast and Lean • Preference is rapid prototyping and proofs-of-concept • We set aggressive dates, then iterate • Cloud deployments align nicely with our culture 13 Embrace automation • CloudFormation Templates to reduce risk of error and increase velocity • Chef for deployment and configuration
  • 15. Transparency and Choice • Clearly describe what is being collected • Be transparent about why it is being collected • What are the intended uses of the data? • Be clear with whom the data will be shared (if anyone) • Allow the customer to opt-out • Opt-in for Location- Based Services 15
  • 16. Transparency: How not to do it 16 “It’s in our privacy policy!”
  • 18. Scale Thy Monitoring 18 This looks bad! OMG, traffic is being rejected! What Worked What Didn’t Chart all the things! In this instance, statsd was being overloaded with traffic
  • 19. Organize Thy Data 19 What Didn’t No temporal organization made querying this data more difficult every day What Worked Archive all the things!
  • 20. Cassandra: df -u • Monitor your capacity • Leave enough disk for compaction • Disabling compactions is a terrible strategy 20
  • 22. Thanks! T-Mobile is actively recruiting engineers – come change the world with us! 22 @TimothyAShelton http://www.t-mobile.com