SlideShare a Scribd company logo
© Copyright 2015 Glassbeam Inc.
What the Spark!
Intro and Use Cases
February 26, 2015
© Copyright 2015 Glassbeam Inc.






© Copyright 2015 Glassbeam Inc.




–
–
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
Big Data
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
Volume
Variety
Velocity
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
Source: Cisco, IDC, Wikibon report 2013
1980s 1990-2000s 2010 - beyond
© Copyright 2015 Glassbeam Inc.
Quick Review
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
Spark Intro
© Copyright 2015 Glassbeam Inc.




© Copyright 2015 Glassbeam Inc.




© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.




© Copyright 2015 Glassbeam Inc.





© Copyright 2015 Glassbeam Inc.



© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.






© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.

© Copyright 2015 Glassbeam Inc.



© Copyright 2015 Glassbeam Inc.

–
–
•
–
–

–
–
–

–
–
–
© Copyright 2015 Glassbeam Inc.
Why is Spark hot?
© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.



© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.

–
–

–
© Copyright 2015 Glassbeam Inc.





© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
Spark SQL Intro
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.






© Copyright 2015 Glassbeam Inc.
Spark Streaming
Intro
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.




© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
MLlib Intro
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.




© Copyright 2015 Glassbeam Inc.
GraphX Intro
© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.
Myths and
Misconceptions
© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.



© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
Getting Started
© Copyright 2015 Glassbeam Inc.



© Copyright 2015 Glassbeam Inc.


© Copyright 2015 Glassbeam Inc.
© Copyright 2015 Glassbeam Inc.
Questions?

More Related Content

PPTX
OpenStack 2016 - Boom or Bust? - Adrian Ionel, CEO, Mirantis - OpenStackSV 2014
Mirantis
 
PDF
Brent Dykes - Data storytelling - Conversion Hotel 2015
Webanalisten .nl
 
PDF
TechWiseTV Workshop: Stealthwatch Cloud
Robb Boyd
 
PPTX
Devoxx Retrospective sailing - Collaboration Games june16 being agile
Belinda Waldock
 
PDF
The value imperative of exceptional leadership
Security Catalyst
 
PDF
Seeking Nirvana - Predictability in a Complex World
Jose Casal-Gimenez FBCS CITP
 
PDF
Ffliping Agility - Lean Agile Brighton - Oct 2018
Jose Casal-Gimenez FBCS CITP
 
PPTX
JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...
David Taieb
 
OpenStack 2016 - Boom or Bust? - Adrian Ionel, CEO, Mirantis - OpenStackSV 2014
Mirantis
 
Brent Dykes - Data storytelling - Conversion Hotel 2015
Webanalisten .nl
 
TechWiseTV Workshop: Stealthwatch Cloud
Robb Boyd
 
Devoxx Retrospective sailing - Collaboration Games june16 being agile
Belinda Waldock
 
The value imperative of exceptional leadership
Security Catalyst
 
Seeking Nirvana - Predictability in a Complex World
Jose Casal-Gimenez FBCS CITP
 
Ffliping Agility - Lean Agile Brighton - Oct 2018
Jose Casal-Gimenez FBCS CITP
 
JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...
David Taieb
 

Viewers also liked (15)

PDF
[db tech showcase Tokyo 2017] AzureでOSS DB/データ処理基盤のPaaSサービスを使ってみよう (Azure Dat...
Naoki (Neo) SATO
 
PDF
SparkMLlibで始めるビッグデータを対象とした機械学習入門
Takeshi Mikami
 
PPTX
Apache Spark in Scientific Applications
Dr. Mirko Kämpf
 
PDF
Introduction to Stateful Stream Processing with Apache Flink.
Konstantinos Kloudas
 
PDF
Apache Spark, the Next Generation Cluster Computing
Gerger
 
PDF
Sparkcamp @ Strata CA: Intro to Apache Spark with Hands-on Tutorials
Databricks
 
PDF
Large-Scale Stream Processing in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
 
PDF
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
tcloudcomputing-tw
 
PDF
Apache Spark Briefing
Thomas W. Dinsmore
 
PDF
Apache Hadoopを利用したビッグデータ分析基盤
Hortonworks Japan
 
PDF
ちょっと理解に自信がないな という皆さまに贈るHadoop/Sparkのキホン (IBM Datapalooza Tokyo 2016講演資料)
hamaken
 
PDF
40分でわかるHadoop徹底入門 (Cloudera World Tokyo 2014 講演資料)
hamaken
 
PDF
Apache Sparkに手を出してヤケドしないための基本 ~「Apache Spark入門より」~ (デブサミ 2016 講演資料)
NTT DATA OSS Professional Services
 
PDF
Top 5 mistakes when writing Spark applications
hadooparchbook
 
PDF
The AI Rush
Jean-Baptiste Dumont
 
[db tech showcase Tokyo 2017] AzureでOSS DB/データ処理基盤のPaaSサービスを使ってみよう (Azure Dat...
Naoki (Neo) SATO
 
SparkMLlibで始めるビッグデータを対象とした機械学習入門
Takeshi Mikami
 
Apache Spark in Scientific Applications
Dr. Mirko Kämpf
 
Introduction to Stateful Stream Processing with Apache Flink.
Konstantinos Kloudas
 
Apache Spark, the Next Generation Cluster Computing
Gerger
 
Sparkcamp @ Strata CA: Intro to Apache Spark with Hands-on Tutorials
Databricks
 
Large-Scale Stream Processing in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
tcloudcomputing-tw
 
Apache Spark Briefing
Thomas W. Dinsmore
 
Apache Hadoopを利用したビッグデータ分析基盤
Hortonworks Japan
 
ちょっと理解に自信がないな という皆さまに贈るHadoop/Sparkのキホン (IBM Datapalooza Tokyo 2016講演資料)
hamaken
 
40分でわかるHadoop徹底入門 (Cloudera World Tokyo 2014 講演資料)
hamaken
 
Apache Sparkに手を出してヤケドしないための基本 ~「Apache Spark入門より」~ (デブサミ 2016 講演資料)
NTT DATA OSS Professional Services
 
Top 5 mistakes when writing Spark applications
hadooparchbook
 
Ad

Similar to What the Spark!? Intro and Use Cases (20)

PDF
Managing Demand Across Organizations
CA Technologies
 
PDF
Serverless <3 GraphQL - AWS UG Tampere 2020
Marcia Villalba
 
PDF
Deploying CA Applications in the Cloud: Automated Blueprints for your Agile I...
CA Technologies
 
PDF
API’s and Identity: Enabling Optum to become the HealthCare cloud
CA Technologies
 
PDF
Case Study: Orange Goes from Dev "Oops" to DevOps With CA Application Perfor...
CA Technologies
 
PDF
An Introduction to Scaled Agile Framework (SAFe)
CA Technologies
 
PDF
Foundations of the Scaled Agile Framework® : Values, Principles, Practices, ...
CA Technologies
 
PDF
Freeing the World from Slow: How Service Virtualization and the Concept of S....
CA Technologies
 
PPTX
Cloudreach Voices The Internet of Things
Cloudreach
 
PDF
AWS Stockholm Summit 19- Building serverless applications with GraphQL
Marcia Villalba
 
PPTX
Conversion Mythbusting
Affiliate Summit
 
PPTX
10.29.15 sa fe in-8 pictures-with speaker-notes-v3.0.4
Tonya McCaulley, SPC4
 
PPTX
10.29.15 SAFe in-8 pictures-with speaker-notes-v3.0.4
Tonya McCaulley, SPC4
 
PPTX
SAFe-in-8 Pictures from Scaled Agile
LJ Alefantis
 
PDF
The Cloud Foundry Story on OpenStack
Stuart Charlton
 
PPTX
SolarWinds Cybersecurity in the Federal Government
SolarWinds
 
PDF
Kranky Geek 2015 - Decisions & Considerations in building your WebRTC App
Kranky Geek
 
PDF
Posters, as a form of visual presentation
sfxwizkid
 
PPTX
Cloudreach Voices AWS CloudWatch and Smart Monitoring
Cloudreach
 
Managing Demand Across Organizations
CA Technologies
 
Serverless <3 GraphQL - AWS UG Tampere 2020
Marcia Villalba
 
Deploying CA Applications in the Cloud: Automated Blueprints for your Agile I...
CA Technologies
 
API’s and Identity: Enabling Optum to become the HealthCare cloud
CA Technologies
 
Case Study: Orange Goes from Dev "Oops" to DevOps With CA Application Perfor...
CA Technologies
 
An Introduction to Scaled Agile Framework (SAFe)
CA Technologies
 
Foundations of the Scaled Agile Framework® : Values, Principles, Practices, ...
CA Technologies
 
Freeing the World from Slow: How Service Virtualization and the Concept of S....
CA Technologies
 
Cloudreach Voices The Internet of Things
Cloudreach
 
AWS Stockholm Summit 19- Building serverless applications with GraphQL
Marcia Villalba
 
Conversion Mythbusting
Affiliate Summit
 
10.29.15 sa fe in-8 pictures-with speaker-notes-v3.0.4
Tonya McCaulley, SPC4
 
10.29.15 SAFe in-8 pictures-with speaker-notes-v3.0.4
Tonya McCaulley, SPC4
 
SAFe-in-8 Pictures from Scaled Agile
LJ Alefantis
 
The Cloud Foundry Story on OpenStack
Stuart Charlton
 
SolarWinds Cybersecurity in the Federal Government
SolarWinds
 
Kranky Geek 2015 - Decisions & Considerations in building your WebRTC App
Kranky Geek
 
Posters, as a form of visual presentation
sfxwizkid
 
Cloudreach Voices AWS CloudWatch and Smart Monitoring
Cloudreach
 
Ad

More from Aerospike, Inc. (20)

PDF
Aerospike Hybrid Memory Architecture
Aerospike, Inc.
 
PDF
2017 DB Trends for Powering Real-Time Systems of Engagement
Aerospike, Inc.
 
PPTX
WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...
Aerospike, Inc.
 
PPTX
Leveraging Big Data with Hadoop, NoSQL and RDBMS
Aerospike, Inc.
 
PDF
Using Databases and Containers From Development to Deployment
Aerospike, Inc.
 
PDF
01282016 Aerospike-Docker webinar
Aerospike, Inc.
 
PPTX
There are 250 Database products, are you running the right one?
Aerospike, Inc.
 
PPTX
The role of NoSQL in the Next Generation of Financial Informatics
Aerospike, Inc.
 
PPTX
Tectonic Shift: A New Foundation for Data Driven Business
Aerospike, Inc.
 
PPTX
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
Aerospike, Inc.
 
PDF
Get Started with Data Science by Analyzing Traffic Data from California Highways
Aerospike, Inc.
 
PPTX
Running a High Performance NoSQL Database on Amazon EC2 for Just $1.68/Hour
Aerospike, Inc.
 
PPTX
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACID
Aerospike, Inc.
 
PPTX
Flash Economics and Lessons learned from operating low latency platforms at h...
Aerospike, Inc.
 
PDF
Storm Persistence and Real-Time Analytics
Aerospike, Inc.
 
PDF
You Snooze You Lose or How to Win in Ad Tech?
Aerospike, Inc.
 
PPT
Aerospike: Key Value Data Access
Aerospike, Inc.
 
PPTX
Aerospike: Maximizing Performance
Aerospike, Inc.
 
PPTX
Distributing Data The Aerospike Way
Aerospike, Inc.
 
PPTX
Getting The Most Out Of Your Flash/SSDs
Aerospike, Inc.
 
Aerospike Hybrid Memory Architecture
Aerospike, Inc.
 
2017 DB Trends for Powering Real-Time Systems of Engagement
Aerospike, Inc.
 
WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...
Aerospike, Inc.
 
Leveraging Big Data with Hadoop, NoSQL and RDBMS
Aerospike, Inc.
 
Using Databases and Containers From Development to Deployment
Aerospike, Inc.
 
01282016 Aerospike-Docker webinar
Aerospike, Inc.
 
There are 250 Database products, are you running the right one?
Aerospike, Inc.
 
The role of NoSQL in the Next Generation of Financial Informatics
Aerospike, Inc.
 
Tectonic Shift: A New Foundation for Data Driven Business
Aerospike, Inc.
 
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
Aerospike, Inc.
 
Get Started with Data Science by Analyzing Traffic Data from California Highways
Aerospike, Inc.
 
Running a High Performance NoSQL Database on Amazon EC2 for Just $1.68/Hour
Aerospike, Inc.
 
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACID
Aerospike, Inc.
 
Flash Economics and Lessons learned from operating low latency platforms at h...
Aerospike, Inc.
 
Storm Persistence and Real-Time Analytics
Aerospike, Inc.
 
You Snooze You Lose or How to Win in Ad Tech?
Aerospike, Inc.
 
Aerospike: Key Value Data Access
Aerospike, Inc.
 
Aerospike: Maximizing Performance
Aerospike, Inc.
 
Distributing Data The Aerospike Way
Aerospike, Inc.
 
Getting The Most Out Of Your Flash/SSDs
Aerospike, Inc.
 

Recently uploaded (20)

PPTX
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PDF
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PDF
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PPTX
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 

What the Spark!? Intro and Use Cases