SlideShare a Scribd company logo
Big Data
& the importance of Data Science
18 december 2014
@wimvanleuven
wim@bigboards.io
1
2
http://www.slideshare.net/kuonen/big-tent-bddsunigenov2014
–Edd Dumbill
“Big data is data that exceeds the processing
capacity of conventional database systems.
The data is too big, moves too fast, or doesn’t fit
the strictures of your database architectures.”
3
http://radar.oreilly.com/2012/01/what-is-big-data.html
What is Big Data?
The 3 V’s of Big Data
4
• Volume
• Velocity
• Variety
• (Veracity)
…too big…
5
IOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOI
OIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOI
OOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOI
OIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIO
IIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOII
OIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIO
… moves to fast …
6
… doesn’t fit …
7
… what?
8
New tools and technologies to store and
process all data on a cluster of commodity
hardware so that the system acts as one, is
resilient and scales linearly.
9
What is Big Data? — revisited
So what?
10
the data lake is a large data pool
in which the schema and data requirements are not defined
until the data is queried, processed, analysed
or delivered as information to the end-user
–???
“We don’t do Hadoop because we have Big
Data; we do Big Data because we have
Hadoop.”
11
So what?
–Matt Ehrlichman
“In the years ahead, the same power that big
data awards enterprise companies will be the
norm for small business.”
12
So what?
http://blogs.wsj.com/accelerators/2014/10/31/matt-ehrlichman-big-data-for-small-firms/
13
What does Big Data enable?
• Combine data from within and without your
organisation
• Build new products and services
• Analyse all data (e.g. 5TB historic event data at rest in Oracle db)
Big Data is no panacea
14
• First decide what problem you want to solve; pick a
real business problem to add immediate value
• Start small, the technology is made for linear
scalability (a 3-node cluster is a cluster!)
• Then become lean: learn through experimentation
Big Data challenges
• Beware of hype, Big Data - washing and fad
• Tech infancy
• IT | Biz
• Data is hard
• Lack of skills!
shameless self plug: BigBoards!
15
Big Data opportunity
• Big Data is here to stay
• Vendor market is HUGE and will grow massively as
Big Data will blend in within the datacenter
• However, the Practitioner market can deliver
EXPONENTIALLY more value
16
17
It is time to band together
and build these systems that deliver this kind of value
for fun
for profit
for good
for Belgium?
Call for Action
https://www.ted.com/talks/susan_etlinger_what_do_we_do_with_all_this_big_data
“Data doesn't create meaning. We do.”
–Susan Etlinger
18
Data Science FTW

More Related Content

What's hot (19)

PPTX
Big data
Nausheen Hasan
 
PPTX
Big data ppt
AKASH SIHAG
 
PPTX
Team 2 Big Data Presentation
Matthew Urdan
 
PPTX
Big data Presentation
Aswadmehar
 
PPTX
Big Data for Beginners
Michael Perez
 
PDF
Big Data vs. Small Data...what's the difference?
Anna Kuhn
 
PPTX
Big data, Big decision
Venkatesh Balakumar
 
PPTX
Introduction of big data and analytics
Sanjeev Solanki
 
PPTX
Big Data - The 5 Vs Everyone Must Know
Bernard Marr
 
PPTX
Presentation Big Data
René Kuipers
 
PPTX
The Business of Big Data - IA Ventures
Ben Siscovick
 
PPTX
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Hritika Raj
 
PPTX
Introduction to Big Data
Karan Desai
 
PPTX
Mining Big Data in Real Time
Albert Bifet
 
ODP
Big Data Presentation
Ritika Barethia
 
PPTX
Big Data Analytics Strategy and Roadmap
Srinath Perera
 
PPTX
A Short History of Big Data
Gadi Eichhorn
 
PDF
Forecast of Big Data Trends
IMC Institute
 
Big data
Nausheen Hasan
 
Big data ppt
AKASH SIHAG
 
Team 2 Big Data Presentation
Matthew Urdan
 
Big data Presentation
Aswadmehar
 
Big Data for Beginners
Michael Perez
 
Big Data vs. Small Data...what's the difference?
Anna Kuhn
 
Big data, Big decision
Venkatesh Balakumar
 
Introduction of big data and analytics
Sanjeev Solanki
 
Big Data - The 5 Vs Everyone Must Know
Bernard Marr
 
Presentation Big Data
René Kuipers
 
The Business of Big Data - IA Ventures
Ben Siscovick
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Hritika Raj
 
Introduction to Big Data
Karan Desai
 
Mining Big Data in Real Time
Albert Bifet
 
Big Data Presentation
Ritika Barethia
 
Big Data Analytics Strategy and Roadmap
Srinath Perera
 
A Short History of Big Data
Gadi Eichhorn
 
Forecast of Big Data Trends
IMC Institute
 

Similar to Big Data & the importance of Data Science (20)

PDF
Introduction to big data for the EA course at Solvay MBA
Wim Van Leuven
 
PPTX
Big Data, NoSQL, NewSQL & The Future of Data Management
Tony Bain
 
PPTX
basic of data science and big data......
anjanasharma77573
 
PPTX
Fundamentals of Big Data
The Wisdom Daily
 
PPTX
Lunch & Learn Intro to Big Data
Melissa Hornbostel
 
PPTX
Big Data
Rohit Jain
 
PDF
What Is Big Data How Big Data Works.pdf
Pridesys IT Ltd.
 
PDF
Why Big Data is Really about Small Data
Hurwitz & Associates
 
PDF
What Is Big Data How Big Data Works.pdf
Pridesys IT Ltd.
 
PPTX
A Big Data Concept
Dharmesh Tank
 
PDF
The Data Axioms lecture-overview-big data-usama-9-2015
CMR WORLD TECH
 
PPSX
Intro to Data Science Big Data
Indu Khemchandani
 
PDF
3 джозеп курто превращаем вашу организацию в big data компанию
antishmanti
 
PDF
uae views on big data
Aravindharamanan S
 
PPTX
BigData.pptx
vidhi171881
 
PPTX
Unit-I- Introduction- Traits of Big Data-Final.pptx
subhashchandra197
 
PPTX
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad
 
PDF
Big data
ISME College
 
PPTX
Why Everything You Know About bigdata Is A Lie
Sunil Ranka
 
PPTX
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy
 
Introduction to big data for the EA course at Solvay MBA
Wim Van Leuven
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Tony Bain
 
basic of data science and big data......
anjanasharma77573
 
Fundamentals of Big Data
The Wisdom Daily
 
Lunch & Learn Intro to Big Data
Melissa Hornbostel
 
Big Data
Rohit Jain
 
What Is Big Data How Big Data Works.pdf
Pridesys IT Ltd.
 
Why Big Data is Really about Small Data
Hurwitz & Associates
 
What Is Big Data How Big Data Works.pdf
Pridesys IT Ltd.
 
A Big Data Concept
Dharmesh Tank
 
The Data Axioms lecture-overview-big data-usama-9-2015
CMR WORLD TECH
 
Intro to Data Science Big Data
Indu Khemchandani
 
3 джозеп курто превращаем вашу организацию в big data компанию
antishmanti
 
uae views on big data
Aravindharamanan S
 
BigData.pptx
vidhi171881
 
Unit-I- Introduction- Traits of Big Data-Final.pptx
subhashchandra197
 
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad
 
Big data
ISME College
 
Why Everything You Know About bigdata Is A Lie
Sunil Ranka
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy
 
Ad

Recently uploaded (20)

PPTX
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
PPTX
DATA-COLLECTION METHODS, TYPES AND SOURCES
biggdaad011
 
PDF
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
PPTX
Resmed Rady Landis May 4th - analytics.pptx
Adrian Limanto
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
PPTX
TSM_08_0811111111111111111111111111111111111111111111111
csomonasteriomoscow
 
PDF
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
PPTX
Slide studies GC- CRC - PC - HNC baru.pptx
LLen8
 
PDF
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PPT
01 presentation finyyyal معهد معايره.ppt
eltohamym057
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PPTX
This PowerPoint presentation titled "Data Visualization: Turning Data into In...
HemaDivyaKantamaneni
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
PDF
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
DATA-COLLECTION METHODS, TYPES AND SOURCES
biggdaad011
 
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
Resmed Rady Landis May 4th - analytics.pptx
Adrian Limanto
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
Choosing the Right Database for Indexing.pdf
Tamanna
 
TSM_08_0811111111111111111111111111111111111111111111111
csomonasteriomoscow
 
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
Slide studies GC- CRC - PC - HNC baru.pptx
LLen8
 
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
01 presentation finyyyal معهد معايره.ppt
eltohamym057
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
This PowerPoint presentation titled "Data Visualization: Turning Data into In...
HemaDivyaKantamaneni
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
Ad

Big Data & the importance of Data Science