SlideShare a Scribd company logo
Big Data and Security

Michel Burger
0.05 ounces/ton




                  Gold mining is about dirt management
About 11850 Amps to generate
around 8.4 Tesla fields (about
   150000 times the earth
   magnetic field) but they
   operate at low Voltage


     A lot of what LHC is about is electricity flow management
How BIG?

BIG data is like the LHC combined with gold
extraction
- Huge amount of data -> 6.6 Zettabytes/year by 2016 (Cisco
  Cloud Index)

- Big flow of data -> 400TB/day (Facebook)
- LHC generates 10-15 Petabytes/year of data for each
  experiment
The essence of new service
providers                                                                 BI Based Revenue Models
                                                                                (eg Advertisement)

                                             User
                                                                               Core Semantic

                          Improves                            Consumes
                         experience
                                                                                Data Set
                                                                                Mindmap

                                                                                            Revenue from
Value enriched Data
                                                                                            existing services
          generates
            revenue         Data                                  Service                   will shrink
                                                                   Service
                                            Produces                Service
                                                                                               Additional
                                                                                               revenue from
                                                                                               new services

  The more context
the more efficient and
                             One data set                                               Many free services
   the more value            and common semantic
                                                            Example:
                                      Search/Information Management :
                                                 Rated auction/Selling:
Classic Approach
• Structured Data
• Data in the range of Gigabytes to Terabytes
• Centralized (Data is imported in analytics)
• Batch based
• Data silos

                ETL                ETL               ETL
  Transaction         Relational           Data            Analyse
                      Database           Warehouse




         Where is the data that answer my questions ?
Big Data Approach
  • Multi Structured Data
  • Data in the range of Terabytes to Petabytes
  • Distributed/Federated (Analytics grab the data)
  • Streaming based
  • Holistic Data Clusters
                   1

         Stream    2
                                 Organize             Analyse

                   3

                   n


Here are the questions and the data for the answers
A new pattern
             • Many different data structures
             • Many different ways to extract the data
                                                     Knowledge                                                                                                                   • Structured
             • Many different locations (even for the
                                                     References




                                                                                                                                                                                        API
                                          Services                              Content
Sources




                   Applications
                                                                                                                                Social Networks
                                                                                                                                                                                   Buffering
               same type of data)                                                                                                                                                  •   Proprietary
                                                                                                                                                                                   •
                                                                                                                                                                           RAN
                                                                                                                                                                                       Graph
             • Batch and Realtime based
                   Data card
                                                                                                                                                                                   •
                                                                                                                                                                                          Data as a
                                                                                                                                                                                           Service
                                                                                                                                                                                       Neural Network
             • Buffered or stream
                   Sim Card                                                               Premise                                   Network Core

                                                                                                                                                                                   •
                                                         Connected Things

                                  Connected
                                                       (Consumer, Enterprise)
                                                                                          Gateway
                                                                                                                                                                                       Relational
             • Correlation parameters                                                                                                                                            • Unstructured
                                   Devices                                                                                         IT Infrastructure




                                                                                                                                                                                                         Consumption
                                                                                                                                                                                   Buffering
                                                                                                                                                                                          Report
                                                                                                                                                                                         Statistics


                                                                                                                                                                                 • Streaming
                                                                                                                                                                                 • Taping at Source



                                                                                           Real-time



                                                                                                                                    Cheap Storage High Efficient Storage
                                                                                                           Low level Semantic
                   • Buffering, Routing, Filtering                                                                                                                               • Taping on Stream
                   • Structured/Unstructured                                                                                                                                     • Consumption to
          Stream




                                                                                                                                                                                               Graph
                                                                                                                                                                                              Network/

                     store                                                                                                                                                         Source     Analysis




                   • Event Collector
                   • Batch Process/Multi
                                                                                           Non Real-time

                                                                                                           Rich Semantic


                     Structure Stream
                   • Multi Stage Store/Process                                                                                                                                               Neura l
                                                                                                                                                                                            Network/
                                                                                                                                                                                            Analysis
With added security
                                                                Knowledge
                                                                References




                                                                                                                                           API
                                                     Services                              Content
Sources




                                                                                                               Social Networks
                      Applications


                                                                                                                                      • Strong access
                                                                                                                        RAN
                                                                                                                                        control based
                                                                                                                                             Data as a
                                                                                                                                              Service
                      Data card
                      Sim Card
                                                                    Connected Things
                                                                  (Consumer, Enterprise)
                                                                                                     Premise
                                                                                                     Gateway
                                                                                                                   Network Core
                                                                                                                                        on industry
                                             Connected
                                              Devices                                                             IT Infrastructure
                                                                                                                                        standard




                                                                                                                                                            Consumption
                                                                                                                                        (user, dev, app
                                                                                                                                        lication)
                                                                                                                                             Report
                                                                                                                                            Statistics



                   • Securing the infrastructure (public, private)                                                                    • Strong
                         •           Policy (internal/external)                                                                         authorization
                         •           On-going assessment (DDOS, Penetration …)                                                          control based
                         •           Data leakage
                         •
                                                                                                                                        on open
          Stream




                                     Migration                                                                                                    Graph

                                                                                                                                        standard
                                                                                                                                                 Network/


                   • Securing the identity
                                                                                                                                                 Analysis




                         •           Validating ID                                                                                    • Analytics
                         •           Anonymization                                                                                      applied to
                   • Securing the access                                                                                                Analytics
                         •           Distributed permission/preference
                         •           3rd party permission                                                                                       Neura l
                                                                                                                                               Network/
                                                                                                                                               Analysis
Final thoughts
1. We need to eliminate the silos
   – Sources or Usage
2. Still very much a collection of technologies
   – The assembly is still very complex
3. Is everything about events?
4. We need to handle the CAP theorem more appropriately
5. What is the user experience (not just the end user but also
   the admin)
Thank You

More Related Content

PPT
Oracle BI Server by AORTA
guest066f569
 
PPTX
AvePoint - Death of the FileShare, as you know it.
garthluke
 
PPTX
Building the Perfect SharePoint 2010 Farm - SharePoint Connections Amsterdam ...
Michael Noel
 
PPTX
SPTechCon SFO 2012 - Building the Perfect SharePoint 2010 Farm by Michael Noel
Michael Noel
 
PDF
Recommendations play @flipkart (3)
hava101
 
PPTX
HAD04: Building it Right the First Time; Best Practice SharePoint 2010 Infras...
Michael Noel
 
PDF
"A Study of I/O and Virtualization Performance with a Search Engine based on ...
Lucidworks (Archived)
 
PDF
Hadoop - Now, Next and Beyond
Teradata Aster
 
Oracle BI Server by AORTA
guest066f569
 
AvePoint - Death of the FileShare, as you know it.
garthluke
 
Building the Perfect SharePoint 2010 Farm - SharePoint Connections Amsterdam ...
Michael Noel
 
SPTechCon SFO 2012 - Building the Perfect SharePoint 2010 Farm by Michael Noel
Michael Noel
 
Recommendations play @flipkart (3)
hava101
 
HAD04: Building it Right the First Time; Best Practice SharePoint 2010 Infras...
Michael Noel
 
"A Study of I/O and Virtualization Performance with a Search Engine based on ...
Lucidworks (Archived)
 
Hadoop - Now, Next and Beyond
Teradata Aster
 

What's hot (11)

PDF
Rapleaf
pete_rapleaf
 
PDF
2012.04.26 big insights streams im forum2
Wilfried Hoge
 
PDF
Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...
Verbella CMG
 
PDF
Rapleaf Overview
cjaros73
 
PDF
Accelerating It Migration Success With A Rock Solid Hp And Red Hat Enterprise...
Vedanta Barooah
 
PPTX
HP Microsoft SQL Server Data Management Solutions
Eduardo Castro
 
PDF
FAST Search for SharePoint
C/D/H Technology Consultants
 
PPTX
Customer relationship management powerpoint templates
SlideTeam.net
 
PDF
Sap sap so h 2013
deepersnet
 
PPTX
Keynote Sap UA Conference March 23 a zeier final
Prof. Dr. Alexander Zeier
 
PPTX
Boston HUG - Cloudera presentation
reedshea
 
Rapleaf
pete_rapleaf
 
2012.04.26 big insights streams im forum2
Wilfried Hoge
 
Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...
Verbella CMG
 
Rapleaf Overview
cjaros73
 
Accelerating It Migration Success With A Rock Solid Hp And Red Hat Enterprise...
Vedanta Barooah
 
HP Microsoft SQL Server Data Management Solutions
Eduardo Castro
 
FAST Search for SharePoint
C/D/H Technology Consultants
 
Customer relationship management powerpoint templates
SlideTeam.net
 
Sap sap so h 2013
deepersnet
 
Keynote Sap UA Conference March 23 a zeier final
Prof. Dr. Alexander Zeier
 
Boston HUG - Cloudera presentation
reedshea
 
Ad

Similar to Vodafone xone fev142013v3 ext (20)

PDF
SAP EIM
Sybase Türkiye
 
PPTX
Kurukshetra - Big Data
shankar_radhakrishnan
 
PDF
SAP HANA and Apache Hadoop for Big Data Management (SF Scalable Systems Meetup)
Will Gardella
 
PDF
Microsoft StreamInsight
Mark Ginnebaugh
 
PDF
"Search, APIs,Capability Management and the Sensis Journey"
Lucidworks (Archived)
 
PDF
Powering Next Generation Data Architecture With Apache Hadoop
Hortonworks
 
PDF
Cutting Big Data Down to Size with AMD and Dell
AMD
 
PPTX
Infomation models for agile bi
Ehtisham Rao
 
PPTX
Software architecture & design patterns for MS CRM Developers
sebedatalabs
 
PPTX
The causes and consequences of too many bits
Dipesh Lall
 
PDF
DashMash: a Mashup Environment for End User Development
Matteo Picozzi
 
PDF
Scaling MySQL: Benefits of Automatic Data Distribution
ScaleBase
 
PDF
Hadoop's Role in the Big Data Architecture, OW2con'12, Paris
OW2
 
PPTX
Business Intelligence - Architecture & Execution Done Right
David Sogn
 
PDF
sones company presentation
sones GmbH
 
PPTX
Enterprise Integration of Disruptive Technologies
DataWorks Summit
 
PPTX
MapR lucidworks joint webinar
Ted Dunning
 
PPT
Search, APIs, capability management and Sensis's journey
ablebagel
 
PDF
NGDATA Corporate Presentation
NGDATA
 
PDF
NGDATA Corporate Presentation
NGDATA
 
Kurukshetra - Big Data
shankar_radhakrishnan
 
SAP HANA and Apache Hadoop for Big Data Management (SF Scalable Systems Meetup)
Will Gardella
 
Microsoft StreamInsight
Mark Ginnebaugh
 
"Search, APIs,Capability Management and the Sensis Journey"
Lucidworks (Archived)
 
Powering Next Generation Data Architecture With Apache Hadoop
Hortonworks
 
Cutting Big Data Down to Size with AMD and Dell
AMD
 
Infomation models for agile bi
Ehtisham Rao
 
Software architecture & design patterns for MS CRM Developers
sebedatalabs
 
The causes and consequences of too many bits
Dipesh Lall
 
DashMash: a Mashup Environment for End User Development
Matteo Picozzi
 
Scaling MySQL: Benefits of Automatic Data Distribution
ScaleBase
 
Hadoop's Role in the Big Data Architecture, OW2con'12, Paris
OW2
 
Business Intelligence - Architecture & Execution Done Right
David Sogn
 
sones company presentation
sones GmbH
 
Enterprise Integration of Disruptive Technologies
DataWorks Summit
 
MapR lucidworks joint webinar
Ted Dunning
 
Search, APIs, capability management and Sensis's journey
ablebagel
 
NGDATA Corporate Presentation
NGDATA
 
NGDATA Corporate Presentation
NGDATA
 
Ad

More from InfiniteGraph (20)

PDF
Making Sense of Graph Databases
InfiniteGraph
 
PPTX
Webinar 3/12/14: Using Social Media to Drive Value
InfiniteGraph
 
PDF
NoSQL Simplified: Schema vs. Schema-less
InfiniteGraph
 
PDF
The Value of Explicit Schema for Graph Use Cases
InfiniteGraph
 
PDF
Solution Use Case Demo: The Power of Relationships in Your Big Data
InfiniteGraph
 
PDF
PowerOfRelationshipsInBigData_SVNoSQL
InfiniteGraph
 
PPT
Objectivity/DB: A Multipurpose NoSQL Database
InfiniteGraph
 
PPT
Making sense of the Graph Revolution
InfiniteGraph
 
PPT
An Introduction to Graph Databases
InfiniteGraph
 
PDF
Using A Distributed Graph Database To Make Sense Of Disparate Data Stores
InfiniteGraph
 
PPT
Turning Big Data into Smart Data with Graph Technologies
InfiniteGraph
 
PPTX
NoSQL Technology and Real-time, Accurate Predictive Analytics
InfiniteGraph
 
PPTX
How we Learned to Stop Worrying and Solve the Distributed Graph Problem
InfiniteGraph
 
PDF
Everything Goes Better With Bacon: Revisiting the Six Degrees Problem with a ...
InfiniteGraph
 
PDF
Dbta Webinar Realize Value of Big Data with graph 011713
InfiniteGraph
 
PDF
Oracle no sql overview brief
InfiniteGraph
 
PPT
Infinite graph nosql meetup dec 2012
InfiniteGraph
 
PDF
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
InfiniteGraph
 
PPTX
Silicon valley nosql meetup april 2012
InfiniteGraph
 
PPT
NOSQL Now! Presentation, August 24, 2011: Graph Databases: Connecting the Dot...
InfiniteGraph
 
Making Sense of Graph Databases
InfiniteGraph
 
Webinar 3/12/14: Using Social Media to Drive Value
InfiniteGraph
 
NoSQL Simplified: Schema vs. Schema-less
InfiniteGraph
 
The Value of Explicit Schema for Graph Use Cases
InfiniteGraph
 
Solution Use Case Demo: The Power of Relationships in Your Big Data
InfiniteGraph
 
PowerOfRelationshipsInBigData_SVNoSQL
InfiniteGraph
 
Objectivity/DB: A Multipurpose NoSQL Database
InfiniteGraph
 
Making sense of the Graph Revolution
InfiniteGraph
 
An Introduction to Graph Databases
InfiniteGraph
 
Using A Distributed Graph Database To Make Sense Of Disparate Data Stores
InfiniteGraph
 
Turning Big Data into Smart Data with Graph Technologies
InfiniteGraph
 
NoSQL Technology and Real-time, Accurate Predictive Analytics
InfiniteGraph
 
How we Learned to Stop Worrying and Solve the Distributed Graph Problem
InfiniteGraph
 
Everything Goes Better With Bacon: Revisiting the Six Degrees Problem with a ...
InfiniteGraph
 
Dbta Webinar Realize Value of Big Data with graph 011713
InfiniteGraph
 
Oracle no sql overview brief
InfiniteGraph
 
Infinite graph nosql meetup dec 2012
InfiniteGraph
 
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
InfiniteGraph
 
Silicon valley nosql meetup april 2012
InfiniteGraph
 
NOSQL Now! Presentation, August 24, 2011: Graph Databases: Connecting the Dot...
InfiniteGraph
 

Vodafone xone fev142013v3 ext

  • 1. Big Data and Security Michel Burger
  • 2. 0.05 ounces/ton Gold mining is about dirt management
  • 3. About 11850 Amps to generate around 8.4 Tesla fields (about 150000 times the earth magnetic field) but they operate at low Voltage A lot of what LHC is about is electricity flow management
  • 4. How BIG? BIG data is like the LHC combined with gold extraction - Huge amount of data -> 6.6 Zettabytes/year by 2016 (Cisco Cloud Index) - Big flow of data -> 400TB/day (Facebook) - LHC generates 10-15 Petabytes/year of data for each experiment
  • 5. The essence of new service providers BI Based Revenue Models (eg Advertisement) User Core Semantic Improves Consumes experience Data Set Mindmap Revenue from Value enriched Data existing services generates revenue Data Service will shrink Service Produces Service Additional revenue from new services The more context the more efficient and One data set Many free services the more value and common semantic Example: Search/Information Management : Rated auction/Selling:
  • 6. Classic Approach • Structured Data • Data in the range of Gigabytes to Terabytes • Centralized (Data is imported in analytics) • Batch based • Data silos ETL ETL ETL Transaction Relational Data Analyse Database Warehouse Where is the data that answer my questions ?
  • 7. Big Data Approach • Multi Structured Data • Data in the range of Terabytes to Petabytes • Distributed/Federated (Analytics grab the data) • Streaming based • Holistic Data Clusters 1 Stream 2 Organize Analyse 3 n Here are the questions and the data for the answers
  • 8. A new pattern • Many different data structures • Many different ways to extract the data Knowledge • Structured • Many different locations (even for the References API Services Content Sources Applications Social Networks Buffering same type of data) • Proprietary • RAN Graph • Batch and Realtime based Data card • Data as a Service Neural Network • Buffered or stream Sim Card Premise Network Core • Connected Things Connected (Consumer, Enterprise) Gateway Relational • Correlation parameters • Unstructured Devices IT Infrastructure Consumption Buffering Report Statistics • Streaming • Taping at Source Real-time Cheap Storage High Efficient Storage Low level Semantic • Buffering, Routing, Filtering • Taping on Stream • Structured/Unstructured • Consumption to Stream Graph Network/ store Source Analysis • Event Collector • Batch Process/Multi Non Real-time Rich Semantic Structure Stream • Multi Stage Store/Process Neura l Network/ Analysis
  • 9. With added security Knowledge References API Services Content Sources Social Networks Applications • Strong access RAN control based Data as a Service Data card Sim Card Connected Things (Consumer, Enterprise) Premise Gateway Network Core on industry Connected Devices IT Infrastructure standard Consumption (user, dev, app lication) Report Statistics • Securing the infrastructure (public, private) • Strong • Policy (internal/external) authorization • On-going assessment (DDOS, Penetration …) control based • Data leakage • on open Stream Migration Graph standard Network/ • Securing the identity Analysis • Validating ID • Analytics • Anonymization applied to • Securing the access Analytics • Distributed permission/preference • 3rd party permission Neura l Network/ Analysis
  • 10. Final thoughts 1. We need to eliminate the silos – Sources or Usage 2. Still very much a collection of technologies – The assembly is still very complex 3. Is everything about events? 4. We need to handle the CAP theorem more appropriately 5. What is the user experience (not just the end user but also the admin)