SlideShare a Scribd company logo
Cloud Computing y Big Data,
próxima frontera de la innovación
Cloud Computing and Big Data,
the next frontier of innovation


Jordi Torres, UPC-BSC
Madrid, 21 Marzo 2013
HOW DID SCIENCE START?
Source: Prof. Mateo Valero, BSC-CNS 2010
Source: Prof. Mateo Valero, BSC-CNS 2010
HOW IS SCIENCE ADVANCING TODAY?
Source: Prof. Mateo Valero, BSC-CNS 2010
Source: Prof. Mateo Valero, BSC-CNS 2010
MATHEMATICAL CALCULATIONS?

         WHERE?
Cloud Computing y Big Data, próxima frontera de la innovación
MN3
              Cores/chip                 8
              Chip/node                  2
Compute       Cores/node                16
              Nodes                    3028
              Total cores          48448
              Freq.                     2,6
              Gflops/core              20,8
Performance
              Gflops/node           332,8
              Total Tflops         1000,0
              GB/core (GB)               2
Memory        GB/node (GB)              32
              Total (TB)            96,89
              Latency (μs)              0,7
Network
              Bandwidth (Gb/s)          40
Storage       (TB)                     2000
Consumption   (KW)                     1080
FOR SOME SPANISH RESEARCH GROUPS!
AND…

FOR THE REST OF THE WORLD?
GOOD NEWS!




Source: http://news.cnet.com/8301-13846_3-57349321-62
/amazon-takes-supercomputing-to-the-cloud
CLOUD COMPUTING?
Source: http://www.wired.com/wiredenterprise/2011/
12/nonexistent-supercomputer/all/1
Source: http://www.facebook.com/media/
            set/?set=a.190842620965185.47008.140375289345252




   40 Mw
28.000 m2
Foto: Google
HUGE DATA CENTERS
Foto: Google




                        > football pitch x 4
Source: http://www.google.com/about/datacenters/gallery/images
Source: http://www.google.com/about/datacenters/gallery/images
Source: http://www.google.com/about/datacenters/gallery/images
Different IT
             production
Foto: J.T.
CLOUD COMPUTING:
            IT as a service

On-demand self-service                                           Pay per use




  Rapid elasticity                                     Ubiquitous access
                                             ....
           Source: http://www.telegraph.co.uk/technology
           /reviews/9241719/Power-Ethernet-Sockets-review.html
Example of benefits (IaaS):




1 computer in a rack
for 120 hours          120 computers in three
                       racks for 1 hour


                              Idea : Tutorial SC2011 - Robert Grossman
AND DATA?
Source: http://www.docuciencia.es/2009/05/lhc-el-acelerador-de-particulas/



“… the LHC produces 1PetaByte of data every second, big data and
lack of computing resources were becoming the European Organization
for Nuclear Research’s biggest IT challenges…”
       Source: computerweekly.com/news/2240173897/CERN-adopts
       -OpenStack-private-cloud-to-solve-big-data-challenges
1 Gigabyte (GB) = 1.000.000.000 byte
1 Terabyte (TB) = 1.000 Gigabyte (GB)
1 Petabyte (PB) = 1.000.000 Gigabyte (GB)
1 Exabyte (EB) = 1.000.000.000 Gigabyte (GB)
1 Zettabyte (ZB) = 1.000.000.000.000 (GB)
Deluge of data created daily




                               Source: Economist , Feb 25th, 2010 http://www.economist.com/node/15579717
Big Data?

definition?
BIG DATA?
Big Data is data that exceeds the
storing, processing and managing
capacity of conventional systems.
BIG DATA?




The reason is that the data is too
big, moves too fast, or doesn’t fit
the structures of our current systems’
architectures.
BIG DATA?




Moreover, to gain value from this
data, we must change the way to
analyze it.
BIG DATA?
Big Data is data that exceeds the storing,
processing and managing capacity of
conventional systems.
The reason is that the data is too big,
moves too fast, or doesn’t fit the
structures of our current systems’
architectures.
Moreover, to gain value from this data, we
must change the way to analyze it.
NEW CHALLENGES
that must be addressed urgently, in order to respond
     to the needs of the advancement of science


                 1.   Storing
                 2.   Managing
                 3.   Processing
                 4.   Analyzing
Affordable Storage
But scanning disks…



assume 100MB/sec
But scanning disks…



assume 100MB/sec
more than 5 hours
approach: massive parallelism

    assume 20.000 disks:
scanning 2 TB takes 1 second




Source: http://www.google.com/about/datacenters/gallery/images/_2000/IDI_018.jpg
1 Data processing challenges




Rethinking data processing is required:
      MapReduce, Storm, S4,…



  Source: http://www.google.com/about/datacenters/gallery/images/_2000/IDI_018.jpg
2 Data storage challenges

New Storage technologies are required

                     HHD 100 cheaper than RAM
                     But 1000 times slower
RAM vs HHD

                     Solid- state drive (SSD)
                     Not volatile
Present solutions:

                     Storage Class Memory (SCM)
Research:
3 Data management challenges


   Relational DB can’t support everything


Example: eventual consistency

Solution: “NoSQL systems”

Research: New management systems
                                   Source: gigaom.com/cloud/big-data-
                                   and-nosql-march-to-the-enterprise/




                                                                        43
4 Obtaining value from data

        The information is non actionable knowledge

-
             Data                  prediction using data mining &
                          +        machine learning techniques
Value




                         Volume

          Information
                                  Research: The majority of algorithms
                                  function well in thousands of
+                                 registers, however at the moment they
           Knowledge      -       are impractical for thousands of
                                  milions.
Cloud Computing
   and Big Data:
the next frontier of
    science and
     innovation
Thank you for your attention

www.JordiTorres.org - @JordiTorresBCN




     www.smartcityexpo.com                 www.bsc.es/eBusiness
  Autonomic Systems and e-Business Platforms research line at BSC/UPC
Cloud Computing y Big Data, próxima frontera de la innovación
Thank you for your attention

www.JordiTorres.org - @JordiTorresBCN




     www.smartcityexpo.com                 www.bsc.es/eBusiness
  Autonomic Systems and e-Business Platforms research line at BSC/UPC

More Related Content

DOCX
Grid computing assiment
Huma Tariq
 
PDF
Harness the Power of Big Data with Oracle
Sai Janakiram Penumuru
 
PDF
Architectures for Data Commons (XLDB 15 Lightning Talk)
Robert Grossman
 
PDF
An NSA Big Graph experiment
Trieu Nguyen
 
PDF
Big Data Story - From An Engineer's Perspective
Hien Luu
 
ODP
Clouds, Grids and Data
Guy Coates
 
PDF
Introduction to Big Data by Manouj Bongirr
Pranav Kulkarni
 
PPT
Integrating compression technique for data mining
Dr.Manmohan Singh
 
Grid computing assiment
Huma Tariq
 
Harness the Power of Big Data with Oracle
Sai Janakiram Penumuru
 
Architectures for Data Commons (XLDB 15 Lightning Talk)
Robert Grossman
 
An NSA Big Graph experiment
Trieu Nguyen
 
Big Data Story - From An Engineer's Perspective
Hien Luu
 
Clouds, Grids and Data
Guy Coates
 
Introduction to Big Data by Manouj Bongirr
Pranav Kulkarni
 
Integrating compression technique for data mining
Dr.Manmohan Singh
 

What's hot (20)

PPTX
Open Science Data Cloud (June 21, 2010)
Robert Grossman
 
PPTX
Presentation on Big Data Hadoop (Summer Training Demo)
Ashok Royal
 
PPTX
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Ashok Royal
 
PDF
Introduction to Numetric (1)
Matt Polson
 
PDF
Big Data: hype or necessity?
Bart Vandewoestyne
 
DOCX
BIG DATA-Seminar Report
josnapv
 
PDF
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Nishant Gandhi
 
PDF
Büyük Veriyle Büyük Resmi Görmek
ideaport
 
PDF
Big Data simplified
Praveen Hanchinal
 
PPTX
A brief history of "big data"
Nicola Ferraro
 
PDF
Real time big data analytical architecture for remote sensing application
LeMeniz Infotech
 
PDF
Deep learning @ Edge using Intel's Neural Compute Stick
geetachauhan
 
PDF
Overview of big data in cloud computing
Viet-Trung TRAN
 
PPTX
The rise of “Big Data” on cloud computing
Minhazul Arefin
 
PDF
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Microsoft Technet France
 
PDF
A Review Paper on Big Data and Hadoop for Data Science
ijtsrd
 
PDF
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET Journal
 
PPTX
Big data
heena verma
 
PPTX
Big data management
zeba khanam
 
PDF
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
Mahantesh Angadi
 
Open Science Data Cloud (June 21, 2010)
Robert Grossman
 
Presentation on Big Data Hadoop (Summer Training Demo)
Ashok Royal
 
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Ashok Royal
 
Introduction to Numetric (1)
Matt Polson
 
Big Data: hype or necessity?
Bart Vandewoestyne
 
BIG DATA-Seminar Report
josnapv
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Nishant Gandhi
 
Büyük Veriyle Büyük Resmi Görmek
ideaport
 
Big Data simplified
Praveen Hanchinal
 
A brief history of "big data"
Nicola Ferraro
 
Real time big data analytical architecture for remote sensing application
LeMeniz Infotech
 
Deep learning @ Edge using Intel's Neural Compute Stick
geetachauhan
 
Overview of big data in cloud computing
Viet-Trung TRAN
 
The rise of “Big Data” on cloud computing
Minhazul Arefin
 
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Microsoft Technet France
 
A Review Paper on Big Data and Hadoop for Data Science
ijtsrd
 
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET Journal
 
Big data
heena verma
 
Big data management
zeba khanam
 
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
Mahantesh Angadi
 
Ad

Similar to Cloud Computing y Big Data, próxima frontera de la innovación (20)

PPT
Petascale Analytics - The World of Big Data Requires Big Analytics
Heiko Joerg Schick
 
PPT
Cyberinfrastructure and Applications Overview: Howard University June22
marpierc
 
PDF
Nikravesh australia long_versionkeynote2012
Masoud Nikravesh
 
PPT
TeraGrid Communication and Computation
Tal Lavian Ph.D.
 
PPTX
Introduction to Cloud computing and Big Data-Hadoop
Nagarjuna D.N
 
PPT
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
Larry Smarr
 
PPTX
Big data business case
Karthik Padmanabhan ( MLE℠)
 
PPT
Computing Outside The Box September 2009
Ian Foster
 
PDF
Internet of Things
Aniekan Akpaffiong
 
PPT
Cloud Computing,雲端運算-中研院網格計畫主持人林誠謙
Tracy Chen
 
PPT
Computing Outside The Box June 2009
Ian Foster
 
PDF
2020-04-29 SIT Insights in Technology - Serguei Beloussov
Schaffhausen Institute of Technology
 
PDF
E Science As A Lens On The World Lazowska
guest43b4df3
 
PDF
E Science As A Lens On The World Lazowska
WCET
 
PDF
End of Moore's Law?
Jeffrey Funk
 
PDF
BIG DATA
Dr. Shashank Shetty
 
PPTX
Stories About Spark, HPC and Barcelona by Jordi Torres
Spark Summit
 
PPTX
Serguei Seloussov - Future of computing and SIT MSc program
Schaffhausen Institute of Technology
 
PDF
How HPC and large-scale data analytics are transforming experimental science
inside-BigData.com
 
PPTX
High performance computing
Guy Tel-Zur
 
Petascale Analytics - The World of Big Data Requires Big Analytics
Heiko Joerg Schick
 
Cyberinfrastructure and Applications Overview: Howard University June22
marpierc
 
Nikravesh australia long_versionkeynote2012
Masoud Nikravesh
 
TeraGrid Communication and Computation
Tal Lavian Ph.D.
 
Introduction to Cloud computing and Big Data-Hadoop
Nagarjuna D.N
 
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
Larry Smarr
 
Big data business case
Karthik Padmanabhan ( MLE℠)
 
Computing Outside The Box September 2009
Ian Foster
 
Internet of Things
Aniekan Akpaffiong
 
Cloud Computing,雲端運算-中研院網格計畫主持人林誠謙
Tracy Chen
 
Computing Outside The Box June 2009
Ian Foster
 
2020-04-29 SIT Insights in Technology - Serguei Beloussov
Schaffhausen Institute of Technology
 
E Science As A Lens On The World Lazowska
guest43b4df3
 
E Science As A Lens On The World Lazowska
WCET
 
End of Moore's Law?
Jeffrey Funk
 
Stories About Spark, HPC and Barcelona by Jordi Torres
Spark Summit
 
Serguei Seloussov - Future of computing and SIT MSc program
Schaffhausen Institute of Technology
 
How HPC and large-scale data analytics are transforming experimental science
inside-BigData.com
 
High performance computing
Guy Tel-Zur
 
Ad

More from Fundación Ramón Areces (20)

PPTX
Jordi Torren - Coordinador del proyecto ESVAC. Agencia Europea de Medicamento...
Fundación Ramón Areces
 
PDF
Dominique L. Monnet Director del programa ARHAI (Antimicrobial Resistance an...
Fundación Ramón Areces
 
PPTX
Antonio Cabrales -University College of London.
Fundación Ramón Areces
 
PPTX
Teresa Puig - Institut de Ciència de Materials de Barcelona, ICMAB-CSIC, Espa...
Fundación Ramón Areces
 
PDF
Elena Bascones - Instituto de Ciencia de Materiales de Madrid (ICMM-CSIC), Es...
Fundación Ramón Areces
 
PDF
Jonathan D. Ostry - Fondo Monetario Internacional (FMI).
Fundación Ramón Areces
 
PDF
Martín Uribe - Universidad de Columbia.
Fundación Ramón Areces
 
PPTX
Thomas S. Robertson - The Wharton School.
Fundación Ramón Areces
 
PPTX
Diana Robertson - The Wharton School.
Fundación Ramón Areces
 
PPTX
Juan Carlos López-Gutiérrez - Unidad de Anomalías Vasculares, Hospital Unive...
Fundación Ramón Areces
 
PPTX
Víctor Martínez-Glez. - Instituto de Genética Médica y Molecular (INGEMM). I...
Fundación Ramón Areces
 
PPT
Rudolf Happle - Dermatología, University of Freiburg Medical Center, Freiburg...
Fundación Ramón Areces
 
PDF
Rafael Doménech - Responsable de Análisis Macroeconómico, BBVA Research.
Fundación Ramón Areces
 
PPTX
Diego Valero - Presidente del Grupo Novaster.
Fundación Ramón Areces
 
PPTX
Mercedes Ayuso - Universitat de Barcelona.
Fundación Ramón Areces
 
PPTX
Nicholas Barr - Profesor de Economía Pública, London School of Economics.
Fundación Ramón Areces
 
PPTX
Julia Campa - The Open University.
Fundación Ramón Areces
 
PDF
Juan Manuel Sarasua - Comunicador y periodista científico.
Fundación Ramón Areces
 
PPTX
Marta Olivares - Investigadora Postdoctoral en Université catholique de Louva...
Fundación Ramón Areces
 
PPTX
Frederic Lluis - Investigador principal en KU Leuven.
Fundación Ramón Areces
 
Jordi Torren - Coordinador del proyecto ESVAC. Agencia Europea de Medicamento...
Fundación Ramón Areces
 
Dominique L. Monnet Director del programa ARHAI (Antimicrobial Resistance an...
Fundación Ramón Areces
 
Antonio Cabrales -University College of London.
Fundación Ramón Areces
 
Teresa Puig - Institut de Ciència de Materials de Barcelona, ICMAB-CSIC, Espa...
Fundación Ramón Areces
 
Elena Bascones - Instituto de Ciencia de Materiales de Madrid (ICMM-CSIC), Es...
Fundación Ramón Areces
 
Jonathan D. Ostry - Fondo Monetario Internacional (FMI).
Fundación Ramón Areces
 
Martín Uribe - Universidad de Columbia.
Fundación Ramón Areces
 
Thomas S. Robertson - The Wharton School.
Fundación Ramón Areces
 
Diana Robertson - The Wharton School.
Fundación Ramón Areces
 
Juan Carlos López-Gutiérrez - Unidad de Anomalías Vasculares, Hospital Unive...
Fundación Ramón Areces
 
Víctor Martínez-Glez. - Instituto de Genética Médica y Molecular (INGEMM). I...
Fundación Ramón Areces
 
Rudolf Happle - Dermatología, University of Freiburg Medical Center, Freiburg...
Fundación Ramón Areces
 
Rafael Doménech - Responsable de Análisis Macroeconómico, BBVA Research.
Fundación Ramón Areces
 
Diego Valero - Presidente del Grupo Novaster.
Fundación Ramón Areces
 
Mercedes Ayuso - Universitat de Barcelona.
Fundación Ramón Areces
 
Nicholas Barr - Profesor de Economía Pública, London School of Economics.
Fundación Ramón Areces
 
Julia Campa - The Open University.
Fundación Ramón Areces
 
Juan Manuel Sarasua - Comunicador y periodista científico.
Fundación Ramón Areces
 
Marta Olivares - Investigadora Postdoctoral en Université catholique de Louva...
Fundación Ramón Areces
 
Frederic Lluis - Investigador principal en KU Leuven.
Fundación Ramón Areces
 

Recently uploaded (20)

PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 

Cloud Computing y Big Data, próxima frontera de la innovación