SlideShare a Scribd company logo
5
Most read
8
Most read
14
Most read
Big Data and HadoopRahul Agarwalirahul.com
AmrAwadallah: http://www.sfbayacm.org/wp/wp-content/uploads/2010/01/amr-hadoop-acm-dm-sig-jan2010.pdf
Hadoop: http://hadoop.apache.org/
Computerworld: http://www.computerworld.com/s/article/350908/5_Indispensable_IT_Skills_of_the_Future
AshishTushoo: http://www.sfbayacm.org/wp/wp-content/uploads/2010/01/sig_2010_v21.pdf
Big data: http://en.wikipedia.org/wiki/Big_data
Chukwa: http://www.cca08.org/papers/Paper-13-Ariel-Rabkin.pdf
Dean, Ghemawat: http://labs.google.com/papers/mapreduce.htmlAttributions
Big Data Problem
What is Hadoop
HDFS
MapReduce
HBase
PIG
HIVE
Chukwa
ZooKeeper
Q&AAgenda

More Related Content

What's hot (20)

PPTX
Apache HBase™
Prashant Gupta
 
PPTX
Introduction to Hadoop and Hadoop component
rebeccatho
 
PPSX
Hadoop
Nishant Gandhi
 
PPTX
Hadoop technology
tipanagiriharika
 
PPT
Hadoop Technology
Atul Kushwaha
 
PPTX
HBase Tutorial For Beginners | HBase Architecture | HBase Tutorial | Hadoop T...
Simplilearn
 
PPTX
Hadoop and Big Data
Harshdeep Kaur
 
PPTX
Big Data
Subhavinolin Raja
 
PPT
Hive(ppt)
Abhinav Tyagi
 
PPTX
Introduction To Hadoop | What Is Hadoop And Big Data | Hadoop Tutorial For Be...
Simplilearn
 
PPTX
Apache hive introduction
Mahmood Reza Esmaili Zand
 
PDF
Big Data Architecture
Guido Schmutz
 
PPT
Introduction to MongoDB
Ravi Teja
 
PPTX
The Basics of MongoDB
valuebound
 
PPTX
Hadoop File system (HDFS)
Prashant Gupta
 
PPTX
Big Data Analytics
Ghulam Imaduddin
 
PPTX
Map Reduce
Prashant Gupta
 
PPTX
introduction to NOSQL Database
nehabsairam
 
PPTX
Introduction to Big Data & Hadoop Architecture - Module 1
Rohit Agrawal
 
PPT
Big data ppt
IDBI Bank Ltd.
 
Apache HBase™
Prashant Gupta
 
Introduction to Hadoop and Hadoop component
rebeccatho
 
Hadoop technology
tipanagiriharika
 
Hadoop Technology
Atul Kushwaha
 
HBase Tutorial For Beginners | HBase Architecture | HBase Tutorial | Hadoop T...
Simplilearn
 
Hadoop and Big Data
Harshdeep Kaur
 
Hive(ppt)
Abhinav Tyagi
 
Introduction To Hadoop | What Is Hadoop And Big Data | Hadoop Tutorial For Be...
Simplilearn
 
Apache hive introduction
Mahmood Reza Esmaili Zand
 
Big Data Architecture
Guido Schmutz
 
Introduction to MongoDB
Ravi Teja
 
The Basics of MongoDB
valuebound
 
Hadoop File system (HDFS)
Prashant Gupta
 
Big Data Analytics
Ghulam Imaduddin
 
Map Reduce
Prashant Gupta
 
introduction to NOSQL Database
nehabsairam
 
Introduction to Big Data & Hadoop Architecture - Module 1
Rohit Agrawal
 
Big data ppt
IDBI Bank Ltd.
 

Viewers also liked (20)

PPT
Seminar Presentation Hadoop
Varun Narang
 
PPTX
Hadoop introduction , Why and What is Hadoop ?
sudhakara st
 
PDF
Practical Problem Solving with Apache Hadoop & Pig
Milind Bhandarkar
 
PPTX
Big Data & Hadoop Tutorial
Edureka!
 
PPT
HIVE: Data Warehousing & Analytics on Hadoop
Zheng Shao
 
PDF
introduction to data processing using Hadoop and Pig
Ricardo Varela
 
PDF
Facebooks Petabyte Scale Data Warehouse using Hive and Hadoop
royans
 
PPTX
Pig, Making Hadoop Easy
Nick Dimiduk
 
KEY
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
 
PDF
Integration of Hive and HBase
Hortonworks
 
PPT
Introduction To Map Reduce
rantav
 
PDF
Hive Quick Start Tutorial
Carl Steinbach
 
PPTX
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Ashok Royal
 
PPTX
Designing an IT Solution
Philippe Julio
 
DOCX
Hadoop Report
Nishant Gandhi
 
DOCX
Big data hadoop titles 2015 2016
xtreamtechnologies
 
PPTX
Hadoop and big data training
agiamas
 
PDF
trng seminar
divya gupta
 
PPTX
Hadoop for beginners free course ppt
Njain85
 
PPT
Hadoop Technologies
Kannappan Sirchabesan
 
Seminar Presentation Hadoop
Varun Narang
 
Hadoop introduction , Why and What is Hadoop ?
sudhakara st
 
Practical Problem Solving with Apache Hadoop & Pig
Milind Bhandarkar
 
Big Data & Hadoop Tutorial
Edureka!
 
HIVE: Data Warehousing & Analytics on Hadoop
Zheng Shao
 
introduction to data processing using Hadoop and Pig
Ricardo Varela
 
Facebooks Petabyte Scale Data Warehouse using Hive and Hadoop
royans
 
Pig, Making Hadoop Easy
Nick Dimiduk
 
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
 
Integration of Hive and HBase
Hortonworks
 
Introduction To Map Reduce
rantav
 
Hive Quick Start Tutorial
Carl Steinbach
 
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Ashok Royal
 
Designing an IT Solution
Philippe Julio
 
Hadoop Report
Nishant Gandhi
 
Big data hadoop titles 2015 2016
xtreamtechnologies
 
Hadoop and big data training
agiamas
 
trng seminar
divya gupta
 
Hadoop for beginners free course ppt
Njain85
 
Hadoop Technologies
Kannappan Sirchabesan
 
Ad

Similar to Big data and Hadoop (20)

PPTX
Big Data and Hadoop Training in Chandigarh
Big Boxx Animation Academy
 
PPTX
Hands on Hadoop and pig
Sudar Muthu
 
ODP
Hadoop introduction
葵慶 李
 
PPTX
Hadoop workshop
Purna Chander
 
PDF
Lesson 1 introduction to_big_data_and_hadoop.pptx
Pankajkumar496281
 
PPT
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Chris Baglieri
 
PDF
Константин Швачко, Yahoo!, - Scaling Storage and Computation with Hadoop
Media Gorod
 
PPTX
A Glimpse of Bigdata - Introduction
saisreealekhya
 
PPTX
Hadoop An Introduction
Mohanasundaram Ponnusamy
 
PPTX
Intro to hadoop ecosystem
Grzegorz Kolpuc
 
PDF
Big data and hadoop overvew
Kunal Khanna
 
PPTX
Big data concepts
Serkan Özal
 
PDF
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
tcloudcomputing-tw
 
PDF
Hadoop breizhjug
David Morin
 
PPTX
Modul_1_Introduction_to_Big_Data.pptx
NouhaElhaji1
 
PPTX
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
amrutupre
 
PPTX
Hadoop and their in big data analysis EcoSystem.pptx
Rahul Borate
 
PPTX
Big data and hadoop
Sri Kanth
 
PPTX
Big data
Abilash Mavila
 
Big Data and Hadoop Training in Chandigarh
Big Boxx Animation Academy
 
Hands on Hadoop and pig
Sudar Muthu
 
Hadoop introduction
葵慶 李
 
Hadoop workshop
Purna Chander
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Pankajkumar496281
 
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Chris Baglieri
 
Константин Швачко, Yahoo!, - Scaling Storage and Computation with Hadoop
Media Gorod
 
A Glimpse of Bigdata - Introduction
saisreealekhya
 
Hadoop An Introduction
Mohanasundaram Ponnusamy
 
Intro to hadoop ecosystem
Grzegorz Kolpuc
 
Big data and hadoop overvew
Kunal Khanna
 
Big data concepts
Serkan Özal
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q2
tcloudcomputing-tw
 
Hadoop breizhjug
David Morin
 
Modul_1_Introduction_to_Big_Data.pptx
NouhaElhaji1
 
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
amrutupre
 
Hadoop and their in big data analysis EcoSystem.pptx
Rahul Borate
 
Big data and hadoop
Sri Kanth
 
Big data
Abilash Mavila
 
Ad

Recently uploaded (20)

PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Digital Circuits, important subject in CS
contactparinay1
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 

Big data and Hadoop

Editor's Notes

  • #5: Analyzing large amounts of data is the top predicted skill required!
  • #10: Pool commodity servers in a single hierarchical namespace.Designed for large files that are written once and read many times.Example here shows what happens with a replication factor of 3, each data block is present in at least 3 separate data nodes.Typical Hadoop node is eight cores with 16GB ram and four 1TB SATA disks.Default block size is 64MB, though most folks now set it to 128MB
  • #18: Example flow as at Facebook
  • #19: Aircraft is refined, very fast, and has a lot of addons/features. But it is pricey on a per bit basis and is expensive to maintainCargo train is rough, missing a lot of “luxury”, slow to accelerate, but it can carry almost anything and once it gets going it can move a lot of stuff very economically