Hadoop foundation for analytics

Paper name : Big Data Analytics
Staff : Mrs M. Florence Dayana M. C. A., M.Phil., (Ph.D.)
Class : II- M.Sc.(Computer Science)
Semester : IV
Unit : IV
Topic : Hadoop Foundation for Analytics

Hadoop
Foundation
for Analytics

HISTORY OF HADOOP:
• Hadoop was created by Doug Cutting and Mike Cafarella. It is
created in 2005.
• Firstly, it was developed to support the distribution for the
Nutch search engine project.
• It was named by Doug after seeing his son’s toy elephant.
• By that time, he was worked in Yahoo.
• Hadoop is an open-source distributed processing framework.
It manages data processing and also storage for big data
applications running in clustered systems.

ARCHITECTURE OF HADOOP:
Hadoop has two major layers. They are
• Processing/Computation layer (Also called as MapReduce)
• Storage layer (Hadoop Distributed File System).
The Hadoop framework application works in an environment
that provides distributed storage and computation across
group of computers. Hadoop is designed in a way to handle
single server to thousands of machines, each offering local
computation and storage.

COMPONENTS OF HADOOP:
• Hive:
Hive is an open-source data warehouse framework that
structures and queries data using a SQL like language called
HiveQL.
• Ambari:
Ambari was designed to remove difficulties of Hadoop
management by providing a simple web interface that can
manage and monitor Apache Hadoop clusters.

• HBase:
HBase is an open-source and distributed database model that
provides random, real-time read/write access to your big data. It is a
non-relational database model HBase is a NoSQL Database for Hadoop.
• Pig:
Pig is an open-source technology that enables low cost storage and
processing of large sets of data, without requiring any specific formats.
• Zookeeper:
ZooKeeper is an open-source platform that provides a centralized
infrastructure. It is used for maintaining configuration information,
naming, providing distributed synchronization, and also providing group
services.

Hadoop foundation for analytics

More Related Content

What's hot (20)

Similar to Hadoop foundation for analytics (20)

Recently uploaded (20)

Hadoop foundation for analytics