SlideShare a Scribd company logo
Introduction to
HiveQL
BY KRISTIN FERRIER
About Me – Kristin Ferrier
 15+ Years in IT (Software development and BI development)
 10+ years experience with SQL Server and 5+ years experience with
Oracle
 Co-founder OKCSQL
 Currently Sr. Data Analyst at an energy company
 Social Media
 Twitter: @SQLenergy
 Blog: http://www.kristinferrier.com
Agenda
 Hadoop – Very High Level
 Hive and HiveQL - High Level
 Getting started with Hive and HiveQL
 HiveQL examples
 Resources for getting started with HiveQL
Hadoop
 Open source software
 Popular for storing, processing, and analyzing large volumes of data
 For example, web logs or sensor data
 Main distributions
 Cloudera
 Hortonworks
 MapR (has some proprietary components)
Hadoop 2.0 Main Components
 Hadoop Distributed File System (HDFS)
 Handles the data storage
 MapReduce
 Handles the processing
 Works with key value pairs
 Often written in Java
 Can be written in any scripting language using the Streaming API of
Hadoop
Example MapReduce Code
public static class Map extends Mapper<LongWritable, Text, Text, IntWritable> {
private final static IntWritable one = new IntWritable(1);
private Text word = new Text();
public void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException {
String line = value.toString();
StringTokenizer tokenizer = new StringTokenizer(line);
while (tokenizer.hasMoreTokens()) {
word.set(tokenizer.nextToken());
context.write(word, one);
}
}
}
Code from Hortonworks tutorial found at http://hortonworks.com/hadoop-tutorial/introducing-apache-hadoop-developers/
Getting Started with Hadoop
 What if I don’t know Java?
 Or one of the Scripting languages using the Streaming API of Hadoop
 Example: Python
 That’s OK. If you know SQL, then Hive and HiveQL may be a great
starting point for your Hadoop learning
Hive
Hive essentially allows us to use tables
within Hadoop
 Built on top of Apache Hadoop
 Can access files stored in HDFS or HBase
 HCatalog allows you to apply table structures to the data
 HiveQL to query the data
HiveQL
HiveQL is SQL-like language for
querying data from Hive
 Follows some of the ANSI SQL-92 standard
 Offers its own extensions
 Implicitly turned into MapReduce jobs
HiveQL – Key SQL items it has
 SELECT
 FROM
 WHERE
 GROUP BY
 HAVING
 JOINS – Some kinds
HiveQL – Key differences from SQL
 No transactions
 No materialized views
 Update and delete available only with Hive 0.14 and later
 Hive 0.14 was released November 2014
Accessing Hive
 Hue
 Web interface for Hadoop
 Beeswax
 Hive UI within Hue
Hue
Beeswax
Getting Data into Hive Tables
 One way is to import a file into Hive
 Can create the table at this time
 Can import the data at this time
 File can even come from a Windows box
Importing a file
Beeswax  Tables  Create a new table from a file
Importing a file cont.
Enter Table Name and Description  .. button
Importing a file cont.
Upload a file  Select your Windows file
 Open
Importing a file cont.
After file uploads, double-click your file
Importing a file cont.
Choose a Delimiter
Importing a file cont.
Select column data types  Create Table
Importing a file cont.
Table has been created
Query Editor
 Write queries in the Query Editor
Select
SELECT * FROM WEATHER
Where, Group By, Min/Max
Where, Group By, Min/Max - Results
Aliasing, Ordering
 Standard SQL syntax for Aliasing
 SORT BY instead of ORDER BY– For ordering
Aliasing, Ordering - Results
Joins
 INNER, LEFT, RIGHT, and FULL OUTER
 Equi Joins only: (table1.key = table2.key) is allowed but not (table1.key
<> table2.key)
 Extensions exist like LEFT SEMI JOIN
INNER JOIN
INNER JOIN - Results
LEFT SEMI JOIN
 Left Semi Joins are less necessary
starting with Hive 0.13
 As of Hive 0.13 the IN/NOT
IN/EXISTS/NOT EXISTS operators are
supported using subqueries
SELECT a.key, a.value
FROM a
WHERE a.key in
(SELECT b.key
FROM B);
can be rewritten to
SELECT a.key, a.val
FROM a LEFT SEMI JOIN b ON (a.key = b.key)
Example from https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Joins
Performance
 Queries can take minutes to run. Focus is on analysis of large data
sets.
 Relational databases are still a strong solution for providing the faster
performance of CRUD (create, read, update, and delete)
operations required by OLTP systems.
Summary
 Hive essentially allows us to use tables in Hadoop
 We can query them using HiveQL, which is similar to SQL
 Knowing how to write MapReduce code is not required, as the
HiveQL will be turned into MapReduce for us
Getting Started Yourself
 Hortonworks Sandbox
 Portable Hadoop environment with tutorials
 Even though the sandbox runs Hadoop on Linux, you can run the sandbox
on your Windows machine and access it via a web browser
 Available at http://hortonworks.com/sandbox
Getting Started Yourself
 Hive DML Reference
 https://cwiki.apache.org/confluence/display/hive/languageManual+dml
 Apache’s Hive Language Manual
 https://cwiki.apache.org/confluence/display/Hive/LanguageManual
 Treasure’s HiveQL Reference
 http://docs.treasuredata.com/articles/hive
 Network World – Comparing the top Hadoop Distros
 http://www.networkworld.com/article/2369327/software/comparing-the-
top-hadoop-distributions.html
Contact Info
 Social Media
 Twitter: @SQLenergy
 Blog: http://www.kristinferrier.com

More Related Content

What's hot (20)

PPTX
Introduction to Apache Spark
Rahul Jain
 
PPTX
Introduction to Hadoop
Dr. C.V. Suresh Babu
 
PPTX
Apache hive
pradipbajpai68
 
PPTX
PPT on Hadoop
Shubham Parmar
 
PPTX
6.hive
Prashant Gupta
 
PPTX
Data Lake Overview
James Serra
 
PPTX
Hive presentation
Hitesh Agrawal
 
PDF
Apache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
PPTX
Hadoop File system (HDFS)
Prashant Gupta
 
PPT
Hive(ppt)
Abhinav Tyagi
 
PPTX
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Simplilearn
 
PPTX
Hadoop Oozie
Madhur Nawandar
 
PDF
Hadoop ecosystem
Stanley Wang
 
PPTX
Big Data and Hadoop
Flavio Vit
 
PPTX
Hive
Manas Nayak
 
PDF
Introduction to Apache Hive
Avkash Chauhan
 
PPTX
Introduction to YARN and MapReduce 2
Cloudera, Inc.
 
PPTX
Big Data Analytics with Hadoop
Philippe Julio
 
PPTX
An intro to Azure Data Lake
Rick van den Bosch
 
PPTX
Introduction to NoSQL
PolarSeven Pty Ltd
 
Introduction to Apache Spark
Rahul Jain
 
Introduction to Hadoop
Dr. C.V. Suresh Babu
 
Apache hive
pradipbajpai68
 
PPT on Hadoop
Shubham Parmar
 
Data Lake Overview
James Serra
 
Hive presentation
Hitesh Agrawal
 
Apache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
Hadoop File system (HDFS)
Prashant Gupta
 
Hive(ppt)
Abhinav Tyagi
 
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Simplilearn
 
Hadoop Oozie
Madhur Nawandar
 
Hadoop ecosystem
Stanley Wang
 
Big Data and Hadoop
Flavio Vit
 
Introduction to Apache Hive
Avkash Chauhan
 
Introduction to YARN and MapReduce 2
Cloudera, Inc.
 
Big Data Analytics with Hadoop
Philippe Julio
 
An intro to Azure Data Lake
Rick van den Bosch
 
Introduction to NoSQL
PolarSeven Pty Ltd
 

Viewers also liked (20)

PDF
Introduction to SQL Server Cloud Storage Azure
Eduardo Castro
 
PPTX
Big Data on the Microsoft Platform
Andrew Brust
 
PDF
Introduction to pig & pig latin
knowbigdata
 
PDF
Hadoop interview questions
Kalyan Hadoop
 
PDF
Apache Spark Streaming - www.know bigdata.com
knowbigdata
 
PDF
Interview questions on Apache spark [part 2]
knowbigdata
 
PPSX
Carnatic Music Notations: Alankara
Meera Raghu
 
PPSX
Jathiswara
Meera Raghu
 
PDF
Introduction to Apache ZooKeeper
knowbigdata
 
PPSX
Guide to understanding Carnatic Music Notations
Meera Raghu
 
PDF
Big data interview questions and answers
Kalyan Hadoop
 
PDF
Orienit hadoop practical cluster setup screenshots
Kalyan Hadoop
 
PDF
Hadoop 31-frequently-asked-interview-questions
Asad Masood Qazi
 
PPT
Recorder lesson
Donna Chaff
 
PPTX
An introduction to the Recorder
Sandra Morgan
 
PPTX
SQL-on-Hadoop Tutorial
Daniel Abadi
 
PDF
Differences between OpenStack and AWS
Edureka!
 
PDF
MapReduce Tutorial | What is MapReduce | Hadoop MapReduce Tutorial | Edureka
Edureka!
 
PPTX
Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol
HARMAN Services
 
PDF
MapReduce Example | MapReduce Programming | Hadoop MapReduce Tutorial | Edureka
Edureka!
 
Introduction to SQL Server Cloud Storage Azure
Eduardo Castro
 
Big Data on the Microsoft Platform
Andrew Brust
 
Introduction to pig & pig latin
knowbigdata
 
Hadoop interview questions
Kalyan Hadoop
 
Apache Spark Streaming - www.know bigdata.com
knowbigdata
 
Interview questions on Apache spark [part 2]
knowbigdata
 
Carnatic Music Notations: Alankara
Meera Raghu
 
Jathiswara
Meera Raghu
 
Introduction to Apache ZooKeeper
knowbigdata
 
Guide to understanding Carnatic Music Notations
Meera Raghu
 
Big data interview questions and answers
Kalyan Hadoop
 
Orienit hadoop practical cluster setup screenshots
Kalyan Hadoop
 
Hadoop 31-frequently-asked-interview-questions
Asad Masood Qazi
 
Recorder lesson
Donna Chaff
 
An introduction to the Recorder
Sandra Morgan
 
SQL-on-Hadoop Tutorial
Daniel Abadi
 
Differences between OpenStack and AWS
Edureka!
 
MapReduce Tutorial | What is MapReduce | Hadoop MapReduce Tutorial | Edureka
Edureka!
 
Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol
HARMAN Services
 
MapReduce Example | MapReduce Programming | Hadoop MapReduce Tutorial | Edureka
Edureka!
 
Ad

Similar to Introduction to HiveQL (20)

PDF
20081030linkedin
Jeff Hammerbacher
 
PPTX
Hive Hadoop
Farafekr Technology Ltd.
 
PDF
hive hadoop sql
ssuserf8f9b2
 
PDF
SQL to Hive Cheat Sheet
Hortonworks
 
PPT
Hive
Srinath Reddy
 
PPT
Unit 5-lecture4
vishal choudhary
 
PPTX
Apache hive
Vaibhav Kadu
 
PPTX
Apache Hive
tusharsinghal58
 
PDF
Introduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
PPTX
テスト用のプレゼンテーション
gooseboi
 
PDF
Hive explanation with examples and syntax
dspyanand
 
PPT
Hive Training -- Motivations and Real World Use Cases
nzhang
 
PPTX
Hive @ Bucharest Java User Group
Remus Rusanu
 
PPT
Hive(ppt)
Abhinav Tyagi
 
PPTX
443988696-Chapter-9-HIVEHIVEHIVE-pptx.pptx
AbdellahELMAMOUN
 
PDF
Working with Hive Analytics
Manish Chopra
 
PPT
Introduction to Big Data Hive by Abhinav Tyagi
kuthubussaman1
 
PPTX
Big Data & Analytics (CSE6005) L6.pptx
Anonymous9etQKwW
 
PPT
Hive_An Brief Introduction to HIVE_BIGDATAANALYTICS
RUHULAMINHAZARIKA
 
PPTX
Introduction to Apache Hive(Big Data, Final Seminar)
Takrim Ul Islam Laskar
 
20081030linkedin
Jeff Hammerbacher
 
hive hadoop sql
ssuserf8f9b2
 
SQL to Hive Cheat Sheet
Hortonworks
 
Unit 5-lecture4
vishal choudhary
 
Apache hive
Vaibhav Kadu
 
Apache Hive
tusharsinghal58
 
Introduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
テスト用のプレゼンテーション
gooseboi
 
Hive explanation with examples and syntax
dspyanand
 
Hive Training -- Motivations and Real World Use Cases
nzhang
 
Hive @ Bucharest Java User Group
Remus Rusanu
 
Hive(ppt)
Abhinav Tyagi
 
443988696-Chapter-9-HIVEHIVEHIVE-pptx.pptx
AbdellahELMAMOUN
 
Working with Hive Analytics
Manish Chopra
 
Introduction to Big Data Hive by Abhinav Tyagi
kuthubussaman1
 
Big Data & Analytics (CSE6005) L6.pptx
Anonymous9etQKwW
 
Hive_An Brief Introduction to HIVE_BIGDATAANALYTICS
RUHULAMINHAZARIKA
 
Introduction to Apache Hive(Big Data, Final Seminar)
Takrim Ul Islam Laskar
 
Ad

More from kristinferrier (6)

PPTX
So MANY databases, which one do I pick?
kristinferrier
 
PPTX
Intro to Firebase Realtime Database and Authentication
kristinferrier
 
PPTX
Demystifying JSON in SQL Server
kristinferrier
 
PPTX
SQL to JSON
kristinferrier
 
PPTX
T-SQL Treats
kristinferrier
 
PPTX
3D Geospatial Visualization Using Power Map
kristinferrier
 
So MANY databases, which one do I pick?
kristinferrier
 
Intro to Firebase Realtime Database and Authentication
kristinferrier
 
Demystifying JSON in SQL Server
kristinferrier
 
SQL to JSON
kristinferrier
 
T-SQL Treats
kristinferrier
 
3D Geospatial Visualization Using Power Map
kristinferrier
 

Recently uploaded (20)

PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
July Patch Tuesday
Ivanti
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
July Patch Tuesday
Ivanti
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 

Introduction to HiveQL

  • 2. About Me – Kristin Ferrier  15+ Years in IT (Software development and BI development)  10+ years experience with SQL Server and 5+ years experience with Oracle  Co-founder OKCSQL  Currently Sr. Data Analyst at an energy company  Social Media  Twitter: @SQLenergy  Blog: http://www.kristinferrier.com
  • 3. Agenda  Hadoop – Very High Level  Hive and HiveQL - High Level  Getting started with Hive and HiveQL  HiveQL examples  Resources for getting started with HiveQL
  • 4. Hadoop  Open source software  Popular for storing, processing, and analyzing large volumes of data  For example, web logs or sensor data  Main distributions  Cloudera  Hortonworks  MapR (has some proprietary components)
  • 5. Hadoop 2.0 Main Components  Hadoop Distributed File System (HDFS)  Handles the data storage  MapReduce  Handles the processing  Works with key value pairs  Often written in Java  Can be written in any scripting language using the Streaming API of Hadoop
  • 6. Example MapReduce Code public static class Map extends Mapper<LongWritable, Text, Text, IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); public void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException { String line = value.toString(); StringTokenizer tokenizer = new StringTokenizer(line); while (tokenizer.hasMoreTokens()) { word.set(tokenizer.nextToken()); context.write(word, one); } } } Code from Hortonworks tutorial found at http://hortonworks.com/hadoop-tutorial/introducing-apache-hadoop-developers/
  • 7. Getting Started with Hadoop  What if I don’t know Java?  Or one of the Scripting languages using the Streaming API of Hadoop  Example: Python  That’s OK. If you know SQL, then Hive and HiveQL may be a great starting point for your Hadoop learning
  • 8. Hive Hive essentially allows us to use tables within Hadoop  Built on top of Apache Hadoop  Can access files stored in HDFS or HBase  HCatalog allows you to apply table structures to the data  HiveQL to query the data
  • 9. HiveQL HiveQL is SQL-like language for querying data from Hive  Follows some of the ANSI SQL-92 standard  Offers its own extensions  Implicitly turned into MapReduce jobs
  • 10. HiveQL – Key SQL items it has  SELECT  FROM  WHERE  GROUP BY  HAVING  JOINS – Some kinds
  • 11. HiveQL – Key differences from SQL  No transactions  No materialized views  Update and delete available only with Hive 0.14 and later  Hive 0.14 was released November 2014
  • 12. Accessing Hive  Hue  Web interface for Hadoop  Beeswax  Hive UI within Hue
  • 13. Hue
  • 15. Getting Data into Hive Tables  One way is to import a file into Hive  Can create the table at this time  Can import the data at this time  File can even come from a Windows box
  • 16. Importing a file Beeswax  Tables  Create a new table from a file
  • 17. Importing a file cont. Enter Table Name and Description  .. button
  • 18. Importing a file cont. Upload a file  Select your Windows file  Open
  • 19. Importing a file cont. After file uploads, double-click your file
  • 20. Importing a file cont. Choose a Delimiter
  • 21. Importing a file cont. Select column data types  Create Table
  • 22. Importing a file cont. Table has been created
  • 23. Query Editor  Write queries in the Query Editor
  • 25. Where, Group By, Min/Max
  • 26. Where, Group By, Min/Max - Results
  • 27. Aliasing, Ordering  Standard SQL syntax for Aliasing  SORT BY instead of ORDER BY– For ordering
  • 29. Joins  INNER, LEFT, RIGHT, and FULL OUTER  Equi Joins only: (table1.key = table2.key) is allowed but not (table1.key <> table2.key)  Extensions exist like LEFT SEMI JOIN
  • 31. INNER JOIN - Results
  • 32. LEFT SEMI JOIN  Left Semi Joins are less necessary starting with Hive 0.13  As of Hive 0.13 the IN/NOT IN/EXISTS/NOT EXISTS operators are supported using subqueries SELECT a.key, a.value FROM a WHERE a.key in (SELECT b.key FROM B); can be rewritten to SELECT a.key, a.val FROM a LEFT SEMI JOIN b ON (a.key = b.key) Example from https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Joins
  • 33. Performance  Queries can take minutes to run. Focus is on analysis of large data sets.  Relational databases are still a strong solution for providing the faster performance of CRUD (create, read, update, and delete) operations required by OLTP systems.
  • 34. Summary  Hive essentially allows us to use tables in Hadoop  We can query them using HiveQL, which is similar to SQL  Knowing how to write MapReduce code is not required, as the HiveQL will be turned into MapReduce for us
  • 35. Getting Started Yourself  Hortonworks Sandbox  Portable Hadoop environment with tutorials  Even though the sandbox runs Hadoop on Linux, you can run the sandbox on your Windows machine and access it via a web browser  Available at http://hortonworks.com/sandbox
  • 36. Getting Started Yourself  Hive DML Reference  https://cwiki.apache.org/confluence/display/hive/languageManual+dml  Apache’s Hive Language Manual  https://cwiki.apache.org/confluence/display/Hive/LanguageManual  Treasure’s HiveQL Reference  http://docs.treasuredata.com/articles/hive  Network World – Comparing the top Hadoop Distros  http://www.networkworld.com/article/2369327/software/comparing-the- top-hadoop-distributions.html
  • 37. Contact Info  Social Media  Twitter: @SQLenergy  Blog: http://www.kristinferrier.com