0% found this document useful (0 votes)

65 views

Hive Code

The document provides an overview of common Hive commands used for interacting with Hive databases, tables, and data. Some key points covered include: - How to start the Hive shell, check HDFS contents, create databases and tables, load and query data, and drop databases and tables. - How to create external tables in Hive, load data into different file formats like ORC and Parquet, and perform operations on tables like renaming, adding/dropping columns. - How to query data using conditions, sorting, aggregation, joins, and views.

Uploaded by

Abdul Khaliq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views

Hive Code

Uploaded by

Abdul Khaliq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Hive Illustration : Basics

 To get started with the hive shell,

hive

 To check what is present in the HDFS,

hadoop fs – ls

 To create a directory in the current path (let’s say the name is ‘foo’),

hadoop fs - mkdir foo

 To create a database, in the hive shell (let’s say the name is ‘vin_emp’),

create database vin_emp;

 To see existing databases,

show databases;

 To start using the database,

use vin_emp;

 To check the tables present in the database,

show tables;

 To come out of the hive shell,

quit;

 to list contents of the current working directory,

 To create a directory,

mkdir myhivedata

 To navigate into that data,

cd myhivedata

 To check the present working directory

pwd

 To check the contents of a file (name of the file is employees.txt),

cat employees.txt

 To create a table within the hive shell,

create table emp_global(id int,name string,city string,continent string)

row format delimited
fields terminated by ‘ , ‘
stored as textfile ;
 To check the tables,
show tables ;

 To query the table,

select * from emp_global ;

 To load data into the table,

load local inpath ‘employees.txt’ into table emp_global;

 To drop all tables inside a database,

drop database vin_emp cascade,

 To know the schema of the table,

describe emp_global ;

 To drop a table,

drop table emp_global ;

Hive Illustration : External tables in hive

 To create a database in a certain desired location,

create database vin_emp_loc location ‘/user/cloudera/myhivedata’ ;

 To copy a file from local file system to hdfs location,

hadoop fs – put empglobal.csv empdata

 To see the contents of the file,

hadoop df – cat empdata/empglobal.csv

Hive Illustration : Loading different file formats

 How to know what the table type is, whether internal or external,

describe extended emp_global ;

 To load the data into orc table,

insert into table emp_global_orc select * from emp_global ;

 Create a table whose schema is exactly like an existing table,

create table emp_global_seq LIKE emp_global_orc stored as sequencefile ;

Hive Illustration : Loading data into Hive tables

 Create table only if another table of the same name doesn’t exist and an input multiple
values in a single column using an array,

create table if not exists sibling_data (

name string, age int, country string, siblings array<string> )
row format delimited
fields terminated by ‘ , ‘
collection items terminated by ‘#’
lines terminated by ‘\n’
sorted as textfile ;

 To create table with multiple inputs of different data type in a single column,

create table auto_details(company string, model string, fuel string,

basic_specs struct<vehicle_type : string, doors : int, gears : int>,
engine_specs struct<cc : int, bhp : double>)
row format delimited
fields terminated by ‘ , ‘
collection items terminated by ‘#’ ;

Hive Illustration : Simple Operations on Hive tables

 To rename an existing table,
alter table auto_details rename to auto_table ;
 To change the name of any column,
alter table auto_details change fuel fuel_type string ;
 To add a new column to an existing table,
alter table auto_details add columns (milage double) ;
 To drop columns,(mention columns which need to remain inside the brackets after “replace”
keyword)
alter table auto_details replace (company string, model string, fuel_type string) ;

Hive Illustration : Query Operations on Hive tables

 To create a table inside a desired pre-existing database, without navigating into the
database first
create table if not exists company.empdata (
empid int,
empname string,
salary double,
designation string,
department string,,
salary double,
designation string,
department string,
age int)
row format delimited
fields terminated by ‘ , ‘
lines terminated by ‘\n’
tblproperties(‘skip.header.line.count’ = ‘1’) ;

 To select all columns and only those rows which satisfy a certain condition,
select * from empdata where department = “HR” ;
 To select all columns and only those rows which satisfy more than one condition,
select * from empdata where department =”HR” and salary > 25000 ;
 To select only desired columns and only those rows which satisfy more than one condition,
select empname, age from empdata where department =”HR” and salary > 25000 ;
 To select all columns and sort the rows based on a desired column,
select * from empdata order by salary ;
 To select all columns and sort the rows based on a desired column in descending order,

select * from empdata order by salary desc ;

 To count the total number of rows in the dataset,
select count(*) from empdata ;
 To use ‘groupby’ to count number of rows based in each category of a certain column,
select department, count(*) from empdata group by department ;
 To select all column but only those rows which do not have null value in a desired column,
select * from empdata where salary is not null ;
 To select rows by matching a substring with a desired column value,
select * from empdata where designation rlike “Manager” or rlike “manager” or “Lead” ;
 To find the average of a desired numerical column, grouped by a categorical column,
select department, avg(salary) from empdata group by department;

Hive Illustration : Querying complex structures

 To enable join operations in the hive shell,
SET hive.auto.conveert.join = False;
 To perform a join operation,
select emp.empname, emp.salary from emp_epf pf join empdata emp on (pf.empid = emp.empid) ;
 To perform a left outer join operation,
select emp.empname, emp.salary from emp_epf pf left outer join empdata emp on (pf.empid =
emp.empid) ;
 To perform a right outer join operation,
select emp.empname, emp.salary from emp_epf pf right outer join empdata emp on (pf.empid =
emp.empid) ;
 To perform a full outer join operation,
select emp.empname, emp.salary from emp_epf pf full outer join empdata emp on (pf.empid =
emp.empid) ;

Hive Illustration : Views

 To create view,
create view if not exists high_sal as select * from empdata where salary > 50000 ;
 To query data from view,
select * from high_sal ;
 To see if view is created,
show tables ;
 To see the table type, (virtual or Managed),
describe formatted high_sal ;
 To create a table, partitioned by a desired column,
create table emp_global_part(id int, name string, city string, country string)
portioned by (continent string)
row format delimited
fields terminated by ‘ , ‘
stored as textfile ;

Standard Glossary of Terms Used in Software Testing All Terms
No ratings yet
Standard Glossary of Terms Used in Software Testing All Terms
67 pages
SPAMMING TUT Cading & Hacking Guide
100% (1)
SPAMMING TUT Cading & Hacking Guide
22 pages
Elex Event Log Explorer Guide
No ratings yet
Elex Event Log Explorer Guide
57 pages
Chapter+9+ HIVE
No ratings yet
Chapter+9+ HIVE
50 pages
Openas2 Server Application
No ratings yet
Openas2 Server Application
48 pages
Red Hat Jboss Enterprise Application Platform 7.1: Getting Started Guide
No ratings yet
Red Hat Jboss Enterprise Application Platform 7.1: Getting Started Guide
61 pages
Experiment 3: Hive: Aim: To Understand Data Processing Tool - Hive and HQL (Hive Query Language)
No ratings yet
Experiment 3: Hive: Aim: To Understand Data Processing Tool - Hive and HQL (Hive Query Language)
11 pages
DSCI 5350 - Lecture 5 PDF
No ratings yet
DSCI 5350 - Lecture 5 PDF
64 pages
HIVE
No ratings yet
HIVE
80 pages
5BDA
No ratings yet
5BDA
5 pages
Big Data Analytics and Developers Training Session 10
No ratings yet
Big Data Analytics and Developers Training Session 10
27 pages
hive
No ratings yet
hive
15 pages
Hive 2nd Practical
No ratings yet
Hive 2nd Practical
11 pages
A_3_hive
No ratings yet
A_3_hive
5 pages
Hive
No ratings yet
Hive
13 pages
Big Data Analytics: Welcome
No ratings yet
Big Data Analytics: Welcome
69 pages
Cheat Sheet: Hive Basics
No ratings yet
Cheat Sheet: Hive Basics
1 page
Hive Presentation
No ratings yet
Hive Presentation
18 pages
BDA Unit-5-PPT
No ratings yet
BDA Unit-5-PPT
39 pages
hive table session
No ratings yet
hive table session
23 pages
ABP W11-W12 Big Data Analytics Lab-HIVE
No ratings yet
ABP W11-W12 Big Data Analytics Lab-HIVE
8 pages
BDA - Exp-8 - Aarya Sawant
No ratings yet
BDA - Exp-8 - Aarya Sawant
18 pages
Hive Commands Syn
No ratings yet
Hive Commands Syn
27 pages
Week-11 - 12-Hivepdf - 2023 - 11 - 10 - 12 - 47 - 43
No ratings yet
Week-11 - 12-Hivepdf - 2023 - 11 - 10 - 12 - 47 - 43
8 pages
BDA-UNIT-IV -2020-21
100% (1)
BDA-UNIT-IV -2020-21
30 pages
Hive PPTs
No ratings yet
Hive PPTs
34 pages
Apache HIVE
No ratings yet
Apache HIVE
44 pages
HIVE Lect
No ratings yet
HIVE Lect
91 pages
Hive
No ratings yet
Hive
65 pages
Unit-4 Pig Hive
No ratings yet
Unit-4 Pig Hive
40 pages
Apache Hive: An Introduction
No ratings yet
Apache Hive: An Introduction
51 pages
Hive Notes PDF
No ratings yet
Hive Notes PDF
12 pages
Unit-5 - Hive
No ratings yet
Unit-5 - Hive
31 pages
Hive Overview
No ratings yet
Hive Overview
28 pages
Introduction to Hive
No ratings yet
Introduction to Hive
14 pages
Hive Tutorial
No ratings yet
Hive Tutorial
25 pages
Hiveppt
No ratings yet
Hiveppt
29 pages
Bigdata@master: 4.set The Environmental Variable HIVE - HOME in Bashrc File
No ratings yet
Bigdata@master: 4.set The Environmental Variable HIVE - HOME in Bashrc File
91 pages
Hive L1
No ratings yet
Hive L1
134 pages
Creating and Managing Database
No ratings yet
Creating and Managing Database
2 pages
5- HIVE
No ratings yet
5- HIVE
51 pages
Hadoop Hive
No ratings yet
Hadoop Hive
61 pages
Hive Documet
No ratings yet
Hive Documet
33 pages
Unit-5
No ratings yet
Unit-5
21 pages
Hive
No ratings yet
Hive
29 pages
Unit 2.2 Hive
No ratings yet
Unit 2.2 Hive
80 pages
module 3-1
No ratings yet
module 3-1
32 pages
Hive-Part-2
No ratings yet
Hive-Part-2
53 pages
HDFSandhivecommands
No ratings yet
HDFSandhivecommands
15 pages
HQL Cheat Sheet PDF
No ratings yet
HQL Cheat Sheet PDF
3 pages
Hive
No ratings yet
Hive
9 pages
Unit Iv Part - 1
No ratings yet
Unit Iv Part - 1
60 pages
Hive_Main
No ratings yet
Hive_Main
33 pages
Hive For SQL Users: Cheat Sheet
No ratings yet
Hive For SQL Users: Cheat Sheet
3 pages
Big Data Record 2
No ratings yet
Big Data Record 2
117 pages
Hive-Part-2
No ratings yet
Hive-Part-2
47 pages
Hive Data Manipulation
No ratings yet
Hive Data Manipulation
17 pages
Hive
No ratings yet
Hive
45 pages
Datatypes in Hive
No ratings yet
Datatypes in Hive
31 pages
Hive Final (1)
No ratings yet
Hive Final (1)
75 pages
Excel Techniques
From Everand
Excel Techniques
Online Trainees
2/5 (1)
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Smartsheet User Guide for Accelerated Learning
From Everand
Smartsheet User Guide for Accelerated Learning
Darren Mullen
No ratings yet
C++ Functions and tutorial
From Everand
C++ Functions and tutorial
Nino Paiotta
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Work From Home Accomplishment Report and Weekly Plan
No ratings yet
Work From Home Accomplishment Report and Weekly Plan
2 pages
Jmol Application Instruction Sheet English
No ratings yet
Jmol Application Instruction Sheet English
2 pages
A Study of Cyber Crime Awareness For Prevention and Its Impact
No ratings yet
A Study of Cyber Crime Awareness For Prevention and Its Impact
7 pages
Supervisor List
No ratings yet
Supervisor List
7 pages
Administration Ax 2009
No ratings yet
Administration Ax 2009
2 pages
9 - Django Rest Framework
No ratings yet
9 - Django Rest Framework
40 pages
PCI DSS v3 - 2 - 1 ROC S6 R2 Do Not Use Vendor Supplied Defaults
No ratings yet
PCI DSS v3 - 2 - 1 ROC S6 R2 Do Not Use Vendor Supplied Defaults
11 pages
Easy Access Rules For Information Security - Word File Final
No ratings yet
Easy Access Rules For Information Security - Word File Final
278 pages
Fresher Golang
No ratings yet
Fresher Golang
1 page
Knowing The Internals - Who Needs SQL Server Anyway - Mark Rasmussen
No ratings yet
Knowing The Internals - Who Needs SQL Server Anyway - Mark Rasmussen
98 pages
Intro To Back-End Dev & Node Js
No ratings yet
Intro To Back-End Dev & Node Js
19 pages
@digitalearn_official tools
No ratings yet
@digitalearn_official tools
2 pages
Offensive Security: Penetration Test Report For OSCP Exam
No ratings yet
Offensive Security: Penetration Test Report For OSCP Exam
17 pages
Mobile Technology Assignment
No ratings yet
Mobile Technology Assignment
3 pages
Important Question With Answers UNIT 1 To 5
No ratings yet
Important Question With Answers UNIT 1 To 5
145 pages
Designing A Project: Administrator User Guide 12 Series
No ratings yet
Designing A Project: Administrator User Guide 12 Series
2 pages
OOP - Slide-3 (Lec - 04 - 05)
No ratings yet
OOP - Slide-3 (Lec - 04 - 05)
22 pages
ICT Sector - Annual Monitoring Report PDF
No ratings yet
ICT Sector - Annual Monitoring Report PDF
46 pages
Data Analytics and Audit Coverage Guide
No ratings yet
Data Analytics and Audit Coverage Guide
46 pages
Making POP3 & SMTP Server Work With Windows Server 2008
No ratings yet
Making POP3 & SMTP Server Work With Windows Server 2008
5 pages
PH Home Internet Comparison
No ratings yet
PH Home Internet Comparison
2 pages
Syllabus Computer Systems Diploma and Certificate
No ratings yet
Syllabus Computer Systems Diploma and Certificate
36 pages
Offensive Security PDF
No ratings yet
Offensive Security PDF
2 pages
COSC 4926 - Assignment 2
No ratings yet
COSC 4926 - Assignment 2
2 pages
Amulya DataStag Resume
No ratings yet
Amulya DataStag Resume
4 pages