Hadoop

Uploaded by

meghana.25s.2000

hadoop architecture

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Hadoop

Uploaded by

meghana.25s.2000

0% found this document useful (0 votes)

14 views12 pages

hadoop architecture

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

hadoop architecture

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

14 views12 pages

Hadoop

Uploaded by

meghana.25s.2000

hadoop architecture

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 12

Search inside document

HADOOP

INTRODUCTION
Hadoop is an open-source software framework
for storing and processing large sets of data. It
provides massive storage for any kind of data,
enormous processing power, and the ability to
handle virtually limitless concurrent tasks or
jobs.
As we all know Hadoop is a framework written in Java that utilizes
a large cluster of commodity hardware to maintain and store big
size data. Hadoop works on MapReduce Programming Algorithm
that was introduced by Google. Today lots of Big Brand Companies
are using Hadoop in their Organization to deal with big data, eg.
Facebook, Yahoo, Netflix, eBay, etc.

The Hadoop Architecture Mainly consists of 4 components.

1.Map Reduce
2.HDFS(Hadoop Distributed File System)
3.YARN(Yet Another Resource Negotiator)
Hadoop architecture
1. MapReduce
MapReduce nothing but just like an Algorithm. The major feature of MapReduce is to perform the distributed processing in parallel in a Hadoop cluster which Makes Hadoop
working so fast. When you are dealing with Big Data, serial processing is no more of any use.
MapReduce has mainly 2 tasks which are divided phase-wise:
In first phase, Map is utilized and in next phase Reduce is utilized.
2.HDFS
 HDFS(Hadoop Distributed File System) is utilized for storage permission. It is
mainly designed for working on commodity Hardware devices(inexpensive
devices), working on a distributed file system design. HDFS is designed in such a
way that it believes more in storing the data in a large chunk of blocks rather than
storing small data blocks.
 HDFS in Hadoop provides Fault-tolerance and High availability to the storage
layer and the other devices present in that Hadoop cluster. Data storage Nodes in
HDFS.
1. Name Node(Master)
2. Data Node(Slave)
File Block In HDFS: Data in HDFS is always stored in terms of blocks. So the single block of data
is divided into multiple blocks of size 128MB which is default and you can also change it manually.
Name Node Data Node
 It is a single master server exist in the  The HDFS cluster contains multiple Data
HDFS cluster. Nodes.
 As it is a single node, it may become the  Each Data Node contains multiple data
reason of single point failure. blocks.
 It manages the file system namespace by  These data blocks are used to store data.
executing an operation like the opening,  It is the responsibility of Data Node to
renaming and closing the files. read and write requests from the file
 It simplifies the architecture of the system's clients.
system.  It performs block creation, deletion, and
replication upon instruction from the
Name Node.
3.YARN(Yet Another Resource Negotiator)

YARN is a Framework on which MapReduce works. It processes job requests and manages
cluster resources.
YARN contains:
1. Resource Manager: The use of Resource Manager is to manage all the resources that are
made available for running a Hadoop cluster.
2. Node Manager: Handles the nodes and monitors the resources.
3. Application Manager: works as an interface between resource and node manager.
4. Container: It holds collection of multiple physical resources.

Cheat Sheet Data Preprocessing Tasks in Pandas
Document2 pages
Cheat Sheet Data Preprocessing Tasks in Pandas
Andres Rincon
100% (1)
Hadoop Interview Questions New
Document9 pages
Hadoop Interview Questions New
Rupali Shetty
No ratings yet
Progress Test 2 Unit 4-6
Document3 pages
Progress Test 2 Unit 4-6
özcan Akgün
75% (4)
Unit IV Notes
Document34 pages
Unit IV Notes
Apoorva Rauniyar
No ratings yet
bda final sem 7
Document120 pages
bda final sem 7
himanshidvyas
No ratings yet
Unit II Big Data
Document27 pages
Unit II Big Data
rohitmarale77
No ratings yet
Unit-2 Hadoop HDFS Hadoopecosystem
Document25 pages
Unit-2 Hadoop HDFS Hadoopecosystem
sisodiyaa853
No ratings yet
UNIT-1-part-2-BIG DATA ANALYTICS AND TOOLS
Document19 pages
UNIT-1-part-2-BIG DATA ANALYTICS AND TOOLS
Alekhya Abbaraju
No ratings yet
Top Hadoop Interview Q&A
Document25 pages
Top Hadoop Interview Q&A
P vishnu
No ratings yet
Lecture Notes Hadoop
Document11 pages
Lecture Notes Hadoop
sakshi kureley
100% (1)
Big Data Unit 2
Document31 pages
Big Data Unit 2
Nazima Begum
No ratings yet
UNIT -2
Document27 pages
UNIT -2
sham.offical25
No ratings yet
Unit 3
Document15 pages
Unit 3
xcgfxgvx
No ratings yet
Big Data Lecture Presentation
Document28 pages
Big Data Lecture Presentation
zioncalvin9
No ratings yet
UNIT-2
Document14 pages
UNIT-2
sham.offical25
No ratings yet
Compare Hadoop & Spark Criteria Hadoop Spark
Document18 pages
Compare Hadoop & Spark Criteria Hadoop Spark
dasari ramya
No ratings yet
Introduction To Hadoop
Document5 pages
Introduction To Hadoop
Hanumanthu Gouthami
No ratings yet
Big Data Module 2
Document23 pages
Big Data Module 2
Srikanth M
No ratings yet
Big Data Capsule PDF
Document12 pages
Big Data Capsule PDF
Kavya Kharbanda
No ratings yet
Module-2
Document23 pages
Module-2
rashmita k
No ratings yet
HADOOP FRAME WORK
Document38 pages
HADOOP FRAME WORK
vinaybiradar14
No ratings yet
Unit 3
Document18 pages
Unit 3
Ajay Kumar Kanamarlapudi
No ratings yet
Unit 3
Document5 pages
Unit 3
koxey22172
No ratings yet
Unit 4 Topic 1 Hadoop Ecosystem Components
Document35 pages
Unit 4 Topic 1 Hadoop Ecosystem Components
rocksblaster9
No ratings yet
Unit 2
Document23 pages
Unit 2
mukul.money2003
No ratings yet
Unit 3
Document44 pages
Unit 3
Vaddi Kasulu
No ratings yet
UNIT V-Cloud Computing
Document33 pages
UNIT V-Cloud Computing
Jayanth V 19CS045
No ratings yet
Fbda Unit-3
Document27 pages
Fbda Unit-3
Aruna Aruna
No ratings yet
Wa0001.
Document56 pages
Wa0001.
Lakkarsu Poojitha
No ratings yet
BDA CW Chapter 2
Document6 pages
BDA CW Chapter 2
tyagrajssecs121
No ratings yet
Unit-2 - Introduction To Hadoop and Hadoop Architecture
Document46 pages
Unit-2 - Introduction To Hadoop and Hadoop Architecture
21dce011
No ratings yet
HADOOP
Document19 pages
HADOOP
ucebittrichy2020
No ratings yet
1 Bda Chapter1 Answer
Document7 pages
1 Bda Chapter1 Answer
Noor alam Shaikh
No ratings yet
BDA Unit 2
Document39 pages
BDA Unit 2
1DA20CS051JEEVAN
No ratings yet
Exp1 Bda
Document11 pages
Exp1 Bda
Bhumika Nalawade
No ratings yet
Unit 2
Document30 pages
Unit 2
Awadhesh Maurya
No ratings yet
Cloud Computing
Document19 pages
Cloud Computing
Afia Faryad
No ratings yet
Module 2 CN
Document23 pages
Module 2 CN
mksaravanamk1
No ratings yet
Bda Unit 2
Document21 pages
Bda Unit 2
245120737162
No ratings yet
Module - 2 Half
Document12 pages
Module - 2 Half
s8143152
No ratings yet
HADOOP
Document40 pages
HADOOP
saadiaiftikhar123
No ratings yet
Big Data Analysis IAT-1
Document43 pages
Big Data Analysis IAT-1
mervismascarenhas
No ratings yet
BDA Lab Assignment 2
Document18 pages
BDA Lab Assignment 2
parth shah
No ratings yet
Hadoop Distributed File System
Document5 pages
Hadoop Distributed File System
gnikithaspandanasridurga3112
No ratings yet
Assignment 6
Document12 pages
Assignment 6
Pujan Patel
No ratings yet
BDA Unit-4 Part-1 HDFS,MapReduce
Document76 pages
BDA Unit-4 Part-1 HDFS,MapReduce
Jaya Prakash
No ratings yet
Module 2 Hadoop Eco System
Document13 pages
Module 2 Hadoop Eco System
n4519072
No ratings yet
BDA Module 2 - Notes PDF
Document101 pages
BDA Module 2 - Notes PDF
Nidhi Srivastava
No ratings yet
Introduction To Big Data and Hadoop
Document29 pages
Introduction To Big Data and Hadoop
Manoj K Upadhyaya
100% (1)
bda unit 4-1
Document64 pages
bda unit 4-1
crishitha8
No ratings yet
Hadoop
Document7 pages
Hadoop
Mayank Rai
No ratings yet
Bda Unit2
Document24 pages
Bda Unit2
pubgmobilesd23
No ratings yet
Unit 2 Big Data Notes
Document21 pages
Unit 2 Big Data Notes
pahadesunanda17
No ratings yet
Unit 5 Print
Document32 pages
Unit 5 Print
sivapunithan S
No ratings yet
Module 1
Document66 pages
Module 1
Anusha Kp
No ratings yet
Syllabus:: Introduction To Hadoop (T1)
Document23 pages
Syllabus:: Introduction To Hadoop (T1)
sanishwin2002
No ratings yet
Hadoop
Document7 pages
Hadoop
Jasleen Kour
No ratings yet
Big Data
Document16 pages
Big Data
roushan singh
No ratings yet
Hadoop Ecosystem
Document7 pages
Hadoop Ecosystem
Khushi Pandey
No ratings yet
CC Unit 5 Notes
Document30 pages
CC Unit 5 Notes
Hrudhai S
No ratings yet
Data Engineering Guide for Beginners: Part 2
From Everand
Data Engineering Guide for Beginners: Part 2
Allan Murray
No ratings yet
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Abab Patterns With Manipulative Trays
Document2 pages
Abab Patterns With Manipulative Trays
api-637232357
No ratings yet
nb0 Walk Through sp21
Document31 pages
nb0 Walk Through sp21
Daniel Kim
No ratings yet
14th PMA Academic Portion by Syed Abdul Basit
Document3 pages
14th PMA Academic Portion by Syed Abdul Basit
mudassirjaleel07
No ratings yet
4 Gcse English Language PPT Component 1
Document28 pages
4 Gcse English Language PPT Component 1
nada
No ratings yet
Tran The Dan Nguyen Huu Hoang Hai LAB2 Report
Document9 pages
Tran The Dan Nguyen Huu Hoang Hai LAB2 Report
Trương Thắng
No ratings yet
MR Morton
Document3 pages
MR Morton
Jullia Georgiana
No ratings yet
1 Semester 2 Semester 1 Semester 2 Semester: Senior High School Subjects
Document3 pages
1 Semester 2 Semester 1 Semester 2 Semester: Senior High School Subjects
Paul Arnie Almaden
100% (1)
THE CNC Parametric Programming Technique: Prerequisites
Document27 pages
THE CNC Parametric Programming Technique: Prerequisites
guy
No ratings yet
Bei Mir Bist Du Schön 3
Document2 pages
Bei Mir Bist Du Schön 3
Tobi Rädle
No ratings yet
Narrative Report
Document3 pages
Narrative Report
kathyjaneesternon46
No ratings yet
Chapter 10 Ex
Document3 pages
Chapter 10 Ex
Sun Java
No ratings yet
Getting Started With Docker
Document8 pages
Getting Started With Docker
denisa
No ratings yet
Implementing Ds2008
Document37 pages
Implementing Ds2008
hans411
No ratings yet
KIR B1 T1 Test Term1 Extension
Document8 pages
KIR B1 T1 Test Term1 Extension
chrisl_01
No ratings yet
The Ecological Validity of TASIT, A Test of Social Perception
Document19 pages
The Ecological Validity of TASIT, A Test of Social Perception
Martha Trivino
No ratings yet
Carmen Sylva - Songs of Toil (1887)
Document156 pages
Carmen Sylva - Songs of Toil (1887)
sqweerty
100% (1)
Strengths and Weaknesses of Intuitionism
Document2 pages
Strengths and Weaknesses of Intuitionism
AJC
No ratings yet
Delinea Licensing
Document41 pages
Delinea Licensing
johncenafls
No ratings yet
Danny 8week SQL Challange Solution
Document6 pages
Danny 8week SQL Challange Solution
mobal62523
No ratings yet
Islamic ST
Document41 pages
Islamic ST
akiffareed15
No ratings yet
Used To Would Lesson Plan2
Document2 pages
Used To Would Lesson Plan2
bigbencollege2930
No ratings yet
Grade 7 Curriculum Map
Document3 pages
Grade 7 Curriculum Map
Jzaninna Sol
No ratings yet
Lesson Plan Form 2
Document6 pages
Lesson Plan Form 2
vicodeedee
No ratings yet
Lecture 21 Rank and Nullity PDF
Document31 pages
Lecture 21 Rank and Nullity PDF
nizad dard
No ratings yet
Mullá Husayn Carries Out His Mission: Memory Verse
Document4 pages
Mullá Husayn Carries Out His Mission: Memory Verse
Cindy Van Kley
No ratings yet
The Effectiveness Establushing Hindi As A National Language
Document9 pages
The Effectiveness Establushing Hindi As A National Language
jin mori
No ratings yet
Untitled
Document152 pages
Untitled
LamNguyen
No ratings yet
Archery Game: A Report Submitted in Partial Fulfillment of Requirements of The Mini Project of
Document6 pages
Archery Game: A Report Submitted in Partial Fulfillment of Requirements of The Mini Project of
jenil
No ratings yet