HDFS Blocks

Uploaded by

The document discusses the differences between HDFS and network attached storage. HDFS is the primary storage system for Hadoop that stores very large files across a cluster of commodity hardware. In contrast, NAS provides file-level data storage on dedicated hardware. HDFS distributes blocks across all machines in a cluster, while NAS stores data separately on its own hardware. HDFS is designed to work with MapReduce to move computation to data, which NAS does not support as well.

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

HDFS Blocks

Uploaded by

sharan kommi

0% found this document useful (0 votes)

18 views2 pages

Original Title

HDFS blocks

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

0% found this document useful (0 votes)

18 views2 pages

HDFS Blocks

Uploaded by

sharan kommi

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

Previous year last question

DFS blocks are large compared to disk blocks, because to minimize the cost of
seeks. If we have many smaller size disk blocks, the seek time would be
maximum (time spent to seek/look for an information). And also, having
multiple small sized blocks is the burden on name node/master, as ultimately
the name node stores metadata, so it has to save this disk block information.
If the Data Block is large enough, the time it takes to transfer the data from the
disk can be significantly longer than the time to seek to the start of the block.
Thus, transferring a large file made of multiple blocks operates at the disk
transfer rate.
For each block we need a Mapper. So, in the case of small-sized blocks, there
will be a lot of Mappers. Each will be processing the data, which isn’t efficient.

Diff b/w HDFS and network attacked storage

 1) HDFS is the primary storage system of Hadoop.

HDFS designs to store very large files running on a cluster
of commodity hardware.
Network-attached storage (NAS) is a file-level computer
data storage server.
NAS provides data access to a heterogeneous group of
clients.

2) HDFS distribute blocks across all the machines in a

Hadoop cluster.
NAS data stores on a dedicated hardware.

3) HDFS is designed to work

with MapReduce Framework.
In MapReduce Framework computation move to the data
instead of Data to computation.
NAS is not suitable for MapReduce, as it stores data
separately from the computations.
 September 20, 2018 at 4:03 pm#5730

DataFlair Team
1)NAS stands for Network Attached storage which is a
file-level computer data storage server connected to a
computer network providing network access to
heterogeneous group of clients
HDFS stands for Hadoop distributed file system which is
a java based file system that provides scalable and reliable
data storage and is designed to span large clusters of
commodity hardware.
2)In HDFS data blocks are distributed across the local
drives of all machines in a cluster whereas in NAS data is
stored on a dedicated server.

3)HDFS includes commodity hardware which will be

cost-effective, but NAS is a high-end storage device
which is expensive.

4)It includes features like rack-awarenessHDFS, data

locality which makes it more scalable and effective then
NAS.

Master Class Student Guide
Document160 pages
Master Class Student Guide
Garfyi
No ratings yet
Data Center Transformation With Sangfor HCI 1
Document4 pages
Data Center Transformation With Sangfor HCI 1
Iwan
No ratings yet
Fbda Unit-3
Document27 pages
Fbda Unit-3
Aruna Aruna
No ratings yet
Unit II Big Data Analytics
Document11 pages
Unit II Big Data Analytics
beelogger4321
No ratings yet
Notes
Document18 pages
Notes
nagalaxmi
88% (8)
Introduction To Hadoop
Document5 pages
Introduction To Hadoop
Hanumanthu Gouthami
No ratings yet
Unit-Iv CC&BD CS71
Document148 pages
Unit-Iv CC&BD CS71
Hael
No ratings yet
Unit-2 Introduction To Hadoop
Document19 pages
Unit-2 Introduction To Hadoop
Siva
No ratings yet
PDF Bigdata 15cs82 Vtu Module 1 2 Notes
Document17 pages
PDF Bigdata 15cs82 Vtu Module 1 2 Notes
raju
No ratings yet
IMTC634_Data Science_Chapter 14
Document22 pages
IMTC634_Data Science_Chapter 14
msmakkar.chief19
No ratings yet
CS19741-Cloud Computing-Unit 3 Notes
Document37 pages
CS19741-Cloud Computing-Unit 3 Notes
Rahul Chiranjeevi V
No ratings yet
Bigdata 15cs82 Vtu Module 1 2 Notes PDF
Document49 pages
Bigdata 15cs82 Vtu Module 1 2 Notes PDF
Shobhit Kushwaha
No ratings yet
Bigdata 15cs82 Vtu Module 1 2 Notes
Document49 pages
Bigdata 15cs82 Vtu Module 1 2 Notes
rishik jha
57% (14)
Big Data Assighmwnt 2
Document60 pages
Big Data Assighmwnt 2
sakshi soni
No ratings yet
BigData Module 1
Document17 pages
BigData Module 1
bhattsb
No ratings yet
HADOOP
Document18 pages
HADOOP
maiyi020106
No ratings yet
Top Hadoop Interview Q&A
Document25 pages
Top Hadoop Interview Q&A
P vishnu
No ratings yet
Bigdata Unit IV
Document29 pages
Bigdata Unit IV
CS BCA
No ratings yet
Big Data Lecture Presentation
Document28 pages
Big Data Lecture Presentation
zioncalvin9
No ratings yet
Bda Summer 2022 Solution
Document30 pages
Bda Summer 2022 Solution
Vivek
No ratings yet
UNIT 3 HDFS, Hadoop Environment Part 1
Document9 pages
UNIT 3 HDFS, Hadoop Environment Part 1
works8606
No ratings yet
Module-2-Introduction To HDFS and Tools
Document38 pages
Module-2-Introduction To HDFS and Tools
shreya
No ratings yet
Wa0001.
Document56 pages
Wa0001.
Lakkarsu Poojitha
No ratings yet
Introduction To Hadoop Ecosystem
Document46 pages
Introduction To Hadoop Ecosystem
Gokul J L
No ratings yet
HDFS
Document8 pages
HDFS
Deergha Tiwari
No ratings yet
High Performance Fault-Tolerant Hadoop Distributed File System
Document9 pages
High Performance Fault-Tolerant Hadoop Distributed File System
Editor IJRITCC
No ratings yet
High Performance Fault-Tolerant Hadoop Distributed File System
Document9 pages
High Performance Fault-Tolerant Hadoop Distributed File System
Editor IJRITCC
No ratings yet
File System Basics: Hadoop Distributed
Document22 pages
File System Basics: Hadoop Distributed
badeni
No ratings yet
HADOOP FRAME WORK
Document38 pages
HADOOP FRAME WORK
vinaybiradar14
No ratings yet
Distributed File Systems Leading To Hadoop File System: UNIT-2
Document12 pages
Distributed File Systems Leading To Hadoop File System: UNIT-2
Chitra Madhuri Yashoda
No ratings yet
10 Dfs
Document5 pages
10 Dfs
Shruti More
No ratings yet
bda final sem 7
Document120 pages
bda final sem 7
himanshidvyas
No ratings yet
HopsFS: Scaling Hierarchical File System Metadata Using NewSQL Databases
Document17 pages
HopsFS: Scaling Hierarchical File System Metadata Using NewSQL Databases
Jim Dowling
No ratings yet
BD Unit-IIINotes
Document17 pages
BD Unit-IIINotes
khushitikoo.work
No ratings yet
Unit 3
Document61 pages
Unit 3
Ramstage Testing
No ratings yet
Unit 3 Big Data_240516_090400
Document20 pages
Unit 3 Big Data_240516_090400
ANMOL RATAN
No ratings yet
Assignment 3 (Big Data)
Document2 pages
Assignment 3 (Big Data)
Vishal Shah
No ratings yet
Hadoop Distributed File System (HDFS)
Document6 pages
Hadoop Distributed File System (HDFS)
mytempemail2023
No ratings yet
Apex Institute of Technology: Big Data Security
Document30 pages
Apex Institute of Technology: Big Data Security
So do so
No ratings yet
Unit 4 - Data Science - Www.rgpvnotes.in
Document18 pages
Unit 4 - Data Science - Www.rgpvnotes.in
DSync
No ratings yet
Unit II-bid Data Programming
Document23 pages
Unit II-bid Data Programming
jasmine
No ratings yet
Cloud Computing - Unit 3
Document38 pages
Cloud Computing - Unit 3
lightfreezzer
No ratings yet
Unit 3
Document44 pages
Unit 3
Vaddi Kasulu
No ratings yet
Act2 - March7 - 6E - BDA - SEC
Document8 pages
Act2 - March7 - 6E - BDA - SEC
yadukrishnagiriwork
No ratings yet
Untitled
Document37 pages
Untitled
asha
No ratings yet
Unit Ii
Document39 pages
Unit Ii
021- IMRAN
No ratings yet
BDA Unit-3
Document47 pages
BDA Unit-3
Jaya Prakash
No ratings yet
BDA 3rd Unit QB
Document4 pages
BDA 3rd Unit QB
Sachin Mahale
No ratings yet
Cloud Computing
Document19 pages
Cloud Computing
Afia Faryad
No ratings yet
Unit 5 Print
Document32 pages
Unit 5 Print
sivapunithan S
No ratings yet
Hadoop
Document4 pages
Hadoop
scribd.unguided000
No ratings yet
Big Data Analytics – Unit 4
Document32 pages
Big Data Analytics – Unit 4
Prabha Joshi
No ratings yet
BDA Notes
Document25 pages
BDA Notes
mrudula.sb
No ratings yet
Computer Science Apprenticeship Bigdata Assignement3
Document3 pages
Computer Science Apprenticeship Bigdata Assignement3
abood jallad
No ratings yet
Comparing The Hadoop Distributed File System (HDFS) With The Cassandra File System (CFS)
Document13 pages
Comparing The Hadoop Distributed File System (HDFS) With The Cassandra File System (CFS)
apkarthick
No ratings yet
Hadoop Architecture - Hadoop Distributed File System (HDFS)-2
Document39 pages
Hadoop Architecture - Hadoop Distributed File System (HDFS)-2
cakvlr
No ratings yet
BDA Unit-4 Part-1 HDFS,MapReduce
Document76 pages
BDA Unit-4 Part-1 HDFS,MapReduce
Jaya Prakash
No ratings yet
Features of HDFS
Document2 pages
Features of HDFS
sampathabo
No ratings yet
Bda Unit2
Document24 pages
Bda Unit2
pubgmobilesd23
No ratings yet
Unit 2 Lecture - 04 - HDFS PDF
Document40 pages
Unit 2 Lecture - 04 - HDFS PDF
Vaibhavi Sangawar
No ratings yet
BDA Lab Assignment 2
Document18 pages
BDA Lab Assignment 2
parth shah
No ratings yet
Data Engineering Guide for Beginners: Part 2
From Everand
Data Engineering Guide for Beginners: Part 2
Allan Murray
No ratings yet
Velocity Planning
Document5 pages
Velocity Planning
sharan kommi
No ratings yet
Pstarguide
Document72 pages
Pstarguide
sharan kommi
No ratings yet
HDFS Intro
Document9 pages
HDFS Intro
sharan kommi
No ratings yet
Hadoop Features 2
Document3 pages
Hadoop Features 2
sharan kommi
No ratings yet
Pipes and Filters Pattern
Document10 pages
Pipes and Filters Pattern
sharan kommi
No ratings yet
Capacity in Sprint
Document4 pages
Capacity in Sprint
sharan kommi
No ratings yet
Definition of Done
Document3 pages
Definition of Done
sharan kommi
No ratings yet
10 Ten Key Factors For Agile Project Success
Document4 pages
10 Ten Key Factors For Agile Project Success
sharan kommi
No ratings yet
Circuit Breaker Pattern
Document10 pages
Circuit Breaker Pattern
sharan kommi
No ratings yet
Topic: - Main Idea
Document1 page
Topic: - Main Idea
sharan kommi
No ratings yet
CQRS Pattern
Document9 pages
CQRS Pattern
sharan kommi
No ratings yet
Pre Bid Meeting BLR NGT
Document1 page
Pre Bid Meeting BLR NGT
sharan kommi
No ratings yet
Message Oriented Middleware
Document5 pages
Message Oriented Middleware
sharan kommi
No ratings yet
Nellore
Document111 pages
Nellore
sharan kommi
No ratings yet
Plan Showing The Proposed Sewer Network in Pulivendula Municipality
Document1 page
Plan Showing The Proposed Sewer Network in Pulivendula Municipality
sharan kommi
No ratings yet
Main Idea
Document1 page
Main Idea
sharan kommi
No ratings yet
Pretorius 2012 Vortex Grit Official
Document21 pages
Pretorius 2012 Vortex Grit Official
sharan kommi
No ratings yet
Cloud Computing: BITS Pilani
Document8 pages
Cloud Computing: BITS Pilani
sharan kommi
No ratings yet
Topic: - Main Idea
Document1 page
Topic: - Main Idea
sharan kommi
No ratings yet
Your Needs Our Solutions
Document1 page
Your Needs Our Solutions
sharan kommi
No ratings yet
Dimension Two (B) : Identifying Implied Main Ideas: For Your Better Understanding
Document1 page
Dimension Two (B) : Identifying Implied Main Ideas: For Your Better Understanding
sharan kommi
No ratings yet
C N N C N CC y L TD L D L DC N D Dy: A) We Have To Show That at Steady Sate and y L
Document1 page
C N N C N CC y L TD L D L DC N D Dy: A) We Have To Show That at Steady Sate and y L
sharan kommi
No ratings yet
6" Borewell Submersible Pumps: For Agriculture & Domestic Applications
Document1 page
6" Borewell Submersible Pumps: For Agriculture & Domestic Applications
sharan kommi
No ratings yet
Bagepalli - 4.4 MLD and 0.55 MLD STP - SBT
Document9 pages
Bagepalli - 4.4 MLD and 0.55 MLD STP - SBT
sharan kommi
100% (1)
Disciplinary Action Company Policy
Document3 pages
Disciplinary Action Company Policy
sharan kommi
100% (2)
Jharkhand DWSD Proposal For Jal Jeevan Mission
Document14 pages
Jharkhand DWSD Proposal For Jal Jeevan Mission
sharan kommi
No ratings yet
Blower and Design Calculation
Document1 page
Blower and Design Calculation
sharan kommi
50% (2)
Fecal Sludge DPR
Document56 pages
Fecal Sludge DPR
sharan kommi
100% (2)
SBR - 6 MLD
Document38 pages
SBR - 6 MLD
sharan kommi
100% (1)
) Perational Vlaintena, Nce Manual: I UGRK Series
Document22 pages
) Perational Vlaintena, Nce Manual: I UGRK Series
sharan kommi
No ratings yet
12 Sap Saps4hana Cloud Update
Document23 pages
12 Sap Saps4hana Cloud Update
TUI Thaweesak
No ratings yet
Challenge Yourself 23
Document3 pages
Challenge Yourself 23
Keii blackhood
No ratings yet
Experienced
Document7 pages
Experienced
manju
No ratings yet
GAIT
Document30 pages
GAIT
Ram Sharma
No ratings yet
Debdeep Dan Xi-A Sci It-802 Project
Document28 pages
Debdeep Dan Xi-A Sci It-802 Project
Debdeep Dan
No ratings yet
3bti ISPs and Internet Access 21.10.2021
Document4 pages
3bti ISPs and Internet Access 21.10.2021
Kuba Kowal
No ratings yet
Kabaxagifefasile
Document3 pages
Kabaxagifefasile
Hritik Rawat
No ratings yet
Cloud Computing in Distributed System IJERTV1IS10199
Document8 pages
Cloud Computing in Distributed System IJERTV1IS10199
Nebula Oriom
No ratings yet
Synopsis
Document5 pages
Synopsis
aruba ansari
No ratings yet
Test Questions
Document3 pages
Test Questions
Asia Haani
No ratings yet
FLUTTER FAQs
Document14 pages
FLUTTER FAQs
lava bhai
No ratings yet
Scheduling Framework Migration
Document8 pages
Scheduling Framework Migration
Kannan Vanniarajan
No ratings yet
Javascript
Document25 pages
Javascript
pilli maheshchandra
No ratings yet
Scripts - PDF Download - SAP Q&A PDF
Document4 pages
Scripts - PDF Download - SAP Q&A PDF
phogat project
No ratings yet
SE (Lab # 01) 1
Document6 pages
SE (Lab # 01) 1
Usman Faizyab Khan
No ratings yet
SEO
Document33 pages
SEO
Syed Muhammad Junaid Hassan
No ratings yet
Govind Vidyalaya, Tamulia
Document14 pages
Govind Vidyalaya, Tamulia
Pratik Anand
No ratings yet
Programming Manual For SIP
Document54 pages
Programming Manual For SIP
Alejandro Daniel
No ratings yet
Data Flow Diagram (DFD)
Document12 pages
Data Flow Diagram (DFD)
yasodhaprathaban
No ratings yet
Webservices Building Blocks: N.Rajganesh
Document29 pages
Webservices Building Blocks: N.Rajganesh
xxx
No ratings yet
SAP Architecture: - Abhay Singh, Employee
Document12 pages
SAP Architecture: - Abhay Singh, Employee
Abhay Singh B
No ratings yet
Ig15 SP IT M13V1 Uplrn
Document40 pages
Ig15 SP IT M13V1 Uplrn
arun.chuvuk4421
No ratings yet
B.tech. IT Curriculam For All 4 Years
Document244 pages
B.tech. IT Curriculam For All 4 Years
Raghav
No ratings yet
Fast Lane - RH-DO180
Document3 pages
Fast Lane - RH-DO180
Charles Wei
No ratings yet
9.1.3 Packet Tracer Identify Mac and Ip Addresses Paul Valdez
Document3 pages
9.1.3 Packet Tracer Identify Mac and Ip Addresses Paul Valdez
ssf 2018
No ratings yet
NSO 5.3 Getting Started Guide: Americas Headquarters
Document76 pages
NSO 5.3 Getting Started Guide: Americas Headquarters
Ala Jebnoun
No ratings yet
VTU MCA Syllabus
Document3 pages
VTU MCA Syllabus
aarchanasingh20
No ratings yet
8392000742-CBI Academy
Document11 pages
8392000742-CBI Academy
srinivas
No ratings yet