0% found this document useful (0 votes)

87 views

Parallel File System

Parallel file systems: - Provide shared access to extremely large datasets across multiple clients concurrently - Distribute data and metadata across multiple storage nodes for high-performance parallel access - Coordinate access to ensure consistency and follow access rules They are designed for high-performance computing applications with massive data and bandwidth needs, distributing data and coordinating parallel access.

Uploaded by

Sachin Anchal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

87 views

Parallel File System

Uploaded by

Sachin Anchal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

PARALLEL FILE SYSTEM

What is parallel file system?

 Store application data persistently

Usually extremely large datasets that can’t ﬁt in memory
 Provide global shared-namespace (ﬁles, directories)
 Designed for parallelism
Concurrent (often coordinated) access from many clients
 Designed for high-performance
Operate over high speed networks (IB, Murine, Portals)
Optimized I/O path for maximum bandwidth

Parallel vs. Distributed

How are Parallel File Systems diﬀerent from Distributed File Systems?

 Data distribution
Distributed file systems often store entire objects (files) on a single storage node
Parallel file systems distribute data of a single object across multiple storage nodes

 Symmetry
Distributed ﬁle systems often run on architectures where the storage is co-located with the
application
Parallel ﬁle systems are often run on architectures storage is physically separate from the
compute system
(not always true here either)

 Fault Tolerance
Distributed ﬁle systems take on fault tolerance responsibilities
Parallel ﬁle systems run on enterprise shared storage

 Workloads
Distributed ﬁle systems are geared for loosely coupled, distributed applications (think data
intensive)
Parallel ﬁle systems target HPC applications, which tend to perform highly coordinated I/O
accesses, and have massive bandwidth requirements

 Overloaded terms!
GlusterFS, Ceph claim to be both
PVFS is often run in symmetric environments
Parallel File Systems

 Provide a directory tree all nodes can see (the global name space)
 Map data across many servers and drives (parallelism of access)
 Coordinate access to data so certain access rules are followed (useful semantics)

Who uses Parallel File Systems?

 Computational Science
o Use of computer simulation as a tool for greater understanding of the real world
– Complements experimentation and theory

o Problems are increasingly computationally challenging

– Large parallel machines needed to perform calculations
– Critical to leverage parallelism in all phases

o Data access is a huge challenge

– Using parallelism to obtain performance
– Finding usable, eﬃcient, portable interfaces
– Understanding and tuning I/O

 Large-Scale Data Sets

o Application teams are beginning to generate 10s of Tbytes of data in a single
simulation. For example, a recent run on 29K processors on the XT4 generated
over 54 Tbytes of data in a 24 hour period.

Application and Storage Data Models

o Applications have data models appropriate to domain

– Multidimensional typed arrays, images composed of scan lines, variable
length records
– Headers, attributes on data
o I/O systems have very simple data models
– Tree based hierarchy of containers
– Some containers have streams of bytes (ﬁles)
– Others hold collections of other containers (directories or folders)
o High-level I/O libraries help map between these data models

Shared-file vs. File-per-process

 Scientific applications perform I/O to parallel file system in primarily one of two ways:

 Shared-file(N-to--1): A single file is created, and all application tasks write to that
file (usually to completely disjoint regions)
• Increases usability: only one file to keep of by application
• Can create lock contention and hinder performance on some systems

 File-per-process (N-to-N): Each application task creates a separate ﬁle, and

writes to that only that file.
• Avoids lock contention on file systems that use locks to maintain POSIX
consistency
• Applications running today create as many as 100,000 tasks
• Impossible to restart application with different number of tasks
Data distribution in parallel file systems

Data Distribution

 Round round is a reasonable default solution

– Works consistently for a variety of workloads
– Works well on most systems
– Who uses it? GPFS, Lustre, PVFS...
– Can you think of a system where this might not work so well?
– What other distributions could be used?
 Clients perform writes/reads of ﬁle at various regions
– Usually depends on application workload and number of tasks

Classes of Parallel File Systems: Blocks vs. Objects

 Block-Based Parallel File Systems (AKA “Shared-disk”)

– Blocks are ﬁxed-width
– File growth requires more blocks
– Blocks distributed over storage nodes
– Suﬀer from block allocation issues, lock managers
– Example: GPFS

 Object-based Parallel File Systems

– Variable-length regions of the file
– A file has a constant number of objects
– Objects are given global identifiers (object-ids, handles, etc.)
– File growth increases the size of object(s)
– Objects are easier to manage and distribute
– Space allocation is managed locally on a per-object basis
– Examples: Lustre, PVFS

Blocks vs. Objects

 Metadata for a ﬁle includes distribution information

 Block-based ﬁle systems (Shared-disk) require dynamic metadata for distribution

Information

 Object based ﬁle systems only need sta6c-metadata for distribution information

Metadata in Parallel File Systems

 A single metadata server creates a single point of contention (hotspot)

 Many clients try to open the same ﬁle at the same time: Creates an
N-to-1 pattern of lookup requests
 Many clients try to create new ﬁles at once: Creates an N-to-1
pattern of create requests (requires disk access too!)
 How can metadata be distributed across metadata servers?
 Depends on underlying design (blocks vs. objects)

DS Lecture 5
No ratings yet
DS Lecture 5
28 pages
Distributed File Systems & Name Services: UNIT-4
No ratings yet
Distributed File Systems & Name Services: UNIT-4
70 pages
DC - PPT A Case Study On Distributed File Systems
No ratings yet
DC - PPT A Case Study On Distributed File Systems
17 pages
Lec 11 - Distributed Files - Distributed File System
No ratings yet
Lec 11 - Distributed Files - Distributed File System
33 pages
Unit 4 Distributed Systems
No ratings yet
Unit 4 Distributed Systems
35 pages
Distributed File System
No ratings yet
Distributed File System
27 pages
5 Distributed File System
100% (1)
5 Distributed File System
59 pages
L8 DFS
No ratings yet
L8 DFS
35 pages
Lecture 5 - DFS & NFS
No ratings yet
Lecture 5 - DFS & NFS
45 pages
Distributed Computing
No ratings yet
Distributed Computing
37 pages
5.distributed File System
No ratings yet
5.distributed File System
86 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
Dos 1
No ratings yet
Dos 1
59 pages
L6 DFS
No ratings yet
L6 DFS
27 pages
Class Notes
No ratings yet
Class Notes
9 pages
Chapter 8
No ratings yet
Chapter 8
30 pages
Distributed File System
No ratings yet
Distributed File System
43 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
Unit 5 CC
No ratings yet
Unit 5 CC
8 pages
Performance Modeling of A Distributed File-System: Sandeep Kumar
No ratings yet
Performance Modeling of A Distributed File-System: Sandeep Kumar
9 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
Distributed File Systems
No ratings yet
Distributed File Systems
6 pages
DFSNov 1
No ratings yet
DFSNov 1
36 pages
CSCI319 Distributed Systems
No ratings yet
CSCI319 Distributed Systems
26 pages
Distributed File Systems
No ratings yet
Distributed File Systems
18 pages
Unit 4
No ratings yet
Unit 4
26 pages
Rev. Lecture 1 PPT2
No ratings yet
Rev. Lecture 1 PPT2
24 pages
chap6
No ratings yet
chap6
54 pages
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
No ratings yet
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
27 pages
Distributed File System Questions and Answers
No ratings yet
Distributed File System Questions and Answers
6 pages
Applications of Distributed Systems
No ratings yet
Applications of Distributed Systems
35 pages
Distributed File Systems
No ratings yet
Distributed File Systems
6 pages
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
No ratings yet
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
46 pages
Distributed File System
No ratings yet
Distributed File System
7 pages
03-1 File Systems
No ratings yet
03-1 File Systems
9 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
Distributed File System - File Service Architecture
No ratings yet
Distributed File System - File Service Architecture
51 pages
Icac Act 1005
No ratings yet
Icac Act 1005
3 pages
Unit Vi
No ratings yet
Unit Vi
15 pages
Module 2
No ratings yet
Module 2
27 pages
DFS, PPT
No ratings yet
DFS, PPT
18 pages
Distributed File Systems
No ratings yet
Distributed File Systems
107 pages
Distributed File Systems
No ratings yet
Distributed File Systems
28 pages
Operating System
No ratings yet
Operating System
40 pages
3Distributed File System
No ratings yet
3Distributed File System
42 pages
File Systems For Various Operating Systems: A Review
No ratings yet
File Systems For Various Operating Systems: A Review
15 pages
Distributed Systems U4
No ratings yet
Distributed Systems U4
8 pages
Distributed File Systems
No ratings yet
Distributed File Systems
42 pages
16 Distributedfilesystems
No ratings yet
16 Distributedfilesystems
6 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
2distributed File System Dfs
No ratings yet
2distributed File System Dfs
21 pages
What Is DFS
No ratings yet
What Is DFS
37 pages
Reliable Distributed Systems
No ratings yet
Reliable Distributed Systems
44 pages
Final suggestions dos- 605B
No ratings yet
Final suggestions dos- 605B
17 pages
SIT102 Lecture 8.2
No ratings yet
SIT102 Lecture 8.2
32 pages
Distributed File Systems: Pavel Bžoch
No ratings yet
Distributed File Systems: Pavel Bžoch
36 pages
Trends in Distributed File Systems: Tu Tran
No ratings yet
Trends in Distributed File Systems: Tu Tran
21 pages
Modern Distributed File System Design: Vrije Universiteit Amsterdam
No ratings yet
Modern Distributed File System Design: Vrije Universiteit Amsterdam
7 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
In DX
No ratings yet
In DX
1 page
Security Log Analytics - Scope Mapping
No ratings yet
Security Log Analytics - Scope Mapping
4 pages
Installation of Ganglia
No ratings yet
Installation of Ganglia
8 pages
NAGIOS Installation
No ratings yet
NAGIOS Installation
20 pages
CP2K Installation Guide
0% (1)
CP2K Installation Guide
2 pages
Effective Public Speaking
No ratings yet
Effective Public Speaking
9 pages
Production System
No ratings yet
Production System
37 pages
Consturction and Analysis of Notches and Weirs
No ratings yet
Consturction and Analysis of Notches and Weirs
6 pages
Unit 6 - Normalization
No ratings yet
Unit 6 - Normalization
10 pages
Causes For Tempdb Full - SQL Server
50% (2)
Causes For Tempdb Full - SQL Server
4 pages
Practical No 5 DBMS
No ratings yet
Practical No 5 DBMS
10 pages
MS Access - Data Types: Type of Data Description Size
No ratings yet
MS Access - Data Types: Type of Data Description Size
2 pages
Free Space Management System in Operating System
No ratings yet
Free Space Management System in Operating System
8 pages
Create Materialized View: Purpose
No ratings yet
Create Materialized View: Purpose
37 pages
Chapter 8 Database CS 9618
No ratings yet
Chapter 8 Database CS 9618
29 pages
Cost Object Controlling
No ratings yet
Cost Object Controlling
2 pages
6 H Data With Hive Big Data Analytics B.tech. Final Year
No ratings yet
6 H Data With Hive Big Data Analytics B.tech. Final Year
24 pages
Introduction of Oracle Database
No ratings yet
Introduction of Oracle Database
37 pages
SQL Server AlwaysOn Availability Groups
No ratings yet
SQL Server AlwaysOn Availability Groups
28 pages
IT260-Practical List 2024-25
No ratings yet
IT260-Practical List 2024-25
24 pages
Row Cache Lock
No ratings yet
Row Cache Lock
7 pages
OLAP function and Tools. OLAP Servers, ROLAP, MOLAP, HOLAP
No ratings yet
OLAP function and Tools. OLAP Servers, ROLAP, MOLAP, HOLAP
2 pages
ResumeLisaPang
No ratings yet
ResumeLisaPang
1 page
Sap Basis Tcode: Useful SAP System Administration Transactions
100% (1)
Sap Basis Tcode: Useful SAP System Administration Transactions
11 pages
K. J. Somaiya College of Engineering, Mumbai-77
No ratings yet
K. J. Somaiya College of Engineering, Mumbai-77
14 pages
SQL Viva Questions
No ratings yet
SQL Viva Questions
16 pages
Topic What Is Business: 01 Intelligence?
No ratings yet
Topic What Is Business: 01 Intelligence?
570 pages
4.Database System Architecture
No ratings yet
4.Database System Architecture
6 pages
Data Structures and Algorithms: Linked List Overview
No ratings yet
Data Structures and Algorithms: Linked List Overview
6 pages
Unit-II Unix Notes
No ratings yet
Unit-II Unix Notes
13 pages
Dbms Important Questions
No ratings yet
Dbms Important Questions
15 pages
Queries
100% (1)
Queries
10 pages
Association Rule Mining Lesson PDF
No ratings yet
Association Rule Mining Lesson PDF
9 pages
Practical File-Informatics Practices (Class XII) : Create A Pandas Series From A Dictionary of Values and An Ndarray
No ratings yet
Practical File-Informatics Practices (Class XII) : Create A Pandas Series From A Dictionary of Values and An Ndarray
22 pages
withCSharpinHindi PDF
0% (1)
withCSharpinHindi PDF
51 pages
Alasan Pancasila SBG Sistem Filsafat
No ratings yet
Alasan Pancasila SBG Sistem Filsafat
21 pages
Where can buy SQL for IBM i A Database Modernization Guide Rafael Victória-Pereira ebook with cheap price
100% (1)
Where can buy SQL for IBM i A Database Modernization Guide Rafael Victória-Pereira ebook with cheap price
50 pages
Oracle Erp Financials r12 Training Manual Navigation
No ratings yet
Oracle Erp Financials r12 Training Manual Navigation
30 pages