100% found this document useful (1 vote)

369 views24 pages

Distributed Database

A distributed database system (DDBS) is a collection of interrelated data that is spread across multiple computers or sites connected through a network. A distributed database management system (DDBMS) allows for the management of the distributed data and makes the distribution transparent to users, so that the system appears as a single database. A DDBS provides advantages like improved data sharing, availability, reliability and performance by reflecting organizational structures and allowing for modular growth. However, it also introduces complexity in areas like concurrency control, transaction management, security and integrity control.

Uploaded by

Himashree Bhuyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

369 views24 pages

Distributed Database

Uploaded by

Himashree Bhuyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

What is a Distributed Database System?

Distributed Database
A logically interrelated collection of shared data (and a description of this data), physically distributed over a computer network.

Distributed DBMS
Software system that permits the management of the distributed database and makes the distribution transparent to users.

What is not a DDBS?

A timesharing computer system A loosely or tightly coupled multiprocessor

system
A database system which resides at one of the

nodes of a network of computers - this is a centralized database on a network node

The Fundamental Principle of Distributed Database

To the user, a distributed system should look exactly like a nondistributed system.

A typical distributed database system:

New York Shanghai

Communication network

London

San Francisco

What is the 12 objectives?

Local autonomy No reliance on a central Distributed query

site Continuous operation Location independence Fragmentation independence Replication independence

processing Distributed transaction management Hardware independence Operating system independence Network independence DBMS independence

Types Of Distributed Databases

In a homogeneous distributed database
All sites have identical software
Are aware of each other and agree to cooperate in processing user

requests. Each site surrenders part of its autonomy in terms of right to change schemas or software Appears to user as a single system

In a heterogeneous distributed database

Different sites may use different schemas and software

Difference in schema is a major problem for query processing Difference in software is a major problem for transaction processing Sites may not be aware of each other and may provide only limited facilities for cooperation in transaction processing

Why use a DDBMS? (!)

Advantages:
Reflects organizational structure
Improved

shareability and local autonomy Improved availability Improved reliability Improved performance Economics Modular growth

Disadvantages: Complexity Cost Security Integrity control more difficult Lack of standards Lack of experience Database design more complex

Distributed Database Design

DATA FRAGMENTATION, REPLICATION, AND ALLOCATION TECHNIQUES FOR DISTRIBUTED DATABASE DESIGN
Fragmentation: Breaking up the database into logical units called

fragments and assigned for storage at various sites.

Data replication: The process of storing fragments in more than one site Data Allocation: The process of assigning a particular fragment to a particular
site in a distributed system.

The information concerning the data fragmentation, allocation and

replication is stored in a global directory.

12.5 Distributed Relational Database Design

Fragmentation !
Four types of fragmentation:

Horizontal:

Consists of a subset of the tuples of a relation.

- Defined using Selection operation - Determined by looking at predicates used by Ts. - Involves finding set of minimal (complete and relevant) predicates. - Set of predicates is complete, iff, any two tuples in same fragment are referenced with same probability by any application. - Predicate is relevant if there is at least one application that accesses fragments differently.

12.5 Distributed Relational Database Design

Fragmentation !
Four types of fragmentation:
2.

Other possibility is no fragmentation:

Vertical:

-If relation is small and not updated frequently, may be - Defined using Projection operation better not to fragment. - Determined by establishing affinity of one attribute to another.

subset of atts of a relation.

Mixed: horizontal fragment that is vertically fragmented, or a

vertical fragment that is horizontally fragmented. - Defined using Selection and Projection operations

Derived: horizontal fragment that is based on horizontal

fragmentation of a parent relation. - Ensures fragments frequently joined together are at same site. - Defined using Semijoin operation

Data Allocation !
Four alternative strategies regarding placement of data:

Centralized: single database and DBMS stored at one site with users distributed across the network.
Partitioned: Database partitioned into disjoint fragments, each fragment assigned to one site. Complete Replication: Consists of maintaining complete copy of database at each site. Selective Replication: Combination of partitioning, replication, and centralization.

Data Allocation

DATA REPLICATION
Fully replicated database:

* Stores multiple copies of each database fragment at multiple sites *Can be impractical due to amount of overhead Partially replicated database: *Stores multiple copies of some database fragments at multiple sites *Most DDBMSs are able to handle the partially replicated database well Unreplicated database: *Stores each database fragment at a single site *No duplicate database fragments

Advantages of Replication
Availability: failure of site containing relation r does

not result in unavailability of r is replicas exist. Parallelism: queries on r may be processed by several nodes in parallel. Reduced data transfer: relation r is available locally at each site containing a replica of r.

Disadvantages of Replication
Increased cost of updates: each replica of relation r

must be updated. Increased complexity of concurrency control: concurrent updates to distinct replicas may lead to inconsistent data unless special concurrency control mechanisms are implemented.

One solution: choose one copy as primary copy and apply concurrency control operations on primary copy.

Transparency in a DDBMS
Transparency hides implementation details from users. Overall objective: equivalence to user of DDBMs to centralised DBMS - FULL transparency not universally accepted objective

Transparency types: 1.Distribution/ Netwrok Transparency a.Location Transparency b.Naming Transparency 2.Replication Transparency 3.Fragmentation Transparency 4.Design Transparency 5.Execution Transparency

Distributed DBMS Issues

Query Processing
convert user transactions to data manipulation instructions optimization problem min{cost = data transmission + local processing} general formulation is NP-hard

Concurrency Control
synchronization of concurrent accesses consistency and isolation of transactions' effects deadlock management

Reliability
how to make the system resilient to failures
atomicity and durability

Relationship Between Issues

Directory Management

Query Processing

Distribution Design

Reliability

Concurrency Control

Deadlock Management

Concurrency Control and Recovery

Distributed Databases encounter a number of

concurrency control and recovery problems which are not present in centralized databases. Some of them are listed below.
Dealing with multiple copies of data items
Failure of individual sites Communication link failure

Distributed commit
Distributed deadlock

Slide 2520

System Failure Modes

Failures unique to distributed systems:
Failure of a site. Loss of massages

Handled by network transmission control protocols such as TCPIP Failure of a communication link Handled by network protocols, by routing messages via alternative links Network partition A network is said to be partitioned when it has been split into two or more subsystems that lack any connection between them Note: a subsystem may consist of a single node Network partitioning and site failures are generally indistinguishable.

Client-Server Database Architecture

It consists of clients running client software, a set of

servers which provide all database functionalities and a reliable communication infrastructure.
Server 1 Client 1 Client 2 Server 2 Client 3

Server n

Client n
Slide 2522

Conclusion
Todays business environment has an increasing need for distributed database and client/server applications as the desire for reliable, scalable and accessible information is steadily rising. Distributed database systems provide an improvement on communication and data processing due to its data distribution throughout different network sites. Not only is data access faster, but a singlepoint of failure is less likely to occur, and it provides local control of data for users. However, there is some complexity when attempting to manage and control distributed database systems. A distributed database allows faster local queries and can reduce network traffic. With these benefits comes the issue of maintaining data integrity. Single big server could hardly handle requirement of high availability, data warehousing and fast data storage simultaneously. The distributed database satisfies them by separating functions at low cost. The grid computing is becoming the main stream of information technology. Not only computation, we expect database grid will also be a key technology in the future.

THANK YOU

Online Restaurant Table Reservation Management System
100% (3)
Online Restaurant Table Reservation Management System
69 pages
Unit - 1 DDB
No ratings yet
Unit - 1 DDB
34 pages
Esquema Elétrico Teclado Cássio mz500
100% (2)
Esquema Elétrico Teclado Cássio mz500
92 pages
Functional Dependency (DBMS)
No ratings yet
Functional Dependency (DBMS)
17 pages
PaperCut Brother Embedded Manual
No ratings yet
PaperCut Brother Embedded Manual
29 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
FDB For Exit Exam
No ratings yet
FDB For Exit Exam
284 pages
Concurrency Control Dbms
No ratings yet
Concurrency Control Dbms
49 pages
Distributed Database Systems: January 2002
No ratings yet
Distributed Database Systems: January 2002
25 pages
Chapter - 7 Distributed Database System
100% (1)
Chapter - 7 Distributed Database System
54 pages
Chapter - 6 Distributed Database System
No ratings yet
Chapter - 6 Distributed Database System
50 pages
Ch#22 TRANSACTION - MANAGEMENT
No ratings yet
Ch#22 TRANSACTION - MANAGEMENT
80 pages
Interprocess Communication and Synchronization
No ratings yet
Interprocess Communication and Synchronization
9 pages
DDBMS MCQ - 1
No ratings yet
DDBMS MCQ - 1
10 pages
Distributed Database Systems (DDBS)
No ratings yet
Distributed Database Systems (DDBS)
30 pages
Distributed Catalog Management
100% (1)
Distributed Catalog Management
12 pages
ADS Chapter 4 Concurrency Control Techniques
No ratings yet
ADS Chapter 4 Concurrency Control Techniques
36 pages
Assignment 7 DBMS JUL 2022
No ratings yet
Assignment 7 DBMS JUL 2022
10 pages
Database Design
No ratings yet
Database Design
97 pages
DBMS Unit 4
No ratings yet
DBMS Unit 4
20 pages
Functional Dependencies and Normalization For Relational Databases
100% (2)
Functional Dependencies and Normalization For Relational Databases
11 pages
04 Chapter 17 Transaction Processing Concepts
100% (1)
04 Chapter 17 Transaction Processing Concepts
59 pages
DBMS Self Notes CHP 1
No ratings yet
DBMS Self Notes CHP 1
7 pages
Hw7 Sol Motro
100% (1)
Hw7 Sol Motro
6 pages
Chapter Four: Introduction To Transaction Processing Concepts and Theory
No ratings yet
Chapter Four: Introduction To Transaction Processing Concepts and Theory
36 pages
Jeffrey A. Hoffer, Mary B. Prescott, Fred R. Mcfadden: Modern Database Management 10 Edition
No ratings yet
Jeffrey A. Hoffer, Mary B. Prescott, Fred R. Mcfadden: Modern Database Management 10 Edition
13 pages
10 Total Mark: 10 X 1 10: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
No ratings yet
10 Total Mark: 10 X 1 10: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
7 pages
Unit-1 DDBMS Architecture
No ratings yet
Unit-1 DDBMS Architecture
14 pages
Dbms Lab File
100% (1)
Dbms Lab File
30 pages
MODULE 3 Syncronization
No ratings yet
MODULE 3 Syncronization
22 pages
Disconnected Architecture in Ado
No ratings yet
Disconnected Architecture in Ado
12 pages
Advanced Database Technology: Ambo University
100% (1)
Advanced Database Technology: Ambo University
28 pages
Dbms MCQ
No ratings yet
Dbms MCQ
3 pages
Concurrency Control in DBMS
No ratings yet
Concurrency Control in DBMS
22 pages
Data Recovery Presentation
No ratings yet
Data Recovery Presentation
8 pages
Functional Dependency
No ratings yet
Functional Dependency
2 pages
Unit 3 (Distributed DBMS Architecture) : Architecture: The Architecture of A System Defines Its Structure
No ratings yet
Unit 3 (Distributed DBMS Architecture) : Architecture: The Architecture of A System Defines Its Structure
11 pages
DB Question
No ratings yet
DB Question
209 pages
Transaction Management Unit III
No ratings yet
Transaction Management Unit III
28 pages
Advanced Database System - Chapter 01
No ratings yet
Advanced Database System - Chapter 01
22 pages
Master of Computer Application: Lab Manual
No ratings yet
Master of Computer Application: Lab Manual
30 pages
Advanced Database Notes
50% (2)
Advanced Database Notes
21 pages
Concurrency Control in Distributed Databases
100% (1)
Concurrency Control in Distributed Databases
12 pages
Query Processing - Database Questions & Answers - Sanfoundry 00
No ratings yet
Query Processing - Database Questions & Answers - Sanfoundry 00
7 pages
DDBMS True False
No ratings yet
DDBMS True False
7 pages
Operating System (Questions)
No ratings yet
Operating System (Questions)
27 pages
Chapter - 7 Distributed Database System
No ratings yet
Chapter - 7 Distributed Database System
58 pages
Unit 4 DBMS R23
No ratings yet
Unit 4 DBMS R23
19 pages
Chapter 5 - Recovery Techniques
No ratings yet
Chapter 5 - Recovery Techniques
24 pages
Distributed Databases
No ratings yet
Distributed Databases
39 pages
Advanced Database Systems: Chapter 4: Transaction Management
No ratings yet
Advanced Database Systems: Chapter 4: Transaction Management
78 pages
The Relational Data Model and Relational Database Constraints
No ratings yet
The Relational Data Model and Relational Database Constraints
41 pages
DDBSmidterm Exam 2018model 1
No ratings yet
DDBSmidterm Exam 2018model 1
4 pages
Exercise: B CD (C) AB CD (D) C D (E) B A (F) BD AC (G) AD BC (H) D B (I) D C (J) C A
No ratings yet
Exercise: B CD (C) AB CD (D) C D (E) B A (F) BD AC (G) AD BC (H) D B (I) D C (J) C A
17 pages
Chapter 2
No ratings yet
Chapter 2
43 pages
Assignment Distributed Database System
No ratings yet
Assignment Distributed Database System
6 pages
Crash Recovery
No ratings yet
Crash Recovery
30 pages
Chapter 4 Concurrency Control Techniques
No ratings yet
Chapter 4 Concurrency Control Techniques
41 pages
Unit-3 Relational Data Model
No ratings yet
Unit-3 Relational Data Model
24 pages
Interrupt Vectors and The Vector Table
100% (1)
Interrupt Vectors and The Vector Table
8 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
60 pages
Chapter 4 - Distributed Database System
No ratings yet
Chapter 4 - Distributed Database System
52 pages
ch6 Distributed Database
No ratings yet
ch6 Distributed Database
25 pages
Super Starter Kit For Arduino Uno (CH340)
No ratings yet
Super Starter Kit For Arduino Uno (CH340)
172 pages
Lecture Parallel Computing
No ratings yet
Lecture Parallel Computing
6 pages
Bca Sem 4 Unit 3
No ratings yet
Bca Sem 4 Unit 3
43 pages
Smart Parking Reservation System Using Iot
No ratings yet
Smart Parking Reservation System Using Iot
16 pages
Abend Code
No ratings yet
Abend Code
7 pages
OM 2274900000 Tone-Master-Pro NL
No ratings yet
OM 2274900000 Tone-Master-Pro NL
46 pages
Solucion TLS
No ratings yet
Solucion TLS
24 pages
New-Tech-brochure-0503-print - 230503 - 193906 - 03-05-2023
No ratings yet
New-Tech-brochure-0503-print - 230503 - 193906 - 03-05-2023
9 pages
EPL1 LineManual
No ratings yet
EPL1 LineManual
82 pages
Dokumen - Tips - Subroutine Guide 56228b2f86860
No ratings yet
Dokumen - Tips - Subroutine Guide 56228b2f86860
68 pages
WT7525
No ratings yet
WT7525
16 pages
Power Supply Lecture For CpE
No ratings yet
Power Supply Lecture For CpE
82 pages
Installation and Commissioning FL WLAN 101x/201x Product Family
No ratings yet
Installation and Commissioning FL WLAN 101x/201x Product Family
46 pages
628968-23 RemoTools SDK en
No ratings yet
628968-23 RemoTools SDK en
4 pages
Manual de Usuario Tarjeta Madre GigaByte GA-990FXA-UD3 v.3.0
No ratings yet
Manual de Usuario Tarjeta Madre GigaByte GA-990FXA-UD3 v.3.0
104 pages
Unified Power Format
No ratings yet
Unified Power Format
3 pages
305 Technical Interview Questions Oracle Apps
No ratings yet
305 Technical Interview Questions Oracle Apps
29 pages
Signals & Systems MCQ
No ratings yet
Signals & Systems MCQ
17 pages
Istqb Road Map
No ratings yet
Istqb Road Map
1 page
Thunder 202 282 404
No ratings yet
Thunder 202 282 404
19 pages
Difference Between Power and Small Signal Diode
No ratings yet
Difference Between Power and Small Signal Diode
4 pages
CTE111 Page62
No ratings yet
CTE111 Page62
18 pages
AttenFace A Real Time Attendance System Using Face Recognition
No ratings yet
AttenFace A Real Time Attendance System Using Face Recognition
5 pages
Firmware Release Note: Model: CLX-4195N. FN Date: February 17, 2017
No ratings yet
Firmware Release Note: Model: CLX-4195N. FN Date: February 17, 2017
2 pages
OOP Course Outline
No ratings yet
OOP Course Outline
9 pages
MDF7N60B
No ratings yet
MDF7N60B
8 pages
CN Mian Pro 2
No ratings yet
CN Mian Pro 2
22 pages

Distributed Database

Uploaded by

Distributed Database

Uploaded by

What is a Distributed Database System?

What is not a DDBS?

nodes of a network of computers - this is a centralized database on a network node

The Fundamental Principle of Distributed Database

A typical distributed database system:

What is the 12 objectives?

site Continuous operation Location independence Fragmentation independence Replication independence

Types Of Distributed Databases

In a heterogeneous distributed database

Why use a DDBMS? (!)

Distributed Database Design

fragments and assigned for storage at various sites.

The information concerning the data fragmentation, allocation and

replication is stored in a global directory.

12.5 Distributed Relational Database Design

Consists of a subset of the tuples of a relation.

12.5 Distributed Relational Database Design

Other possibility is no fragmentation:

subset of atts of a relation.

Mixed: horizontal fragment that is vertically fragmented, or a

Derived: horizontal fragment that is based on horizontal

Distributed DBMS Issues

Relationship Between Issues

Concurrency Control and Recovery

System Failure Modes

Client-Server Database Architecture

You might also like