DBMS U4

Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

CS1254 DATABASE MANAGEMENT SYSTEMS

105

UNIT IV
TRANSACTIONS

Transaction Concepts - Transaction Recovery ACID Properties System Recovery Media Recovery
Two Phase Commit - Save Points SQL Facilities for recovery Concurrency Need for Concurrency
Locking Protocols Two Phase Locking Intent Locking Deadlock- Serializability Recovery
Isolation Levels SQL Facilities for Concurrency.

1. TRANSACTION CONCEPTS
A transaction is a logical unit of work. It begins with the execution of a BEGIN TRANSACTION
operation and ends with the execution of a COMMIT or ROLLBACK operation.
A Sample Transaction (Pseudo code)
BEGIN TRANSACTION
UPDATE ACC123 (BALANCE: =BALANCE-$100);
If any error occurred THEN GOTO UNDO;
END IF;
UPDATE ACC123 (BALANCE: =BALANCE+$100);
If any error occurred THEN GOTO UNDO;
END IF;
COMMIT;
GOTO FINISH;
UNDO;
ROLLBACK;
FINISH;
RETURN;
In our example an amount of $100 is transferred from account 123 to 456.
It is not a single atomic operation, it involves two separate updates on the database.
Transaction involves a sequence of database update operation.

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

106

The purpose of this transaction is to transform a correct state of database into another incorrect state,
without preserving correctness at all intermediate points.
Transaction management guarantees a correct transaction and maintains the database in a correct
state.
It guarantees that if the transaction executes some updates and then a failure occurs before the
transaction reaches its planned termination, then those updates will be undone.
Thus the transaction either executes entirely or totally cancelled.
The system component that provides this atomicity is called transaction manager or transaction
processing monitor or TP monitor.
ROLLBACK and COMMIT are key to the way it works.
1. COMMIT:
The COMMIT operation signals successful end of transaction.
It tells the transaction manager that a logical unit of work has been successfully completed and
database is in correct state and the updates can be recorded or saved.
2. ROLLBACK:
a. By contrast, the ROLLBACK operation signals unsuccessful end of transaction.
b. It tells the transaction manager that something has gone wrong, the database might be in
incorrect state and all the updates made by the transaction should be undone.
3. IMPLICIT ROLLBACK:
Explicit ROLLBACK cannot be issued in all cases of transaction failures or errors. So the
system issues implicit ROLLBACK for any transaction failure.
If the transaction does not reach the planned termination then we ROLLBACK the transaction
else it is COMMITTED.
4. MESSAGE HANDLING:
A typical transaction will not only update the database, it will also send some kind of message
back to the end user indicating what has happened.
Example: Transfer done if the COMMIT is reached, or Errortransfer not done
5.

RECOVERY LOG:
The system maintains a log or journal or disk on which all particular about the updation is
maintained.
The values of before and after updation is also called as before and after images.
This log is used to bring the database to the previous state incase of some undo operation.
The log consist of two portions

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

107

a. an active or online portion


b.

an archive or offline portion.

The online portion is the portion used during normal system operation to record details of
updates as they are performed and it is normally kept on disk.
When the online portion becomes full, its contents are transferred to the offline portion, which
can be kept on tape.
6. STATEMENT ATOMICITY:
The system should guarantee that individual statement execution must be atomic.
7. PROGRAM EXECUTION IS A SEQUENCE OF TRANSACTIONS:
COMMIT and ROLLBACK terminate the transaction, not the application program.
A single program execution will consist of a sequence of several transactions running one after
another.
PROGRAM EXECUTION IS A SEQUENCE OF TRANSACTIONS

8. NO NESTED TRANSACTIONS:
An application program can execute a BEGIN TRANSACTION statement only when it has no
transaction currently in progress.
i.e., no transaction has other transactions nested inside itself.
9. CORRECTNESS:
Consistent means not violating any known integrity constraint.
Consistency and correctness of the system should be maintained.
If T is a transaction that transforms the database from state D1 to state D2, and if D1 is correct,
then D2 is correct as well.
10. MULTIPLE ASSIGNMENT:

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

108

Multiple assignments allow any number of individual assignments (i.e., updates) to be


performed simultaneously.
Example: UPDATE ACC 123 {BALANCE: = BALANCE - $100}
UPDATE ACC 456 {BALANCE: = BALANCE + $100}
Multiple assignments would make the statement atomic.
Current products do not support multiple assignments.
2. TRANSACTION RECOVERY
A transaction begins by executing a BEGIN TRANSACTION operation and ends by executing either a
COMMIT or a ROLLBACK operation.
COMMIT establishes a commit point or synch point.
A commit point corresponds to the successful end of a transaction and the database will be in a correct
state.
ROLLBACK rolls the database back to the previous commit point.
There will be several transactions executing in parallel in a database.
When a commit point is established:
1. When a program is committed, the change is made permanent. i.e., they are guaranteed to be
recorded in the database. Prior to the commit point updates are tentative i.e., they can be
subsequently be undone.
2. All database positioning is lost and all tuple locks are released.

Database positioning means at the time of execution each program will typically have addressability to
certain tuples in the database, this addressability is lost at a COMMIT point.
Transactions are not only a unit of work but also unit of recovery.
If a transaction successfully commits, then the system updates will be permanently recorded in the
database, even if the system crashes the very next moment.
If the system crashes before the updates are written physically to the database, the systems restart
procedure will still record those updates in the database.
The values can be discovered from the relevant records in the log.
The log must be physically written before the COMMIT processing can complete. This is called writeahead log rule.
The restart procedure helps in recovering any any transactions that completed successfully but not
physically written prior to the crash.
Implementation issues

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

109

1. Database updates are kept in buffers in main memory and not physically written to disk until the
transaction commits. That way, if the transaction terminates unsuccessfully, there will be no need
to undo any disk updates.
2. Database updates are physically written to the disk after COMMIT operation. That way, if the
system subsequently crashes, there will be no need to redo any disk updates.
If there is no enough disk space then a transaction may steal buffer space from another transaction. They
may also force updates to be written physically at the time of COMMIT.
Write ahead log rule is elaborated as follows:
1. The log record for a given database update must be physically written to the log before that update
is physically written to the database.
2. All other log records for a given transaction must be physically written to the log before the
COMMIT log record for that transaction is physically written to the log.
3. COMMIT processing for a given transaction must not complete until the COMMIT log record for
that transaction is physically written to the log.
3. ACID PROPERTIES
ACID stands for Atomicity, Correctness, Isolation and Durability.
* Atomicity: Transactions are atomic.
Consider the following example
Transaction to transfer $50 from account A to account B:
read(A)
A := A 50
write(A)
read(B)
B := B + 50
write(B)
read(X), which transfers the data item X from the database to a local buffer belonging to the
transaction that executed the read operation.
write(X), which transfers the data item X from the local buffer of the transaction that executed the
write back to the database.
Before the execution of transaction Ti the values of accounts A and B are $1000 and $2000,
respectively.
Suppose if the transaction fails due to some power failure, hardware failure and system error the
transaction Ti will not execute successfully.

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

110

If the failure happens after the write(A) operation but before the write(B) operation. The database
will have values $950 and $2000 which results in a failure.
The system destroys $50 as a result of failure and leads the system to inconsistent state.
The basic idea of atomicity is: The database system keeps track of the old values of any data on
which a transaction performs a write, if the transaction does not terminate successfully then the
database system restores the old values.
Atomicity is handled by transaction-management component.
* Correctness/ Consistency:
Transactions transform a correct state of the database into another correct state, without necessarily
preserving correctness at all intermediate points.
In our example the transaction is in consistent state if the sum of A and B is unchanged by the
execution of transaction.
*Isolation:
Transactions are isolated from one another.
Even though there are many transactions running concurrently, any given transactions updates are
concealed from all the rest, until that transaction commits.
The database will be temporarily inconsistent while the transaction is in progress.
When the amount is reduced from A and not yet incremented to B. the database will be inconsistent.
If a second concurrently running transaction reads A and B at this intermediate point and computes
A+B, it will observe an inconsistent value.

If the second transaction performs updates on A and B based on the inconsistent values that it read,
the database will remain inconsistent even after both transactions are completed.
In order to avoid this problem serial execution of transaction is preferred.
Concurrency control component maintain isolation of transaction.

*Durability:
Once a transaction commits, its updates persist in the database, even if there is a subsequent system
crash.
The computer system failure may lead to loss of data in main memory, but data written to disk are
not lost.
Durability is guaranteed by ensuring the following
o The updates carried out by the transaction should be written to the disk.

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

111

o Information stored in the disk should be sufficient to enable the database to reconstruct the
updates when the database system restarts after failure.
o Recovery management component is responsible for ensuring durability.

4. SYSTEM RECOVERY
The system must be recovered not only from purely local failures such as an individual transaction, but
also from global failures.
A local failure affects only the transaction in which the failure has actually occurred.
A global failure affects all of the transactions in progress at the time of the failure.
The failures fall into two broad categories:
1. System failures (e.g., power outage), which affect all transactions currently in progress but do not
physically damage the database. A system failure is sometimes called a soft crash.
2. Media failures (e.g., head crash on the disk), which cause damage to the database or some portion
of it. A media failure is sometimes called a hard crash.
System failure and recovery
During system failures the contents of main memory is lost.
The transaction at the time of the failure will not be successfully completed, so transactions must be
undone i.e., rolled back when the system restarts.

It is necessary to redo certain transactions at the time of restart that is not successfully completed prior
to the crash but did not manage to get their updates transferred from the buffers in main memory to the
physical database.
Whenever some prescribed number of records has been written to the log the system automatically takes
a checkpoint.
The checkpoint record contains a list of all transactions that were in progress at the time the checkpoint
was taken.
To see how a check point works consider the following

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

112

A system failure has occurred at time tf.


The most recent checkpoint prior to time tf was taken at time tc.
Transactions of type T1 completed (successfully) prior to time tc.
Transactions of type T2 started prior to time tc and completed (successfully) after time tc and before
time tf.
Transactions of type T3 also started prior to time tc but did not complete by time tf.
Transactions of type T4 started after time tc and completed (successfully) before time tf.
Finally, transactions of type T5 also started after time tc but did not complete by time tf.
The transactions of types T3 and T5must be undone, and transactions of types T2 and T4 must be redone. At
restart time, the system first goes through the following procedure
1. Start with two lists of transactions, the UNDO list and the REDO list.
2. Set the UNDO list equal to the list of all transactions given in the most recent checkpoint record
and the REDO list to empty.
3. Search forward through the log, starting from the checkpoint record.
4. If a BEGIN TRANSACTION log record is found for transaction T, add T to the UNDO list.
5. If a COMMIT log record is found for transaction T, move T from the UNDO list to the REDO list.
6. When the end of the log is reached, the UNDO and REDO lists are identified.
The system now works backward through the log, undoing the transactions in the UNDO list.
Then works forward, redoing the transactions in the REDO list.
Restoring the database to a correct state by redoing work is sometimes called forward recovery.
Restoring the database to a correct state by undoing work is called backward recovery.
When all recovery activity is complete, then the system is ready to accept new work.
ARIES

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

113

Earlier recovery system performs UNDO before REDO operations.


ARIES scheme performs REDO before UNDO operation.
ARIES operates in three broad phases:
1. Analysis: Build the REDO and UNDO lists.
2. Redo: Start from the log determined in the analysis phase and restore the database to the state it was in the
time of crash.
3. Undo: Undo the effects of transactions that failed to commit.
The name ARIES stands for Algorithms for Recovery and Isolation Exploiting Semantics.
5. TWO PHASE COMMIT
Two-phase commit is important whenever a given transaction can interact with several independent
resource managers,.
Example,
o Consider a transaction running on an IBM mainframe that updates both an IMS database and
a DB2 database. If the transaction completes successfully, then both IMS data and DB2 data
are committed.
o Conversely, if the transaction fails, then both the updates must be rolled back.
o It is not possible to commit one database update and rollback the other. If done so the
atomicity will not be maintained in the system.
o Therefore, the transaction issues a single global or system-wide COMMIT or
ROLLBACK.
o That COMMIT or ROLLBACK is handled by a system component called the coordinator.
o Coordinators task is to guarantee the resource managers commit or roll back.
o It should also guarantee even if the system fails in the middle of the process.
o The two-phase commit protocol is responsible for maintaining such a guarantee.
WORKING
Assume that the transaction has completed and a COMMIT is issued. On receiving the COMMIT
request, the coordinator goes through the following two-phase process:

Prepare:

The resource manager should get ready to go either way on the transaction.
The participant in the transaction should record all updates performed during the transaction from
temporary storage to permanent storage.
In order to perform either COMMIT or ROLLBACK as necessary.

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

114

Resource manager now replies OK to the coordinator or NOT OK based on the write operation.
Commit:

When the coordinator has received replies from all participants, it takes a decision regarding the
transaction and records it in the physical log.
If all replies were OK, that the decision is commit; if any reply was Not OK, the decision is
rollback.
The coordinator informs its decision to all the participants.
Each participant must then commit or roll back the transaction locally, as instructed by the
coordinator.
If the system fails at some point during the process, the restart procedure looks for the decision of the
coordinator.
If the decision is found then the two phase commit can start processing from where it has left off.
If the decision is not found then it assumes that the decision is ROLLBACK and the process can
complete appropriately.
If the participants are from several systems like in distributed system, then some participants should wait
for long time for the coordinators decision.
Data communication manager (DC manager) can act as a resource manager in case of a two-phase
commit process.
6. SAVEPOINTS
Transactions cannot be nested with in another transaction.
Transactions cannot be broken down into smaller subtransactions.
Transactions establish intermediate savepoints while executing.
If there is a roll back operation executed in the transaction, instead of performing roll back all the way to
the beginning we can roll back to the previous savepoint.
Savepoint is not the same as performing a COMMIT, updates made by the transaction are still not
visible to other transaction until the transaction successfully executes a COMMIT.
7. MEDIA RECOVERY
Media recovery is different from transaction and system recovery.
A media failure is a failure such as a disk head crash or a disk controller failure in which some portion
of the database has been physically destroyed.
Recovery from such a failure basically involves reloading or restoring the database from a backup or
dump copy and then using the log.
There is no need to undo transactions that were still in progress at the time of the failure.

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

115

The dump portion of that utility is used to make backup copies of the database on demand.
Such copies can be kept on tape or other archival storage, it is not necessary that they be on direct access
media.
After a media failure, the restore portion of the utility is used to recreate the database from a specified
backup copy.

8. SQL FACILITIES FOR RECOVERY

SQL supports transactions and transaction-based recovery.


All executable SQL statements are atomic except CALL and RETURN.
SQL

provides

BEGIN

TRANSACTION,

COMMIT,

and

ROLLBACK,

called

START

TRANSACTION, COMMIT WORK, and ROLLBACK WORK, respectively.


Syntax for START TRANSACTION:
START TRANSACTION < option commalist> ;
The <option commalist> specifies an access mode, an isolation level, or both
The access mode is either READ ONLY or READ WRITE.
o If neither is specified, READ WRITE is assumed. If READ WRITE is specified, the
isolation level must not be READ UNCOMMITTED.
The isolation level takes the form ISOLATION LEVEL < isolation> , where < isolation> can be READ
UNCOMMITTED, READ COMMITTED, REPEATABLE READ, or SERIALIZABLE.
The syntax for COMMIT and ROLLBACK is:
COMMIT [WORK] [AND [NO] CHAIN];
ROLLBACK [WORK] [AND [NO] CHAIN];
AND CHAIN causes a START TRANSACTION to be executed automatically after the COMMIT;
AND NO CHAIN is the default.
A CLOSE is executed automatically for every open cursor except for the cursors declared WITH
HOLD.
A cursor declared WITH HOLD is not automatically closed at COMMIT
SQL also supports savepoints.
Syntax: SAVEPOINT < savepoint name> ;
This syntax creates a savepoint with the specified user-chosen name.
Syntax for roll back : ROLLBACK TO < savepoint name> ;
This statement undoes all updates done since the specified savepoint.

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

116

Syntax for releasing savepoints: RELEASE < savepoint name> ;


This statement drops the specified savepoint. All savepoints are automatically dropped at transaction
termination.

2 Mark Questions
1. What is transaction?

2. What are the two statements regarding transaction?


3. What are the properties of transaction?
4. What is recovery management component?
5. When is a transaction rolled back?
Any changes that the aborted transaction made to the database must be undone. Once the changes caused by
an aborted transaction have been undone, then the transaction has been rolled back.
6. What are the states of transaction?
7. What is a shadow copy scheme?
8. Give the reasons for allowing concurrency?
9. What is average response time?
10. What are the two types of serializability?
11. Define lock?
12. What are the different modes of lock?
13. Define deadlock?
14. Define the phases of two phase locking protocol
15. Define upgrade and downgrade?
16. What is a database graph?
17. What are the two methods for dealing deadlock problem?
18. What is a recovery scheme?
19. What are the two types of errors?
20. What are the storage types?
21. Define blocks?
22. What is meant by Physical blocks?
23. What is meant by buffer blocks?
24. What is meant by disk buffer?
25. What is meant by log-based recovery?
26. What are uncommitted modifications?

http://engineerportal.blogspot.in/

CS1254 DATABASE MANAGEMENT SYSTEMS

117

27. Define shadow paging.


28. Define page.
29. Explain current page table and shadow page table.
30. What are the drawbacks of shadow-paging technique?
31. Define garbage collection.
32. Differentiate strict two phase locking protocol and rigorous two phase locking protocol.
33. How the time stamps are implemented
34. What are the timestamps associated with each data item?
35. Why is it necessary to have control of concurrent execution of transaction? How is it made possible?

16 Mark Questions
1. Define Serializability. Explain the types of serializability with example.

2. Explain Deadlock with example.


3. Explain in detail about Locking Protocol.
4. Explain the Need for Concurrency Control.
5. Discuss about transaction recoverability.
6. Explain Recovery isolation levels with example.
7. Explain in detail about ACID properties.

http://engineerportal.blogspot.in/

You might also like