Disjoint Sets Union Find Algorithms

Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 3

Disjoint-set data structure

In computing given a set of elements, it is often useful to break them up or partition them into a number of separate, nonoverlapping sets. A disjoint-set data structure is a data structure that keeps track of such a partitioning. A union-find algorithm is an algorithm that performs two useful operations on such a data structure: Find: Determine in which set a particular element is in. Union: Combine or merge two sets into a single set. Because it supports these two operations, a disjoint-set data structure is sometimes called a union-find data structure or merge-find set. The other important operation, MakeSet, which makes a set containing only a given element (a singleton), is generally trivial. With these three operations, many practical partitioning problems can be solved . Disjoint-Set Linked Lists A simple approach to creating a disjoint-set data structure is to create a linked list for each set. The element at the head of each list is chosen as its representative. MakeSet creates a list of one element. Union appends the two lists, a constant-time operation. The drawback of this implementation is that Find requires (n) or linear time. This can be avoided by including in each linked list node a pointer to the head of the list; then Find takes constant time. However, Union now has to update each element of the list being appended to make it point to the head of the new combined list, requiring (n) time. Disjoint-Set Forests Disjoint-set forests are a data structure where each set is represented by a tree data structure, in which each node holds a reference to its parent node In a disjoint-set forest, the representative of each set is the root of that set's tree. Find follows parent nodes until it reaches the root. Union combines two trees into one by attaching the root of one to the root of the other. One way of implementing these might be: function MakeSet(x) x.parent := x

function Find(x) if x.parent == x return x else return Find(x.parent)

function Union(x, y) xRoot := Find(x) yRoot := Find(y) xRoot.parent := yRoot however, it can be enhanced in two ways. The first way, called union by rank, is to always attach the smaller tree to the root of the larger tree, rather than vice versa. Since it is the depth of the tree that affects the running time, the tree with smaller depth gets added under the root of the deeper tree, which only increases the depth if the depths were equal. One-element trees are defined to have a rank of zero, and whenever two trees of the same rank r are united, the rank of the result is r+1. Just applying this technique alone yields an amortized running-time of O(logn) per MakeSet, Union, or Find operation Pseudocode for the improved MakeSet and Union: function MakeSet(x) x.parent := x x.rank := 0 function Union(x, y) xRoot := Find(x) yRoot := Find(y) if xRoot.rank > yRoot.rank yRoot.parent := xRoot else if xRoot != yRoot xRoot.parent := yRoot if xRoot.rank == yRoot.rank yRoot.rank := yRoot.rank + 1

// Unless x and y are already in same set, merge them

The second improvement, called path compression, is a way of flattening the structure of the tree whenever Find is used on it. The idea is that each node visited on the way to a root node may as well be attached directly to the root node; they all share the same representative. function Find(x) if x.parent == x return x else x.parent := Find(x.parent)

return x.parent Applications Disjoint-set data structures model the partitioning of a set, for example to keep track of the connected components of an undirected graph. This model can then be used to determine whether two vertices belong to the same component, or whether adding an edge between them would result in a cycle. The Union-Find algorithm is used in high-performance implementations of Unification. This data structure is used by the Boost Graph Library to implement its Incremental Connected Components functionality. It is also used for implementing Kruskal's algorithm to find the minimum spanning tree of a graph

You might also like