|
| 1 | +<!--?title Breadth First Search --> |
| 2 | + |
| 3 | +# Breadth-first search |
| 4 | +Breadth first search is one of the basic and essential algorithms on graphs. |
| 5 | + |
| 6 | +As a result of how the algorithm works, the path found by breadth first search to any node is the shortest path to that node i.e |
| 7 | +the path that contains the smallest number of edges in unweighted graphs. |
| 8 | + |
| 9 | +The algorithm works in $O (n + m)$ time, where $n$ is number of vertices and $m$ is the number of edges. |
| 10 | + |
| 11 | +## Description of the algorithm |
| 12 | + |
| 13 | +The algorithm takes as input an unweighted graph and the id of the source vertex $s$. The input graph can be directed or undirected, |
| 14 | +it does not matter to the algorithm. |
| 15 | + |
| 16 | +The algorithm can be understood as a fire spreading on the graph: at the zeroth step only the source $s$ is on fire. At each step, the fire burning at each vertex spreads to all of its neighbors. In one iteration of the algorithm, the "ring of |
| 17 | +fire" is expanded in width by one unit (hence the name of the algorithm). |
| 18 | + |
| 19 | +More precisely, the algorithm can be stated as follows: Create a queue $q$ which will contain the vertices to be processed and a |
| 20 | +boolean array $used[]$ which indicates for each vertex, if it has been lit (or visited) or not. |
| 21 | + |
| 22 | +Initially, push the source $s$ to the queue and set $used[s] = true$, while for all other vertices, $used[] = false$. Then, loop until the queue is empty and in each iteration, pop a vertex from the front of the queue. Iterate through all the edges going out |
| 23 | +of this vertex and if some of these edges go to vertices that are not already lit, set them on fire and place them in the queue. |
| 24 | + |
| 25 | +As a result, when the queue is empty, the "ring of fire" contains all vertices reachable from the source $s$, with each vertex |
| 26 | +begin reached in the shortest possible way. You can also calculate the lengths of the shortest paths (which just requires maintaining an array of path lengths $d[]$) as well as save information to restore all of these shortest paths (for this, it is |
| 27 | +necessary to maintain an array of "parents" $p[]$, which stores for each vertex, the vertex number from which we reached here). |
| 28 | + |
| 29 | +## Implementation |
| 30 | + |
| 31 | +We write code for the described algorithm in C++. |
| 32 | + |
| 33 | +Input data: |
| 34 | + |
| 35 | + vector < vector<int> > g; // adjacency list representation of graph |
| 36 | + int n; // number of nodes in the graph |
| 37 | + int s; // the source vertex |
| 38 | + // take input ... |
| 39 | + |
| 40 | +Breadth first Search: |
| 41 | + |
| 42 | + queue<int> q; |
| 43 | + q.push (s); |
| 44 | + vector<bool> used (n); |
| 45 | + vector<int> d (n), p (n); |
| 46 | + used[s] = true; |
| 47 | + p[s] = -1; |
| 48 | + while (!q.empty()) { |
| 49 | + int v = q.front(); |
| 50 | + q.pop(); |
| 51 | + for (size_t i = 0; i < g[v].size(); ++i) { |
| 52 | + int to = g[v][i]; |
| 53 | + if (!used[to]) { |
| 54 | + used[to] = true; |
| 55 | + q.push (to); |
| 56 | + d[to] = d[v] + 1; |
| 57 | + p[to] = v; |
| 58 | + } |
| 59 | + } |
| 60 | + } |
| 61 | + |
| 62 | +If we have to restore and display the shortest path from the source to some vertex $to$, it can be done in the following |
| 63 | +manner: |
| 64 | + |
| 65 | + if (!used[to]) |
| 66 | + cout << "No path!"; |
| 67 | + else { |
| 68 | + vector<int> path; |
| 69 | + for (int v=to; v!=-1; v=p[v]) |
| 70 | + path.push_back (v); |
| 71 | + reverse (path.begin(), path.end()); |
| 72 | + cout << "Path: "; |
| 73 | + for (size_t i=0; i<path.size(); ++i) |
| 74 | + cout << path[i] + 1 << " "; |
| 75 | + } |
| 76 | + |
| 77 | +## Applications of BFS |
| 78 | + |
| 79 | +* Find the shortest path from a source to other vertices in an unweighted graph. |
| 80 | + |
| 81 | +* Find all connected components in a graph in O (n + m) time.: To do this, we just run BFS from each vertex, except for vertices which have already been visited from previous runs. Thus, we perform normal BFS from each of the vertices, but do not reset the array $used []$ each and every time, due to which every time we run a BFS, we get a new connected component, and the total running time will still be $O (n + m)$ (such multiple BFS on the graph without zeroing array $used []$ is called a series of breadth first searches). |
| 82 | + |
| 83 | +* Finding a solution to a problem or a game with the least number of moves , if each state of the game can be represented by a vertex of the graph, and the transitions from one state to the other are the edges of the graph. |
| 84 | + |
| 85 | +* Finding the shortest path in a graph with weights 0 or 1: This requires just a little modification to normal breadth-first search: if the current edge of zero weight, and distance to the vertex is shorter than the current found distance, then add this vertex not to the back, but to the front of the queue. |
| 86 | + |
| 87 | +* Finding the shortest cycle in a directed unweighted graph: start a breadth-first search from each vertex; as soon as we try to go from the current vertex from the queue to an already visited vertex, then it means that we have found the shortest cycle, and should stop the BFS; from all such cycles (one from each BFS), choose the shortest. |
| 88 | + |
| 89 | +* Find all the edges that lie on any shortest path between a given pair of vertices (a, b). To do this, run two breadth first searches: one from a and one from b. Let $d_a []$ be the array containing shortest distances obtained from the first BFS (from a) and $d_b []$ be the array containing shortest distances obtained from the second BFS from b. Now, for every edge (u, v), it is easy to check whether that edge lies on any shortest path between a and b: the criterion is the condition $d_a [u] + 1 + d_b [v] = d_a [b]$. |
| 90 | + |
| 91 | +* Find all the vertices on any shortest path between a given pair of vertices (a, b). To do this, run two breadth first searches: one from a and one from b. Let $d_a []$ be the array containing shortest distances obtained from the first BFS (from a) and $d_b []$ be the array containing shortest distances obtained from the second BFS from b. Now, for each vertex, it is easy to check whether it lies on any shortest path between a and b: the criterion is the condition $d_a [v] + d_b [v] = d_a [b]$. |
| 92 | + |
| 93 | +* Find the shortest path of even length from start vertex to end vertex in an unweighted graph: For this, we must construct an auxiliary graph, whose vertices are the state (v, c), where v- the number of current node, c = 0 or 1- the current parity. Any edge (a, b) of the original graph in this new column will turn into two edges ((u, 0), (v, 1))and ((u, 1), (v, 0)). After that, on this graph, we need to run a BFS to find the shortest path from the starting vertex to the end, with parity, equal to 0. |
| 94 | + |
| 95 | +## Practice Problems |
| 96 | + |
| 97 | +* [SPOJ: AKBAR](http://spoj.com/problems/AKBAR) |
| 98 | +* [SPOJ: NAKANJ](http://www.spoj.com/problems/NAKANJ/) |
| 99 | +* [SPOJ: WATER](http://www.spoj.com/problems/WATER) |
| 100 | + |
| 101 | + |
0 commit comments