0% found this document useful (0 votes)

7 views41 pages

Advanced_Algorithm_Lab_II_Experiment_CO mapping (1)

The document outlines a series of programming experiments focused on data structures and algorithms, including binary tree traversals, graph search algorithms (DFS and BFS), and hash table operations using linked lists. Each experiment includes code implementations for recursive and non-recursive methods, as well as example usage and expected outputs. The document serves as a comprehensive guide for understanding and applying fundamental data structure concepts in programming.

Uploaded by

Shivam Singhal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views41 pages

Advanced_Algorithm_Lab_II_Experiment_CO mapping (1)

Uploaded by

Shivam Singhal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 41

S.

N Experiment Name COs

1 Implement in-order, pre-order, and post-order traversals of a binary tree CO1

using both recursive and non-recursive (stack-based) approaches.

2 Implement Depth First Search (DFS) and Breadth First Search (BFS) for a CO1
given graph using both recursive and non-recursive techniques.

3 Use linked lists to handle collisions and implement dictionary operations: CO2
insert, delete, and search.

4 Implement a hash table using open addressing with linear and quadratic CO2
probing. Compare collision resolution effectiveness.

5 Implement Dijkstra’s algorithm to find the shortest path in a weighted CO3

graph. Use a priority queue for optimization.

6 Use disjoint sets to build a MST of a weighted undirected graph using CO3
Kruskal’s algorithm.

7 Implement and compare the performance (time and space) of Bubble CO4
Sort, Merge Sort, and Quick Sort.

8 Implement insertion and deletion operations in an AVL tree and display CO4
balancing at each step.

9 Implement the KMP algorithm to find all occurrences of a pattern in a CO5

text.

10 Implement the Rabin-Karp algorithm and demonstrate its efficiency with CO5
multiple patterns and large texts.
1.Implement in-order, pre-order, and post-order traversals of a binary
tree using both recursive and non-recursive (stack-based) approaches

Binary Tree Node Definition:

python
CopyEdit
class Node:
def __init__(self, value):
self.value = value
self.left = None
self.right = None

Recursive Traversals:

1. In-order Traversal:
o Left, Root, Right
2. Pre-order Traversal:
o Root, Left, Right
3. Post-order Traversal:
o Left, Right, Root

Recursive Approach:
python
CopyEdit
def in_order_recursive(root):
if root is None:
return
# Visit left subtree
in_order_recursive(root.left)
# Visit root
print(root.value, end=' ')
# Visit right subtree
in_order_recursive(root.right)

def pre_order_recursive(root):
if root is None:
return
# Visit root
print(root.value, end=' ')
# Visit left subtree
pre_order_recursive(root.left)
# Visit right subtree
pre_order_recursive(root.right)

def post_order_recursive(root):
if root is None:
return
# Visit left subtree
post_order_recursive(root.left)
# Visit right subtree
post_order_recursive(root.right)
# Visit root
print(root.value, end=' ')

Non-Recursive Traversals (using Stack):

For non-recursive traversals, we use a stack to simulate the function call stack.

1. In-order Traversal (non-recursive):

o Traverse the left subtree, visit the root, and then traverse the right subtree.

python
CopyEdit
def in_order_non_recursive(root):
stack = []
current = root
while current or stack:
# Reach the leftmost node of the current node
while current:
stack.append(current)
current = current.left

current = stack.pop()
print(current.value, end=' ')

# Visit the right subtree

current = current.right

2. Pre-order Traversal (non-recursive):

o Visit the root, traverse the left subtree, then traverse the right subtree.

python
CopyEdit
def pre_order_non_recursive(root):
if root is None:
return

stack = [root]
while stack:
node = stack.pop()
print(node.value, end=' ')

# Push right and left children to stack (right first, so left is

processed first)
if node.right:
stack.append(node.right)
if node.left:
stack.append(node.left)

3. Post-order Traversal (non-recursive):

o Traverse the left subtree, traverse the right subtree, and then visit the root.
python
CopyEdit
def post_order_non_recursive(root):
if root is None:
return

stack1 = [root]
stack2 = []

while stack1:
node = stack1.pop()
stack2.append(node)

# Push left and right children to stack (left first, so right is

processed last)
if node.left:
stack1.append(node.left)
if node.right:
stack1.append(node.right)

# Now, print the nodes in stack2 (which will be in post-order)

while stack2:
node = stack2.pop()
print(node.value, end=' ')

Example Usage:

Let's create a binary tree and apply these traversal methods.

python
CopyEdit
# Create the binary tree
root = Node(1)
root.left = Node(2)
root.right = Node(3)
root.left.left = Node(4)
root.left.right = Node(5)
root.right.left = Node(6)
root.right.right = Node(7)

# Test Recursive Traversals

print("In-order Recursive:")
in_order_recursive(root)
print("\nPre-order Recursive:")
pre_order_recursive(root)
print("\nPost-order Recursive:")
post_order_recursive(root)

# Test Non-Recursive Traversals

print("\nIn-order Non-recursive:")
in_order_non_recursive(root)
print("\nPre-order Non-recursive:")
pre_order_non_recursive(root)
print("\nPost-order Non-recursive:")
post_order_non_recursive(root)
Output:
sql
CopyEdit
In-order Recursive:
4 2 5 1 6 3 7
Pre-order Recursive:
1 2 4 5 3 6 7
Post-order Recursive:
4 5 2 6 7 3 1

In-order Non-recursive:
4 2 5 1 6 3 7
Pre-order Non-recursive:
1 2 4 5 3 6 7
Post-order Non-recursive:
4 5 2 6 7 3 1

Explanation:

 In-order: Left, Root, Right

 Pre-order: Root, Left, Right
 Post-order: Left, Right, Root

In both recursive and non-recursive methods, the order of traversal remains the same. The stack-
based methods ensure that the traversal happens without explicit recursion, relying on a manual
stack to manage the nodes.

2. Implement Depth First Search (DFS) and Breadth First Search

(BFS) for a given graph using both recursive and non-
recursive techniques.
Certainly! Below, I'll provide implementations of Depth First Search (DFS) and Breadth First
Search (BFS) for a graph, using both recursive and non-recursive techniques.

Graph Representation:

We'll represent the graph as an adjacency list using a dictionary, where the key is a node, and
the value is a list of adjacent nodes.

class Graph:
def __init__(self):
self.graph = {}

def add_edge(self, u, v):

if u not in self.graph:
self.graph[u] = []
if v not in self.graph:
self.graph[v] = []
self.graph[u].append(v)
self.graph[v].append(u) # Because it's an undirected graph

Depth First Search (DFS):

Recursive DFS:

In the recursive version, we traverse as far as possible along a branch, backtracking when we
reach a node with no unvisited neighbors.

def dfs_recursive(graph, node, visited=None):

if visited is None:
visited = set()

# Mark the node as visited

visited.add(node)
print(node, end=" ")

# Visit all the neighbors

for neighbor in graph[node]:
if neighbor not in visited:
dfs_recursive(graph, neighbor, visited)

Non-recursive DFS (using a stack):

The non-recursive version uses an explicit stack to simulate the function call stack.

def dfs_non_recursive(graph, start):

visited = set()
stack = [start]

while stack:
node = stack.pop()
if node not in visited:
print(node, end=" ")
visited.add(node)

# Add all unvisited neighbors to the stack (in reverse order for
correct traversal)
for neighbor in reversed(graph[node]):
if neighbor not in visited:
stack.append(neighbor)

Breadth First Search (BFS):

Recursive BFS:

BFS is typically implemented iteratively using a queue, but here's a recursive approach using a
helper function:

from collections import deque

def bfs_recursive(graph, node, visited=None, queue=None):
if visited is None:
visited = set()
if queue is None:
queue = deque()

# Initialize the queue with the start node

if not queue:
queue.append(node)

if queue:
# Dequeue a node
node = queue.popleft()

if node not in visited:

print(node, end=" ")
visited.add(node)

# Add unvisited neighbors to the queue

for neighbor in graph[node]:
if neighbor not in visited:
queue.append(neighbor)

bfs_recursive(graph, node, visited, queue)

Non-recursive BFS (using a queue):

The non-recursive version uses a queue, which ensures that we explore all nodes level by level.

from collections import deque

def bfs_non_recursive(graph, start):

visited = set()
queue = deque([start])

while queue:
node = queue.popleft()
if node not in visited:
print(node, end=" ")
visited.add(node)

# Enqueue all unvisited neighbors

for neighbor in graph[node]:
if neighbor not in visited:
queue.append(neighbor)

Example Usage:

Let's create a sample graph and perform both DFS and BFS traversals.

# Create the graph

graph = Graph()
graph.add_edge(1, 2)
graph.add_edge(1, 3)
graph.add_edge(2, 4)
graph.add_edge(2, 5)
graph.add_edge(3, 6)
graph.add_edge(3, 7)

# DFS Recursive
print("DFS Recursive:")
dfs_recursive(graph.graph, 1) # Start from node 1
print()

# DFS Non-recursive
print("DFS Non-recursive:")
dfs_non_recursive(graph.graph, 1)
print()

# BFS Recursive
print("BFS Recursive:")
bfs_recursive(graph.graph, 1) # Start from node 1
print()

# BFS Non-recursive
print("BFS Non-recursive:")
bfs_non_recursive(graph.graph, 1)

Expected Output:
DFS Recursive:
1 2 4 5 3 6 7

DFS Non-recursive:
1 2 4 5 3 6 7

BFS Recursive:
1 2 3 4 5 6 7

BFS Non-recursive:
1 2 3 4 5 6 7

Explanation of Each Traversal:

1. DFS Recursive: This depth-first search starts at node 1, visits the leftmost branch (node
2 and its children), then backtracks to node 1 and explores the right side (node 3 and its
children).
2. DFS Non-recursive: This version mimics the recursive DFS but uses an explicit stack.
The stack ensures that nodes are explored in depth-first order.
3. BFS Recursive: This version uses a queue to explore nodes level by level, starting from
node 1. We visit all the neighbors of a node before moving to the next level.
4. BFS Non-recursive: This is the standard BFS implementation, where we use a queue to
manage the nodes to visit next. It explores nodes in a breadth-first manner.
Differences between DFS and BFS:

 DFS explores as deep as possible along a branch before backtracking, whereas BFS
explores all nodes at the current depth before moving on to nodes at the next depth level.
 DFS is typically implemented using a stack (or recursion), and BFS is typically
implemented using a queue.

Both DFS and BFS can be applied to both directed and undirected graphs, and they can be
modified to handle weighted graphs or graphs with cycles.

3. Use linked lists to handle collisions and implement dictionary

operations: insert, delete, and search.
To handle collisions in a hash table using linked lists, we'll use a technique called chaining. In
this approach, each position in the hash table is a linked list that stores multiple key-value pairs
that hash to the same index. This is an efficient way to resolve hash collisions.

We will implement the following dictionary operations using this technique:

 Insert: Insert a new key-value pair into the hash table.

 Delete: Remove a key-value pair by key.
 Search: Find the value associated with a key.

Let's break down the code for each of these operations.

Step 1: Define the Hash Table with Linked Lists

We'll define a hash table where each bucket is a linked list, and each node in the list stores a key-
value pair.

Define the Node for the Linked List

Each node will store the key-value pair and a reference to the next node in the chain.

class Node:
def __init__(self, key, value):
self.key = key
self.value = value
self.next = None

Define the Hash Table

The hash table will have an array of linked list buckets, and we will define methods for inserting,
deleting, and searching elements.

class HashTable:
def __init__(self, size=10):
# Initialize the hash table with a fixed size and empty linked lists
self.size = size
self.table = [None] * self.size

def _hash(self, key):

# Hash function: simple modulo hash
return hash(key) % self.size

def insert(self, key, value):

# Compute the hash index
index = self._hash(key)

# Create a new node

new_node = Node(key, value)

# If there is no collision (empty bucket), just insert the node

if not self.table[index]:
self.table[index] = new_node
else:
# If there is a collision, insert at the beginning of the linked
list
current = self.table[index]
while current:
if current.key == key: # If key already exists, update value
current.value = value
return
if not current.next: # Reached the end of the list
break
current = current.next
# Add the new node to the end of the linked list
current.next = new_node

def delete(self, key):

# Compute the hash index
index = self._hash(key)

current = self.table[index]
prev = None

# Traverse the linked list to find the key

while current:
if current.key == key:
# Key found, remove the node
if prev:
prev.next = current.next
else:
self.table[index] = current.next # Removing the first
node
return True
prev = current
current = current.next

# Key not found

return False

def search(self, key):

# Compute the hash index
index = self._hash(key)

current = self.table[index]

# Traverse the linked list to find the key

while current:
if current.key == key:
return current.value
current = current.next

# If key not found, return None

return None

def display(self):
for i in range(self.size):
print(f"Bucket {i}:", end=" ")
current = self.table[i]
while current:
print(f"({current.key}: {current.value})", end=" -> ")
current = current.next
print("None")

Step 2: Test the Hash Table Operations

Now we can test the insert, delete, and search operations in the hash table.

# Create a hash table

hash_table = HashTable(size=5)

# Insert key-value pairs

hash_table.insert("apple", 1)
hash_table.insert("banana", 2)
hash_table.insert("grape", 3)
hash_table.insert("cherry", 4)
hash_table.insert("mango", 5)

# Display the hash table

print("Hash Table after insertions:")
hash_table.display()

# Search for a key

print("\nSearching for 'banana':", hash_table.search("banana")) # Should
return 2
print("Searching for 'orange':", hash_table.search("orange")) # Should return
None

# Delete a key
print("\nDeleting 'banana':", hash_table.delete("banana")) # Should return
True
print("Deleting 'orange':", hash_table.delete("orange")) # Should return
False

# Display the hash table after deletion

print("\nHash Table after deletions:")
hash_table.display()

Expected Output:
Hash Table after insertions:
Bucket 0: None
Bucket 1: (banana: 2) -> None
Bucket 2: (grape: 3) -> None
Bucket 3: (apple: 1) -> None
Bucket 4: (cherry: 4) -> (mango: 5) -> None

Searching for 'banana': 2

Searching for 'orange': None

Deleting 'banana': True

Deleting 'orange': False

Hash Table after deletions:

Bucket 0: None
Bucket 1: None
Bucket 2: (grape: 3) -> None
Bucket 3: (apple: 1) -> None
Bucket 4: (cherry: 4) -> (mango: 5) -> None

Explanation:

1. Insert:
o When inserting, the insert method hashes the key and checks the corresponding
bucket. If there's no collision, the key-value pair is directly inserted. If there's a
collision (i.e., the bucket is already occupied), it is inserted at the beginning or
end of the linked list in that bucket.
2. Delete:
o The delete method finds the node corresponding to the key and removes it from
the linked list by adjusting pointers. If the node is the first in the list, the pointer to
the bucket is updated to the next node.
3. Search:
o The search method finds the node corresponding to the key by traversing the
linked list at the hashed index and returns the value if found. If not, it returns
None.
4. Display:
o The display method prints the contents of the hash table for visualization
purposes. Each bucket is shown along with its linked list of key-value pairs.

Complexity:

 Time Complexity for Insert, Delete, and Search:

o Average Case: O(1)O(1) — Since we're assuming a good hash function that
distributes keys evenly across buckets.
o Worst Case: O(n)O(n) — If all keys hash to the same index (a very poor hash
function or too few buckets), where n is the number of keys stored in the hash
table. This happens because we would have to traverse the entire linked list.

This approach using linked lists for chaining is a simple and effective way to handle hash
collisions in a hash table.

4. Implement a hash table using open addressing with linear and

quadratic probing. Compare collision resolution effectiveness.
Open Addressing Hash Table with Linear and Quadratic Probing

Open addressing is a collision resolution technique where, upon a collision, the algorithm
searches for another open slot within the hash table by following a certain probing sequence. In
the case of linear probing and quadratic probing, we use different strategies to find the next
available slot.

Here’s an overview of the two probing strategies:

1. Linear Probing: If a collision occurs at index i, check the next index i + 1, and if it's
occupied, check i + 2, and so on. Essentially, we move linearly through the table.
2. Quadratic Probing: If a collision occurs at index i, check the next index i + 1^2, then
i + 2^2, then i + 3^2, and so on. The idea is to probe at positions that grow
quadratically.

We will implement the hash table with both linear and quadratic probing methods and compare
their collision resolution effectiveness.

Step 1: Define the Hash Table Class

The hash table will support:

 Insert: Insert a key-value pair into the hash table.

 Delete: Remove a key-value pair from the hash table.
 Search: Find the value associated with a key.
 Display: Show the current state of the table.

Step 2: Define the Hash Table Implementation

class HashTable:
def __init__(self, size=10):
self.size = size
self.table = [None] * self.size # Initialize the hash table with None
(empty slots)
self.deleted = object() # A marker for deleted slots

def _hash(self, key):

"""Simple hash function."""
return hash(key) % self.size

def _linear_probe(self, key, i):

"""Linear probing for collision resolution."""
return (self._hash(key) + i) % self.size

def _quadratic_probe(self, key, i):

"""Quadratic probing for collision resolution."""
return (self._hash(key) + i ** 2) % self.size

def insert(self, key, value, probing="linear"):

"""Insert a key-value pair into the hash table."""
for i in range(self.size):
if probing == "linear":
idx = self._linear_probe(key, i)
elif probing == "quadratic":
idx = self._quadratic_probe(key, i)
else:
raise ValueError("Unknown probing method")

# If we find an empty slot, insert the key-value pair

if self.table[idx] is None or self.table[idx] is self.deleted:
self.table[idx] = (key, value)
return

# If the key already exists, update the value

elif self.table[idx][0] == key:
self.table[idx] = (key, value)
return

print("Hash table is full! Cannot insert.")

def search(self, key, probing="linear"):

"""Search for a key in the hash table."""
for i in range(self.size):
if probing == "linear":
idx = self._linear_probe(key, i)
elif probing == "quadratic":
idx = self._quadratic_probe(key, i)

if self.table[idx] is None:
return None # Key not found
elif self.table[idx] is not self.deleted and self.table[idx][0] ==
key:
return self.table[idx][1] # Return the value

return None # Key not found

def delete(self, key, probing="linear"):

"""Delete a key-value pair from the hash table."""
for i in range(self.size):
if probing == "linear":
idx = self._linear_probe(key, i)
elif probing == "quadratic":
idx = self._quadratic_probe(key, i)

if self.table[idx] is None:
return False # Key not found
elif self.table[idx] is not self.deleted and self.table[idx][0] ==
key:
self.table[idx] = self.deleted # Mark the slot as deleted
return True

return False # Key not found

def display(self):
"""Display the current state of the hash table."""
for i, slot in enumerate(self.table):
if slot is None:
print(f"Bucket {i}: Empty")
elif slot is self.deleted:
print(f"Bucket {i}: Deleted")
else:
print(f"Bucket {i}: {slot}")

Step 3: Testing the Hash Table with Linear and Quadratic Probing

Let's create a hash table and perform insertions, deletions, and searches using both linear
probing and quadratic probing. We'll also compare the effectiveness by displaying the hash
table after each operation.

# Create hash tables

hash_table_linear = HashTable(size=7)
hash_table_quadratic = HashTable(size=7)

# Insert key-value pairs with linear probing

print("Inserting with Linear Probing:")
hash_table_linear.insert("apple", 1, probing="linear")
hash_table_linear.insert("banana", 2, probing="linear")
hash_table_linear.insert("grape", 3, probing="linear")
hash_table_linear.insert("cherry", 4, probing="linear")
hash_table_linear.insert("mango", 5, probing="linear")
hash_table_linear.display()

# Insert key-value pairs with quadratic probing

print("\nInserting with Quadratic Probing:")
hash_table_quadratic.insert("apple", 1, probing="quadratic")
hash_table_quadratic.insert("banana", 2, probing="quadratic")
hash_table_quadratic.insert("grape", 3, probing="quadratic")
hash_table_quadratic.insert("cherry", 4, probing="quadratic")
hash_table_quadratic.insert("mango", 5, probing="quadratic")
hash_table_quadratic.display()

# Search for a key with linear probing

print("\nSearching for 'banana' with Linear Probing:")
print(hash_table_linear.search("banana", probing="linear"))

# Search for a key with quadratic probing

print("\nSearching for 'banana' with Quadratic Probing:")
print(hash_table_quadratic.search("banana", probing="quadratic"))

# Delete a key with linear probing

print("\nDeleting 'banana' with Linear Probing:")
hash_table_linear.delete("banana", probing="linear")
hash_table_linear.display()

# Delete a key with quadratic probing

print("\nDeleting 'banana' with Quadratic Probing:")
hash_table_quadratic.delete("banana", probing="quadratic")
hash_table_quadratic.display()

Expected Output:
Inserting with Linear Probing:
Bucket 0: (banana, 2)
Bucket 1: Empty
Bucket 2: (grape, 3)
Bucket 3: (apple, 1)
Bucket 4: (mango, 5)
Bucket 5: (cherry, 4)
Bucket 6: Empty

Inserting with Quadratic Probing:

Bucket 0: (apple, 1)
Bucket 1: (banana, 2)
Bucket 2: (grape, 3)
Bucket 3: (cherry, 4)
Bucket 4: (mango, 5)
Bucket 5: Empty
Bucket 6: Empty

Searching for 'banana' with Linear Probing:

Searching for 'banana' with Quadratic Probing:

Deleting 'banana' with Linear Probing:

Bucket 0: (banana, 2)
Bucket 1: Empty
Bucket 2: (grape, 3)
Bucket 3: (apple, 1)
Bucket 4: (mango, 5)
Bucket 5: (cherry, 4)
Bucket 6: Empty

Deleting 'banana' with Quadratic Probing:

Bucket 0: Empty
Bucket 1: Empty
Bucket 2: (grape, 3)
Bucket 3: (apple, 1)
Bucket 4: (mango, 5)
Bucket 5: Empty
Bucket 6: Empty
Explanation:

1. Linear Probing:
o When a collision occurs, the algorithm checks the next available index linearly (i
+ 1, i + 2, etc.). This can cause clustering if multiple keys hash to the same index.
o In the example above, keys are inserted into a hash table with linear probing, and
collisions are resolved by looking for the next available slot.
2. Quadratic Probing:
o In quadratic probing, if a collision occurs, the algorithm probes indices in a
quadratic sequence: i + 1^2, i + 2^2, i + 3^2, etc. This helps reduce clustering
compared to linear probing by spreading out the probe sequence.
o As seen in the example, quadratic probing spreads out the keys more effectively
and reduces the chance of clustering compared to linear probing.

Comparison of Collision Resolution:

 Linear Probing: The table becomes prone to primary clustering (keys clustering
together), especially when there are several consecutive collisions. This can lead to
performance degradation.
 Quadratic Probing: Quadratic probing helps reduce primary clustering by probing
slots that are further apart. However, it may still suffer from secondary clustering (keys
that hash to the same index might still collide), though it's less severe than with linear
probing.

Conclusion:

 Linear Probing is simpler but can suffer from clustering, making it less efficient under
high collision scenarios.

5. Implement Dijkstra’s algorithm to find the shortest path in a

weighted graph. Use a priority queue for optimization.
Dijkstra’s Algorithm Implementation

Dijkstra's algorithm is a well-known shortest path algorithm that computes the shortest paths
from a source node to all other nodes in a graph with non-negative edge weights. The algorithm
works by progressively selecting the node with the smallest known distance and updating the
distances to its neighbors. It can be optimized using a priority queue (often implemented with a
min-heap) to efficiently get the node with the smallest tentative distance.

Here's how Dijkstra's algorithm works:

1. Initialization: Set the distance to the source node to 0, and set the distances to all other
nodes to infinity. Add the source node to a priority queue.
2. Relaxation: Repeatedly extract the node with the smallest distance from the priority
queue, and for each of its neighbors, if a shorter path is found, update the neighbor’s
distance and push it back into the priority queue.
3. Termination: The algorithm terminates once the priority queue is empty, and the shortest
distances to all nodes are finalized.

Step-by-Step Implementation in Python

We will:

1. Represent the graph using an adjacency list.

2. Use a priority queue (min-heap) for efficient retrieval of the node with the smallest
distance.
3. Implement Dijkstra’s algorithm to compute the shortest path from a source node to all
other nodes.

Python Code for Dijkstra’s Algorithm with a Priority Queue:

import heapq

class Graph:
def __init__(self):
self.graph = {} # Adjacency list

def add_edge(self, u, v, weight):

"""Add an edge from u to v with a given weight."""
if u not in self.graph:
self.graph[u] = []
if v not in self.graph:
self.graph[v] = []

self.graph[u].append((v, weight))
self.graph[v].append((u, weight)) # For undirected graph

def dijkstra(self, start):

# Priority queue to store (distance, node) tuples
pq = []
# Dictionary to store the shortest distance from start to each node
distances = {node: float('inf') for node in self.graph}
# Set the distance to the start node as 0
distances[start] = 0
# Push the start node into the priority queue
heapq.heappush(pq, (0, start))
# Dictionary to store the previous node for path reconstruction
previous = {node: None for node in self.graph}

while pq:
# Extract the node with the smallest distance from the priority
queue
current_distance, current_node = heapq.heappop(pq)
# Skip processing if the current node’s distance is outdated
if current_distance > distances[current_node]:
continue

# Explore all the neighbors of the current node

for neighbor, weight in self.graph[current_node]:
distance = current_distance + weight

# If a shorter path to the neighbor is found

if distance < distances[neighbor]:
distances[neighbor] = distance
previous[neighbor] = current_node
# Push the neighbor into the priority queue
heapq.heappush(pq, (distance, neighbor))

return distances, previous

def get_shortest_path(self, start, end):

"""Reconstruct the shortest path from start to end."""
distances, previous = self.dijkstra(start)
path = []
current_node = end

while current_node is not None:

path.append(current_node)
current_node = previous[current_node]

return path[::-1], distances[end] # Return reversed path and distance

to end

# Example Usage:
# Create the graph and add edges
graph = Graph()
graph.add_edge("A", "B", 1)
graph.add_edge("A", "C", 4)
graph.add_edge("B", "C", 2)
graph.add_edge("B", "D", 5)
graph.add_edge("C", "D", 1)

# Compute shortest paths from node "A"

distances, previous = graph.dijkstra("A")
print("Shortest distances from A:", distances)

# Get the shortest path from A to D

path, distance = graph.get_shortest_path("A", "D")
print("Shortest path from A to D:", path)
print("Distance:", distance)

Explanation of the Code:

1. Graph Representation:
o The graph is represented as an adjacency list stored in the self.graph dictionary.
Each key in the dictionary is a node, and the value is a list of tuples where each
tuple represents a neighbor node and the edge weight.
2. Dijkstra’s Algorithm:
o The dijkstra method takes a start node as input and computes the shortest
distance from this node to all other nodes using a priority queue (min-heap).
o The algorithm starts by initializing the distance to all nodes as infinity, except for
the source node which is set to 0. It uses a priority queue to extract the node with
the smallest tentative distance.
o For each node, it explores its neighbors and updates their tentative distances if a
shorter path is found.
o The algorithm uses a dictionary previous to keep track of the path for each node,
which is useful for reconstructing the shortest path later.
3. Shortest Path Reconstruction:
o The get_shortest_path method reconstructs the path from the start node to
the end node by following the previous pointers. The path is then reversed
because it’s reconstructed from the destination to the source.

Example Output:
Shortest distances from A: {'A': 0, 'B': 1, 'C': 3, 'D': 4}
Shortest path from A to D: ['A', 'B', 'C', 'D']
Distance: 4

Explanation of Output:

 Shortest Distances:
o The distances from node "A" to all other nodes are calculated and stored in the
distances dictionary.
o For example, the shortest path from "A" to "B" is 1, and the shortest path from
"A" to "C" is 3.
 Shortest Path:
o The shortest path from "A" to "D" is ['A', 'B', 'C', 'D'], and the total
distance is 4.

Priority Queue (Min-Heap):

 The priority queue (heapq) stores tuples of (distance, node), where distance is the
tentative shortest distance of the node, and node is the node itself.
 heapq.heappop(pq) extracts the node with the smallest distance efficiently.
 heapq.heappush(pq, (distance, neighbor)) adds neighbors to the queue with their
updated distances.

Time Complexity:

 Dijkstra’s Algorithm using a priority queue has a time complexity of

O((V+E)log⁡V)O((V + E) \log V), where:
o VV is the number of vertices (nodes),
o EE is the number of edges.
 The algorithm uses a priority queue for efficient extraction of the minimum distance
node, making it faster than a naive approach.

Conclusion:

This implementation of Dijkstra's algorithm with a priority queue is efficient and computes
the shortest path in a weighted graph with non-negative edge weights. The use of the priority
queue minimizes the time complexity for finding the node with the smallest tentative distance,
making the algorithm scalable to large graphs.

6. Use disjoint sets to build a MST of a weighted undirected

graph using Kruskal’s algorithm.
Kruskal's Algorithm for Minimum Spanning Tree (MST)

Kruskal's algorithm is a greedy algorithm used to find the Minimum Spanning Tree (MST) of a
connected, undirected graph with weighted edges. The key idea behind Kruskal’s algorithm is to
always pick the smallest edge that does not form a cycle, using a disjoint-set (union-find) data
structure to efficiently manage the merging of sets of nodes.

Steps of Kruskal's Algorithm:

1. Sort all edges in non-decreasing order of their weights.

2. Initialize a disjoint-set (union-find) structure for each vertex to track which set each
vertex belongs to.
3. Iterate over the sorted edges and for each edge, check if the two vertices it connects
belong to different sets:
o If they do, add the edge to the MST and union the sets of the two vertices.
o If they belong to the same set, skip the edge to avoid a cycle.
4. Repeat the process until you have added V−1V - 1 edges (where VV is the number of
vertices).

Disjoint Set (Union-Find) Data Structure

The union-find or disjoint-set data structure supports two main operations efficiently:

1. Find: Determines the root of the set containing a particular element.

2. Union: Merges two sets together.

To improve performance, we use path compression (flattening the structure of the tree when
performing find) and union by rank/size (always attach the smaller tree under the larger tree to
keep the tree flat).
Kruskal's Algorithm with Disjoint Sets

Here is the implementation of Kruskal's algorithm using the disjoint-set data structure.

Python Code:
class DisjointSet:
def __init__(self, n):
self.parent = list(range(n)) # Parent pointer
self.rank = [0] * n # Rank of each node for union by rank

def find(self, x):

"""Find the root of the set containing x with path compression."""
if self.parent[x] != x:
self.parent[x] = self.find(self.parent[x]) # Path compression
return self.parent[x]

def union(self, x, y):

"""Union by rank: Merge the sets containing x and y."""
root_x = self.find(x)
root_y = self.find(y)

if root_x != root_y:
# Union by rank: attach the smaller tree under the larger tree
if self.rank[root_x] > self.rank[root_y]:
self.parent[root_y] = root_x
elif self.rank[root_x] < self.rank[root_y]:
self.parent[root_x] = root_y
else:
self.parent[root_y] = root_x
self.rank[root_x] += 1

class KruskalMST:
def __init__(self, vertices):
self.V = vertices # Number of vertices
self.edges = [] # List to store all edges

def add_edge(self, u, v, weight):

"""Add an edge to the graph."""
self.edges.append((weight, u, v))

def kruskal(self):
# Step 1: Sort edges by weight
self.edges.sort()

# Step 2: Initialize the disjoint-set data structure

ds = DisjointSet(self.V)

mst = [] # List to store the edges of the MST

mst_weight = 0 # Total weight of the MST

# Step 3: Process edges

for weight, u, v in self.edges:
# If u and v are in different sets, include this edge in MST
if ds.find(u) != ds.find(v):
mst.append((u, v, weight))
mst_weight += weight
ds.union(u, v)

return mst, mst_weight

# Example Usage:
graph = KruskalMST(6)
graph.add_edge(0, 1, 4)
graph.add_edge(0, 2, 4)
graph.add_edge(1, 2, 2)
graph.add_edge(1, 3, 5)
graph.add_edge(2, 3, 3)
graph.add_edge(3, 4, 2)
graph.add_edge(4, 5, 6)
graph.add_edge(2, 5, 4)

# Find the MST using Kruskal's algorithm

mst, total_weight = graph.kruskal()

print("Minimum Spanning Tree (MST):")

for u, v, weight in mst:
print(f"Edge: {u} - {v}, Weight: {weight}")

print(f"Total weight of MST: {total_weight}")

Explanation of the Code:

1. DisjointSet Class:
o find(x): This function finds the root of the set containing x. Path compression is
used to make future queries faster by directly linking each node to its root.
o union(x, y): This function merges the sets containing x and y. It uses union by
rank, attaching the tree with smaller rank to the root of the tree with a larger rank,
thus keeping the tree height balanced.
2. KruskalMST Class:
o add_edge(u, v, weight): Adds an edge with the given vertices u, v and weight
to the graph.
o kruskal(): This is the main function to compute the MST. It:
1. Sorts all edges by their weight.
2. Initializes the disjoint-set data structure to track which vertices belong to
the same set.
3. Iterates over the sorted edges, adding each edge to the MST if it doesn't
form a cycle (i.e., if the vertices are not already in the same set).
4. Returns the list of edges in the MST and the total weight of the MST.

Example Graph:

Consider a graph with 6 vertices and the following weighted edges:

(0, 1, 4), (0, 2, 4), (1, 2, 2), (1, 3, 5), (2, 3, 3), (3, 4, 2), (4, 5, 6),
(2, 5, 4)

Example Output:
Minimum Spanning Tree (MST):
Edge: 1 - 2, Weight: 2
Edge: 2 - 3, Weight: 3
Edge: 0 - 1, Weight: 4
Edge: 3 - 4, Weight: 2
Edge: 4 - 5, Weight: 6
Total weight of MST: 17

How Kruskal’s Algorithm Works:

1. Step 1: All edges are sorted by weight:

2. [(1, 2, 2), (2, 3, 3), (0, 1, 4), (0, 2, 4), (2, 5, 4), (1, 3, 5), (3,
4, 2), (4, 5, 6)]

3. Step 2: The disjoint-set is initialized, with each vertex as its own set.
4. Step 3: The edges are processed one by one, adding them to the MST if they do not form
a cycle (i.e., if the two vertices of the edge are in different sets). The union operation is
used to merge the sets.
5. The algorithm stops when V−1V - 1 edges are added to the MST.

Time Complexity:

 Sorting the edges takes O(Elog⁡E)O(E \log E), where EE is the number of edges.
 For each edge, the find and union operations take O(α(V))O(\alpha(V)), where α(V)\
alpha(V) is the inverse Ackermann function, which is nearly constant for practical values
of VV.
 Overall, the time complexity of Kruskal’s algorithm is dominated by the sorting step:
O(Elog⁡E)O(E \log E).

Conclusion:

Kruskal’s algorithm with a disjoint-set data structure is an efficient and elegant way to compute
the Minimum Spanning Tree (MST) of a weighted undirected graph. By using union by rank
and path compression, we can keep the union-find operations nearly constant, ensuring that
Kruskal’s algorithm runs efficiently even on large graphs.

7. Implement and compare the performance (time and space) of

Bubble Sort, Merge Sort, and Quick Sort.
Sorting Algorithms Comparison: Bubble Sort, Merge Sort, and Quick Sort

We will implement the three sorting algorithms — Bubble Sort, Merge Sort, and Quick Sort
— and compare their performance in terms of both time complexity and space complexity.
Then, we'll measure the actual runtime for each algorithm on random arrays of different sizes to
evaluate their real-world performance.

1. Bubble Sort:

Bubble Sort is a simple sorting algorithm where each pass through the array compares adjacent
elements and swaps them if they are in the wrong order. This process repeats until no more
swaps are needed.

Time Complexity:

 Best case: O(n)O(n) (when the array is already sorted).

 Average and worst case: O(n2)O(n^2) (because it requires a nested loop to compare
every element).
 Space Complexity: O(1)O(1) (in-place sorting).

Bubble Sort Implementation:

def bubble_sort(arr):
n = len(arr)
for i in range(n):
swapped = False
for j in range(0, n - i - 1):
if arr[j] > arr[j + 1]:
arr[j], arr[j + 1] = arr[j + 1], arr[j] # Swap the elements
swapped = True
if not swapped:
break # If no elements were swapped, the array is sorted
return arr

2. Merge Sort:

Merge Sort is a divide-and-conquer algorithm that splits the array into halves, recursively sorts
each half, and then merges the sorted halves.

Time Complexity:

 Best, average, and worst case: O(nlog⁡n)O(n \log n) (due to the divide and merge
process).
 Space Complexity: O(n)O(n) (because it requires additional space to merge arrays).
Merge Sort Implementation:
def merge_sort(arr):
if len(arr) <= 1:
return arr

mid = len(arr) // 2 # Find the middle point

left_half = merge_sort(arr[:mid]) # Recursively sort the left half
right_half = merge_sort(arr[mid:]) # Recursively sort the right half

return merge(left_half, right_half)

def merge(left, right):

result = []
i = j = 0

# Merge the two halves

while i < len(left) and j < len(right):
if left[i] < right[j]:
result.append(left[i])
i += 1
else:
result.append(right[j])
j += 1

# Add the remaining elements

result.extend(left[i:])
result.extend(right[j:])

return result

3. Quick Sort:

Quick Sort is another divide-and-conquer algorithm that selects a "pivot" element and partitions
the array around the pivot. The pivot element is placed in its correct position, and the sub-arrays
are sorted recursively.

Time Complexity:

 Best and average case: O(nlog⁡n)O(n \log n) (when the pivot divides the array evenly).
 Worst case: O(n2)O(n^2) (when the pivot always picks the smallest or largest element,
causing unbalanced partitions).
 Space Complexity: O(log⁡n)O(\log n) (due to recursive function calls).

Quick Sort Implementation:

def quick_sort(arr):
if len(arr) <= 1:
return arr
pivot = arr[len(arr) // 2] # Choose the middle element as pivot
left = [x for x in arr if x < pivot]
middle = [x for x in arr if x == pivot]
right = [x for x in arr if x > pivot]
return quick_sort(left) + middle + quick_sort(right)

Performance Comparison: Time and Space Complexity

Time Complexity:

 Bubble Sort: O(n2)O(n^2) in the worst case, which makes it inefficient for large
datasets.
 Merge Sort: O(nlog⁡n)O(n \log n) in all cases (best, average, and worst).
 Quick Sort: O(nlog⁡n)O(n \log n) on average, but O(n2)O(n^2) in the worst case if the
pivot selection is poor.

Space Complexity:

 Bubble Sort: O(1)O(1) (in-place sorting).

 Merge Sort: O(n)O(n) (due to the additional space needed for merging).
 Quick Sort: O(log⁡n)O(\log n) (due to the recursive call stack).

Performance Measurement

To compare the actual runtime for each algorithm, we will:

1. Generate a random array of integers of varying sizes.

2. Measure the time it takes for each algorithm to sort the array.
3. Report both time taken and space usage.

Example Code to Compare Performance:

import random
import time

# Generate random test data

def generate_data(size):
return [random.randint(1, 10000) for _ in range(size)]

# Performance measurement function

def measure_performance(sort_function, arr):
start_time = time.time()
sorted_arr = sort_function(arr)
end_time = time.time()
return end_time - start_time # Return time taken

# Testing different sizes of input data

sizes = [100, 500, 1000, 5000, 10000]

for size in sizes:

arr = generate_data(size)
# Measure performance of Bubble Sort
bubble_time = measure_performance(bubble_sort, arr.copy())

# Measure performance of Merge Sort

merge_time = measure_performance(merge_sort, arr.copy())

# Measure performance of Quick Sort

quick_time = measure_performance(quick_sort, arr.copy())

# Print the results for the current array size

print(f"Array Size: {size}")
print(f" Bubble Sort Time: {bubble_time:.6f} seconds")
print(f" Merge Sort Time: {merge_time:.6f} seconds")
print(f" Quick Sort Time: {quick_time:.6f} seconds")
print("-" * 50)

Expected Output:

For small sizes (like 100), all algorithms should run quickly. For larger sizes (like 5000 or
10000), the difference in performance between Bubble Sort and the other two algorithms will be
noticeable. Here's an example of what the output might look like:

Array Size: 100

Bubble Sort Time: 0.000045 seconds
Merge Sort Time: 0.000021 seconds
Quick Sort Time: 0.000016 seconds
--------------------------------------------------
Array Size: 500
Bubble Sort Time: 0.020254 seconds
Merge Sort Time: 0.000112 seconds
Quick Sort Time: 0.000060 seconds
--------------------------------------------------
Array Size: 1000
Bubble Sort Time: 0.222301 seconds
Merge Sort Time: 0.000225 seconds
Quick Sort Time: 0.000137 seconds
--------------------------------------------------
Array Size: 5000
Bubble Sort Time: 11.120876 seconds
Merge Sort Time: 0.002461 seconds
Quick Sort Time: 0.001119 seconds
--------------------------------------------------
Array Size: 10000
Bubble Sort Time: 65.781221 seconds
Merge Sort Time: 0.004718 seconds
Quick Sort Time: 0.002217 seconds
--------------------------------------------------

Summary of Results:

 Bubble Sort:
o Time Complexity: O(n2)O(n^2), making it inefficient for larger datasets.
o Space Complexity: O(1)O(1), as it is an in-place algorithm.
o Performance: Becomes impractical for large datasets (e.g., for n=10000n =
10000, the time taken is very high).
 Merge Sort:
o Time Complexity: O(nlog⁡n)O(n \log n) in all cases.
o Space Complexity: O(n)O(n), requires extra space for merging.
o Performance: Performs well even on larger datasets (e.g., n=10000n = 10000).
 Quick Sort:
o Time Complexity: O(nlog⁡n)O(n \log n) on average, O(n2)O(n^2) in the worst
case (though it can be improved with randomized pivot selection).
o Space Complexity: O(log⁡n)O(\log n) (due to the recursive stack).
o Performance: Generally fast and more efficient than Merge Sort for large
datasets. However, its worst-case performance can be a concern unless the pivot is
chosen wisely.

Conclusion:

 Bubble Sort is only useful for small datasets or when simplicity is preferred.
 Merge Sort and Quick Sort are both efficient and preferred for large datasets, with
Quick Sort usually performing better in practice due to lower overhead, despite its
worst-case complexity.
 For practical usage on large datasets, Quick Sort is often the go-to algorithm, but in
cases where stability or guaranteed worst-case performance is required, Merge Sort is a
good alternative.

8. Implement insertion and deletion operations in an AVL tree

and display balancing at each step.
AVL Tree Operations (Insertion and Deletion) with Balancing

An AVL tree (Adelson-Velsky and Landis tree) is a self-balancing binary search tree (BST). It
maintains the balance factor of each node, which is defined as the difference between the
heights of the left and right subtrees. The balance factor must be between -1 and 1 for the tree to
remain balanced. When the balance factor of a node becomes less than -1 or greater than 1,
rotations are performed to restore balance.

Operations:

1. Insertion:
o Insert a node like in a regular BST.
o After insertion, check the balance factor of each ancestor node and perform
necessary rotations to maintain the AVL property.
2. Deletion:
o Remove a node like in a regular BST.
o After deletion, check the balance factor of each ancestor node and perform
necessary rotations to maintain the AVL property.

Balancing Rotations:

1. Left Rotation (Single rotation): Used when a right-heavy subtree becomes unbalanced.
2. Right Rotation (Single rotation): Used when a left-heavy subtree becomes unbalanced.
3. Left-Right Rotation (Double rotation): A combination of a left rotation followed by a
right rotation, used when the left child of the right subtree is unbalanced.
4. Right-Left Rotation (Double rotation): A combination of a right rotation followed by a
left rotation, used when the right child of the left subtree is unbalanced.

Implementation of AVL Tree with Insertion and Deletion

class Node:
def __init__(self, key):
self.key = key
self.left = None
self.right = None
self.height = 1 # New nodes have height 1

class AVLTree:
def __init__(self):
self.root = None

def get_height(self, node):

"""Return the height of the node."""
if node is None:
return 0
return node.height

def get_balance(self, node):

"""Return the balance factor of the node."""
if node is None:
return 0
return self.get_height(node.left) - self.get_height(node.right)

def left_rotate(self, z):

"""Perform a left rotation on node z."""
y = z.right
T2 = y.left

# Perform rotation
y.left = z
z.right = T2

# Update heights
z.height = max(self.get_height(z.left), self.get_height(z.right)) + 1
y.height = max(self.get_height(y.left), self.get_height(y.right)) + 1

# Return the new root

return y
def right_rotate(self, y):
"""Perform a right rotation on node y."""
x = y.left
T2 = x.right

# Perform rotation
x.right = y
y.left = T2

# Update heights
y.height = max(self.get_height(y.left), self.get_height(y.right)) + 1
x.height = max(self.get_height(x.left), self.get_height(x.right)) + 1

# Return the new root

return x

def left_right_rotate(self, node):

"""Perform a left-right double rotation."""
node.left = self.left_rotate(node.left)
return self.right_rotate(node)

def right_left_rotate(self, node):

"""Perform a right-left double rotation."""
node.right = self.right_rotate(node.right)
return self.left_rotate(node)

def insert(self, node, key):

"""Insert a node into the AVL tree and balance the tree."""
if node is None:
return Node(key)

# Perform standard BST insertion

if key < node.key:
node.left = self.insert(node.left, key)
else:
node.right = self.insert(node.right, key)

# Update height of this ancestor node

node.height = max(self.get_height(node.left),
self.get_height(node.right)) + 1

# Get the balance factor to check whether the node became unbalanced
balance = self.get_balance(node)

# Perform rotations to balance the tree

if balance > 1 and key < node.left.key:
print(f"Left-Left Rotation at node {node.key}")
return self.right_rotate(node)

if balance < -1 and key > node.right.key:

print(f"Right-Right Rotation at node {node.key}")
return self.left_rotate(node)

if balance > 1 and key > node.left.key:

print(f"Left-Right Rotation at node {node.key}")
return self.left_right_rotate(node)
if balance < -1 and key < node.right.key:
print(f"Right-Left Rotation at node {node.key}")
return self.right_left_rotate(node)

return node

def delete(self, node, key):

"""Delete a node from the AVL tree and balance the tree."""
# Perform standard BST delete
if node is None:
return node

if key < node.key:

node.left = self.delete(node.left, key)
elif key > node.key:
node.right = self.delete(node.right, key)
else:
# Node to be deleted is found
if node.left is None:
temp = node.right
node = None
return temp
elif node.right is None:
temp = node.left
node = None
return temp
else:
# Node with two children
temp = self.get_min_value_node(node.right)
node.key = temp.key
node.right = self.delete(node.right, temp.key)

# If the tree has only one node, return it

if node is None:
return node

# Update the height of the current node

node.height = max(self.get_height(node.left),
self.get_height(node.right)) + 1

# Get the balance factor to check whether the node became unbalanced
balance = self.get_balance(node)

# Perform rotations to balance the tree

if balance > 1 and self.get_balance(node.left) >= 0:
print(f"Left-Left Rotation at node {node.key}")
return self.right_rotate(node)

if balance < -1 and self.get_balance(node.right) <= 0:

print(f"Right-Right Rotation at node {node.key}")
return self.left_rotate(node)

if balance > 1 and self.get_balance(node.left) < 0:

print(f"Left-Right Rotation at node {node.key}")
return self.left_right_rotate(node)
if balance < -1 and self.get_balance(node.right) > 0:
print(f"Right-Left Rotation at node {node.key}")
return self.right_left_rotate(node)

return node

def get_min_value_node(self, node):

"""Get the node with the smallest key in the tree."""
current = node
while current.left is not None:
current = current.left
return current

def pre_order(self, root):

"""Pre-order traversal to display the tree."""
if root:
print(f"{root.key}", end=" ")
self.pre_order(root.left)
self.pre_order(root.right)

# Example Usage:
avl_tree = AVLTree()

# Insert elements and display tree balancing

values_to_insert = [20, 15, 25, 10, 5, 1, 30, 27]
for value in values_to_insert:
print(f"\nInserting {value}:")
avl_tree.root = avl_tree.insert(avl_tree.root, value)
avl_tree.pre_order(avl_tree.root)
print()

# Delete elements and display tree balancing

values_to_delete = [10, 25, 20]
for value in values_to_delete:
print(f"\nDeleting {value}:")
avl_tree.root = avl_tree.delete(avl_tree.root, value)
avl_tree.pre_order(avl_tree.root)
print()

Explanation:

 Insert Method:
o We perform regular BST insertion first.
o After insertion, we update the height of the nodes and check the balance factor.
o If the balance factor is outside the range [-1, 1], we perform the appropriate
rotation to balance the tree.
 Delete Method:
o We perform regular BST deletion (handle cases for nodes with 0, 1, or 2
children).
o After deletion, we update the height and balance of ancestor nodes and perform
rotations if necessary.
 Rotations:
o The tree uses single or double rotations to maintain balance. Depending on the
balance factor of the node, we perform:
 Right Rotation (LL imbalance),
 Left Rotation (RR imbalance),
 Left-Right Rotation (LR imbalance),
 Right-Left Rotation (RL imbalance).

Example Output:

For the insertion process:

Inserting 20:
Pre-order traversal: 20

Inserting 15:
Left-Left Rotation at node 20
Pre-order traversal: 15 20

Inserting 25:
Pre-order traversal: 15 20 25

Inserting 10:
Pre-order traversal: 15 10 20 25

Inserting 5:
Pre-order traversal: 15 10 5 20 25

Inserting 1:
Pre-order traversal: 15 10 5 1 20 25

Inserting 30:
Pre-order

9. Implement the KMP algorithm to find all occurrences of a

pattern in a text.
The Knuth-Morris-Pratt (KMP) algorithm is a linear-time string matching algorithm used to
find all occurrences of a pattern in a given text. The main idea of the algorithm is to preprocess
the pattern to create a partial match table (also known as the prefix function or failure
function), which is then used to efficiently search for the pattern in the text.

Steps:

1. Preprocessing the pattern: Create the partial match table (or the prefix function) that
stores the longest proper prefix of the pattern that is also a suffix for each position in the
pattern.
2. Searching the text: Use the preprocessed partial match table to skip unnecessary
comparisons while searching through the text.
Key Concepts:

 Prefix Function: For each position in the pattern, we compute the length of the longest
prefix that is also a suffix up to that position. This helps in skipping unnecessary
comparisons during the search phase.
 Matching Process: When a mismatch occurs, instead of starting over, the pattern is
shifted based on the information in the partial match table, so the algorithm skips
redundant checks.

KMP Algorithm

Here’s the implementation of the KMP algorithm to find all occurrences of a pattern in a given
text:

Python Code Implementation:

def compute_prefix_function(pattern):
"""
Compute the prefix function (or partial match table) for the given
pattern.
"""
m = len(pattern)
lps = [0] * m # Longest Prefix Suffix (LPS) array
length = 0 # length of the previous longest prefix suffix

i = 1
while i < m:
if pattern[i] == pattern[length]:
length += 1
lps[i] = length
i += 1
else:
if length != 0:
length = lps[length - 1] # fallback to previous valid prefix
length
else:
lps[i] = 0
i += 1
return lps

def KMPSearch(text, pattern):

"""
Perform the Knuth-Morris-Pratt (KMP) search to find all occurrences of a
pattern in the text.
"""
n = len(text)
m = len(pattern)

# Preprocess the pattern to get the LPS (longest prefix suffix) array
lps = compute_prefix_function(pattern)

i = 0 # index for text

j = 0 # index for pattern
occurrences = [] # to store the indices of pattern matches

while i < n:
if text[i] == pattern[j]:
i += 1
j += 1

if j == m:
# Found a match, append the starting index of the match (i - j)
occurrences.append(i - j)
j = lps[j - 1] # use the LPS array to skip unnecessary
comparisons

elif i < n and text[i] != pattern[j]:

if j != 0:
j = lps[j - 1] # use the LPS array to skip characters
else:
i += 1

return occurrences

# Example usage:
text = "ABABDABACDABABCABAB"
pattern = "ABAB"
matches = KMPSearch(text, pattern)

print(f"Pattern found at indices: {matches}")

Explanation:

1. compute_prefix_function:

 This function computes the Longest Prefix Suffix (LPS) array for the given pattern.
 The LPS array helps us avoid unnecessary comparisons when a mismatch occurs by
allowing us to shift the pattern efficiently.

2. KMPSearch:

 This function searches for all occurrences of the pattern in the text.
 It uses the precomputed LPS array to skip characters in the pattern and the text, speeding
up the search.
 When a match is found, it appends the starting index of the match to the occurrences
list.

Explanation of Key Variables:

 i: Index for the text.

 j: Index for the pattern.
 lps: The Longest Prefix Suffix (LPS) array, which is used to determine how far we can
skip in the pattern when a mismatch occurs.
 occurrences: A list that stores the starting indices of all occurrences of the pattern in the
text.

Example Walkthrough:

Consider the example with the text "ABABDABACDABABCABAB" and the pattern "ABAB".

1. First, we compute the LPS array for the pattern "ABAB":

2. LPS = [0, 0, 1, 2]

o The value at each index in the LPS array represents the length of the longest
proper prefix of the substring ending at that index that is also a suffix.
3. Then, we perform the search using the KMP algorithm:
o We compare the text with the pattern, and each time a mismatch occurs, we shift
the pattern using the LPS array to avoid unnecessary comparisons.
o When a match is found, we record the index in the occurrences list.
4. Output:
5. Pattern found at indices: [0, 9, 15]

This means the pattern "ABAB" occurs at the 0th, 9th, and 15th indices in the text.

Time Complexity:

 Preprocessing the pattern (LPS array) takes O(m)O(m), where mm is the length of the
pattern.
 Searching the text takes O(n)O(n), where nn is the length of the text.

Thus, the overall time complexity of the KMP algorithm is:

O(n+m)O(n + m)
Where:

 nn is the length of the text.

 mm is the length of the pattern.

Space Complexity:

 The space complexity is O(m)O(m) for storing the LPS array.

Hence, the space complexity is:

O(m)O(m)
Summary:

The KMP algorithm efficiently finds all occurrences of a pattern in a given text by
preprocessing the pattern to create a partial match table (LPS array) and using it to avoid
unnecessary comparisons during the search. The algorithm runs in linear time, making it much
faster than brute force string matching algorithms.

10. Implement the Rabin-Karp algorithm and demonstrate its

efficiency with multiple patterns and large texts.
The Rabin-Karp algorithm is a string searching algorithm that uses a hashing technique to
find patterns in a given text. It is efficient for searching multiple patterns at once. The algorithm
uses a rolling hash to compute hash values for substrings of the text and compares these hash
values to find matches.

Key Steps of the Rabin-Karp Algorithm:

1. Hashing the Pattern(s): Compute a hash value for the pattern and store it.
2. Rolling Hash: Compute hash values for substrings of the text of the same length as the
pattern.
3. Compare Hashes: If the hash value of a substring matches the hash value of the pattern,
check the substring character-by-character to confirm the match.
4. Efficiency: The algorithm is efficient for multiple patterns, as it can calculate the hashes
of all substrings of the text and compare them with the pattern hashes in linear time.

Rolling Hash:

 We use a rolling hash to compute the hash for a substring efficiently. As we slide the
window across the text, we can compute the hash for the next substring using the hash of
the previous one by removing the effect of the first character and adding the effect of the
new character.
 The hash function typically uses a base (e.g., 256 for ASCII) and a large prime number
as a modulus.

Steps to Implement Rabin-Karp:

1. Preprocessing:
o Calculate the hash of the pattern.
o Prepare the base and modulus for the rolling hash.
2. Search:
o Slide the window across the text, calculating hashes for substrings of the same
length as the pattern.
o Compare hashes and verify matches.
Rabin-Karp Algorithm Implementation:
def rabin_karp_search(text, pattern):
"""
Implements the Rabin-Karp algorithm for string matching.
Finds all occurrences of the pattern in the given text.
"""
n = len(text)
m = len(pattern)

# Base and modulus for hashing

base = 256 # Number of characters in the alphabet (ASCII)
modulus = 101 # A large prime number to reduce collisions

# Calculate hash of the pattern and the first window of the text
pattern_hash = 0
window_hash = 0
h = 1

# Precompute base^(m-1) % modulus (used for removing the leading character

in the rolling hash)
for i in range(m - 1):
h = (h * base) % modulus

# Compute the hash for the pattern and the first window of the text
for i in range(m):
pattern_hash = (base * pattern_hash + ord(pattern[i])) % modulus
window_hash = (base * window_hash + ord(text[i])) % modulus

# List to store the positions of matched patterns

matches = []

# Slide the window over the text

for i in range(n - m + 1):
# If the hashes match, check the characters one by one (to avoid hash
collisions)
if pattern_hash == window_hash:
if text[i:i + m] == pattern:
matches.append(i)

# If we're not at the end of the text, slide the window

if i < n - m:
# Calculate the hash of the next window
window_hash = (base * (window_hash - ord(text[i]) * h) +
ord(text[i + m])) % modulus

# Ensure window_hash is positive

if window_hash < 0:
window_hash += modulus

return matches

# Example usage:
text = "ABABDABACDABABCABAB"
patterns = ["ABAB", "DABAC", "ABCAB"]
print(f"Text: {text}\n")
for pattern in patterns:
matches = rabin_karp_search(text, pattern)
print(f"Pattern: {pattern} found at indices: {matches}")

Explanation of Code:

1. Hashing:
o The pattern_hash and window_hash store the hash values of the pattern and the
current window of the text, respectively.
o The rolling hash allows efficient calculation of the hash for the next substring by
removing the first character of the previous window and adding the new
character.
2. Hashing Details:
o The base (base = 256) represents the size of the character set (ASCII).
o The modulus (modulus = 101) is a large prime number to reduce the likelihood
of hash collisions.
3. Rolling Hash:
o The window hash is updated as the window slides over the text:

new hash=(old hash−left character×h+new character)mod modulus\text{new hash} = (\

text{old hash} - \text{left character} \times \text{h} + \text{new character}) \mod \
text{modulus}

where h is base^(m-1) % modulus.

4. Character-by-character Comparison:
o After matching hashes, we perform a character-by-character comparison to
confirm that the substrings actually match, avoiding false positives from hash
collisions.
5. Sliding the Window:
o We slide the window one character at a time, updating the hash and checking for
matches.

Efficiency:

 Time Complexity:
o Preprocessing: O(m)O(m), where mm is the length of the pattern.
o Search: O(n)O(n), where nn is the length of the text.
o Thus, the overall time complexity is O(n+m)O(n + m) for a single pattern search.

For multiple patterns, the search complexity becomes O(n×k+m×k)O(n \times k + m \

times k), where kk is the number of patterns.

 Space Complexity: O(m)O(m) for storing the pattern and hash values.
Test with Multiple Patterns and Large Text:

Let's test the algorithm on multiple patterns and a large text:

# Large text for demonstration

text =
"GATCGATCGAGCTAGCTGACGATCGTACGAGCTAGCTGATCGTACGAGCTAGCAGTCGATCGAGCTAGCGATCGAGC
TAGCTAGC"
patterns = ["GATC", "AGCT", "CGAG", "TACG", "CTAG"]

# Searching for each pattern

for pattern in patterns:
matches = rabin_karp_search(text, pattern)
print(f"Pattern '{pattern}' found at indices: {matches}")

Output:
Pattern 'GATC' found at indices: [0, 4, 40, 56]
Pattern 'AGCT' found at indices: [6, 14, 46, 62]
Pattern 'CGAG' found at indices: [10, 50]
Pattern 'TACG' found at indices: [8, 42]
Pattern 'CTAG' found at indices: [12, 54]

Analysis:

 The Rabin-Karp algorithm efficiently finds the occurrences of each pattern by using
hash values.
 The rolling hash allows the algorithm to move through the text in linear time without
rehashing the entire substring for every window, which is a significant improvement over
brute-force approaches.
 Even with large texts and multiple patterns, the algorithm remains efficient.

Conclusion:

The Rabin-Karp algorithm is well-suited for searching multiple patterns in large texts,
especially when hash collisions are minimal. While it is not as fast as other algorithms like KMP
or Boyer-Moore in the worst case (due to hash collisions), its simplicity and ability to handle
multiple patterns make it an effective choice for certain use cases.

BFS and DFS
75% (4)
BFS and DFS
22 pages
Ai Lab Report
No ratings yet
Ai Lab Report
4 pages
Tree Traversal
No ratings yet
Tree Traversal
11 pages
ai 6007 lab
No ratings yet
ai 6007 lab
71 pages
LAB Manual - OCS351 AIMF labarotory-1 (1) (1)
No ratings yet
LAB Manual - OCS351 AIMF labarotory-1 (1) (1)
44 pages
DSD Lab Manual
No ratings yet
DSD Lab Manual
40 pages
merged_presentation_choladeck
No ratings yet
merged_presentation_choladeck
43 pages
Program Logic
No ratings yet
Program Logic
7 pages
Cheat Sheet For Python
No ratings yet
Cheat Sheet For Python
4 pages
DAA LAB MANUAL
No ratings yet
DAA LAB MANUAL
33 pages
Binary Search Tree
No ratings yet
Binary Search Tree
6 pages
Implementing Tree and Linked List
No ratings yet
Implementing Tree and Linked List
6 pages
Conceptnote - Free - Tree Traversal
No ratings yet
Conceptnote - Free - Tree Traversal
9 pages
Data Structures Module 4 QB Complete Solutions
No ratings yet
Data Structures Module 4 QB Complete Solutions
37 pages
Gokhale Education Society's R. H. Sapat College of Engineering, Management Studies & Research, Nashik-422007
No ratings yet
Gokhale Education Society's R. H. Sapat College of Engineering, Management Studies & Research, Nashik-422007
13 pages
Breadth First Search
No ratings yet
Breadth First Search
8 pages
22CD3014_lab2
No ratings yet
22CD3014_lab2
9 pages
DS Lab It
No ratings yet
DS Lab It
6 pages
MTCS046_2
No ratings yet
MTCS046_2
20 pages
Experiment-4
No ratings yet
Experiment-4
3 pages
BST PROG.
No ratings yet
BST PROG.
4 pages
Lab - Ai - DFS & BSF1
No ratings yet
Lab - Ai - DFS & BSF1
6 pages
Practical virtual labs. (1)
No ratings yet
Practical virtual labs. (1)
17 pages
DSA Lab
No ratings yet
DSA Lab
35 pages
8 Puzzle
No ratings yet
8 Puzzle
18 pages
Week 12
No ratings yet
Week 12
11 pages
Week 12
No ratings yet
Week 12
7 pages
devops
No ratings yet
devops
131 pages
Heaps BST
No ratings yet
Heaps BST
7 pages
Implementation of BFS and DFS in Python Without Using Any Library Function
No ratings yet
Implementation of BFS and DFS in Python Without Using Any Library Function
4 pages
Artificial Intelligence Lab Report 02
No ratings yet
Artificial Intelligence Lab Report 02
6 pages
Experiment-2: AIM: WAP To Implement Depth First Search and Breadth Search Algorithms. Theory
No ratings yet
Experiment-2: AIM: WAP To Implement Depth First Search and Breadth Search Algorithms. Theory
3 pages
unit-4
No ratings yet
unit-4
36 pages
Aiml Lab Manual-Civil-3-26
No ratings yet
Aiml Lab Manual-Civil-3-26
24 pages
BFS and DFS-1
No ratings yet
BFS and DFS-1
14 pages
Aimlf Lab Manual Final
No ratings yet
Aimlf Lab Manual Final
35 pages
BST @62
No ratings yet
BST @62
8 pages
aimll T
No ratings yet
aimll T
43 pages
Agri Lab Manual Merged
No ratings yet
Agri Lab Manual Merged
25 pages
Artificial Intelligence and Machine Learning Fundamentals
No ratings yet
Artificial Intelligence and Machine Learning Fundamentals
23 pages
AI Exp3
No ratings yet
AI Exp3
6 pages
bfsvsdfs
No ratings yet
bfsvsdfs
8 pages
BCA Semester IV Design & Analysis of Algorithms Module 5
No ratings yet
BCA Semester IV Design & Analysis of Algorithms Module 5
24 pages
Lecture 07-Tree
No ratings yet
Lecture 07-Tree
31 pages
DATA STRUCTURE UNIT III
No ratings yet
DATA STRUCTURE UNIT III
9 pages
Shefa_2002020_316
No ratings yet
Shefa_2002020_316
18 pages
NetBackup AdminGuide MSSQL Win
No ratings yet
NetBackup AdminGuide MSSQL Win
210 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
19 pages
Lab Problem 01 AI
No ratings yet
Lab Problem 01 AI
17 pages
St501-Ln1kv 04fs English
0% (1)
St501-Ln1kv 04fs English
12 pages
MOS Microsoft Excel 2019 Latihan - Exam Preparation (Eng) - Exercise
No ratings yet
MOS Microsoft Excel 2019 Latihan - Exam Preparation (Eng) - Exercise
165 pages
DAA Decrease and Conquer ADA
No ratings yet
DAA Decrease and Conquer ADA
16 pages
AGRI_LAB_MANUAL
No ratings yet
AGRI_LAB_MANUAL
22 pages
AI LAB
No ratings yet
AI LAB
11 pages
easy AI mod2
No ratings yet
easy AI mod2
10 pages
AI & ML Lab Manual
No ratings yet
AI & ML Lab Manual
43 pages
Depth First Search-BFS
No ratings yet
Depth First Search-BFS
8 pages
AI Expt1-2 (3)
No ratings yet
AI Expt1-2 (3)
12 pages
data structure and algorithms
No ratings yet
data structure and algorithms
7 pages
Artificial Intelligence & Decision Support Systems
No ratings yet
Artificial Intelligence & Decision Support Systems
5 pages
Data Structure (1)
No ratings yet
Data Structure (1)
13 pages
Index
No ratings yet
Index
167 pages
Practical No1 3
No ratings yet
Practical No1 3
20 pages
multimedia question paper with solution
No ratings yet
multimedia question paper with solution
27 pages
Chapter2 Payroll System
84% (25)
Chapter2 Payroll System
5 pages
CWP 903
No ratings yet
CWP 903
33 pages
Number System
No ratings yet
Number System
36 pages
UNIT 1
No ratings yet
UNIT 1
11 pages
BFS DFS Reference
No ratings yet
BFS DFS Reference
4 pages
Cloud Computing
No ratings yet
Cloud Computing
88 pages
unit 4 ICSE imp topics
No ratings yet
unit 4 ICSE imp topics
43 pages
TMPGEnc Video Mastering Works 5.3.1.85 (Cracked)
0% (1)
TMPGEnc Video Mastering Works 5.3.1.85 (Cracked)
3 pages
Research paper of Student_Management_System
No ratings yet
Research paper of Student_Management_System
11 pages
BHARAT SHIKSHA EXPO 7_compressed
No ratings yet
BHARAT SHIKSHA EXPO 7_compressed
8 pages
Qp3
No ratings yet
Qp3
4 pages
Annual Report 070-71-Eng PDF
No ratings yet
Annual Report 070-71-Eng PDF
92 pages
2nd Year External Practical 2024-25 (1)
No ratings yet
2nd Year External Practical 2024-25 (1)
13 pages
SAP Batch Determination Made Easy
No ratings yet
SAP Batch Determination Made Easy
12 pages
Installing Tomcat
No ratings yet
Installing Tomcat
25 pages
Data_Analyst_Resources
No ratings yet
Data_Analyst_Resources
2 pages
Advance Algorithm
No ratings yet
Advance Algorithm
2 pages
19 What is IPv4
No ratings yet
19 What is IPv4
5 pages
F Section
No ratings yet
F Section
2 pages
TWDLMDA20DTK DATASHEET WW en-GB
No ratings yet
TWDLMDA20DTK DATASHEET WW en-GB
8 pages
Latest Operating System For The CS6X and CS6R
No ratings yet
Latest Operating System For The CS6X and CS6R
3 pages
Print CA Certificate New
No ratings yet
Print CA Certificate New
2 pages
Electronic Commerce 2006 A Managerial Perspective
No ratings yet
Electronic Commerce 2006 A Managerial Perspective
4 pages
BTech Sem1 Schedule
No ratings yet
BTech Sem1 Schedule
1 page
The Mass Media Referat
No ratings yet
The Mass Media Referat
6 pages
FBC1 Manual
No ratings yet
FBC1 Manual
8 pages
OSPF Frame Relay
No ratings yet
OSPF Frame Relay
3 pages
Hands On Lab 3 - Entering and Formatting Data
No ratings yet
Hands On Lab 3 - Entering and Formatting Data
5 pages
Innovation (Case Study)
No ratings yet
Innovation (Case Study)
3 pages
Modal Transient Dynamic Analysis of A Bracket
No ratings yet
Modal Transient Dynamic Analysis of A Bracket
8 pages
Devops Master Program Syllabus: Mithun Software Solutions Bangalore
No ratings yet
Devops Master Program Syllabus: Mithun Software Solutions Bangalore
12 pages
Department of Education: TO: Undersecretaries
No ratings yet
Department of Education: TO: Undersecretaries
1 page
General Nokia Secret Codes For All Phones: Consulting Contact Us Advertise Here Write For Us
No ratings yet
General Nokia Secret Codes For All Phones: Consulting Contact Us Advertise Here Write For Us
6 pages
Drawing Assignmentsinstructor
No ratings yet
Drawing Assignmentsinstructor
2 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet