0% found this document useful (0 votes)

4 views6 pages

Multi View Deep Learning

The document outlines a multi-view deep learning approach using YOLO models, emphasizing enhancements like weighted averaging for predictions, Non-Maximum Suppression (NMS), and feature fusion. It includes code snippets for loading models, processing video streams, and combining predictions from multiple views. The main function demonstrates how to implement these techniques in a practical application for object detection and tracking.

Uploaded by

securesentinel007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views6 pages

Multi View Deep Learning

Uploaded by

securesentinel007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

The multi-view deep learning approach with YOLO models

we can implement more sophisticated methods for training, combining

predictions, and utilizing the strengths of each model.

modifications and enhancements:

1. Weighted Averaging for Combining Predictions:

Instead of simple averaging, we can use weighted averaging where the weights
are based on the confidence scores of the predictions.

2. Non-Maximum Suppression (NMS):

Applying NMS can help in reducing overlapping bounding boxes from different
views
3. Feature Fusion:

Instead of just combining the final predictions, you can also fuse intermediate
features from different models before making the final prediction.
4. Training with a Combined Dataset:

Train each model not only on its specific view but also include some data from
other views to improve generalization.
5. Post-processing with Ensemble Techniques:

Use ensemble techniques like stacking to learn how to best combine the
predictions from different models.
CODES

#yolo.py
from ultralytics import YOLO

# Load multiple models for different views

model_view1 = YOLO("yolov8n.yaml") # model for view 1
model_view2 = YOLO("yolov8l.yaml") # model for view 2
model_view3 = YOLO("yolov8x.yaml") # model for view 3

# Train the models

model_view1.train(data="config1.yaml", epochs=1) # train the model for
view 1

model_view2.train(data="config1.yaml", epochs=1) # train the model for

model_view3.train(data="config1.yaml", epochs=1) # train the model for

#Detect.py
import cv2
import numpy as np
from ultralytics import YOLO
from scipy.optimize import linear_sum_assignment

def load_models(model_paths):
"""
Load YOLO models from the provided paths.

Args:
model_paths (list): List of paths to YOLO model weights.

Returns:
list: List of loaded YOLO models.
"""
return [YOLO(model_path) for model_path in model_paths]

def open_cameras(camera_urls):
"""
Open video capture streams for the provided camera URLs.

Args:
camera_urls (list): List of IP webcam URLs.

Returns:
list: List of video capture objects.
"""
return [cv2.VideoCapture(url) for url in camera_urls]

def compute_iou(box1, boxes):

"""
Compute Intersection over Union (IoU) between a single box and
multiple boxes.

Args:
box1 (array): Coordinates of the first box.
boxes (array): Coordinates of the other boxes.

Returns:
array: IoU values for the box compared to the other boxes.
"""
x1 = np.maximum(box1[0], boxes[:, 0])
y1 = np.maximum(box1[1], boxes[:, 1])
x2 = np.minimum(box1[2], boxes[:, 2])
y2 = np.minimum(box1[3], boxes[:, 3])

inter_area = np.maximum(0, x2 - x1) * np.maximum(0, y2 - y1)

box1_area = (box1[2] - box1[0]) * (box1[3] - box1[1])
boxes_area = (boxes[:, 2] - boxes[:, 0]) * (boxes[:, 3] - boxes[:,
1])

iou = inter_area / (box1_area + boxes_area - inter_area)

return iou

def nms(predictions, iou_threshold=0.5):

"""
Perform Non-Maximum Suppression (NMS) on the predictions.

Args:
predictions (list): List of prediction dictionaries with
'boxes' and 'scores'.
iou_threshold (float): IoU threshold for NMS.

Returns:
list: List of filtered predictions after NMS.
"""
boxes = np.array([pred['boxes'] for pred in predictions])
scores = np.array([pred['scores'] for pred in predictions])

indices = np.argsort(scores)[::-1]
keep_boxes = []

while len(indices) > 0:

current_index = indices[0]
keep_boxes.append(current_index)
if len(indices) == 1:
break

current_box = boxes[current_index]
remaining_boxes = boxes[indices[1:]]
ious = compute_iou(current_box, remaining_boxes)
indices = indices[1:][ious <= iou_threshold]

return [predictions[i] for i in keep_boxes]

def combine_predictions(predictions_list, weights):

"""
Combine predictions from multiple views using weighted averaging.
Args:
predictions_list (list): List of predictions from multiple
views.
weights (list): List of weights for averaging predictions.

Returns:
list: List of combined predictions after averaging and NMS.
"""
combined_result = []
for preds in zip(*predictions_list):
combined_boxes = np.average([pred['boxes'] for pred in preds],
axis=0, weights=weights)
combined_scores = np.average([pred['scores'] for pred in
preds], axis=0, weights=weights)
combined_labels = max(preds, key=lambda p:
p['scores'])['labels']

combined_result.append({
'boxes': combined_boxes,
'scores': combined_scores,
'labels': combined_labels
})
return nms(combined_result)

def track_objects(detections, prev_detections, iou_threshold=0.3):

"""
Track objects across frames using detected boxes and IoU matching.

Args:
detections (list): Current frame detections.
prev_detections (list): Previous frame detections.
iou_threshold (float): IoU threshold for matching detections.

Returns:
list: List of matched object indices between current and
previous detections.
"""
if len(detections) == 0 or len(prev_detections) == 0:
return []

iou_matrix = np.zeros((len(detections), len(prev_detections)),

dtype=np.float32)
for i, det in enumerate(detections):
for j, prev_det in enumerate(prev_detections):
iou_matrix[i, j] = compute_iou(det['boxes'],
np.array([prev_det['boxes']]))

row_ind, col_ind = linear_sum_assignment(-iou_matrix)

matches = []
for i, j in zip(row_ind, col_ind):
if iou_matrix[i, j] >= iou_threshold:
matches.append((i, j))

return matches

def main():
# Get IP webcam URLs from the user
ip_webcam_urls = input("Enter the IP webcam URLs separated by
commas (e.g.,
http://<IP_ADDRESS1>:<PORT>/video,http://<IP_ADDRESS2>:<PORT>/video):
").split(',')

# Paths to the YOLO models for each view

model_paths = [

r"C:\Users\SRIKANTH\PycharmProjects\yolov8\runs\detect\train2\weights\b
est.pt",

r"C:\Users\SRIKANTH\PycharmProjects\yolov8\runs\detect\train3\weights\b
est.pt",
# Add paths to other models as needed
]

# Load YOLO models

models = load_models(model_paths)

# Open connections to the IP webcams

caps = open_cameras(ip_webcam_urls)

# Check if all video streams opened successfully

if not all(cap.isOpened() for cap in caps):
print("Error: Could not open one or more video streams")
return
else:
print("Successfully opened video streams")

# Store previous detections for tracking

prev_detections = []

while True:
frames = []
for cap in caps:
ret, frame = cap.read()
if not ret:
print("Error: Failed to capture image from one of the
streams")
break
frames.append(frame)

if not frames:
break

# Perform YOLO detection on each frame

predictions = [model(frame, save=False)[0] for model, frame in
zip(models, frames)]

# Combine predictions from all views using weighted averaging

weights = [0.3, 0.4, 0.3] # Adjust weights as needed
combined_predictions = combine_predictions(predictions,
weights)

# Track objects across different views

matches = track_objects(combined_predictions, prev_detections)
prev_detections = combined_predictions
# Plot the combined results on the frames
for frame, preds in zip(frames, combined_predictions):
res_plotted = preds.plot()
cv2.imshow('YOLO Detection', res_plotted)

# Exit the loop if 'q' is pressed

if cv2.waitKey(1) & 0xFF == ord('q'):
break

# Release the video capture objects and close display windows

for cap in caps:
cap.release()
cv2.destroyAllWindows()

if __name__ == "__main__":
main()

Build An AI - ML Tennis Analysis System With YOLO, PyTorch, and Key Point Extraction (English (Auto-Generated) )
No ratings yet
Build An AI - ML Tennis Analysis System With YOLO, PyTorch, and Key Point Extraction (English (Auto-Generated) )
165 pages
Mondeo5EU2019 Cel151
No ratings yet
Mondeo5EU2019 Cel151
78 pages
IBM Storage Insights and Insights Pro - Level 2 Quiz - Attempt Review
67% (3)
IBM Storage Insights and Insights Pro - Level 2 Quiz - Attempt Review
13 pages
Charlie and The Chocolate Factory
100% (7)
Charlie and The Chocolate Factory
42 pages
Untitled Document
No ratings yet
Untitled Document
4 pages
CV
No ratings yet
CV
5 pages
Yolo Tensorflow
No ratings yet
Yolo Tensorflow
13 pages
Ejemplo 1 Chapas
No ratings yet
Ejemplo 1 Chapas
3 pages
Object Detection Webcam
No ratings yet
Object Detection Webcam
3 pages
DETECTCAMERA
No ratings yet
DETECTCAMERA
3 pages
DL PROGRAMS
No ratings yet
DL PROGRAMS
36 pages
Document 2
No ratings yet
Document 2
8 pages
Detection ORIGINAL
No ratings yet
Detection ORIGINAL
3 pages
Huggin Face Code
No ratings yet
Huggin Face Code
3 pages
Predict
No ratings yet
Predict
1 page
Codeyolov 5
No ratings yet
Codeyolov 5
16 pages
HRMS Project Report
No ratings yet
HRMS Project Report
21 pages
Object Detection Webcam
No ratings yet
Object Detection Webcam
3 pages
PR Project Ankit
No ratings yet
PR Project Ankit
9 pages
Detect
No ratings yet
Detect
6 pages
Machine Vison Homework
No ratings yet
Machine Vison Homework
4 pages
Scripts
No ratings yet
Scripts
5 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
4-Channel YOLO Validation, Testing & Evaluation Guide
No ratings yet
4-Channel YOLO Validation, Testing & Evaluation Guide
29 pages
Val
No ratings yet
Val
9 pages
Experiment 8
No ratings yet
Experiment 8
3 pages
Laser Weeder (Amigos)
No ratings yet
Laser Weeder (Amigos)
15 pages
DL 6
No ratings yet
DL 6
3 pages
Project 2
No ratings yet
Project 2
10 pages
CCC
No ratings yet
CCC
25 pages
Twins Code
No ratings yet
Twins Code
4 pages
Code
No ratings yet
Code
4 pages
Sample Code-Structure For Anomaly Detection
No ratings yet
Sample Code-Structure For Anomaly Detection
8 pages
Week 05
No ratings yet
Week 05
38 pages
(P) Program AIO
No ratings yet
(P) Program AIO
22 pages
Vehicle Detection
No ratings yet
Vehicle Detection
7 pages
NNDL Record Manual
No ratings yet
NNDL Record Manual
36 pages
Dinushasan Courseproject04: Sign in
No ratings yet
Dinushasan Courseproject04: Sign in
19 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
Corn Det
No ratings yet
Corn Det
2 pages
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
No ratings yet
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
41 pages
1-Linear Regression and TensorFlow
No ratings yet
1-Linear Regression and TensorFlow
79 pages
AhanaBasu 11500120098 Grp-2 Mid-Term-Project-Evaluation REPORT
No ratings yet
AhanaBasu 11500120098 Grp-2 Mid-Term-Project-Evaluation REPORT
9 pages
Machine Learning CODE
No ratings yet
Machine Learning CODE
19 pages
Video
No ratings yet
Video
2 pages
Deep Learning Programs Updated
No ratings yet
Deep Learning Programs Updated
24 pages
EEE
No ratings yet
EEE
9 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
ML&DAP Lab
No ratings yet
ML&DAP Lab
8 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
Coding
No ratings yet
Coding
6 pages
S. NO. Title of The Experiments Page No
No ratings yet
S. NO. Title of The Experiments Page No
11 pages
DL Lab Manual
No ratings yet
DL Lab Manual
29 pages
C1 W1 Lab 3 Siamese-Network
No ratings yet
C1 W1 Lab 3 Siamese-Network
13 pages
ML Lab P-1
No ratings yet
ML Lab P-1
10 pages
Source
No ratings yet
Source
18 pages
Sec
No ratings yet
Sec
16 pages
Mark 1
No ratings yet
Mark 1
13 pages
Proj 1
No ratings yet
Proj 1
19 pages
Csc413 Project Semantic Segmentation
No ratings yet
Csc413 Project Semantic Segmentation
84 pages
AIML
No ratings yet
AIML
12 pages
50 Java Concepts Every Developer Should Know
From Everand
50 Java Concepts Every Developer Should Know
Hernando Abella
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Midterm Quizzes
No ratings yet
Midterm Quizzes
7 pages
DVSSE-01 Twin Cylinder Horizontal Steam Engine General Arrangement
No ratings yet
DVSSE-01 Twin Cylinder Horizontal Steam Engine General Arrangement
4 pages
Regulatory PPT Final
0% (1)
Regulatory PPT Final
23 pages
Post Industrial Society
No ratings yet
Post Industrial Society
2 pages
Tsho Rolpa
No ratings yet
Tsho Rolpa
2 pages
Pitched Battle GHB 2023-24 2,000 (Order - Sylvaneth) (860pts)
No ratings yet
Pitched Battle GHB 2023-24 2,000 (Order - Sylvaneth) (860pts)
5 pages
Yemen WASH Cluster Cholera SOPs v3.1 ZD
No ratings yet
Yemen WASH Cluster Cholera SOPs v3.1 ZD
42 pages
DCL Manual
No ratings yet
DCL Manual
12 pages
Pt. B.D.Sharma, University of Health Sciences, Rohtak.: Tentative Theory Date Sheet of
No ratings yet
Pt. B.D.Sharma, University of Health Sciences, Rohtak.: Tentative Theory Date Sheet of
2 pages
Technor JB Datasheet
No ratings yet
Technor JB Datasheet
4 pages
Connectors Showing Contrast
No ratings yet
Connectors Showing Contrast
4 pages
KQ4 - Who Was To Blame For The Cold War
No ratings yet
KQ4 - Who Was To Blame For The Cold War
5 pages
Gaushala PDF
100% (2)
Gaushala PDF
360 pages
HKIMO 2020 Heat P2
No ratings yet
HKIMO 2020 Heat P2
8 pages
Opencore 2024 09 23 174601
No ratings yet
Opencore 2024 09 23 174601
59 pages
m4 SD Manual
No ratings yet
m4 SD Manual
2 pages
Carrier ComfortVIEW
100% (1)
Carrier ComfortVIEW
624 pages
Koye Lifesciences Private Limited - Presentation
No ratings yet
Koye Lifesciences Private Limited - Presentation
6 pages
Triplex Pump Part 1
No ratings yet
Triplex Pump Part 1
30 pages
Geogr 54 2 0207
No ratings yet
Geogr 54 2 0207
14 pages
WellPlan Exercie Book
100% (4)
WellPlan Exercie Book
115 pages
All Shook Up - Peyson Antholz (1958.ace D-306) (Darwin-IA)
No ratings yet
All Shook Up - Peyson Antholz (1958.ace D-306) (Darwin-IA)
196 pages
International Monetary Stability
No ratings yet
International Monetary Stability
6 pages
PS - CB - X - Math - Areas Related To Circle - 241208 - 163951
No ratings yet
PS - CB - X - Math - Areas Related To Circle - 241208 - 163951
7 pages
WEDA Eastern Chapter Meeting Agenda Oct 13-15th 2021
No ratings yet
WEDA Eastern Chapter Meeting Agenda Oct 13-15th 2021
3 pages
HP Laserjet M525 (Troubleshooting Manual) PDF
No ratings yet
HP Laserjet M525 (Troubleshooting Manual) PDF
414 pages
Site Inspection Record Template 1 of 2 Construction: Observations / Corrective Actions
No ratings yet
Site Inspection Record Template 1 of 2 Construction: Observations / Corrective Actions
2 pages

Multi View Deep Learning

Uploaded by

Multi View Deep Learning

Uploaded by

The multi-view deep learning approach with YOLO models

we can implement more sophisticated methods for training, combining

modifications and enhancements:

2. Non-Maximum Suppression (NMS):

# Load multiple models for different views

# Train the models

model_view2.train(data="config1.yaml", epochs=1) # train the model for

model_view3.train(data="config1.yaml", epochs=1) # train the model for

def compute_iou(box1, boxes):

inter_area = np.maximum(0, x2 - x1) * np.maximum(0, y2 - y1)

iou = inter_area / (box1_area + boxes_area - inter_area)

def nms(predictions, iou_threshold=0.5):

while len(indices) > 0:

return [predictions[i] for i in keep_boxes]

def combine_predictions(predictions_list, weights):

def track_objects(detections, prev_detections, iou_threshold=0.3):

iou_matrix = np.zeros((len(detections), len(prev_detections)),

row_ind, col_ind = linear_sum_assignment(-iou_matrix)

# Paths to the YOLO models for each view

# Load YOLO models

# Open connections to the IP webcams

# Check if all video streams opened successfully

# Store previous detections for tracking

# Perform YOLO detection on each frame

# Combine predictions from all views using weighted averaging

# Track objects across different views

# Exit the loop if 'q' is pressed

# Release the video capture objects and close display windows

You might also like