0% found this document useful (0 votes)

13 views14 pages

3D Convolutional Autoencoder

Uploaded by

Boch

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views14 pages

3D Convolutional Autoencoder

Uploaded by

Boch

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

from google.

colab import drive

drive.mount('/content/drive')

Drive already mounted at /content/drive; to attempt to forcibly

remount, call drive.mount("/content/drive", force_remount=True).

gz_file_path ='/content/drive/MyDrive/UCSD_Anomaly_Dataset.tar.gz'

import tarfile

# Extract the .tar.gz file

with tarfile.open(gz_file_path, 'r:gz') as tar:
tar.extractall('/content/UCSD_Anomaly_Dataset')

!pip install tensorflow

!pip install opencv-python

Requirement already satisfied: tensorflow in

/usr/local/lib/python3.10/dist-packages (2.17.0)
Requirement already satisfied: absl-py>=1.0.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (1.4.0)
Requirement already satisfied: astunparse>=1.6.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (1.6.3)
Requirement already satisfied: flatbuffers>=24.3.25 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (24.3.25)
Requirement already satisfied: gast!=0.5.0,!=0.5.1,!=0.5.2,>=0.2.1
in /usr/local/lib/python3.10/dist-packages (from tensorflow) (0.6.0)
Requirement already satisfied: google-pasta>=0.1.1 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (0.2.0)
Requirement already satisfied: h5py>=3.10.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (3.12.1)
Requirement already satisfied: libclang>=13.0.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (18.1.1)
Requirement already satisfied: ml-dtypes<0.5.0,>=0.3.1 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (0.4.1)
Requirement already satisfied: opt-einsum>=2.3.2 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (3.4.0)
Requirement already satisfied: packaging in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (24.1)
Requirement already satisfied: protobuf!=4.21.0,!=4.21.1,!=4.21.2,!
=4.21.3,!=4.21.4,!=4.21.5,<5.0.0dev,>=3.20.3 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (3.20.3)
Requirement already satisfied: requests<3,>=2.21.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (2.32.3)
Requirement already satisfied: setuptools in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (75.1.0)
Requirement already satisfied: six>=1.12.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (1.16.0)
Requirement already satisfied: termcolor>=1.1.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (2.5.0)
Requirement already satisfied: typing-extensions>=3.6.6 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (4.12.2)
Requirement already satisfied: wrapt>=1.11.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (1.16.0)
Requirement already satisfied: grpcio<2.0,>=1.24.3 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (1.64.1)
Requirement already satisfied: tensorboard<2.18,>=2.17 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (2.17.0)
Requirement already satisfied: keras>=3.2.0 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (3.4.1)
Requirement already satisfied: tensorflow-io-gcs-filesystem>=0.23.1 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (0.37.1)
Requirement already satisfied: numpy<2.0.0,>=1.23.5 in
/usr/local/lib/python3.10/dist-packages (from tensorflow) (1.26.4)
Requirement already satisfied: wheel<1.0,>=0.23.0 in
/usr/local/lib/python3.10/dist-packages (from astunparse>=1.6.0-
>tensorflow) (0.44.0)
Requirement already satisfied: rich in /usr/local/lib/python3.10/dist-
packages (from keras>=3.2.0->tensorflow) (13.9.3)
Requirement already satisfied: namex in
/usr/local/lib/python3.10/dist-packages (from keras>=3.2.0-
>tensorflow) (0.0.8)
Requirement already satisfied: optree in
/usr/local/lib/python3.10/dist-packages (from keras>=3.2.0-
>tensorflow) (0.13.0)
Requirement already satisfied: charset-normalizer<4,>=2 in
/usr/local/lib/python3.10/dist-packages (from requests<3,>=2.21.0-
>tensorflow) (3.4.0)
Requirement already satisfied: idna<4,>=2.5 in
/usr/local/lib/python3.10/dist-packages (from requests<3,>=2.21.0-
>tensorflow) (3.10)
Requirement already satisfied: urllib3<3,>=1.21.1 in
/usr/local/lib/python3.10/dist-packages (from requests<3,>=2.21.0-
>tensorflow) (2.2.3)
Requirement already satisfied: certifi>=2017.4.17 in
/usr/local/lib/python3.10/dist-packages (from requests<3,>=2.21.0-
>tensorflow) (2024.8.30)
Requirement already satisfied: markdown>=2.6.8 in
/usr/local/lib/python3.10/dist-packages (from tensorboard<2.18,>=2.17-
>tensorflow) (3.7)
Requirement already satisfied: tensorboard-data-server<0.8.0,>=0.7.0
in /usr/local/lib/python3.10/dist-packages (from
tensorboard<2.18,>=2.17->tensorflow) (0.7.2)
Requirement already satisfied: werkzeug>=1.0.1 in
/usr/local/lib/python3.10/dist-packages (from tensorboard<2.18,>=2.17-
>tensorflow) (3.0.6)
Requirement already satisfied: MarkupSafe>=2.1.1 in
/usr/local/lib/python3.10/dist-packages (from werkzeug>=1.0.1-
>tensorboard<2.18,>=2.17->tensorflow) (3.0.2)
Requirement already satisfied: markdown-it-py>=2.2.0 in
/usr/local/lib/python3.10/dist-packages (from rich->keras>=3.2.0-
>tensorflow) (3.0.0)
Requirement already satisfied: pygments<3.0.0,>=2.13.0 in
/usr/local/lib/python3.10/dist-packages (from rich->keras>=3.2.0-
>tensorflow) (2.18.0)
Requirement already satisfied: mdurl~=0.1 in
/usr/local/lib/python3.10/dist-packages (from markdown-it-py>=2.2.0-
>rich->keras>=3.2.0->tensorflow) (0.1.2)
Requirement already satisfied: opencv-python in
/usr/local/lib/python3.10/dist-packages (4.10.0.84)
Requirement already satisfied: numpy>=1.21.2 in
/usr/local/lib/python3.10/dist-packages (from opencv-python) (1.26.4)

#!pip install matplotlib==3.4.3

import cv2
import numpy as np
import os
from tensorflow.keras.models import Model
from tensorflow.keras.layers import Conv2D, MaxPooling2D,
UpSampling2D, Input
from tensorflow.keras.optimizers import Adam
import matplotlib.pyplot as plt
from sklearn.metrics import mean_squared_error

from PIL import Image

def load_and_preprocess_frames(directory_path, frame_height=160,

frame_width=160):
all_videos = []

for video_folder in sorted(os.listdir(directory_path)):

video_path = os.path.join(directory_path, video_folder)

if os.path.isdir(video_path):
frames = []

for filename in sorted(os.listdir(video_path)):

frame_path = os.path.join(video_path, filename)

if filename.lower().endswith('.tif'):
# Open the image, resize, and convert to grayscale
if needed
with Image.open(frame_path) as img:
img = img.resize((frame_width, frame_height))
# Resize to (128, 128)
frame = np.array(img) # Convert to numpy
array
frame = np.expand_dims(frame, axis=-1) # Add
channel dimension for grayscale
frame = frame / 255.0 # Normalize to [0, 1]
frames.append(frame)

if frames:
all_videos.append(np.array(frames))

return all_videos

# Example usage
train_directory_ped1 =
'/content/UCSD_Anomaly_Dataset/UCSD_Anomaly_Dataset.v1p2/UCSDped1/
Train'
train_directory_ped2 =
'/content/UCSD_Anomaly_Dataset/UCSD_Anomaly_Dataset.v1p2/UCSDped2/
Train'

train_videos_ped1 = load_and_preprocess_frames(train_directory_ped1)
train_videos_ped2 = load_and_preprocess_frames(train_directory_ped2)
train_videos = train_videos_ped1 + train_videos_ped2

# Display a sample of frames to verify the loading process

def display_sample_frames(videos, num_frames=5):
for i, video_frames in enumerate(videos[:1]): # Display frames
from the first video only
plt.figure(figsize=(15, 5))
for j in range(min(num_frames, len(video_frames))):
plt.subplot(1, num_frames, j + 1)
plt.imshow(video_frames[j])
plt.axis('off')
plt.title(f"Video {i + 1} - Frame {j + 1}")
plt.show()

display_sample_frames(train_videos)
import numpy as np

def split_into_sequences(video_frames, sequence_length=16):

"""
Splits the frames of a video into sequences of a specified length.

Parameters:
- video_frames: np.array, the frames of a single video (e.g.,
shape (num_frames, height, width, channels))
- sequence_length: int, the number of frames per sequence

Returns:
- sequences: np.array, shape (num_sequences, sequence_length,
height, width, channels)
"""
num_frames = len(video_frames)
sequences = []

# Slide over frames to create sequences

for i in range(0, num_frames - sequence_length + 1,
sequence_length):
sequence = video_frames[i:i + sequence_length]
sequences.append(sequence)

return np.array(sequences)

# Define directories for both training datasets

train_directory_ped1 =
'/content/UCSD_Anomaly_Dataset/UCSD_Anomaly_Dataset.v1p2/UCSDped1/
Train'
train_directory_ped2 =
'/content/UCSD_Anomaly_Dataset/UCSD_Anomaly_Dataset.v1p2/UCSDped2/
Train'

# Load both datasets

train_videos_ped1 = load_and_preprocess_frames(train_directory_ped1)
train_videos_ped2 = load_and_preprocess_frames(train_directory_ped2)

# Combine videos from both datasets

train_videos = train_videos_ped1 + train_videos_ped2

# Concatenate all sequences from all videos in both datasets

train_3d_data = np.concatenate([split_into_sequences(video,
sequence_length=16) for video in train_videos], axis=0)
print("Prepared training data shape:", train_3d_data.shape)

Prepared training data shape: (562, 16, 160, 160, 1)

from tensorflow.keras.models import Model

from tensorflow.keras.layers import Conv3D, MaxPooling3D,
UpSampling3D, Input, Activation
from tensorflow.keras.optimizers import Adam

# 3D CNN Autoencoder Model

def build_3d_cnn_autoencoder(input_shape=(16, 160, 160, 1)):
input_layer = Input(shape=input_shape)

# Encoder
x = Conv3D(32, (3, 3, 3), padding='same')(input_layer)
x = Activation('relu')(x)
x = MaxPooling3D((2, 2, 2), padding='same')(x)

x = Conv3D(32, (3, 3, 3), padding='same')(x)

x = Activation('relu')(x)
x = MaxPooling3D((2, 2, 2), padding='same')(x)

# Decoder
x = Conv3D(32, (3, 3, 3), padding='same')(x)
x = Activation('relu')(x)
x = UpSampling3D((2, 2, 2))(x)

x = Conv3D(32, (3, 3, 3), padding='same')(x)

x = Activation('relu')(x)
x = UpSampling3D((2, 2, 2))(x)

decoded = Conv3D(1, (3, 3, 3), activation='sigmoid',

padding='same')(x) # Final layer with sigmoid for [0, 1] range

# Autoencoder Model
autoencoder = Model(input_layer, decoded)
autoencoder.compile(optimizer=Adam(learning_rate=0.001),
loss='mse')

return autoencoder

# Example usage
input_shape = (16, 160, 160, 1) # 16 consecutive frames, 160x160
resolution, 1 channel (grayscale)
cnn_3d_autoencoder = build_3d_cnn_autoencoder(input_shape=input_shape)

# Encoder
# x = Conv3D(32, (3, 3, 3), activation='relu', padding='same')
(input_layer)
# x = MaxPooling3D((2, 2, 2), padding='same')(x)
# x = Conv3D(64, (3, 3, 3), activation='relu', padding='same')(x)
# x = MaxPooling3D((2, 2, 2), padding='same')(x)
# encoded = Conv3D(277, (3, 3, 3), activation='relu',
padding='same')(x)

# # Decoder
# x = UpSampling3D((2, 2, 2))(encoded)
# x = Conv3D(64, (3, 3, 3), activation='relu', padding='same')(x)
# x = UpSampling3D((2, 2, 2))(x)
# decoded = Conv3D(1, (3, 3, 3), activation='sigmoid',
padding='same')(x) # Single channel for grayscale output

# autoencoder = Model(input_layer, decoded)

# autoencoder.compile(optimizer=Adam(), loss='mse')
# return autoencoder

cnn_3d_autoencoder.compile(optimizer='adam', loss='mse')

# Assuming train_3d_data is your full dataset

train_size = int(0.8 * len(train_3d_data))
train_data = train_3d_data[:train_size]
val_data = train_3d_data[train_size:]

from tensorflow.keras.callbacks import EarlyStopping, ModelCheckpoint

# Set up early stopping and model checkpoint to monitor validation

loss
early_stopping = EarlyStopping(monitor='val_loss', patience=3,
verbose=1)

cnn_3d_autoencoder.fit(
train_data, train_data,
epochs=10,
batch_size=2,
shuffle=True,
validation_data=(val_data, val_data),
callbacks=[early_stopping]
)

Epoch 1/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 27s 84ms/step - loss: 0.0158 - val_loss:
0.0013
Epoch 2/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 29s 58ms/step - loss: 0.0023 - val_loss:
9.8407e-04
Epoch 3/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 13s 58ms/step - loss: 0.0018 - val_loss:
8.5347e-04
Epoch 4/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 21s 59ms/step - loss: 0.0017 - val_loss:
7.8221e-04
Epoch 5/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 14s 61ms/step - loss: 0.0015 - val_loss:
7.4683e-04
Epoch 6/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 20s 60ms/step - loss: 0.0014 - val_loss:
7.0392e-04
Epoch 7/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 21s 61ms/step - loss: 0.0014 - val_loss:
7.2750e-04
Epoch 8/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 20s 61ms/step - loss: 0.0013 - val_loss:
6.5005e-04
Epoch 9/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 21s 61ms/step - loss: 0.0012 - val_loss:
6.2748e-04
Epoch 10/10
225/225 ━━━━━━━━━━━━━━━━━━━━ 14s 62ms/step - loss: 0.0012 - val_loss:
6.3976e-04

<keras.src.callbacks.history.History at 0x79e7ed803910>

import matplotlib.pyplot as plt

from PIL import Image
import numpy as np
import os

def load_and_preprocess_frames(directory_path, frame_height=160,

frame_width=160):
all_videos = []

for video_folder in sorted(os.listdir(directory_path)):

video_path = os.path.join(directory_path, video_folder)

if os.path.isdir(video_path):
frames = []

for filename in sorted(os.listdir(video_path)):

frame_path = os.path.join(video_path, filename)

if filename.lower().endswith('.tif'):
try:
# Attempt to open, resize, and normalize the
image
with Image.open(frame_path) as img:
img = img.resize((frame_width,
frame_height)) # Resize to (128, 128)
frame = np.array(img) # Convert to numpy
array
frame = np.expand_dims(frame, axis=-1) #
Add channel dimension for grayscale
frame = frame / 255.0 # Normalize to [0,
1]
frames.append(frame)
except Exception as e:
# Print an error message for any file that
fails to load
print(f"Error loading file {frame_path}: {e}")

if frames:
all_videos.append(np.array(frames))

return all_videos

# Example usage for loading the test set

test_directory_ped1 =
'/content/UCSD_Anomaly_Dataset/UCSD_Anomaly_Dataset.v1p2/UCSDped1/
Test'
test_directory_ped2 =
'/content/UCSD_Anomaly_Dataset/UCSD_Anomaly_Dataset.v1p2/UCSDped2/
Test'

test_videos_ped1 = load_and_preprocess_frames(test_directory_ped1)
test_videos_ped2 = load_and_preprocess_frames(test_directory_ped2)

test_videos = test_videos_ped1 + test_videos_ped2

Error loading file

/content/UCSD_Anomaly_Dataset/UCSD_Anomaly_Dataset.v1p2/UCSDped1/Test/
Test017/142.tif: -2

test_3d_data = np.concatenate([split_into_sequences(video,
sequence_length=16) for video in test_videos], axis=0)
print("Prepared test data shape:", test_3d_data.shape)

Prepared test data shape: (554, 16, 160, 160, 1)

reconstructed_test_data = cnn_3d_autoencoder.predict(test_3d_data)

18/18 ━━━━━━━━━━━━━━━━━━━━ 11s 326ms/step

from sklearn.metrics import mean_squared_error

# Calculate reconstruction error for each sequence

reconstruction_errors = [
mean_squared_error(original.flatten(), reconstructed.flatten())
for original, reconstructed in zip(test_3d_data,
reconstructed_test_data)
]

import numpy as np

threshold = np.percentile(reconstruction_errors, 30
)
print("Anomaly threshold:", threshold)

Anomaly threshold: 0.0011392208955404764

anomalies = [error > threshold for error in reconstruction_errors]

import matplotlib.pyplot as plt

plt.plot(reconstruction_errors, label='Reconstruction Error')

plt.axhline(y=threshold, color='r', linestyle='--', label='Anomaly
Threshold')
plt.xlabel("Sequence")
plt.ylabel("Reconstruction Error")
plt.title("Reconstruction Error on Test Data")
plt.legend()
plt.show()

# Define ground truth for UCSDped1

ground_truth_frames_ped1 = [
list(range(60, 153)),
list(range(50, 176)),
list(range(91, 201)),
list(range(31, 169)),
list(range(5, 91)) + list(range(140, 201)),
list(range(1, 101)) + list(range(110, 201)),
list(range(1, 176)),
list(range(1, 95)),
list(range(1, 49)),
list(range(1, 141)),
list(range(70, 166)),
list(range(130, 201)),
list(range(1, 157)),
list(range(1, 201)),
list(range(138, 201)),
list(range(123, 201)),
list(range(1, 48)),
list(range(54, 121)),
list(range(64, 139)),
list(range(45, 176)),
list(range(31, 201)),
list(range(16, 108)),
list(range(8, 166)),
list(range(50, 172)),
list(range(40, 136)),
list(range(77, 145)),
list(range(10, 123)),
list(range(105, 201)),
list(range(1, 16)) + list(range(45, 114)),
list(range(175, 201)),
list(range(1, 181)),
list(range(1, 53)) + list(range(65, 116)),
list(range(5, 166)),
list(range(1, 122)),
list(range(86, 201)),
list(range(15, 109))
]

# Define ground truth for UCSDped2

ground_truth_frames_ped2 = [
list(range(61, 180)),
list(range(95, 180)),
list(range(1, 146)),
list(range(31, 180)),
list(range(1, 129)),
list(range(1, 162)),
list(range(46, 180)),
list(range(1, 180)),
list(range(1, 120)),
list(range(1, 150)),
list(range(1, 180)),
list(range(88, 180))
]

# Combine both ground truth annotations

ground_truth_frames = ground_truth_frames_ped1 +
ground_truth_frames_ped2

# Convert reconstruction errors into binary predictions based on the

threshold
binary_predictions = [1 if error > threshold else 0 for error in
reconstruction_errors]

# Group predictions by video

sequence_length = 16
num_sequences_per_video = len(test_videos[0]) // sequence_length #
Assuming each video has the same number of frames

# Organize binary predictions by video

model_predictions = [
binary_predictions[i * num_sequences_per_video : (i + 1) *
num_sequences_per_video]
for i in range(len(test_videos))
]

from sklearn.metrics import precision_score, recall_score, f1_score

# Initialize lists to collect all ground truth labels and model

predictions
all_gt_labels = []
all_model_labels = []

# Iterate over each test video

for i, gt_frames in enumerate(ground_truth_frames):
# Get binary predictions for the current video, expanded to match
frame-level granularity
video_predictions = []
for seq_pred in model_predictions[i]:
video_predictions.extend([seq_pred] * sequence_length) #
Repeat each sequence prediction across its frames

# Generate ground truth labels for each frame in the video

gt_labels = [1 if frame in gt_frames else 0 for frame in
range(len(video_predictions))]

# Collect the ground truth and predictions for overall evaluation

all_gt_labels.extend(gt_labels)
all_model_labels.extend(video_predictions[:len(gt_labels)]) #
Ensure predictions align with gt labels length

# Calculate and display evaluation metrics

precision = precision_score(all_gt_labels, all_model_labels)
recall = recall_score(all_gt_labels, all_model_labels)
f1 = f1_score(all_gt_labels, all_model_labels)

print(f"Precision: {precision:.4f}")
print(f"Recall: {recall:.4f}")
print(f"F1 Score: {f1:.4f}")

Precision: 0.6432
Recall: 0.7471
F1 Score: 0.6912

from sklearn.metrics import confusion_matrix, ConfusionMatrixDisplay

import matplotlib.pyplot as plt

# Assuming `all_gt_labels` are the ground truth labels and

`all_model_labels` are your final binary predictions
# Calculate the confusion matrix
cm = confusion_matrix(all_gt_labels, all_model_labels)

# Plot the confusion matrix

disp = ConfusionMatrixDisplay(confusion_matrix=cm,
display_labels=["Normal", "Anomaly"])
disp.plot(cmap="Blues")
plt.title("Confusion Matrix for Anomaly Detection")
plt.show()

Recall (0.7471): This moderately high recall indicates that the model successfully detects most
of the actual anomalies, although it may miss a few. A recall of 0.7471 means that the model is
generally effective in identifying anomalous sequences but may occasionally let some anomalies
go undetected.

Precision (0.6432): With a precision of 0.6432, the model is reasonably selective in identifying
anomalies. However, it does flag some normal sequences as anomalies, which suggests there
are still false positives. This precision level indicates a good balance where the model avoids
being overly sensitive, but it could still be improved if false alarms are a concern.

F1 Score (0.6912): The F1 score of 0.6912 reflects a solid balance between recall and precision.
This score indicates that the model is fairly good at both capturing actual anomalies and
avoiding false positives, making it a well-rounded choice for general anomaly detection.

Delta Module 1 Sample Papers Key PDF
100% (3)
Delta Module 1 Sample Papers Key PDF
19 pages
Instruction Manual For Flap-Gate Barriers Selection and Axioma
No ratings yet
Instruction Manual For Flap-Gate Barriers Selection and Axioma
47 pages
LSTM Autoencoder
No ratings yet
LSTM Autoencoder
8 pages
Video Api Endpoint N
No ratings yet
Video Api Endpoint N
7 pages
CCTV Anomaly Detection Guide
No ratings yet
CCTV Anomaly Detection Guide
39 pages
A 1
No ratings yet
A 1
9 pages
Csc413 Project Semantic Segmentation
No ratings yet
Csc413 Project Semantic Segmentation
84 pages
Kijai ComfyUI VEnhancer
No ratings yet
Kijai ComfyUI VEnhancer
76 pages
Sample Code-Structure For Anomaly Detection
No ratings yet
Sample Code-Structure For Anomaly Detection
8 pages
stable_diffusion_report_updated
No ratings yet
stable_diffusion_report_updated
19 pages
Handwriting Recognition
No ratings yet
Handwriting Recognition
31 pages
Task VIII Quantum Vision Transformer
No ratings yet
Task VIII Quantum Vision Transformer
1 page
7.copy of Text To Image Generation With LLM With Hugging Face - Ipynb
No ratings yet
7.copy of Text To Image Generation With LLM With Hugging Face - Ipynb
1,156 pages
Cctvmodel
No ratings yet
Cctvmodel
4 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
4 pages
Finalised Question 2
No ratings yet
Finalised Question 2
24 pages
IPCV
No ratings yet
IPCV
26 pages
Performance Testing
No ratings yet
Performance Testing
15 pages
Bit 22034
No ratings yet
Bit 22034
18 pages
Next With Continuos Run
No ratings yet
Next With Continuos Run
4 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
TF Mannual
No ratings yet
TF Mannual
19 pages
Facedetection
No ratings yet
Facedetection
16 pages
tensor flow programs
No ratings yet
tensor flow programs
30 pages
GenAI - Lab-File - Darab Khan 22SCSE1480055
No ratings yet
GenAI - Lab-File - Darab Khan 22SCSE1480055
31 pages
DETECTCAMERA
No ratings yet
DETECTCAMERA
3 pages
Improved - FCC - Cat - Dog - Ipynb - Colab
No ratings yet
Improved - FCC - Cat - Dog - Ipynb - Colab
12 pages
Dcgan
No ratings yet
Dcgan
9 pages
Dinushasan Courseproject04: Sign in
No ratings yet
Dinushasan Courseproject04: Sign in
19 pages
Image Classification Code
No ratings yet
Image Classification Code
4 pages
Tensorflow and Keras Apis: 0.1 Computer Vision: Neural Networks and Deep Learning
No ratings yet
Tensorflow and Keras Apis: 0.1 Computer Vision: Neural Networks and Deep Learning
32 pages
Start
No ratings yet
Start
3 pages
Project Guidelines - AIML
No ratings yet
Project Guidelines - AIML
30 pages
Requirements Dev
No ratings yet
Requirements Dev
7 pages
C3W2 - Assignment - Ipynb - Colaboratory
No ratings yet
C3W2 - Assignment - Ipynb - Colaboratory
39 pages
Wild Fire CNN Accuracy 95
No ratings yet
Wild Fire CNN Accuracy 95
15 pages
Exp 10 Sentiment Analysis BERT
No ratings yet
Exp 10 Sentiment Analysis BERT
5 pages
Image Caption2
No ratings yet
Image Caption2
9 pages
Trash Detection
No ratings yet
Trash Detection
17 pages
Yolo Detect
No ratings yet
Yolo Detect
5 pages
TMA01 Question 1 (45 Marks)
No ratings yet
TMA01 Question 1 (45 Marks)
31 pages
Cse519 hw3
No ratings yet
Cse519 hw3
50 pages
Lab 4-Image Segmentation Using U-Net
No ratings yet
Lab 4-Image Segmentation Using U-Net
9 pages
AI Functions
No ratings yet
AI Functions
1 page
Assignment 2.3.1 Transfer Learning
No ratings yet
Assignment 2.3.1 Transfer Learning
7 pages
Final Question1 With Results
No ratings yet
Final Question1 With Results
21 pages
Capture D'écran . 2023-05-15 À 16.45.57
No ratings yet
Capture D'écran . 2023-05-15 À 16.45.57
1 page
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Inference
No ratings yet
Inference
8 pages
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
Finalised Question 1
No ratings yet
Finalised Question 1
40 pages
Object Detection Webcam
No ratings yet
Object Detection Webcam
3 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
Object Detection Webcam
No ratings yet
Object Detection Webcam
3 pages
Yolo V8
No ratings yet
Yolo V8
16 pages
HRMS Project Report
No ratings yet
HRMS Project Report
21 pages
Train - Model
No ratings yet
Train - Model
2 pages
Wa0029.
No ratings yet
Wa0029.
11 pages
Tensorflow Object Detection API Tutorial Readthedocs Io en Latest
No ratings yet
Tensorflow Object Detection API Tutorial Readthedocs Io en Latest
65 pages
6G1_1753692346
No ratings yet
6G1_1753692346
19 pages
Srafvana
No ratings yet
Srafvana
6 pages
Deforum Stable Diffusion - Ipynb
No ratings yet
Deforum Stable Diffusion - Ipynb
12 pages
A Practical Applications of Virtual PLC Using LabVIEW Software
No ratings yet
A Practical Applications of Virtual PLC Using LabVIEW Software
6 pages
Dell Technologies Cloud Implementation
No ratings yet
Dell Technologies Cloud Implementation
26 pages
20 Repair of Fire Damaged Structures PDF
No ratings yet
20 Repair of Fire Damaged Structures PDF
1 page
Viscoelasticity 01 Intro
No ratings yet
Viscoelasticity 01 Intro
4 pages
B7 - Control Relays - EN
No ratings yet
B7 - Control Relays - EN
28 pages
Syn MC Phasor Diagram-Part 1
No ratings yet
Syn MC Phasor Diagram-Part 1
67 pages
Karyl Resume
No ratings yet
Karyl Resume
3 pages
Office of The President MEMORANDUM No. 23, S. 2020 TO: CFCST Employees Date Issued: May 2, 2020
No ratings yet
Office of The President MEMORANDUM No. 23, S. 2020 TO: CFCST Employees Date Issued: May 2, 2020
1 page
Chapter 4 5 Isometric and Orthographic Sketching
No ratings yet
Chapter 4 5 Isometric and Orthographic Sketching
23 pages
Rauc Iom 14 - 06012007
No ratings yet
Rauc Iom 14 - 06012007
76 pages
AMSD
No ratings yet
AMSD
6 pages
OP5 OPP Order Form-INS-OHS-10-2022-00026-V8-EW1 V1
No ratings yet
OP5 OPP Order Form-INS-OHS-10-2022-00026-V8-EW1 V1
2 pages
A Tiger in The Zoo
No ratings yet
A Tiger in The Zoo
3 pages
Ge 107 - M-1 STS
No ratings yet
Ge 107 - M-1 STS
3 pages
Pont Fog Quatra
No ratings yet
Pont Fog Quatra
60 pages
Las Grade 8 P.E.
No ratings yet
Las Grade 8 P.E.
8 pages
Dlp-Conditional Statements
100% (1)
Dlp-Conditional Statements
7 pages
Notice of Awardssss 22 23 Updated
No ratings yet
Notice of Awardssss 22 23 Updated
82 pages
Avner Greif
No ratings yet
Avner Greif
3 pages
s7200 System Manual Pinout
100% (1)
s7200 System Manual Pinout
1 page
Courier 6 SL
100% (1)
Courier 6 SL
12 pages
Rubric S
No ratings yet
Rubric S
2 pages
Amazon Logistics Provider
No ratings yet
Amazon Logistics Provider
3 pages
Jurgen Habermas's Theory of The Public Sphere and Its Transformation in The 21st Century
100% (1)
Jurgen Habermas's Theory of The Public Sphere and Its Transformation in The 21st Century
157 pages
Urban Issues
100% (1)
Urban Issues
147 pages
Experimental Research
No ratings yet
Experimental Research
22 pages
Agile Software Development Scrum
100% (2)
Agile Software Development Scrum
50 pages
Bullet Security Cameras
No ratings yet
Bullet Security Cameras
10 pages

3D Convolutional Autoencoder

Uploaded by

3D Convolutional Autoencoder

Uploaded by

from google.

colab import drive

Drive already mounted at /content/drive; to attempt to forcibly

# Extract the .tar.gz file

!pip install tensorflow

Requirement already satisfied: tensorflow in

#!pip install matplotlib==3.4.3

from PIL import Image

def load_and_preprocess_frames(directory_path, frame_height=160,

for video_folder in sorted(os.listdir(directory_path)):

for filename in sorted(os.listdir(video_path)):

# Display a sample of frames to verify the loading process

def split_into_sequences(video_frames, sequence_length=16):

# Slide over frames to create sequences

# Define directories for both training datasets

# Load both datasets

# Combine videos from both datasets

# Concatenate all sequences from all videos in both datasets

Prepared training data shape: (562, 16, 160, 160, 1)

from tensorflow.keras.models import Model

# 3D CNN Autoencoder Model

x = Conv3D(32, (3, 3, 3), padding='same')(x)

x = Conv3D(32, (3, 3, 3), padding='same')(x)

decoded = Conv3D(1, (3, 3, 3), activation='sigmoid',

# autoencoder = Model(input_layer, decoded)

# Assuming train_3d_data is your full dataset

from tensorflow.keras.callbacks import EarlyStopping, ModelCheckpoint

# Set up early stopping and model checkpoint to monitor validation

import matplotlib.pyplot as plt

def load_and_preprocess_frames(directory_path, frame_height=160,

for video_folder in sorted(os.listdir(directory_path)):

for filename in sorted(os.listdir(video_path)):

# Example usage for loading the test set

test_videos = test_videos_ped1 + test_videos_ped2

Error loading file

Prepared test data shape: (554, 16, 160, 160, 1)

18/18 ━━━━━━━━━━━━━━━━━━━━ 11s 326ms/step

from sklearn.metrics import mean_squared_error

# Calculate reconstruction error for each sequence

Anomaly threshold: 0.0011392208955404764

import matplotlib.pyplot as plt

plt.plot(reconstruction_errors, label='Reconstruction Error')

# Define ground truth for UCSDped1

# Define ground truth for UCSDped2

# Combine both ground truth annotations

# Convert reconstruction errors into binary predictions based on the

# Group predictions by video

# Organize binary predictions by video

from sklearn.metrics import precision_score, recall_score, f1_score

# Initialize lists to collect all ground truth labels and model

# Iterate over each test video

# Generate ground truth labels for each frame in the video

# Collect the ground truth and predictions for overall evaluation

# Calculate and display evaluation metrics

from sklearn.metrics import confusion_matrix, ConfusionMatrixDisplay

# Assuming `all_gt_labels` are the ground truth labels and

# Plot the confusion matrix

You might also like