0% found this document useful (0 votes)

10 views

Image segmentation with a U-Net-like architecture

Uploaded by

pedro garcia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Image segmentation with a U-Net-like architecture

Uploaded by

pedro garcia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

14/10/24, 17:36 Image segmentation with a U-Net-like architecture

Search Keras documentation...

► Code examples / Computer Vision / Image segmentation with a U-Net-like architecture

About Keras

Getting started Image segmentation with a U-Net-like

Developer guides architecture
Author: fchollet
Keras 3 API documentation
Date created: 2019/03/20
Keras 2 API documentation Last modified: 2020/04/20
Description: Image segmentation model trained from scratch on the Oxford Pets dataset.
Code examples
ⓘ This example uses Keras 3
Computer Vision
View in Colab • GitHub source
Image classification from scratch

Simple MNIST convnet

Image classification via fine-tuning with

EfficientNet Download the data
Image classification with Vision
Transformer !!wget https://www.robots.ox.ac.uk/~vgg/data/pets/data/images.tar.gz
!!wget https://www.robots.ox.ac.uk/~vgg/data/pets/data/annotations.tar.gz
Classification using Attention-based Deep !
Multiple Instance Learning !curl -O https://thor.robots.ox.ac.uk/datasets/pets/images.tar.gz
!curl -O https://thor.robots.ox.ac.uk/datasets/pets/annotations.tar.gz
Image classification with modern MLP !
models !tar -xf images.tar.gz
!tar -xf annotations.tar.gz
A mobile-friendly Transformer-based
model for image classification

Pneumonia Classification on TPU

% Total % Received % Xferd Average Speed Time Time Time Current
Compact Convolutional Transformers Dload Upload Total Spent Left Speed
100 755M 100 755M 0 0 21.3M 0 0:00:35 0:00:35 --:--:-- 22.2M
Image classification with ConvMixer % Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
Image classification with EANet (External 100 18.2M 100 18.2M 0 0 7977k 0 0:00:02 0:00:02 --:--:-- 7974k
Attention Transformer)

Involutional neural networks

Image classification with Perceiver

Few-Shot learning with Reptile

Semi-supervised image classification

using contrastive pretraining with SimCLR

Image classification with Swin

Transformers

Train a Vision Transformer on small

datasets

A Vision Transformer without Attention

Image Classification using Global Context

Vision Transformer

Image segmentation with a U-Net-like

architecture

Multiclass semantic segmentation using

DeepLabV3+

Highly accurate boundaries segmentation

using BASNet

Image Segmentation using Composable

Fully-Convolutional Networks

Object Detection with RetinaNet

Keypoint Detection with Transfer

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 1/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

Learning
Prepare paths of input images and target segmentation
Object detection with Vision Transformers masks
3D image classification from CT scans
import os
Monocular depth estimation

3D volumetric rendering with NeRF input_dir = "images/"

target_dir = "annotations/trimaps/"
Point cloud segmentation with PointNet img_size = (160, 160)
num_classes = 3
Point cloud classification batch_size = 32

OCR model for reading Captchas

input_img_paths = sorted(
Handwriting recognition [
os.path.join(input_dir, fname)
Convolutional autoencoder for image for fname in os.listdir(input_dir)
denoising if fname.endswith(".jpg")
]
Low-light image enhancement using )
MIRNet target_img_paths = sorted(
[
Image Super-Resolution using an Efficient
os.path.join(target_dir, fname)
Sub-Pixel CNN
for fname in os.listdir(target_dir)
Enhanced Deep Residual Networks for if fname.endswith(".png") and not fname.startswith(".")
single-image super-resolution ]
)
Zero-DCE for low-light image
enhancement print("Number of samples:", len(input_img_paths))

CutMix data augmentation for image

for input_path, target_path in zip(input_img_paths[:10], target_img_paths[:10]):
classification
print(input_path, "|", target_path)

MixUp augmentation for image

classification
Number of samples: 7390
RandAugment for Image Classification for
images/Abyssinian_1.jpg | annotations/trimaps/Abyssinian_1.png
Improved Robustness
images/Abyssinian_10.jpg | annotations/trimaps/Abyssinian_10.png
Image captioning images/Abyssinian_100.jpg | annotations/trimaps/Abyssinian_100.png
images/Abyssinian_101.jpg | annotations/trimaps/Abyssinian_101.png
Natural language image search with a images/Abyssinian_102.jpg | annotations/trimaps/Abyssinian_102.png
Dual Encoder images/Abyssinian_103.jpg | annotations/trimaps/Abyssinian_103.png
images/Abyssinian_104.jpg | annotations/trimaps/Abyssinian_104.png
Visualizing what convnets learn images/Abyssinian_105.jpg | annotations/trimaps/Abyssinian_105.png
images/Abyssinian_106.jpg | annotations/trimaps/Abyssinian_106.png
Model interpretability with Integrated
images/Abyssinian_107.jpg | annotations/trimaps/Abyssinian_107.png
Gradients

Investigating Vision Transformer

representations

Grad-CAM class activation visualization

What does one input image and corresponding
Near-duplicate image search
segmentation mask look like?
Semantic Image Clustering

Image similarity estimation using a from IPython.display import Image, display

Siamese Network with a contrastive loss from keras.utils import load_img
from PIL import ImageOps
Image similarity estimation using a
Siamese Network with a triplet loss # Display input image #7
display(Image(filename=input_img_paths[9]))
Metric learning for image similarity search

Metric learning for image similarity search # Display auto-contrast version of corresponding target (per-pixel categories)
using TensorFlow Similarity img = ImageOps.autocontrast(load_img(target_img_paths[9]))
display(img)
Self-supervised contrastive learning with
NNCLR

Video Classification with a CNN-RNN

Architecture

Next-Frame Video Prediction with

Convolutional LSTMs

Video Classification with Transformers

Video Vision Transformer

Image Classification using BigTransfer

(BiT)

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 2/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

Gradient Centralization for Better

Training Performance

Learning to tokenize in Vision

Transformers

Knowledge Distillation

FixRes: Fixing train-test resolution

discrepancy

Class Attention Image Transformers with

LayerScale

Augmenting convnets with aggregated

attention

Learning to Resize

Semi-supervision and domain adaptation

with AdaMatch

Barlow Twins for Contrastive SSL

Consistency training with supervision

Distilling Vision Transformers

Focal Modulation: A replacement for Self-

Attention

Using the Forward-Forward Algorithm for

Image Classification

Masked image modeling with

Autoencoders

Segment Anything Model with 🤗

Transformers

Semantic segmentation with SegFormer

and Hugging Face Transformers

Self-supervised contrastive learning with

SimSiam

Supervised Contrastive Learning

When Recurrence meets Transformers

Efficient Object Detection with YOLOV8

and KerasCV

Natural Language Processing

Structured Data

Timeseries

Generative Deep Learning

Audio Data

Reinforcement Learning

Graph Data

Quick Keras Recipes

KerasTuner: Hyperparameter
Tuning

KerasHub: Pretrained Models

KerasCV: Computer Vision

Workflows

KerasNLP: Natural Language

Workflows

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 3/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

Prepare dataset to load & vectorize batches of data

import keras
import numpy as np
from tensorflow import data as tf_data
from tensorflow import image as tf_image
from tensorflow import io as tf_io

def get_dataset(
batch_size,
img_size,
input_img_paths,
target_img_paths,
max_dataset_len=None,
):
"""Returns a TF Dataset."""

def load_img_masks(input_img_path, target_img_path):

input_img = tf_io.read_file(input_img_path)
input_img = tf_io.decode_png(input_img, channels=3)
input_img = tf_image.resize(input_img, img_size)
input_img = tf_image.convert_image_dtype(input_img, "float32")

target_img = tf_io.read_file(target_img_path)
target_img = tf_io.decode_png(target_img, channels=1)
target_img = tf_image.resize(target_img, img_size, method="nearest")
target_img = tf_image.convert_image_dtype(target_img, "uint8")

# Ground truth labels are 1, 2, 3. Subtract one to make them 0, 1, 2:

target_img -= 1
return input_img, target_img

# For faster debugging, limit the size of data

if max_dataset_len:
input_img_paths = input_img_paths[:max_dataset_len]
target_img_paths = target_img_paths[:max_dataset_len]
dataset = tf_data.Dataset.from_tensor_slices((input_img_paths, target_img_paths))
dataset = dataset.map(load_img_masks, num_parallel_calls=tf_data.AUTOTUNE)
return dataset.batch(batch_size)

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 4/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

Prepare U-Net Xception-style model

from keras import layers

def get_model(img_size, num_classes):

inputs = keras.Input(shape=img_size + (3,))

### [First half of the network: downsampling inputs] ###

# Entry block
x = layers.Conv2D(32, 3, strides=2, padding="same")(inputs)
x = layers.BatchNormalization()(x)
x = layers.Activation("relu")(x)

previous_block_activation = x # Set aside residual

# Blocks 1, 2, 3 are identical apart from the feature depth.

for filters in [64, 128, 256]:
x = layers.Activation("relu")(x)
x = layers.SeparableConv2D(filters, 3, padding="same")(x)
x = layers.BatchNormalization()(x)

x = layers.Activation("relu")(x)
x = layers.SeparableConv2D(filters, 3, padding="same")(x)
x = layers.BatchNormalization()(x)

x = layers.MaxPooling2D(3, strides=2, padding="same")(x)

# Project residual
residual = layers.Conv2D(filters, 1, strides=2, padding="same")(
previous_block_activation
)
x = layers.add([x, residual]) # Add back residual
previous_block_activation = x # Set aside next residual

### [Second half of the network: upsampling inputs] ###

for filters in [256, 128, 64, 32]:

x = layers.Activation("relu")(x)
x = layers.Conv2DTranspose(filters, 3, padding="same")(x)
x = layers.BatchNormalization()(x)

x = layers.UpSampling2D(2)(x)

# Project residual
residual = layers.UpSampling2D(2)(previous_block_activation)
residual = layers.Conv2D(filters, 1, padding="same")(residual)
x = layers.add([x, residual]) # Add back residual
previous_block_activation = x # Set aside next residual

# Add a per-pixel classification layer

outputs = layers.Conv2D(num_classes, 3, activation="softmax", padding="same")(x)

# Define the model

model = keras.Model(inputs, outputs)
return model

# Build model
model = get_model(img_size, num_classes)
model.summary()

Model: "functional_1"

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 5/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture
┏━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃ Connected to ┃
┡━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━┩
│ input_layer │ (None, 160, 160, │ 0 │ - │
│ (InputLayer) │ 3) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d (Conv2D) │ (None, 80, 80, │ 896 │ input_layer[0][0] │
│ │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalization │ (None, 80, 80, │ 128 │ conv2d[0][0] │
│ (BatchNormalizatio… │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation │ (None, 80, 80, │ 0 │ batch_normalization… │
│ (Activation) │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_1 │ (None, 80, 80, │ 0 │ activation[0][0] │
│ (Activation) │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ separable_conv2d │ (None, 80, 80, │ 2,400 │ activation_1[0][0] │
│ (SeparableConv2D) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 80, 80, │ 256 │ separable_conv2d[0]… │
│ (BatchNormalizatio… │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_2 │ (None, 80, 80, │ 0 │ batch_normalization… │
│ (Activation) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ separable_conv2d_1 │ (None, 80, 80, │ 4,736 │ activation_2[0][0] │
│ (SeparableConv2D) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 80, 80, │ 256 │ separable_conv2d_1[… │
│ (BatchNormalizatio… │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ max_pooling2d │ (None, 40, 40, │ 0 │ batch_normalization… │
│ (MaxPooling2D) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_1 (Conv2D) │ (None, 40, 40, │ 2,112 │ activation[0][0] │
│ │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ add (Add) │ (None, 40, 40, │ 0 │ max_pooling2d[0][0], │
│ │ 64) │ │ conv2d_1[0][0] │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_3 │ (None, 40, 40, │ 0 │ add[0][0] │
│ (Activation) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ separable_conv2d_2 │ (None, 40, 40, │ 8,896 │ activation_3[0][0] │
│ (SeparableConv2D) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 40, 40, │ 512 │ separable_conv2d_2[… │
│ (BatchNormalizatio… │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_4 │ (None, 40, 40, │ 0 │ batch_normalization… │
│ (Activation) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ separable_conv2d_3 │ (None, 40, 40, │ 17,664 │ activation_4[0][0] │
│ (SeparableConv2D) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 40, 40, │ 512 │ separable_conv2d_3[… │
│ (BatchNormalizatio… │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ max_pooling2d_1 │ (None, 20, 20, │ 0 │ batch_normalization… │
│ (MaxPooling2D) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_2 (Conv2D) │ (None, 20, 20, │ 8,320 │ add[0][0] │
│ │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ add_1 (Add) │ (None, 20, 20, │ 0 │ max_pooling2d_1[0][… │
│ │ 128) │ │ conv2d_2[0][0] │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_5 │ (None, 20, 20, │ 0 │ add_1[0][0] │
│ (Activation) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ separable_conv2d_4 │ (None, 20, 20, │ 34,176 │ activation_5[0][0] │
│ (SeparableConv2D) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 20, 20, │ 1,024 │ separable_conv2d_4[… │
│ (BatchNormalizatio… │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_6 │ (None, 20, 20, │ 0 │ batch_normalization… │
│ (Activation) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ separable_conv2d_5 │ (None, 20, 20, │ 68,096 │ activation_6[0][0] │
│ (SeparableConv2D) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 20, 20, │ 1,024 │ separable_conv2d_5[… │
│ (BatchNormalizatio… │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ max_pooling2d_2 │ (None, 10, 10, │ 0 │ batch_normalization… │
│ (MaxPooling2D) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_3 (Conv2D) │ (None, 10, 10, │ 33,024 │ add_1[0][0] │
│ │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ add_2 (Add) │ (None, 10, 10, │ 0 │ max_pooling2d_2[0][… │
│ │ 256) │ │ conv2d_3[0][0] │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_7 │ (None, 10, 10, │ 0 │ add_2[0][0] │
│ (Activation) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_transpose │ (None, 10, 10, │ 590,080 │ activation_7[0][0] │
│ (Conv2DTranspose) │ 256) │ │ │

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 6/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 10, 10, │ 1,024 │ conv2d_transpose[0]… │
│ (BatchNormalizatio… │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_8 │ (None, 10, 10, │ 0 │ batch_normalization… │
│ (Activation) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_transpose_1 │ (None, 10, 10, │ 590,080 │ activation_8[0][0] │
│ (Conv2DTranspose) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 10, 10, │ 1,024 │ conv2d_transpose_1[… │
│ (BatchNormalizatio… │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ up_sampling2d_1 │ (None, 20, 20, │ 0 │ add_2[0][0] │
│ (UpSampling2D) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ up_sampling2d │ (None, 20, 20, │ 0 │ batch_normalization… │
│ (UpSampling2D) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_4 (Conv2D) │ (None, 20, 20, │ 65,792 │ up_sampling2d_1[0][… │
│ │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ add_3 (Add) │ (None, 20, 20, │ 0 │ up_sampling2d[0][0], │
│ │ 256) │ │ conv2d_4[0][0] │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_9 │ (None, 20, 20, │ 0 │ add_3[0][0] │
│ (Activation) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_transpose_2 │ (None, 20, 20, │ 295,040 │ activation_9[0][0] │
│ (Conv2DTranspose) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 20, 20, │ 512 │ conv2d_transpose_2[… │
│ (BatchNormalizatio… │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_10 │ (None, 20, 20, │ 0 │ batch_normalization… │
│ (Activation) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_transpose_3 │ (None, 20, 20, │ 147,584 │ activation_10[0][0] │
│ (Conv2DTranspose) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 20, 20, │ 512 │ conv2d_transpose_3[… │
│ (BatchNormalizatio… │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ up_sampling2d_3 │ (None, 40, 40, │ 0 │ add_3[0][0] │
│ (UpSampling2D) │ 256) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ up_sampling2d_2 │ (None, 40, 40, │ 0 │ batch_normalization… │
│ (UpSampling2D) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_5 (Conv2D) │ (None, 40, 40, │ 32,896 │ up_sampling2d_3[0][… │
│ │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ add_4 (Add) │ (None, 40, 40, │ 0 │ up_sampling2d_2[0][… │
│ │ 128) │ │ conv2d_5[0][0] │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_11 │ (None, 40, 40, │ 0 │ add_4[0][0] │
│ (Activation) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_transpose_4 │ (None, 40, 40, │ 73,792 │ activation_11[0][0] │
│ (Conv2DTranspose) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 40, 40, │ 256 │ conv2d_transpose_4[… │
│ (BatchNormalizatio… │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_12 │ (None, 40, 40, │ 0 │ batch_normalization… │
│ (Activation) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_transpose_5 │ (None, 40, 40, │ 36,928 │ activation_12[0][0] │
│ (Conv2DTranspose) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 40, 40, │ 256 │ conv2d_transpose_5[… │
│ (BatchNormalizatio… │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ up_sampling2d_5 │ (None, 80, 80, │ 0 │ add_4[0][0] │
│ (UpSampling2D) │ 128) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ up_sampling2d_4 │ (None, 80, 80, │ 0 │ batch_normalization… │
│ (UpSampling2D) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_6 (Conv2D) │ (None, 80, 80, │ 8,256 │ up_sampling2d_5[0][… │
│ │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ add_5 (Add) │ (None, 80, 80, │ 0 │ up_sampling2d_4[0][… │
│ │ 64) │ │ conv2d_6[0][0] │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_13 │ (None, 80, 80, │ 0 │ add_5[0][0] │
│ (Activation) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_transpose_6 │ (None, 80, 80, │ 18,464 │ activation_13[0][0] │
│ (Conv2DTranspose) │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 80, 80, │ 128 │ conv2d_transpose_6[… │
│ (BatchNormalizatio… │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ activation_14 │ (None, 80, 80, │ 0 │ batch_normalization… │
│ (Activation) │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_transpose_7 │ (None, 80, 80, │ 9,248 │ activation_14[0][0] │
│ (Conv2DTranspose) │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ batch_normalizatio… │ (None, 80, 80, │ 128 │ conv2d_transpose_7[… │

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 7/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture
│ (BatchNormalizatio… │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ up_sampling2d_7 │ (None, 160, 160, │ 0 │ add_5[0][0] │
│ (UpSampling2D) │ 64) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ up_sampling2d_6 │ (None, 160, 160, │ 0 │ batch_normalization… │
│ (UpSampling2D) │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_7 (Conv2D) │ (None, 160, 160, │ 2,080 │ up_sampling2d_7[0][… │
│ │ 32) │ │ │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ add_6 (Add) │ (None, 160, 160, │ 0 │ up_sampling2d_6[0][… │
│ │ 32) │ │ conv2d_7[0][0] │
├─────────────────────┼───────────────────┼─────────┼──────────────────────┤
│ conv2d_8 (Conv2D) │ (None, 160, 160, │ 867 │ add_6[0][0] │
│ │ 3) │ │ │
└─────────────────────┴───────────────────┴─────────┴──────────────────────┘

Total params: 2,058,979 (7.85 MB)

Trainable params: 2,055,203 (7.84 MB)

Non-trainable params: 3,776 (14.75 KB)

Set aside a validation split

import random

# Split our img paths into a training and a validation set

val_samples = 1000
random.Random(1337).shuffle(input_img_paths)
random.Random(1337).shuffle(target_img_paths)
train_input_img_paths = input_img_paths[:-val_samples]
train_target_img_paths = target_img_paths[:-val_samples]
val_input_img_paths = input_img_paths[-val_samples:]
val_target_img_paths = target_img_paths[-val_samples:]

# Instantiate dataset for each split

# Limit input files in `max_dataset_len` for faster epoch training time.
# Remove the `max_dataset_len` arg when running with full dataset.
train_dataset = get_dataset(
batch_size,
img_size,
train_input_img_paths,
train_target_img_paths,
max_dataset_len=1000,
)
valid_dataset = get_dataset(
batch_size, img_size, val_input_img_paths, val_target_img_paths
)

Train the model

# Configure the model for training.
# We use the "sparse" version of categorical_crossentropy
# because our target data is integers.
model.compile(
optimizer=keras.optimizers.Adam(1e-4), loss="sparse_categorical_crossentropy"
)

callbacks = [
keras.callbacks.ModelCheckpoint("oxford_segmentation.keras", save_best_only=True)
]

# Train the model, doing validation at the end of each epoch.

epochs = 50
model.fit(
train_dataset,
epochs=epochs,
validation_data=valid_dataset,
callbacks=callbacks,
verbose=2,
)

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 8/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

Epoch 1/50

WARNING: All log messages before absl::InitializeLog() is called are written to STDERR
I0000 00:00:1700414690.172044 2226172 device_compiler.h:187] Compiled cluster using
XLA! This line is logged at most once for the lifetime of the process.
Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 62s - 2s/step - loss: 1.6363 - val_loss: 2.2226

Epoch 2/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 94ms/step - loss: 0.9223 - val_loss: 1.8273

Epoch 3/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 82ms/step - loss: 0.7894 - val_loss: 2.0044

Epoch 4/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 83ms/step - loss: 0.7174 - val_loss: 2.3480

Epoch 5/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 82ms/step - loss: 0.6695 - val_loss: 2.7528

Epoch 6/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 83ms/step - loss: 0.6325 - val_loss: 3.1453

Epoch 7/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 84ms/step - loss: 0.6012 - val_loss: 3.5611

Epoch 8/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 87ms/step - loss: 0.5730 - val_loss: 4.0003

Epoch 9/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 85ms/step - loss: 0.5466 - val_loss: 4.4798

Epoch 10/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 86ms/step - loss: 0.5210 - val_loss: 5.0245

Epoch 11/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 87ms/step - loss: 0.4958 - val_loss: 5.5950

Epoch 12/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 87ms/step - loss: 0.4706 - val_loss: 6.1534

Epoch 13/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 85ms/step - loss: 0.4453 - val_loss: 6.6107

Epoch 14/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 83ms/step - loss: 0.4202 - val_loss: 6.8010

Epoch 15/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 84ms/step - loss: 0.3956 - val_loss: 6.6751

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 9/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

Epoch 16/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 83ms/step - loss: 0.3721 - val_loss: 6.0800

Epoch 17/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 84ms/step - loss: 0.3506 - val_loss: 5.1820

Epoch 18/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 82ms/step - loss: 0.3329 - val_loss: 4.0350

Epoch 19/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 4s - 114ms/step - loss: 0.3216 - val_loss: 3.0513

Epoch 20/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 94ms/step - loss: 0.3595 - val_loss: 2.2567

Epoch 21/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 100ms/step - loss: 0.4417 - val_loss: 1.5873

Epoch 22/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 101ms/step - loss: 0.3531 - val_loss: 1.5798

Epoch 23/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 96ms/step - loss: 0.3353 - val_loss: 1.5525

Epoch 24/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 95ms/step - loss: 0.3392 - val_loss: 1.4625

Epoch 25/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 95ms/step - loss: 0.3596 - val_loss: 0.8867

Epoch 26/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 94ms/step - loss: 0.3528 - val_loss: 0.8021

Epoch 27/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 92ms/step - loss: 0.3237 - val_loss: 0.7986

Epoch 28/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 89ms/step - loss: 0.3198 - val_loss: 0.8533

Epoch 29/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 84ms/step - loss: 0.3272 - val_loss: 1.0588

Epoch 30/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 88ms/step - loss: 0.3164 - val_loss: 1.1889

Epoch 31/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 10/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

32/32 - 3s - 85ms/step - loss: 0.2987 - val_loss: 0.9518

Epoch 32/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 87ms/step - loss: 0.2749 - val_loss: 0.9011

Epoch 33/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 84ms/step - loss: 0.2595 - val_loss: 0.8872

Epoch 34/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 87ms/step - loss: 0.2552 - val_loss: 1.0221

Epoch 35/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 82ms/step - loss: 0.2628 - val_loss: 1.1553

Epoch 36/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 85ms/step - loss: 0.2788 - val_loss: 2.1549

Epoch 37/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 94ms/step - loss: 0.2870 - val_loss: 1.6282

Epoch 38/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 89ms/step - loss: 0.2702 - val_loss: 1.3201

Epoch 39/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 91ms/step - loss: 0.2569 - val_loss: 1.2364

Epoch 40/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 106ms/step - loss: 0.2523 - val_loss: 1.3673

Epoch 41/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 86ms/step - loss: 0.2570 - val_loss: 1.3999

Epoch 42/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 87ms/step - loss: 0.2680 - val_loss: 0.9976

Epoch 43/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 83ms/step - loss: 0.2558 - val_loss: 1.0209

Epoch 44/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 85ms/step - loss: 0.2403 - val_loss: 1.3271

Epoch 45/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 83ms/step - loss: 0.2414 - val_loss: 1.1993

Epoch 46/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 84ms/step - loss: 0.2516 - val_loss: 1.0532

Epoch 47/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 11/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

32/32 - 3s - 83ms/step - loss: 0.2695 - val_loss: 1.1183

Epoch 48/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 87ms/step - loss: 0.2555 - val_loss: 1.0432

Epoch 49/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 82ms/step - loss: 0.2290 - val_loss: 0.9444

Epoch 50/50

Corrupt JPEG data: 240 extraneous bytes before marker 0xd9

32/32 - 3s - 83ms/step - loss: 0.1994 - val_loss: 1.2182

<keras.src.callbacks.history.History at 0x7fe01842dab0>

Visualize predictions
# Generate predictions for all images in the validation set

val_dataset = get_dataset(
batch_size, img_size, val_input_img_paths, val_target_img_paths
)
val_preds = model.predict(val_dataset)

def display_mask(i):
"""Quick utility to display a model's prediction."""
mask = np.argmax(val_preds[i], axis=-1)
mask = np.expand_dims(mask, axis=-1)
img = ImageOps.autocontrast(keras.utils.array_to_img(mask))
display(img)

# Display results for validation image #10

i = 10

# Display input image

display(Image(filename=val_input_img_paths[i]))

# Display ground-truth target mask

img = ImageOps.autocontrast(load_img(val_target_img_paths[i]))
display(img)

# Display mask predicted by our model

display_mask(i) # Note that the model only sees inputs at 150x150.

32/32 ━━━━━━━━━━━━━━━━━━━━ 5s 100ms/step

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 12/13
14/10/24, 17:36 Image segmentation with a U-Net-like architecture

Terms | Privacy

https://keras.io/examples/vision/oxford_pets_image_segmentation/ 13/13

Computer Vision For Visual Effects PDF
No ratings yet
Computer Vision For Visual Effects PDF
410 pages
Computer Vision With Keras
No ratings yet
Computer Vision With Keras
67 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
Deep Learning For Computer Vision PDF
7% (14)
Deep Learning For Computer Vision PDF
24 pages
Computer Vision Pretrained Models: What Is Pre-Trained Model?
No ratings yet
Computer Vision Pretrained Models: What Is Pre-Trained Model?
10 pages
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
No ratings yet
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
24 pages
14 DL Frameworks
No ratings yet
14 DL Frameworks
30 pages
Deep Learning for Computer Vision Image Classification Object Detection and Face Recognition in Python Jason Brownlee instant download
No ratings yet
Deep Learning for Computer Vision Image Classification Object Detection and Face Recognition in Python Jason Brownlee instant download
54 pages
Object Recog
No ratings yet
Object Recog
102 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
Keras v.2.1.6
No ratings yet
Keras v.2.1.6
244 pages
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
No ratings yet
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
14 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
978-0-7503-6244-3.preview (1)
No ratings yet
978-0-7503-6244-3.preview (1)
56 pages
MVS_Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS_Expt8 Object Detection and Reconstruction Using CNN
5 pages
Deep Learning Project for Computer Vision with Python 2022
No ratings yet
Deep Learning Project for Computer Vision with Python 2022
297 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Image Recognition Using Machine Learning Research Paper
No ratings yet
Image Recognition Using Machine Learning Research Paper
5 pages
Full download Deep Learning for Computer Vision: Image Classification, Object Detection, and Face Recognition in Python Jason Brownlee pdf docx
100% (1)
Full download Deep Learning for Computer Vision: Image Classification, Object Detection, and Face Recognition in Python Jason Brownlee pdf docx
40 pages
Keras1 - 1.4 Advanced Model Architectures
No ratings yet
Keras1 - 1.4 Advanced Model Architectures
11 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
15 pages
Deep Learning With Python
100% (5)
Deep Learning With Python
396 pages
INT422
No ratings yet
INT422
5 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
73 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
Deep Learning For Computer Vision PDF
No ratings yet
Deep Learning For Computer Vision PDF
24 pages
Vbook - Pub Deep Learning For Computer Visionpdf
No ratings yet
Vbook - Pub Deep Learning For Computer Visionpdf
24 pages
Image Classification using MNIST Dataset
No ratings yet
Image Classification using MNIST Dataset
28 pages
CV_T3_ Unit-7
No ratings yet
CV_T3_ Unit-7
36 pages
04 Transfer Learning With Tensorflow Part 1 Feature Extraction
No ratings yet
04 Transfer Learning With Tensorflow Part 1 Feature Extraction
18 pages
Final Certificate Seminar
No ratings yet
Final Certificate Seminar
6 pages
PyTorch Guide
No ratings yet
PyTorch Guide
17 pages
Week-2 - ML Slides
No ratings yet
Week-2 - ML Slides
26 pages
Internship
No ratings yet
Internship
18 pages
Master's Thesis Deep Learning For Visual Recognition: Remi Cadene Supervised by Nicolas Thome and Matthieu Cord
No ratings yet
Master's Thesis Deep Learning For Visual Recognition: Remi Cadene Supervised by Nicolas Thome and Matthieu Cord
58 pages
Transfer Learning CNN
No ratings yet
Transfer Learning CNN
21 pages
Vineela Ann1
No ratings yet
Vineela Ann1
9 pages
Moore, Stephen Shanmugamani
No ratings yet
Moore, Stephen Shanmugamani
2 pages
TLM for CNN
No ratings yet
TLM for CNN
32 pages
Medical Imaging Analysis
No ratings yet
Medical Imaging Analysis
11 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
DEEP LEARNING EXPERIMENTS
No ratings yet
DEEP LEARNING EXPERIMENTS
42 pages
Computer Vision basics
No ratings yet
Computer Vision basics
11 pages
IntroKeras Español
No ratings yet
IntroKeras Español
46 pages
Image Recognition Using Machine Learning Research Paper
No ratings yet
Image Recognition Using Machine Learning Research Paper
5 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
CERN Deep Learning and Vision
No ratings yet
CERN Deep Learning and Vision
72 pages
WINSEM2024-25_CSE4006_ETH_AP2024254000693_2024-12-19_Reference-Material-I
No ratings yet
WINSEM2024-25_CSE4006_ETH_AP2024254000693_2024-12-19_Reference-Material-I
10 pages
Harsha Thesis
No ratings yet
Harsha Thesis
62 pages
Machine Learning Lab8 PDF
No ratings yet
Machine Learning Lab8 PDF
14 pages
Lect11 Neural Nets2
No ratings yet
Lect11 Neural Nets2
48 pages
Convolutional Neural PDF
No ratings yet
Convolutional Neural PDF
187 pages
Auto Keras
No ratings yet
Auto Keras
6 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
Syllabus
No ratings yet
Syllabus
15 pages
21BCP167_AI_9
No ratings yet
21BCP167_AI_9
10 pages
6+
No ratings yet
6+
4 pages
Machine Mastermind
No ratings yet
Machine Mastermind
6 pages
dl-unit-3
No ratings yet
dl-unit-3
21 pages
1729492946538
No ratings yet
1729492946538
10 pages
Image Compression: Efficient Techniques for Visual Data Optimization
From Everand
Image Compression: Efficient Techniques for Visual Data Optimization
Fouad Sabry
No ratings yet
Part B Unit 5 Computer Vision
No ratings yet
Part B Unit 5 Computer Vision
5 pages
1212 0059 PDF
No ratings yet
1212 0059 PDF
5 pages
(Rian) Robotics in Laprascopic Surgery
No ratings yet
(Rian) Robotics in Laprascopic Surgery
80 pages
1.digital Image Processing Basics
No ratings yet
1.digital Image Processing Basics
85 pages
ELE-I Digital Image Processing: Question Bank
No ratings yet
ELE-I Digital Image Processing: Question Bank
5 pages
Flow Based Image Abstraction ZhY
No ratings yet
Flow Based Image Abstraction ZhY
15 pages
Customer Segmentation Using Machine Learning
No ratings yet
Customer Segmentation Using Machine Learning
6 pages
Ocr With Machine Learning
No ratings yet
Ocr With Machine Learning
6 pages
Deep Learning With Noisy Labels Exploring Techniques and Remedies in Medical Image Analysis
No ratings yet
Deep Learning With Noisy Labels Exploring Techniques and Remedies in Medical Image Analysis
23 pages
Nr-410507-Digital Speech and Image Processing
No ratings yet
Nr-410507-Digital Speech and Image Processing
4 pages
Mesh Completion Using Incomplete Mesh and Template Model: D. Srinivasa Reddy Dr. M. V. Subba Reddy
No ratings yet
Mesh Completion Using Incomplete Mesh and Template Model: D. Srinivasa Reddy Dr. M. V. Subba Reddy
6 pages
IoT Report
No ratings yet
IoT Report
10 pages
Shashank Resume L
No ratings yet
Shashank Resume L
2 pages
Accuvein Vein Finder How Does It Work?: Problem Statement
No ratings yet
Accuvein Vein Finder How Does It Work?: Problem Statement
5 pages
Face Recognition Using CBIR and Genetic Algorithm
No ratings yet
Face Recognition Using CBIR and Genetic Algorithm
5 pages
Fast and Robust Virtual Try On Based On Parser Free Generative Adversarial Network
No ratings yet
Fast and Robust Virtual Try On Based On Parser Free Generative Adversarial Network
10 pages
VIT Image Processing Question Paper
0% (1)
VIT Image Processing Question Paper
3 pages
Melanoma Classification A Comprehensive Survey (3 240314 220858
No ratings yet
Melanoma Classification A Comprehensive Survey (3 240314 220858
67 pages
Vision Mamba: Rethinking Visual Representation With Bidirectional LSTMs
No ratings yet
Vision Mamba: Rethinking Visual Representation With Bidirectional LSTMs
7 pages
Deep Learning in Biometrics-CRC Press (2018)
No ratings yet
Deep Learning in Biometrics-CRC Press (2018)
329 pages
1 s2.0 S0012825221003597 Main
No ratings yet
1 s2.0 S0012825221003597 Main
33 pages
Harnessing Deep Learning for Early Breast Cancer Diagnosis
No ratings yet
Harnessing Deep Learning for Early Breast Cancer Diagnosis
19 pages
A Deep Learning-Based Cryptocurrency Price Prediction Model That Uses On-Chain Data
No ratings yet
A Deep Learning-Based Cryptocurrency Price Prediction Model That Uses On-Chain Data
18 pages
Lecture 1
No ratings yet
Lecture 1
34 pages
M20CS061
No ratings yet
M20CS061
37 pages
ChoroidNET A Dense Dilated U-Net Model For Choroid Layer and Vessel Segmentation in Optical Coherence Tomography Images
No ratings yet
ChoroidNET A Dense Dilated U-Net Model For Choroid Layer and Vessel Segmentation in Optical Coherence Tomography Images
15 pages
Chapter (2) Literature Review
No ratings yet
Chapter (2) Literature Review
8 pages
Kicker Team - Description - Paper - 2008
No ratings yet
Kicker Team - Description - Paper - 2008
8 pages
Visvesvaraya Technological University: Lung Cancer Segmentation and Detection Using Machine Learning
No ratings yet
Visvesvaraya Technological University: Lung Cancer Segmentation and Detection Using Machine Learning
67 pages