Tensorflow and Keras Apis: 0.1 Computer Vision: Neural Networks and Deep Learning

Tensorflow and Keras APIs
May 9, 2020
0.1 Computer vision : Neural Networks and Deep Learning
PAR : NAMA NYAM Guy Anthony

Image classification with Tensorflow and Keras APIs. We show examples of image
pre-processing techniques with the OpenCV library and PCA.We present a neural
network architecture suitable for image processing(CNN) and the concepts of Deep
Learning.
We implement two approches:
1. Transfer Learning : re-using the weights of a pre-trained model
2. Build a model from scratch
To enhance our dataset, the data augmentation technique via Keras ImageDataGenerator is used.
We can also improve models by adjusting model layers and optimizing hyper-parameters.
Dataset : Stanford dogs
[0]: import os
import random
import numpy as np
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
import cv2
import shutil
import pickle
import datetime
from xml.etree import cElementTree as ElementTree
import tensorflow as tf
from tensorflow.keras.layers import concatenate
from tensorflow.keras.layers import Conv2D, SeparableConv2D
from tensorflow.keras.layers import Flatten, Dense, Activation
from tensorflow.keras.layers import BatchNormalization, Dropout
from tensorflow.keras.layers import MaxPooling2D, AveragePooling2D
from tensorflow.keras.layers import GlobalMaxPooling2D, GlobalAveragePooling2D
from tensorflow.python.client import device_lib
from tensorflow.keras.preprocessing.image import ImageDataGenerator
1
from tensorflow.keras.preprocessing.image import load_img, img_to_array
from tensorflow.keras.callbacks import TensorBoard,
from tensorflow.keras.callbacks import ReduceLROnPlateau, EarlyStopping
Environment
[0]: tf.keras.__version__
[0]: '2.3.0-tf'
[0]: tf.__version__
[0]: '2.2.0-rc3'
[0]: device_lib.list_local_devices()
[0]: [name: "/device:CPU:0"

device_type: "CPU"
memory_limit: 268435456
locality {
}
incarnation: 6791675207572200710, name: "/device:XLA_CPU:0"
device_type: "XLA_CPU"
locality {
}
incarnation: 6167036836661017396
physical_device_desc: "device: XLA_CPU device", name: "/device:XLA_GPU:0"
device_type: "XLA_GPU"
locality {
}
incarnation: 9163714199696093503
physical_device_desc: "device: XLA_GPU device", name: "/device:GPU:0"
device_type: "GPU"
locality {
bus_id: 1
links {
}
}
incarnation: 16104315102507413925
physical_device_desc: "device: 0, name: Tesla P100-PCIE-16GB, pci bus id:
0000:00:04.0, compute capability: 6.0"]
Code to extract information from xml annotation
2
[0]: class XmlListConfig(list):
def __init__(self, aList):
for element in aList:
if element:
# treat like dict
if len(element) == 1 or element[0].tag != element[1].tag:
self.append(XmlDictConfig(element))
# treat like list
elif element[0].tag == element[1].tag:
self.append(XmlListConfig(element))
elif element.text:
text = element.text.strip()
if text:
self.append(text)
class XmlDictConfig(dict):
def __init__(self, parent_element):

if parent_element.items():
self.update(dict(parent_element.items()))
for element in parent_element:
if element:
if len(element) == 1 or element[0].tag != element[1].tag:
aDict = XmlDictConfig(element)
else:
aDict = {element[0].tag: XmlListConfig(element)}
if element.items():
aDict.update(dict(element.items()))
self.update({element.tag: aDict})
elif element.items():
self.update({element.tag: dict(element.items())})
else:
self.update({element.tag: element.text})
Organizing the data on my drive(one of a kind)
[0]: # Root data folder

root_data = 'drive/My Drive/Colab Notebooks/P6/data/'
[0]: # Images folder

root_images = 'drive/My Drive/Colab Notebooks/P6/data/Images'
[0]: # Annotation xml folder

root_annotation = 'drive/My Drive/Colab Notebooks/P6/data/annotation/Annotation'
[0]: dir_train = 'drive/My Drive/Colab Notebooks/P6/data/train'
3
[0]: dir_test = 'drive/My Drive/Colab Notebooks/P6/data/test'
[0]: dir_model = 'drive/My Drive/Colab Notebooks/P6/data/model/'
Functions to save and load objects

[0]: def save_obj(obj, name):
with open(root_data + 'obj/'+ name + '.pkl', 'wb') as f:
pickle.dump(obj, f, pickle.HIGHEST_PROTOCOL)
def load_obj(name):
with open(root_data + 'obj/' + name + '.pkl', 'rb') as f:
return pickle.load(f)
1 Prepare Data
Extract information about on dog images : filename, breed, height, width
[0]: directory_annotation_names = [x[0] for x in os.walk(root_annotation)][1:]
[0]: filenames = []
categories = []
widths = []
heights = []
count = 0
for dir_an_name in directory_annotation_names:
directory = dir_an_name.split('/')[-1]
filenames_dir = os.listdir(dir_an_name)
path = root_images + "/" + directory
xml_string = open(root_annotation + "/" + directory +
"/" + filenames_dir[0], "r+").read()
root_xml = ElementTree.XML(xml_string)
xmldict = XmlDictConfig(root_xml)
for filename_dir in filenames_dir:
if os.path.isfile(path + "/" + filename_dir + ".jpg"):
filenames.append(filename_dir + ".jpg")
categories.append(xmldict['object']['name'])
widths.append(xmldict['size']['width'])
heights.append(xmldict['size']['height'])
else:
count += 1
[0]: data = pd.DataFrame({

'filename': filenames,
'category': categories,
'widths': widths,
4
'heights': heights
})
[0]: # Save data object

save_obj(data, 'data')
View DataFrame and distribution

[0]: data = load_obj('data')
[0]: data
[0]: filename category widths heights

0 n02110185_5973.jpg Siberian_husky 500 375
… … … … …
20575 n02093647_120.jpg Bedlington_terrier 237 360
[20580 rows x 4 columns]
[0]: plt.figure(figsize=(20, 6))

data['category'].value_counts().plot.bar()
[0]: <matplotlib.axes._subplots.AxesSubplot at 0x7f6b9710ff60>
5
sns.distplot(data['heights'], hist=False, label="distribution heights")
sns.distplot(data['widths'], hist=False, label="distribution widths")
[0]: <matplotlib.axes._subplots.AxesSubplot at 0x7f6ad0d0f7b8>
Split train and test data

[0]: # train_df has the same distribution of dog breeds
df = data.groupby('category').head(137)
df.shape[0]
[0]: 16440
[0]: validate_df = df.groupby('category').head(35)

validate_df.shape[0]
[0]: 4200
[0]: train_df = df.loc[df.index.difference(validate_df.index)]

train_df.shape[0]
[0]: 12240
[0]: test_df = data.loc[data.index.difference(df.index)]

test_df.shape[0]
[0]: 4140
Create the train and test folder images on drive
[0]: # Delete train and test folder if exist
if os.path.isdir(dir_train):
#shutil.rmtree(dir_train)
6
if os.path.isdir(dir_test):
#shutil.rmtree(dir_test)
[0]: # Create train and test directories
os.mkdir(dir_train)
os.mkdir(dir_test)
[0]: # Copy the files

file_error_train = []
for i, row in df.iterrows():
path = root_images + "/" + row['directory'] + "/" + row['filename']
if os.path.isfile(path):
#shutil.copy(path, dir_train)
else:
file_error_train.append(row['filename'])
print(file_error_train)
file_error_test = []
for i, row in test_df.iterrows():
path = root_images + "/" + row['directory'] + "/" + row['filename']
if os.path.isfile():
#shutil.copy(path, dir_test)
else:
file_error_test.append(row['filename'])
print(file_error_test)
See an sample image

[0]: sample = random.choice(train_df['filename'].values.tolist())
[0]: img = load_img(dir_train + "/" + sample, target_size=(224, 224))
[0]: plt.imshow(img)
[0]: <matplotlib.image.AxesImage at 0x7f5996ec9668>
7
2 Preprocessing
Define constants
[0]: IMAGE_SIZE = (299, 299)
Image preprocessing : usage of ImageDataGenerator class.

Generate batches of tensor image data with real-time data augmentation.
[0]: def zca_whitening(x):

x = x - x.mean(axis=0)
sigma = x.dot(x.T) / float(x.shape[0])
U, S, Vh = linalg.svd(sigma)
xRot = U.T.dot(x)
epsilon = 1e-5
xPCAWhite = np.diag(1.0 / np.sqrt(S + epsilon)).dot(U.T).dot(x)
xZCAWhite = U.dot(xPCAWhite)
return xZCAWhite
[0]: def whitening(img):

channels = cv2.split(img)
channels[0] = zca_whitening(channels[0])
#channels[1] = zca_whitening(channels[1])
#channels[2] = zca_whitening(channels[2])
8
img = cv2.merge(channels)
return img
[0]: plt.imshow(whitening(img_to_array(img)))
Clipping input data to the valid range for imshow with RGB data ([0..1] for
floats or [0..255] for integers).
[0]: <matplotlib.image.AxesImage at 0x7f59961a4320>
[0]: def equalHist(img, adaptive=True):

img = img.astype(np.uint8)
ycrcb=cv2.cvtColor(img,cv2.COLOR_BGR2YCR_CB)
channels=cv2.split(ycrcb)
if adaptive:
clahe = cv2.createCLAHE(clipLimit=2.0, tileGridSize=(8,8))
channels[0] = clahe.apply(channels[0])
else:
channels[0] = cv2.equalizeHist(channels[0])
ycrcb = cv2.merge(channels)
img = cv2.cvtColor(ycrcb,cv2.COLOR_YCR_CB2BGR)
return img
[0]: plt.imshow(equalHist(img_to_array(img)))
9
[0]: <matplotlib.image.AxesImage at 0x7f599610e6d8>
[0]: def preprocessing_image(img):

#img = whitening(img)
img = equalHist(img)
img = tf.keras.applications.inception_resnet_v2.preprocess_input(img)
return img
[0]: # use of the pre-processing function of the model inceptionResNetV2 + equalHist

train_datagen = ImageDataGenerator(preprocessing_function=preprocessing_image)
[0]: validation_datagen = ImageDataGenerator(rescale=1./255)
[0]: train_generator = train_datagen.flow_from_dataframe(dataframe=train_df,

directory=dir_train,
x_col='filename',
y_col='category',
target_size=IMAGE_SIZE,
class_mode='categorical',
batch_size=32
)
Found 12240 validated image filenames belonging to 120 classes.
10
[0]: validation_generator = validation_datagen.\
flow_from_dataframe(dataframe=validate_df,
x_col="filename",
y_col="category",
class_mode="categorical",
batch_size=32
)

See the effects of preprocessing
[0]: example_df = train_df.sample(n=1).reset_index(drop=True)

example_generator = train_datagen.flow_from_dataframe(dataframe=example_df,
x_col='filename',
y_col='category',
batch_size=32
)
[0]: img = load_img(dir_train + "/" + example_df.loc[0, 'filename'])
[0]: X_batch, Y_batch = next(example_generator)
[0]: plt.subplot(1, 2, 1)
plt.imshow(img, aspect="auto")
plt.title('Original image')
plt.subplot(1, 2, 2)
plt.imshow(X_batch[0], aspect="auto")
plt.title('preprocessing')
plt.tight_layout()
plt.show()
11
3 Callbacks : ReduceLROnPlateau, EarlyStopping, TensorBoard
[0]: # Reduce Learning rate from 3 epochs

learning_rate_reduction = ReduceLROnPlateau(monitor='val_accuracy', patience=3,
verbose=1, factor=0.5, min_lr=0.
,→000001)
[0]: # Stop training when the loss metric has stopped improving from 5 epochs
earlystop = EarlyStopping(monitor='val_loss', patience=5)
[0]: log_dir = "logs_transfer/fit/" + datetime.datetime.now().\

strftime("%Y%m%d-%H%M%S")
# Load the TensorBoard notebook extension

%load_ext tensorboard
%tensorboard --logdir $log_dir
tensorboard_callback = TensorBoard(log_dir=log_dir, histogram_freq=1)
<IPython.core.display.Javascript object>
12
Optimizers : RMSprop. See [Adam optimizer](https://keras.io/optimizers/)
Use for approximate the gradient during back-propagation
[0]: optimiseur = tf.keras.optimizers.RMSprop(learning_rate=0.01)
4 Transfer Learning
Building CNN from
[0]: # Load InceptionResNetV2 model with the trained weights for the feature␣
,→representation layers
inceptionResNetV2_model = tf.keras.applications.
,→InceptionResNetV2(weights='imagenet', include_top=False)
Downloading data from https://storage.googleapis.com/tensorflow/keras-applicatio

ns/inception_resnet_v2/inception_resnet_v2_weights_tf_dim_ordering_tf_kernels_no
top.h5
219062272/219055592 [==============================] - 2s 0us/step
[0]: x = inceptionResNetV2_model.output
[0]: x = GlobalAveragePooling2D()(x)
[0]: predictions = Dense(120, activation='softmax', name='predictions')(x)
[0]: my_inceptionResNetV2 = tf.keras.models.Model(inputs=inceptionResNetV2_model.

,→input, outputs=predictions)
[0]: # We're only training the new classifier

for layer in inceptionResNetV2_model.layers:
layer.trainable = False
Compile and fit model

[0]: my_inceptionResNetV2.compile(optimizer=optimiseur,␣
,→loss='categorical_crossentropy', metrics=['accuracy'])
[0]: history = my_inceptionResNetV2.fit(train_generator,

epochs=30,
steps_per_epoch=train_df.shape[0]//32,
validation_data=validation_generator,
validation_steps=validate_df.shape[0]//32,
callbacks=[learning_rate_reduction,␣
,→earlystop, tensorboard_callback]
13
Epoch 1/30
382/382 [==============================] - 8900s 23s/step - loss: 1.2136 -
accuracy: 0.7956 - val_loss: 0.9366 - val_accuracy: 0.8638 - lr: 0.0100
Epoch 2/30
382/382 [==============================] - 146s 383ms/step - loss: 0.8820 -
Epoch 3/30
382/382 [==============================] - 147s 384ms/step - loss: 0.7621 -
Epoch 4/30
382/382 [==============================] - 146s 383ms/step - loss: 0.7119 -
Epoch 5/30
382/382 [==============================] - 146s 382ms/step - loss: 0.6227 -
Epoch 6/30
382/382 [==============================] - 146s 381ms/step - loss: 0.6019 -
[0]: # Save model

tf.keras.models.save_model(my_inceptionResNetV2, dir_model +␣
,→"my_inceptionResNetV2.h5")
[0]: # Save class indices

save_obj(train_generator.class_indices, 'class_indices')
[0]: fig = plt.figure(figsize=(12, 6))

ax = fig.add_subplot(111)
ax.plot(history.history['accuracy'], color='b', label="Training accuracy")
ax.plot(history.history['val_accuracy'], color='r',label="Validation accuracy")
ax.set_xticks(np.arange(1, 30, 1))
legend = plt.legend(loc='best', shadow=True)
plt.tight_layout()
plt.show()
14
ax = fig.add_subplot(111, autoscale_on=True)
ax.plot(history.history['loss'], color='b', label="Training loss")
ax.plot(history.history['val_loss'], color='r', label="validation loss")
plt.tight_layout()
plt.show()
Evaluation
15
[0]: test_datagen = ImageDataGenerator(rescale=1./255)
[0]: test_generator = test_datagen.flow_from_dataframe(dataframe=test_df,

directory=dir_test,
x_col="filename",
y_col="category",
class_mode="categorical",
batch_size=32
)
[0]: loss_and_metrics = my_inceptionResNetV2.evaluate(test_generator, batch_size=32,␣

,→steps=test_df.shape[0]//32)
129/129 [==============================] - 2379s 18s/step - loss: 0.9754 -

accuracy: 0.8881
[0]: loss_and_metrics
[0]: [0.975404679775238, 0.8880813717842102]
5 Extract misclassifications
[0]: collections = np.array([img_to_array(load_img(dir_test + "/" + row['filename'],␣

,→target_size=(299, 299))) \
for i, row in test_df.iterrows()])
[0]: collections = np.array([preprocessing_image(collection) for collection in␣

,→collections])
[0]: predict = my_inceptionResNetV2.predict(collections)
[0]: test_df['predicted'] = np.argmax(predict, axis=1) # position of max probability
[0]: test_df
[0]: filename category widths heights predicted

137 n02110185_6438.jpg Siberian_husky 500 375 24
… … … … … …
16
20575 n02093647_120.jpg Bedlington_terrier 237 360 6
[0]: # Correspondence position category

label_map = dict((v,k) for k,v in test_generator.class_indices.items())
test_df['predicted'] = test_df['predicted'].replace(label_map)
[0]: test_df
[0]: filename category … heights predicted

137 n02110185_6438.jpg Siberian_husky … 375 Eskimo_dog
138 n02110185_7413.jpg Siberian_husky … 375 toy_poodle
… … … … … …
20575 n02093647_120.jpg Bedlington_terrier … 360 Bedlington_terrier
[0]: # Misclassifications
error_df = test_df[test_df['category'] != test_df['predicted']]
error_df.shape[0]
[0]: 485

error_df['category'].value_counts().plot.bar()
[0]: <matplotlib.axes._subplots.AxesSubplot at 0x7f8989fa7438>
17
[0]: error_df[['category', 'predicted']].groupby('category').describe()
[0]: predicted
count unique top freq
category
Afghan_hound 1 1 Newfoundland 1
Airedale 2 2 Irish_terrier 1
American_Staffordshire_terrier 14 4 Staffordshire_bullterrier 11
Appenzeller 2 2 EntleBucher 1
Australian_terrier 8 3 silky_terrier 4
… … … … …
toy_poodle 1 1 miniature_poodle 1
toy_terrier 4 3 basenji 2
vizsla 2 2 Weimaraner 1
whippet 3 2 Italian_greyhound 2
wire-haired_fox_terrier 1 1 Lakeland_terrier 1
[0]: errors_sypnosis = pd.DataFrame(columns=['category', 'list_errors'])

for category in set(error_df['category'].values):
list_errors = "| "
for i, row in error_df.iterrows():
if row['category'] == category:
list_errors += row['predicted'] + " | "
errors_sypnosis = errors_sypnosis.append({'category': category,␣
,→'list_errors': list_errors},
ignore_index=True)
[0]: display(errors_sypnosis)
category list_errors
0 bull_mastiff | Labrador_retriever | Brabancon_griffon | cho...
18
1 Norwich_terrier | Scottish_deerhound | West_Highland_white_ter...
2 Tibetan_terrier | otterhound | soft-coated_wheaten_terrier | L...
3 Cardigan | Pembroke | Pembroke | Pembroke |
4 Irish_wolfhound | Scottish_deerhound | Scottish_deerhound | Sc...
.. ... ...
92 Siberian_husky | Eskimo_dog | toy_poodle | Eskimo_dog | Eskim...
93 Eskimo_dog | malamute | Siberian_husky |
94 Japanese_spaniel | Shih-Tzu | Blenheim_spaniel |
95 golden_retriever | Great_Pyrenees | cocker_spaniel | Great_Pyre...
96 Lhasa | Shih-Tzu | Tibetan_mastiff | Shih-Tzu | Shih...
6 Data augmentation : to improve performance
[0]: # Parameters estimated from misclassifications that detect them(empirical)

train_datagen_aug = ImageDataGenerator(rotation_range=30,
width_shift_range=0.1,
height_shift_range=0.1,
shear_range=0.01,
zoom_range=0.1,
horizontal_flip=True,
brightness_range=[0.5, 1.5],
␣
,→preprocessing_function=preprocessing_image
[0]: train_generator_aug = train_datagen_aug.\

flow_from_dataframe(dataframe=train_df,
x_col='filename',
y_col='category',
batch_size=32
)

See how the generator work
[0]: example_df = train_df.sample(n=1).reset_index(drop=True)

example_generator = train_datagen_aug.\
flow_from_dataframe(dataframe=example_df,
19
x_col='filename',
y_col='category',
batch_size=32
)
[0]: img = load_img(dir_train + "/" + example_df.loc[0, 'filename'])

plt.figure(figsize=(16, 3))
plt.subplot(1, 6, 1)
plt.imshow(img, aspect=True)
plt.title('Original image')
for i in range(1, 6):
plt.subplot(1, 6, i+1)
for X_batch, Y_batch in example_generator:
image = X_batch[0]
plt.imshow(image, aspect=True)
break
plt.tight_layout()
#plt.subtitle('Original image on the left, augmented on the other images')
plt.show()
See if there is a gain in performance
[0]: history_aug = my_inceptionResNetV2.\

fit(train_generator_aug,
20
epochs=30,
callbacks=[learning_rate_reduction, earlystop]
)
Epoch 1/30
382/382 [==============================] - 406s 1s/step - loss: 1.4635 -
Epoch 2/30
382/382 [==============================] - 399s 1s/step - loss: 1.4446 -
Epoch 3/30
382/382 [==============================] - 396s 1s/step - loss: 1.3201 -
Epoch 4/30
382/382 [==============================] - 393s 1s/step - loss: 1.2951 -
Epoch 5/30
382/382 [==============================] - ETA: 0s - loss: 1.2972 - accuracy:
0.8508
Epoch 00005: ReduceLROnPlateau reducing learning rate to 0.004999999888241291.
382/382 [==============================] - 394s 1s/step - loss: 1.2972 -
Epoch 6/30
382/382 [==============================] - 397s 1s/step - loss: 0.8653 -
Epoch 7/30
382/382 [==============================] - 406s 1s/step - loss: 0.7608 -
Epoch 8/30
382/382 [==============================] - 408s 1s/step - loss: 0.7355 -
Epoch 9/30
0.8912
382/382 [==============================] - 409s 1s/step - loss: 0.6819 -
Epoch 10/30
382/382 [==============================] - 407s 1s/step - loss: 0.5759 -
Epoch 11/30
382/382 [==============================] - 408s 1s/step - loss: 0.5444 -
Epoch 12/30
21
382/382 [==============================] - 407s 1s/step - loss: 0.5221 -
Epoch 13/30
382/382 [==============================] - 405s 1s/step - loss: 0.4721 -
Epoch 14/30
0.9092
382/382 [==============================] - 407s 1s/step - loss: 0.5005 -
Epoch 15/30
382/382 [==============================] - 412s 1s/step - loss: 0.4344 -
Epoch 16/30
382/382 [==============================] - 413s 1s/step - loss: 0.4216 -
Epoch 17/30
382/382 [==============================] - 413s 1s/step - loss: 0.4210 -
Epoch 18/30
382/382 [==============================] - 415s 1s/step - loss: 0.4030 -
Epoch 19/30
382/382 [==============================] - 415s 1s/step - loss: 0.3890 -
Epoch 20/30
0.9171
382/382 [==============================] - 414s 1s/step - loss: 0.4068 -
Epoch 21/30
382/382 [==============================] - 415s 1s/step - loss: 0.3534 -
accuracy: 0.9270 - val_loss: 0.7882 - val_accuracy: 0.8938 - lr: 6.2500e-04
Epoch 22/30
382/382 [==============================] - 411s 1s/step - loss: 0.3533 -
Epoch 23/30
382/382 [==============================] - 409s 1s/step - loss: 0.3751 -
Epoch 24/30
382/382 [==============================] - 408s 1s/step - loss: 0.3989 -
Epoch 25/30
382/382 [==============================] - 409s 1s/step - loss: 0.3875 -
Epoch 26/30
22
0.9235
382/382 [==============================] - 416s 1s/step - loss: 0.3833 -
Epoch 27/30
382/382 [==============================] - 422s 1s/step - loss: 0.3431 -
Epoch 28/30
382/382 [==============================] - 421s 1s/step - loss: 0.3444 -
Epoch 29/30
382/382 [==============================] - 423s 1s/step - loss: 0.3542 -
Evaluation
ax = fig.add_subplot(111)
ax.plot(history_aug.history['accuracy'], color='b',
label="Training accuracy")
ax.plot(history_aug.history['val_accuracy'], color='r',
label="Validation accuracy")
plt.tight_layout()
plt.show()
23
ax = fig.add_subplot(111, autoscale_on=True)
ax.plot(history_aug.history['loss'], color='b', label="Training loss")
ax.plot(history_aug.history['val_loss'], color='r', label="Validation loss")
plt.tight_layout()
plt.show()
[0]: loss_and_metrics_aug = my_inceptionResNetV2.evaluate(test_generator,␣

,→batch_size=32, steps=test_df.shape[0]//32)
129/129 [==============================] - 30s 229ms/step - loss: 0.6266 -

accuracy: 0.9135
[0]: loss_and_metrics_aug
[0]: [0.62655109167099, 0.913517415523529]
7 Build from scratch
preprocessing function
[0]: def preprocessing_from_scratch(img):

return equalHist(img)
Based on VGG16 model : reduce layers and dimensions
24
Modify image size input
[0]: # use of the pre-processing function of the model inceptionResNetV2

vgg16_train_datagen = ImageDataGenerator(rotation_range=20,
shear_range=0.01,
zoom_range=0.1,
␣
,→preprocessing_function=preprocessing_from_scratch
[0]: vgg16_validation_datagen = ImageDataGenerator(rescale=1./255)
[0]: vgg16_train_generator = vgg16_train_datagen.\

x_col='filename',
y_col='category',
target_size=(224, 224),
batch_size=32
)
[0]: vgg16_validation_generator = vgg16_validation_datagen.\

flow_from_dataframe(dataframe=validate_df,
x_col='filename',
y_col='category',
batch_size=32
)
[0]: my_VGG16 = tf.keras.models.Sequential()
[0]: # Block 1
my_VGG16.add(Conv2D(64,(3, 3), input_shape=(224, 224, 3), padding='same',
activation='relu'))
my_VGG16.add(Conv2D(64,(3, 3), padding='same', activation='relu'))
25
my_VGG16.add(MaxPooling2D(pool_size=(2,2), strides=(2,2)))
[0]: # Block2
my_VGG16.add(Conv2D(128, (3, 3), padding='same', activation='relu'))
[0]: # Block3
[0]: # Block4
[0]: # Block5
[0]: # Fully-connected classifier

my_VGG16.add(Flatten())
my_VGG16.add(Dense(2048, activation='relu'))
my_VGG16.add(Dense(2048, activation='relu'))
my_VGG16.add(Dense(120, activation='softmax'))
[0]: my_VGG16.compile(optimizer=tf.keras.optimizers.RMSprop(learning_rate=0.001),
loss='categorical_crossentropy', metrics=['accuracy'])
[0]: history_vgg16 = my_VGG16.fit(vgg16_train_generator,

epochs=30,
validation_data=vgg16_validation_generator,
)
Epoch 1/30
382/382 [==============================] - 273s 713ms/step - loss: 81.5608 -
Epoch 2/30
26
382/382 [==============================] - 271s 708ms/step - loss: 4.8113 -
Epoch 3/30
382/382 [==============================] - 272s 712ms/step - loss: 4.8540 -
Epoch 4/30
0.0070
382/382 [==============================] - 273s 716ms/step - loss: 4.8599 -
Epoch 5/30
382/382 [==============================] - 273s 715ms/step - loss: 4.7883 -
Epoch 6/30
382/382 [==============================] - 272s 711ms/step - loss: 4.6768 -
Epoch 7/30
382/382 [==============================] - 271s 710ms/step - loss: 4.5277 -
Epoch 8/30
382/382 [==============================] - 274s 716ms/step - loss: 4.3475 -
Epoch 9/30
382/382 [==============================] - 274s 718ms/step - loss: 4.2178 -
Epoch 10/30
382/382 [==============================] - 274s 718ms/step - loss: 4.0927 -
Epoch 11/30
382/382 [==============================] - 274s 718ms/step - loss: 3.9952 -
Epoch 12/30
0.1011
382/382 [==============================] - 274s 719ms/step - loss: 3.8918 -
Epoch 13/30
382/382 [==============================] - 274s 717ms/step - loss: 3.5799 -
Epoch 14/30
382/382 [==============================] - 274s 718ms/step - loss: 3.4416 -
Evaluation
[0]: vgg16_test_datagen = ImageDataGenerator(rescale=1./255)
27
[0]: vgg16_test_generator = vgg16_test_datagen.\
flow_from_dataframe(dataframe=test_df,
directory=dir_test,
x_col='filename',
y_col='category',
batch_size=32
)
[0]: loss_and_metrics_v = my_VGG16.evaluate(vgg16_test_generator, batch_size=32,

steps=test_df.shape[0]//32)
129/129 [==============================] - 25s 194ms/step - loss: 5.8860 -

accuracy: 0.0046
[0]: loss_and_metrics_v
[0]: [5.885979652404785, 0.004602713044732809]
Based on Xception model

[0]: Xception_train_datagen = ImageDataGenerator(rotation_range=20,
shear_range=0.01,
zoom_range=0.1,
␣
,→preprocessing_function=preprocessing_from_scratch
[0]: Xception_train_generator = Xception_train_datagen.\

x_col='filename',
y_col='category',
batch_size=32
)
28
[0]: # Entry flow
main_input = tf.keras.Input(shape=(299, 299, 3), name='main_input')
x = Conv2D(32, (3, 3), strides=(2,2), activation='relu',

padding="same")(main_input)
x = Conv2D(64, (3, 3), activation='relu', padding="same")(x)
tower_1 = Conv2D(1, (1, 1), strides=(2,2))(x)
x = SeparableConv2D(128, (3, 3), padding='same')(x)
x = Activation('relu')(x)
x = SeparableConv2D(128, (3, 3), padding='same')(x)
x = MaxPooling2D(pool_size=(3, 3), strides=(2,2), padding="same")(x)
x = concatenate([x, tower_1], axis=-1)
tower_2 = Conv2D(1, (1, 1), strides=(2,2))(tower_1)
x = SeparableConv2D(256, (3, 3), padding="same")(x)
x = MaxPooling2D(pool_size=(3, 3), strides=(2, 2), padding="same")(x)
tower_3 = Conv2D(1, (1, 1), strides=(2,2))(tower_2)
[0]: # Middle flow : repeat 4 fois

for i in range(4):
y = Activation('relu')(x)
y = SeparableConv2D(728, (3, 3), padding="same")(y)
29
y = Activation('relu')(y)
y = Activation('relu')(y)
x = concatenate([x, y])
[0]: # Exit flow

tower_4 = Conv2D(1, (1, 1), strides=(2,2))(x)
x = concatenate([x, tower_4])
x = SeparableConv2D(1536, (3, 3), activation='relu', padding="same")(x)

x = SeparableConv2D(2048, (3, 3), activation='relu', padding="same")(x)
x = GlobalAveragePooling2D()(x)
x = Flatten()(x)
x = Dense(2048, activation='relu')(x)
x = Dense(1024, activation='relu')(x)
x = Dense(120, activation='softmax')(x)
[0]: my_Xception = tf.keras.Model(inputs=main_input, outputs=x)
[0]: my_Xception.compile(optimizer=tf.keras.optimizers.RMSprop(learning_rate=0.001),
loss='categorical_crossentropy', metrics=['accuracy'])
[0]: history_Xception = my_Xception.\

fit(Xception_train_generator,
epochs=30,
)
Epoch 1/30
382/382 [==============================] - 429s 1s/step - loss: 4.7893 -
30
Epoch 2/30
382/382 [==============================] - 416s 1s/step - loss: 4.7889 -
Epoch 3/30
382/382 [==============================] - 415s 1s/step - loss: 4.7885 -
Epoch 4/30
0.0079
382/382 [==============================] - 414s 1s/step - loss: 4.7885 -
Epoch 5/30
382/382 [==============================] - 417s 1s/step - loss: 4.7880 -
Epoch 6/30
382/382 [==============================] - 415s 1s/step - loss: 4.7880 -
Epoch 7/30
0.0072
382/382 [==============================] - 417s 1s/step - loss: 4.7880 -
Epoch 8/30
382/382 [==============================] - 418s 1s/step - loss: 4.7878 -
Epoch 9/30
382/382 [==============================] - 419s 1s/step - loss: 4.7878 -
Epoch 10/30
0.0072
382/382 [==============================] - 421s 1s/step - loss: 4.7878 -
Epoch 11/30
382/382 [==============================] - 423s 1s/step - loss: 4.7876 -
Epoch 12/30
382/382 [==============================] - 424s 1s/step - loss: 4.7876 -
Evaluation
[0]: loss_and_metrics_x = my_Xception.evaluate(test_generator, batch_size=32,
steps=test_df.shape[0]//32)
31
129/129 [==============================] - 28s 218ms/step - loss: 4.7874 -
accuracy: 0.0041
[0]: # Benchmark : loss = 4.78749 -- https://www.kaggle.com/c/

,→dog-breed-identification/leaderboard
loss_and_metrics_x
[0]: [4.787434101104736, 0.004118217155337334]
32

Tensorflow and Keras Apis: 0.1 Computer Vision: Neural Networks and Deep Learning

Uploaded by

Copyright:

Available Formats

Tensorflow and Keras Apis: 0.1 Computer Vision: Neural Networks and Deep Learning

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Tensorflow and Keras Apis: 0.1 Computer Vision: Neural Networks and Deep Learning

Uploaded by

Copyright:

Available Formats

Tensorflow and Keras APIs

0.1 Computer vision : Neural Networks and Deep Learning

PAR : NAMA NYAM Guy Anthony

Dataset : Stanford dogs

[0]: [name: "/device:CPU:0"

Code to extract information from xml annotation

def __init__(self, parent_element):

Organizing the data on my drive(one of a kind)

[0]: # Root data folder

[0]: # Images folder

[0]: # Annotation xml folder

[0]: dir_train = 'drive/My Drive/Colab Notebooks/P6/data/train'

[0]: dir_model = 'drive/My Drive/Colab Notebooks/P6/data/model/'

Functions to save and load objects

Extract information about on dog images : filename, breed, height, width

[0]: directory_annotation_names = [x[0] for x in os.walk(root_annotation)][1:]

[0]: data = pd.DataFrame({

[0]: # Save data object

View DataFrame and distribution

[0]: filename category widths heights

[20580 rows x 4 columns]

[0]: plt.figure(figsize=(20, 6))

[0]: <matplotlib.axes._subplots.AxesSubplot at 0x7f6b9710ff60>

[0]: <matplotlib.axes._subplots.AxesSubplot at 0x7f6ad0d0f7b8>

Split train and test data

[0]: validate_df = df.groupby('category').head(35)

[0]: train_df = df.loc[df.index.difference(validate_df.index)]

[0]: test_df = data.loc[data.index.difference(df.index)]

Create the train and test folder images on drive

[0]: # Delete train and test folder if exist

[0]: # Create train and test directories

[0]: # Copy the files

See an sample image

[0]: img = load_img(dir_train + "/" + sample, target_size=(224, 224))

[0]: <matplotlib.image.AxesImage at 0x7f5996ec9668>

Image preprocessing : usage of ImageDataGenerator class.

[0]: def zca_whitening(x):

[0]: def whitening(img):

[0]: <matplotlib.image.AxesImage at 0x7f59961a4320>

[0]: def equalHist(img, adaptive=True):

[0]: def preprocessing_image(img):

[0]: # use of the pre-processing function of the model inceptionResNetV2 + equalHist

[0]: validation_datagen = ImageDataGenerator(rescale=1./255)

[0]: train_generator = train_datagen.flow_from_dataframe(dataframe=train_df,

Found 12240 validated image filenames belonging to 120 classes.

Found 4200 validated image filenames belonging to 120 classes.

[0]: example_df = train_df.sample(n=1).reset_index(drop=True)

Found 1 validated image filenames belonging to 1 classes.

[0]: img = load_img(dir_train + "/" + example_df.loc[0, 'filename'])

[0]: X_batch, Y_batch = next(example_generator)

[0]: # Reduce Learning rate from 3 epochs

[0]: log_dir = "logs_transfer/fit/" + datetime.datetime.now().\

# Load the TensorBoard notebook extension

%tensorboard --logdir $log_dir

tensorboard_callback = TensorBoard(log_dir=log_dir, histogram_freq=1)

[0]: optimiseur = tf.keras.optimizers.RMSprop(learning_rate=0.01)

Building CNN from

Downloading data from https://storage.googleapis.com/tensorflow/keras-applicatio

[0]: predictions = Dense(120, activation='softmax', name='predictions')(x)

def init(self, parent_element):