0% found this document useful (0 votes)

62 views

Autoencoder Report 1

This document discusses autoencoders, which are neural networks used for dimensionality reduction. It describes the basic components of an autoencoder including the encoder, decoder, and code layers. The encoder compresses the input into the code layer, and the decoder then reconstructs the output from the code layer. It also outlines some key hyperparameters for autoencoders like code size and number of layers/nodes. Autoencoders are commonly used for applications like data compression, anomaly detection, and generating representations of images or text.

Uploaded by

Antush Tesfaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

Autoencoder Report 1

Uploaded by

Antush Tesfaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Autoencoder 2023

2023
Autoencoder

Mattu University,
Department of ECE
MULUKEN TESFAYE
Page |
Autoencoder 2023

MATTU UNIVERSITY

COLLEGE OF ENGINEERING AND TECHNOLOGY

DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING

MSc PROGRAM IN COMMUNICATION ENGINEERING

INDIVIDUAL ASSIGNMENT
COURSE TITLE: MACHINE LEARNING FOR COMMUNICATION SYSTEM

INDIVIDUAL ASSIGNMENT

NAME YEAR&PROGRAM

MULUKEN TESFAYE 1ST YEAR, PG

SUBMISSION DATE: JANUARY/2023

SUBMITTED TO Dr. ISAYIYAS NIGATU

METTU, ETHIOPIA
Autoencoder 2023

Table of Contents
Abstract ......................................................................................................................................ii
Introduction ................................................................................................................................ 1
Basic Autoencoder System ........................................................................................................ 1
Components of Autoencoder ..................................................................................................... 3
Difference Between Encoding and Decoding ............................................................................ 5
Architecture of Autoencoder...................................................................................................... 7
Types of Autoencoders .............................................................................................................. 9
Application of Autoencoders ................................................................................................... 16
Implementation of Autoencoder .............................................................................................. 19
Explanation of Big Data Architecture...................................................................................... 27
References ................................................................................................................................ 30

Mattu University, Department of ECE Page i

Autoencoder 2023

Abstract
Autoencoder, one of the most promising and successful architectures in deep learning (DL),
has been widely used in wireless communications. However, fast-increasing size of neural
networks (NNs) in autoencoder leads to high storage requirement and heavy computational
overhead, which poses a challenge to practical deployment of autoencoders in real
communications systems. It is the case of artificial neural mesh used to discover effective
data coding in an unattended manner. The Autoencoder goal is used to learn presentation for
a group of data, especially for dimensionality step-down. Autoencoders have a unique feature
where their input is equal to its output by forming feedforwarding networks. Autoencoder
turns the input into compressed data to form a low dimensional code and then again retrace
the input to form the desired output. The compressed code of input is also called latent space
representation. In simple, the main aim is to reduce distortion in between circuits[1].

Keywords

 Wireless communication
 Deep learning (DL)
 Neural networks (NNs)

Mattu University, Department of ECE Page ii

Autoencoder 2023

Introduction
A traditional autoencoder is an unsupervised neural network that learns how to efficiently
compress data, which is also called encoding. The autoencoder also learns how to reconstruct
the data from the compressed representation such that the difference between the original
data and the reconstructed data is minimal.

Traditional wireless communication systems are designed to provide reliable data transfer
over a channel that impairs the transmitted signals. These systems have multiple components
such as channel coding, modulation, equalization, synchronization, etc. Each component is
optimized independently based on mathematical models that are simplified to arrive at closed
form expressions. On the contrary, an autoencoder jointly optimizes the transmitter and the
receiver as a whole. This joint optimization has the potential of providing a better
performance than the traditional systems [2],[3].

Traditional autoencoders are usually used to compress images, in other words remove
redundancies in an image and reduce its dimension. A wireless communication system on the
other hand uses channel coding and modulation techniques to add redundancy to the
information bits. With this added redundancy, the system can recover the information bits
that are impaired by the wireless channel. So, a wireless autoencoder actually adds
redundancy and tries to minimize the number of errors in the received information for a given
channel while learning to apply both channel coding and modulation in an unsupervised way.

End-to-end learning of communication systems consists of implementing the transmitter,

channel, and receiver as a single neural network (NN), referred to as an autoencoder, and
training it to reproduce its input at its output [4]. This approach enables joint optimization of
the transmitter and receiver for a specific channel model without extensive mathematical
analysis. Autoencoder-based communication systems have first been proposed in the context
of wireless communications [4], and have subsequently been extended towards other
settings,such as optical fiber, optical wireless [5], and molecularcommunications.

Basic Autoencoder System

The following is the block diagram of a wireless autoencoder system. The encoder
(transmitter) first maps k information bits into a message s such that s∈{1,…,M},

Mattu University, Department of ECE Page 1

Autoencoder 2023

where M=2k. Then message s is mapped to n real number to create x=f(s)∈ℝn. The last layer
of the encoder imposes constraints on x to further restrict the encoded symbols. The
following are possible such constraints and are implemented using the normalization layer:

 Energy constraint: ‖x‖22≤n

 Average power constraint: 𝔼[∣xi∣2]≤1,∀i

Figure 1: Basic Autoencoder System

Define the communication rate of this system as R=k/n [bits/channel use], where (n,k) means

that the system sends one of M=2k messages using n channel uses. The channel impairs

encoded (i.e. transmitted) symbols to generate y∈ℝn. The decoder (i.e. receiver) produces an
estimate, ˆs, of the transmitted message, s.

The input message is defined as a one-hot vector 1s∈ℝM, which is defined as a vector whose

elements are all zeros except the sth one. The channel is additive white Gaussian noise
(AWGN) that adds noise to achieve a given energy per data bit to noise power density
ratio, Eb/No.

The autoencoder maps k data bits into n channel uses, which results in an effective coding
rate of R=k/n data bits per channel use. Then, 2 channel uses are mapped into a symbol,
which results in 2 channel uses per symbol. Map the channel uses per channel symbol value
to the BitsPerSymbol parameter of the AWGN channel.

Mattu University, Department of ECE Page 2

Autoencoder 2023

Components of Autoencoder
There are three main components in Autoencoder. They are Encoder, Decoder, and Code.
The encoder and decoder are completely connected to form a feed forwarding mesh—the
code act as a single layer that acts as per its own dimension. To develop an Autoencoder, you
have to set a hyperparameter; you have to set the number of nodes in the core layer. The
decoder’s output network is a mirror image of the input encoder in a more detailed manner.
The decoder produces the desired output only with the help of the code layer.

Figure 2: Components of Autoencoder

Ensure that the encoder and decoder have the same dimensional values. The important
parameter to set autoencoder is code size, number of layers, and number of nodes in each
layer.

Code size is defined by the total quantity of nodes present in the middle layer. To get
effective compression, the small size of a middle layer is advisable. The Number of layers in
the autoencoder can be deep or shallow as you wish. The Number of nodes in the
autoencoder should be the same in both encoder and decoder. The layer of decoder and
encoder must be symmetric.

In a stacked autoencoder, you have one invisible layer in both the encoder and decoder. It
consists of handwritten pictures with a size of 28*28. Now you can develop an autoencoder

Mattu University, Department of ECE Page 3

Autoencoder 2023

with 128 nodes in the invisible layer with 32 as code size. To add many numbers of layers,
use this function

model.add(Dense(16,activation='relu'))

model.add(Dense(8, activation='relu'))

for conversion,

layer_1=Dense(16,activation='relu')(input)

layer_2 = Dense(8, activation='relu')(layer_1)

Now the output of this layer is added as an input to the next layer. This is the callable layer in
this dense method. The decoder performs this function. It uses the sigmoid method to obtain
output between 0 to 1. Since the input lies between 0 to 1 range

Figure 3: Structure of Original and Reconstructed Image

Reconstruction of input by an Autoencoder in this method is done by prediction. Then, the

individual image test is carried out, and the output is not exactly as input but similar to the
input. To overcome these difficulties, you can make autoencoder more efficient by adding
many layers and adding multiple nodes to layers. But making it more powerful results in a
copy of data similar to the input. But this is not the expected result.

Mattu University, Department of ECE Page 4

Autoencoder 2023

Difference Between Encoding and Decoding

Encoding vs Decoding is the comparison between the two processes that are related to data
security and encryption or cryptographic mechanism. Encoding is implemented by applying
some algorithm or computation in which the original data form is modified to a different
form, whereas decoding is the process of decrypting the encoded data to its original form by
applying the decoding computation or the algorithm. The encoding uses various algorithms to
decrease the size of the file or the resource, whereas decoding is the process of reverting the
encoded content of the file to its actual size. Certain specific standards, such as Base64, are
widely used for the industry’s encoding and decoding process.

Figure 4: Encoding vs Decoding

Table 1: Basis of Comparison Encoding and Decoding

Basis of Comparison Encoding Decoding
Definition Encoding is the technique of Decoding is the inverse process
converting data among two of encoding.
types according to a set of
protocols or a method.
Conversion type Analog to Digital converter Digital to Analog converter
API method Window.atob() Window.btoa()
Example pqr to PQR (smaller case to PQR to pqr (uppercase to the
uppercase) smaller case)
Communication Abstract the data while sending Decoding could be described

Mattu University, Department of ECE Page 5

Autoencoder 2023

the message. This can be with the receiver interpreting

carried out using words, icons, the message and simply
photos, symbols, and sounds. visiting knowledge as to what
the origin is usually
interacting.
Media Encoding is often utilized to Decoding benefits a similar
decrease the size of audio and codec to rebuild the initial
video data. document coming from an
encoded data file.
Real life example A breakfast food organization They will produce an
ought to express their advertisement for the reasons,
particular message for you to and you may observe or
acquire their item. perhaps listen to it on
television, radio or various
other social networking. You
can translate as well as be
familiar with the message.
Input The applied signal or message Coded binary data is the input.
is the input.
Output The generated data in the The message in
coded form is the output. understandable form is the
output.

Encoding means the creation of a messages (which you want to communicate with other person).
On the other hand decoding means listener or audience of encoded message. So decoding means
interpreting the meaning of the message. You will interpret and understand the message, what
just been said.

All communication begins with the sender. The first step the sender is faced with involves
the encoding process. In order to convey meaning, the sender must begin encoding, which means
translating information into a message in the form of symbols that represent ideas or concepts.

The decoding of a message is how an audience member is able to understand, and interpret the
message.

Mattu University, Department of ECE Page 6

Autoencoder 2023

Figure 5: The Basic way of Encoding and Decoding

Architecture of Autoencoder
In this stacked architecture, the code layer has a small dimensional value than input
information, which is said to be under a complete autoencoder.

1. Denoising Autoencoders

You cannot copy the input signal to the output signal to get the perfect result in this method.
Because here, the input signal contains noise that needs to be subtracted before getting the
result that is the underlying needed data. This process is called denoising autoencoder. The
first row contains original images. To make them a noisy input signal, some noisy data is
added. Now you can design the autoencoder to get a noise-free output as follows

Figure 6: Denoising Autoencoders

autoencoder.fit(x_train, x_train)

A modified Autoencoder is as follows,

Mattu University, Department of ECE Page 7

Autoencoder 2023

autoencoder.fit(x_train_noisy, x_train)

Hence you can get noise-free output easily.

Convolution autoencoder is used to handle complex signals and also get a better result than
the normal process.

2. Sparse Autoencoders

To use autoencoders effectively, you can follow two steps.

Set a small code size, and the other is denoising autoencoder.

Then another effective method is regularization. To apply this regularization, you need to
regularize sparsity constraints. To activate some parts of nodes in the layer, add some extra
terms to the loss function, which pushes the autoencoder to make each input as combined
smaller nodes, and it makes the encoder find some unique structures in the given data. It is
also applicable for many data because only a part of the nodes is activated.

The sparsity constraint value is closer to zero.

To generate a code layer,

code = Dense(code_size, activation='relu')(input_img)

To add regularizing value,

code = Dense(code_size, activation='relu', activity_regularizer=l1(10e-6))(input_img)

In this model, only 0.01 is the final loss that too because of the regularization term.

In this sparse model, a bunch of code values is true to the expected result. But it has fairly
low variance values.

Regularized autoencoders have unique properties like robustness to missing inputs, sparse
representation, and nearest value to derivatives in presentations. To use effectively, keep

Mattu University, Department of ECE Page 8

Autoencoder 2023

minimum code size and shallow encoder and decoder. They discover a high capacity of
inputs and do not need any extra regularizing term for encoding to be effective. They are
trained to give maximized effect rather than copy and paste.

3. Variational Autoencoder

It is used in complex cases, and it finds the chances of distribution designing the input data.
This variational autoencoder uses a sampling method to get its effective output. It follows the
same architecture as regularized autoencoders.

Hence autoencoders are used to learn real-world data and images involved in binary and
multiclass classifications. It is a simple process for dimensionality reduction. It is applied in a
Restricted Boltzmann machine and plays a vital role in it. It is also used in the biochemical
industry to discover the unrevealed part of learning and identify intelligent behavior patterns.
Every component in machine learning has a self-organized character; Autoencoder is one of
that which is successful learning in artificial intelligence..

When it comes to managing heavy data and doing complex operations on that massive data
there becomes a need to use big data tools and techniques. When we say using big data tools
and techniques we effectively mean that we are asking to make use of various software and
procedures which lie in the big data ecosystem and its sphere. There is no generic solution
that is provided for every use case and therefore it has to be crafted and made in an effective
way as per the business requirements of a particular company. Thus there becomes a need to
make use of different big data architecture as the combination of various technologies will
result in the resultant use case being achieved. By establishing a fixed architecture it can be
ensured that a viable solution will be provided for the asked use case.

Types of Autoencoders
There are, basically, 7 types of autoencoders:

 Denoising autoencoder

 Sparse Autoencoder

 Deep Autoencoder

Mattu University, Department of ECE Page 9

Autoencoder 2023

 Contractive Autoencoder

 Undercomplete Autoencoder

 Convolutional Autoencoder

 Variational Autoencoder

1) Denoising Autoencoder

Denoising autoencoders create a corrupted copy of the input by introducing some noise. This
helps to avoid the autoencoders to copy the input to the output without learning features
about the data. These autoencoders take a partially corrupted input while training to recover
the original undistorted input. The model learns a vector field for mapping the input data
towards a lower dimensional manifold which describes the natural data to cancel out the
added noise.

Advantages

 It was introduced to achieve good representation. Such a representation is one that can
be obtained robustly from a corrupted input and that will be useful for recovering the
corresponding clean input.

 Corruption of the input can be done randomly by making some of the input as zero.
Remaining nodes copy the input to the noised input.

 Minimizes the loss function between the output node and the corrupted input.

 Setting up a single-thread denoising autoencoder is easy.

Drawbacks

 To train an autoencoder to denoise data, it is necessary to perform preliminary

stochastic mapping in order to corrupt the data and use as input.

 This model isn't able to develop a mapping which memorizes the training data
because our input and target output are no longer the same.

Mattu University, Department of ECE Page 10

Autoencoder 2023

2) Sparse Autoencoder

Sparse autoencoders have hidden nodes greater than input nodes. They can still discover
important features from the data. A generic sparse autoencoder is visualized where the
obscurity of a node corresponds with the level of activation. Sparsity constraint is introduced
on the hidden layer. This is to prevent output layer copy input data. Sparsity may be obtained
by additional terms in the loss function during the training process, either by comparing the
probability distribution of the hidden unit activations with some low desired value,or by
manually zeroing all but the strongest hidden unit activations. Some of the most powerful AIs
in the 2010s involved sparse autoencoders stacked inside of deep neural networks.

Advantages

 Sparse autoencoders have a sparsity penalty, a value close to zero but not exactly
zero. Sparsity penalty is applied on the hidden layer in addition to the reconstruction
error. This prevents overfitting.

 They take the highest activation values in the hidden layer and zero out the rest of the
hidden nodes. This prevents autoencoders to use all of the hidden nodes at a time and
forcing only a reduced number of hidden nodes to be used.

Drawbacks

 For it to be working, it's essential that the individual nodes of a trained model which
activate are data dependent, and that different inputs will result in activations of
different nodes through the network.

Mattu University, Department of ECE Page 11

Autoencoder 2023

Figure 7: Sparse Autoencoder

3) Deep Autoencoder

Deep Autoencoders consist of two identical deep belief networks, oOne network for encoding
and another for decoding. Typically deep autoencoders have 4 to 5 layers for encoding and
the next 4 to 5 layers for decoding. We use unsupervised layer by layer pre-training for this
model. The layers are Restricted Boltzmann Machines which are the building blocks of deep-
belief networks. Processing the benchmark dataset MNIST, a deep autoencoder would use
binary transformations after each RBM. Deep autoencoders are useful in topic modeling, or
statistically modeling abstract topics that are distributed across a collection of documents.
They are also capable of compressing images into 30 number vectors.

Advantages

 Deep autoencoders can be used for other types of datasets with real-valued data, on
which you would use Gaussian rectified transformations for the RBMs instead.

 Final encoding layer is compact and fast.

Drawbacks

 Chances of overfitting to occur since there's more parameters than input data.

Mattu University, Department of ECE Page 12

Autoencoder 2023

 Training the data maybe a nuance since at the stage of the decoder’s backpropagation,
the learning rate should be lowered or made slower depending on whether binary or
continuous data is being handled.

Figure 8: Deep Autoencoder

4) Contractive Autoencoder

The objective of a contractive autoencoder is to have a robust learned representation which is

less sensitive to small variation in the data. Robustness of the representation for the data is
done by applying a penalty term to the loss function. Contractive autoencoder is another
regularization technique just like sparse and denoising autoencoders. However, this
regularizer corresponds to the Frobenius norm of the Jacobian matrix of the encoder
activations with respect to the input. Frobenius norm of the Jacobian matrix for the hidden
layer is calculated with respect to input and it is basically the sum of square of all elements.

Advantages

 Contractive autoencoder is a better choice than denoising autoencoder to learn useful

feature extraction.

 This model learns an encoding in which similar inputs have similar encodings. Hence,
we're forcing the model to learn how to contract a neighborhood of inputs into a
smaller neighborhood of outputs.

Mattu University, Department of ECE Page 13

Autoencoder 2023

Figure 9: Contractive Autoencoder

5) Undercomplete Autoencoder

The objective of undercomplete autoencoder is to capture the most important features present
in the data. Undercomplete autoencoders have a smaller dimension for hidden layer
compared to the input layer. This helps to obtain important features from the data. It
minimizes the loss function by penalizing the g(f(x)) for being different from the input x.

Advantages

 Undercomplete autoencoders do not need any regularization as they maximize the

probability of data rather than copying the input to the output.

Drawbacks

 Using an overparameterized model due to lack of sufficient training data can create
overfitting.

6) Convolutional Autoencoder

Mattu University, Department of ECE Page 14

Autoencoder 2023

Autoencoders in their traditional formulation does not take into account the fact that a signal
can be seen as a sum of other signals. Convolutional Autoencoders use the convolution
operator to exploit this observation. They learn to encode the input in a set of simple signals
and then try to reconstruct the input from them, modify the geometry or the reflectance of the
image. They are the state-of-art tools for unsupervised learning of convolutional filters. Once
these filters have been learned, they can be applied to any input in order to extract features.
These features, then, can be used to do any task that requires a compact representation of the
input, like classification.

Advantages

 Due to their convolutional nature, they scale well to realistic-sized high dimensional
images.

 Can remove noise from picture or reconstruct missing parts.

Drawbacks

 The reconstruction of the input image is often blurry and of lower quality due to
compression during which information is lost.

Figure 10: Undercomplete Autoencoder

7) Variational Autoencoder

Variational autoencoder models make strong assumptions concerning the distribution of

latent variables. They use a variational approach for latent representation learning, which
results in an additional loss component and a specific estimator for the training algorithm

Mattu University, Department of ECE Page 15

Autoencoder 2023

called the Stochastic Gradient Variational Bayes estimator. It assumes that the data is
generated by a directed graphical model and that the encoder is learning an approximation to
the posterior distribution where Ф and θ denote the parameters of the encoder (recognition
model) and decoder (generative model) respectively. The probability distribution of the latent
vector of a variational autoencoder typically matches that of the training data much closer
than a standard autoencoder.

Advantages

 It gives significant control over how we want to model our latent distribution unlike
the other models.

 After training you can just sample from the distribution followed by decoding and
generating new data.

Drawbacks

 When training the model, there is a need to calculate the relationship of each
parameter in the network with respect to the final output loss using a technique known
as backpropagation. Hence, the sampling process requires some extra attention.

Application of Autoencoders
So far we have seen a variety of autoencoders and each of them is good at a specific task.
Let’s find out some of the tasks they can do

1. File Compression: Primary use of Autoencoders is that they can reduce the dimensionality
of input data which we in common refer to as file compression. Autoencoders works with all
kinds of data like Images, Videos, and Audio, this helps in sharing and viewing data faster
than we could do with its original file size.

2. Image De-noising: Autoencoders are also used as noise removal techniques (Image De-
noising), what makes it the best choice for De-noising is that it does not require any human
interaction, once trained on any kind of data it can reproduce that data with less noise than the
original image.

Mattu University, Department of ECE Page 16

Autoencoder 2023

3. Image Transformation: Autoencoders are also used for image transformations, which is
typically classified under GAN(Generative Adversarial Networks) models. Using these we
can transform B/W images to colored one and vice versa, we can up-sample and down-
sample the input data, etc.

4. Data Compression

Although autoencoders are designed for data compression yet they are hardly used for this
purpose in practical situations. The reasons are:

 Lossy compression: The output of the autoencoder is not exactly the same as the
input, it is a close but degraded representation. For lossless compression, they are not
the way to go.
 Data-specific: Autoencoders are only able to meaningfully compress data similar to
what they have been trained on. Since they learn features specific for the given
training data, they are different from a standard data compression algorithm like jpeg
or gzip. Hence, we can’t expect an autoencoder trained on handwritten digits to
compress landscape photos.
Since we have more efficient and simple algorithms like jpeg, LZMA, LZSS(used in
WinRAR in tandem with Huffman coding), autoencoders are not generally used for
compression. Although autoencoders have seen their use for image denoising and
dimensionality reduction in recent years.

5. Machine translation

Autoencoders have been applied to machine translation, which is usually referred to as neural
machine translation (NMT). Unlike traditional autoencoders, the output does not match the
input - it is in another language. In NMT, texts are treated as sequences to be encoded into the
learning procedure, while on the decoder side sequences in the target language(s) are
generated. Language-specific autoencoders incorporate further linguistic features into the
learning procedure, such as Chinese decomposition features. Machine translation is rarely
still done with autoencoders, but rather transformer networks

6. Anomaly detection

Mattu University, Department of ECE Page 17

Autoencoder 2023

Another application for autoencoders is anomaly detection. By learning to replicate the most
salient features in the training data under some of the constraints described previously, the
model is encouraged to learn to precisely reproduce the most frequently observed
characteristics. When facing anomalies, the model should worsen its reconstruction
performance. In most cases, only data with normal instances are used to train the
autoencoder; in others, the frequency of anomalies is small compared to the observation set
so that its contribution to the learned representation could be ignored. After training, the
autoencoder will accurately reconstruct "normal" data, while failing to do so with unfamiliar
anomalous data.[35] Reconstruction error (the error between the original data and its low
dimensional reconstruction) is used as an anomaly score to detect anomalies.[35]

Recent literature has however shown that certain autoencoding models can,
counterintuitively, be very good at reconstructing anomalous examples and consequently not
able to reliably perform anomaly detection.

7. Information retrieval

Information retrieval benefits particularly from dimensionality reduction in that search can
become more efficient in certain kinds of low dimensional spaces. Autoencoders were indeed
applied to semantic hashing, proposed by Salakhutdinov and Hinton in 2007. By training the
algorithm to produce a low-dimensional binary code, all database entries could be stored in
a hash table mapping binary code vectors to entries. This table would then support
information retrieval by returning all entries with the same binary code as the query, or
slightly less similar entries by flipping some bits from the query encoding.

Figure 11: Information retrieval

Mattu University, Department of ECE Page 18

Autoencoder 2023

Implementation of Autoencoder
In this section, we explore the concept of Image denoising which is one of the applications of
autoencoders. After getting images of handwritten digits from the MNIST dataset, we add
noise to the images and then try to reconstruct the original image out of the distorted image.

In this tutorial, we use convolutional autoencoders to reconstruct the image as they work
better with images. Also, we use Python programming language along with Keras and
TensorFlow to code this up.

from keras.datasets import mnist

### Importing Libraries

import keras

from keras import callbacks

from keras.models import Model

from keras.optimizers import Adadelta

from keras.layers import Input, Conv2D, MaxPool2D, UpSampling2D

### Downloading and Preprocessing of dataset and adding some noise to it.

import numpy as np

(trainX, trainy), (testX, testy) = mnist.load_data()

# to convert values from 0 to 255 into range 0 to 1.

trainX = np.expand_dims(trainX, axis=-1)

testX = np.expand_dims(testX, axis=-1)

Mattu University, Department of ECE Page 19

Autoencoder 2023

trainX = trainX.astype("float32") / 255.0

testX = testX.astype("float32") / 255.0

The next step is to add noise to our dataset. For this purpose, we use the NumPy library to
generate random numbers with a mean of 0.5 and a standard deviation of 0.5 in the shape of
our input data. Also to make sure the values of a pixel in between 0 and 1, we use the clip
function of NumPy to do so

#add noise to the images

trainNoise = np.random.normal(loc=0.5, scale=0.5, size=trainX.shape)

testNoise = np.random.normal(loc=0.5, scale=0.5, size=testX.shape)

trainXNoisy = np.clip(trainX + trainNoise, 0, 1)

testXNoisy = np.clip(testX + testNoise, 0, 1)

Now let us visualize the distorted dataset and compare it with our original dataset. Here I
have displayed the five images before and after adding noise to them

import matplotlib.pyplot as plt

plt.figure(figsize=(10, 10))

for i in range(5):

plt.subplot(1, 5, i+1)

plt.xticks([])

plt.yticks([])

Mattu University, Department of ECE Page 20

Autoencoder 2023

plt.grid(False)

plt.imshow(trainX[i].reshape(28, 28), cmap='gray')

plt.tight_layout()

plt.show()

plt.figure(figsize=(10, 10))

for i in range(5):

plt.subplot(1, 5, i+1)

plt.xticks([])

plt.yticks([])

plt.grid(False)

plt.imshow(trainXNoisy[i].reshape(28, 28), cmap='gray')

plt.tight_layout()

plt.show()

Before adding noise

Mattu University, Department of ECE Page 21

Autoencoder 2023

After adding noise

As we can see above, the images are partially distorted after adding noise to them and we can
hardly recognize the digits. Next, we define the structure of our autoencoder, fit the distorted
images, and pass the original images as labels.

input_img = Input(shape=(28, 28, 1)) # adapt this if using `channels_first` image data format

x = Conv2D(32, (3, 3), activation='relu', padding='same')(input_img)

x = MaxPool2D((2, 2), padding='same')(x)

x = Conv2D(32, (3, 3), activation='relu', padding='same')(x)

encoded = MaxPool2D((2, 2), padding='same')(x)

# at this point the representation is (7, 7, 32)

x = Conv2D(32, (3, 3), activation='relu', padding='same')(encoded)

x = UpSampling2D((2, 2))(x)

x = Conv2D(32, (3, 3), activation='relu', padding='same')(x)

x = UpSampling2D((2, 2))(x)

decoded = Conv2D(1, (3, 3), activation='sigmoid', padding='same')(x)

autoencoder = Model(input_img, decoded)

Mattu University, Department of ECE Page 22

Autoencoder 2023

autoencoder.compile(optimizer='adadelta', loss='binary_crossentropy')

# train the convolutional autoencoder

H = autoencoder.fit(

trainXNoisy, trainX,

validation_data=(testXNoisy, testX),

epochs=20,

batch_size=32)

N = np.arange(0, 20)

plt.style.use("ggplot")

plt.figure()

plt.plot(N, H.history["loss"], label="train_loss")

plt.plot(N, H.history["val_loss"], label="val_loss")

plt.title("Training Loss and Accuracy")

plt.xlabel("Epoch #")

plt.ylabel("Loss/Accuracy")

plt.legend(loc="lower left")

plt.show()

Mattu University, Department of ECE Page 23

Autoencoder 2023

Output: Here is a plot which shows loss at each epoch for both training and validation sets

Figure 12: Training loss and accuracy

# predict the reconstructed images for the original images

pred = autoencoder.predict(testXNoisy)

## Visualizing our results

plt.figure(figsize=(10,10))

for i in range(5):

plt.subplot(1, 5, i+1)

plt.xticks([]) # to remove x-axis the [] empty list indicates this

plt.yticks([]) # to remove y-axis

Mattu University, Department of ECE Page 24

Autoencoder 2023

plt.grid(False) # to remove grid

plt.imshow(testX[i].reshape(28, 28), cmap='gray') #display the image

plt.tight_layout() # to have a proper space in the subplots

plt.show()

plt.figure(figsize=(10,10))

for i in range(5):

plt.subplot(1, 5, i+1)

plt.xticks([]) # to remove x-axis the [] empty list indicates this

plt.yticks([]) # to remove y-axis

plt.grid(False) # to remove grid

plt.imshow(testXNoisy[i].reshape(28, 28), cmap='gray') #display the image

plt.tight_layout() # to have a proper space in the subplots

plt.show()

# to visualise reconstructed images(output of autoencoder)

plt.figure(figsize=(10,10))

for i in range(5):

plt.subplot(1, 5, i+1)

Mattu University, Department of ECE Page 25

Autoencoder 2023

plt.xticks([]) # to remove x-axis the [] empty list indicates this

plt.yticks([]) # to remove y-axis

plt.grid(False) # to remove grid

plt.imshow(pred[i].reshape(28, 28), cmap='gray') #display the image

plt.tight_layout() # to have a proper space in the subplots

plt.show()

As we can see above, the model is able to successfully denoise the images and generate the
pictures that are pretty much identical to the original images.

Mattu University, Department of ECE Page 26

Autoencoder 2023

What is Big Data Architecture?

 This architecture is designed in such a way that it handles the ingestion process,
processing of data and analysis of the data is done which is way too large or complex
to handle the traditional database management systems.
 Different organizations have different thresholds for their organizations, some have it
for a few hundred gigabytes while for others even some terabytes are not good enough
a threshold value.
 Due to this event happening if you look at the commodity systems and the commodity
storage the values and the cost of storage have reduced significantly. There is a huge
variety of data that demands different ways to be catered.
 Some of them are batch related data that comes at a particular time and therefore the
jobs are required to be scheduled in a similar fashion while some others belong to the
streaming class where a real-time streaming pipeline has to be built to cater to all the
requirements. All these challenges are solved by big data architecture.

Explanation of Big Data Architecture

Figure 1: Explanation of big data architecture

Big Data systems involve more than one workload types and they are broadly classified as
follows:

1. Where the big data-based sources are at rest batch processing is involved.
2. Big data processing in motion for real-time processing.
3. Exploration of interactive big data tools and technologies.
4. Machine learning and predictive analysis.

Mattu University, Department of ECE Page 27

Autoencoder 2023

1. Data Sources

The data sources involve all those golden sources from where the data extraction pipeline is
built and therefore this can be said to be the starting point of the big data pipeline.

Examples,
(i) Datastores of applications such as the ones like relational databases

(ii) The files which are produced by a number of applications and are majorly a part of static
file systems such as web-based server files generating logs.

(iii) IoT devices and other real time-based data sources.

2. Data Storage

This includes the data which is managed for the batch built operations and is stored in the file
stores which are distributed in nature and are also capable of holding large volumes of
different format backed big files. It is called the data lake. This generally forms the part
where our Hadoop storage such as HDFS, Microsoft Azure, AWS, GCP storages are
provided along with blob containers.

3. Batch Processing

All the data is segregated into different categories or chunks which makes use of long-
running jobs used to filter and aggregate and also prepare data o processed state for analysis.
These jobs usually make use of sources, process them and provide the output of the processed
files to the new files. The batch processing is done in various ways by making use of Hive
jobs or U-SQL based jobs or by making use of Sqoop or Pig along with the custom map
reducer jobs which are generally written in any one of the Java or Scala or any other language
such as Python.

4. Real Time-Based Message Ingestion

This includes, in contrast with the batch processing, all those real-time streaming systems
which cater to the data being generated sequentially and in a fixed pattern. This is often a
simple data mart or store responsible for all the incoming messages which are dropped inside
the folder necessarily used for data processing. There are, however, majority of solutions that

Mattu University, Department of ECE Page 28

Autoencoder 2023

require the need of a message-based ingestion store which acts as a message buffer and also
supports the scale based processing, provides a comparatively reliable delivery along with
other messaging queuing semantics. The options include those like Apache Kafka, Apache
Flume, Event hubs from Azure, etc.

5. Stream Processing

There is a slight difference between the real-time message ingestion and stream processing.
The former takes into consideration the ingested data which is collected at first and then is
used as a publish-subscribe kind of a tool. Stream processing, on the other hand, is used to
handle all that streaming data which is occurring in windows or streams and then writes the
data to the output sink. This includes Apache Spark, Apache Flink, Storm, etc.

6. Analytics-Based Datastore

This is the data store that is used for analytical purposes and therefore the already processed
data is then queried and analyzed by using analytics tools that can correspond to the BI
solutions. The data can also be presented with the help of a NoSQL data warehouse
technology like HBase or any interactive use of hive database which can provide the
metadata abstraction in the data store. Tools include Hive, Spark SQL, Hbase, etc.

7. Reporting and Analysis

The insights have to be generated on the processed data and that is effectively done by the
reporting and analysis tools which makes use of their embedded technology and solution to
generate useful graphs, analysis, and insights helpful to the businesses. Tools include Cognos,
Hyperion, etc.

8. Orchestration

Big data-based solutions consist of data related operations that are repetitive in nature and are
also encapsulated in the workflows which can transform the source data and also move data
across sources as well as sinks and load in stores and push into analytical units. Examples
include Sqoop, oozie, data factory, etc.

Mattu University, Department of ECE Page 29

Autoencoder 2023

References
 [1] Oshea TJ, Hoydis J (2017) An introduction to deep learning for the physical layer.
IEEE Trans Cognitive Commun Netw 3(4):563–575
 [2] T. O’Shea and J. Hoydis, "An Introduction to Deep Learning for the Physical
Layer," in IEEE Transactions on Cognitive Communications and Networking, vol. 3,
no. 4, pp. 563-575, Dec. 2017, doi: 10.1109/TCCN.2017.2758370.
 [3] S. Dörner, S. Cammerer, J. Hoydis and S. t. Brink, "Deep Learning Based
Communication Over the Air," in IEEE Journal of Selected Topics in Signal
Processing, vol. 12, no. 1, pp. 132-143, Feb. 2018, doi:
10.1109/JSTSP.2017.2784180.
 [4] B. Karanov, M. Chagnon, F. Thouin, T. A. Eriksson, H. B¨ulow, D. Lavery,P.
Bayvel, and L. Schmalen, “End-to-End Deep Learning of Optical Fiber
Communications,” J. Lightw. Technol., vol. 36, no. 20, pp. 4843– 4855, Oct. 2018.
 [5] Z. Zhu, J. Zhang, R. Chen, and H. Yu, “Autoencoder-Based Transceiver Design
for OWC Systems in Log-Normal Fading Channel,” IEEE Photon. J., vol. 11, no. 5,
pp. 1–12, Oct. 2019.

Mattu University, Department of ECE Page 30

Machine Learning PPT For Students
70% (10)
Machine Learning PPT For Students
18 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
Autoencoders Based Digital Communication Systems
No ratings yet
Autoencoders Based Digital Communication Systems
5 pages
DUnit IV
No ratings yet
DUnit IV
22 pages
DL Unit 5
No ratings yet
DL Unit 5
19 pages
D5_PPT
No ratings yet
D5_PPT
79 pages
Unit IV DL
No ratings yet
Unit IV DL
122 pages
SML unit 3
No ratings yet
SML unit 3
38 pages
Wireless comms with ML
No ratings yet
Wireless comms with ML
2 pages
Lecture_6373_07
No ratings yet
Lecture_6373_07
53 pages
MODULE 5 Auto-Encoders and Generative Models
No ratings yet
MODULE 5 Auto-Encoders and Generative Models
25 pages
Unit IV DL
No ratings yet
Unit IV DL
122 pages
Experiment 11
No ratings yet
Experiment 11
4 pages
Autoencoders
No ratings yet
Autoencoders
12 pages
Unit 3
No ratings yet
Unit 3
39 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
Generative_Models
No ratings yet
Generative_Models
65 pages
Unit-5 Auto Encoders in Deep Learning
No ratings yet
Unit-5 Auto Encoders in Deep Learning
23 pages
Autoencoders - Bits and Bytes of Deep Learning - Towards Data Science
No ratings yet
Autoencoders - Bits and Bytes of Deep Learning - Towards Data Science
10 pages
L23_autoencoders
No ratings yet
L23_autoencoders
16 pages
Unit 4
No ratings yet
Unit 4
10 pages
Deep Learning: Autoencoder
No ratings yet
Deep Learning: Autoencoder
42 pages
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
No ratings yet
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
22 pages
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
No ratings yet
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
22 pages
Unit 5
No ratings yet
Unit 5
23 pages
Auto Encoder
No ratings yet
Auto Encoder
10 pages
ARDA_JOURNAL_15946
No ratings yet
ARDA_JOURNAL_15946
13 pages
Autoencoders: Presented By: 2019220013 Balde Lansana (
No ratings yet
Autoencoders: Presented By: 2019220013 Balde Lansana (
21 pages
DL Unit3 Autoencoder
No ratings yet
DL Unit3 Autoencoder
91 pages
35-Gated RNNs - Optimization For Long-Term Dependencies - Explicit Memory-07!10!2024
No ratings yet
35-Gated RNNs - Optimization For Long-Term Dependencies - Explicit Memory-07!10!2024
3 pages
Joint Transceiver Optimization For Wireless Communication PHY With CNN
No ratings yet
Joint Transceiver Optimization For Wireless Communication PHY With CNN
21 pages
Lecture 2.3.1 - Autoencoders
No ratings yet
Lecture 2.3.1 - Autoencoders
6 pages
UNIT 3
No ratings yet
UNIT 3
23 pages
Auto Encoders
No ratings yet
Auto Encoders
4 pages
Autoencoders and Restricted Boltzmann Machines: Amir H. Payberah
No ratings yet
Autoencoders and Restricted Boltzmann Machines: Amir H. Payberah
139 pages
Autoencoders
No ratings yet
Autoencoders
14 pages
Auto Encoder S
No ratings yet
Auto Encoder S
32 pages
03 Autoencoders 4
No ratings yet
03 Autoencoders 4
159 pages
Experiment 4
No ratings yet
Experiment 4
26 pages
Vae Gan
No ratings yet
Vae Gan
214 pages
UNIT V
No ratings yet
UNIT V
32 pages
Unit II
No ratings yet
Unit II
35 pages
Autoencoder
No ratings yet
Autoencoder
14 pages
Autoencoder
No ratings yet
Autoencoder
4 pages
659451A19_DL_EXP5
No ratings yet
659451A19_DL_EXP5
8 pages
DeepLearning Unit IV Notes
No ratings yet
DeepLearning Unit IV Notes
58 pages
ISMreport
No ratings yet
ISMreport
23 pages
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
UNIT-V DL
No ratings yet
UNIT-V DL
31 pages
Learning To Communicate With Autoencoders Rethinking Wireless Systems With Deep Learning
No ratings yet
Learning To Communicate With Autoencoders Rethinking Wireless Systems With Deep Learning
4 pages
Doan Wiley
No ratings yet
Doan Wiley
15 pages
Schmitz 2019
No ratings yet
Schmitz 2019
6 pages
RP 1
No ratings yet
RP 1
13 pages
Autoencoder_GAN_edited
No ratings yet
Autoencoder_GAN_edited
138 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
Module 5 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 5 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
26 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Master'S Thesis: Potential Deep Learning Approaches For The Physical Layer
No ratings yet
Master'S Thesis: Potential Deep Learning Approaches For The Physical Layer
59 pages
NeurIPS-2022-error-correction-code-transformer-Paper-Conference
No ratings yet
NeurIPS-2022-error-correction-code-transformer-Paper-Conference
11 pages
DL M3 Tech
No ratings yet
DL M3 Tech
15 pages
Supriya Data Engineer Resume (1)
No ratings yet
Supriya Data Engineer Resume (1)
4 pages
Data Mining UNIT 2 LECTURE NOTES
No ratings yet
Data Mining UNIT 2 LECTURE NOTES
32 pages
Serbest Iş English-1
No ratings yet
Serbest Iş English-1
15 pages
IJSRDV6I10368
No ratings yet
IJSRDV6I10368
2 pages
Programming Skills Matrix
No ratings yet
Programming Skills Matrix
1 page
Madhi_final
No ratings yet
Madhi_final
2 pages
Data Prepration
No ratings yet
Data Prepration
1 page
IBM Spectrum Discover Level 2 Quiz Attempt Review PDF
100% (1)
IBM Spectrum Discover Level 2 Quiz Attempt Review PDF
6 pages
MIDAS-2025 Brochure
No ratings yet
MIDAS-2025 Brochure
2 pages
Syllabus EC-L17
No ratings yet
Syllabus EC-L17
2 pages
Mami Resume
No ratings yet
Mami Resume
3 pages
Mini Project 3rd Ppt
No ratings yet
Mini Project 3rd Ppt
14 pages
big data processing
No ratings yet
big data processing
38 pages
[FREE PDF sample] Development of Linguistic Linked Open Data Resources for Collaborative Data Intensive Research in the Language Sciences 1st Edition Antonio Pareja Lora ebooks
100% (3)
[FREE PDF sample] Development of Linguistic Linked Open Data Resources for Collaborative Data Intensive Research in the Language Sciences 1st Edition Antonio Pareja Lora ebooks
39 pages
BDA-24_Lect (3-4)-(Fundamentals of Data Analysis)
No ratings yet
BDA-24_Lect (3-4)-(Fundamentals of Data Analysis)
15 pages
Advanced Python Course
No ratings yet
Advanced Python Course
6 pages
Training Test
No ratings yet
Training Test
54 pages
Introduction To Expert Systems
No ratings yet
Introduction To Expert Systems
13 pages
Next-Generation Electronic Health Records - Harnessing Blockchain and AI For Secure and Efficient Patient Data Management
No ratings yet
Next-Generation Electronic Health Records - Harnessing Blockchain and AI For Secure and Efficient Patient Data Management
11 pages
Get Understanding Large Language Models Thimira Amaratunga free all chapters
100% (3)
Get Understanding Large Language Models Thimira Amaratunga free all chapters
76 pages
MBAN Assignment
No ratings yet
MBAN Assignment
2 pages
Basics of compilation process COM 413
No ratings yet
Basics of compilation process COM 413
31 pages
Java Full Satck & Mulesoft Profiles - Capgemini - 12292023
No ratings yet
Java Full Satck & Mulesoft Profiles - Capgemini - 12292023
8 pages
ITE183
No ratings yet
ITE183
5 pages
10 AI QP 27112024
No ratings yet
10 AI QP 27112024
7 pages
NLP QB
No ratings yet
NLP QB
5 pages
Julius Sempio: Profile
100% (1)
Julius Sempio: Profile
1 page
Unit 3. Search Strategies
No ratings yet
Unit 3. Search Strategies
20 pages
Process Aware Information Systems
No ratings yet
Process Aware Information Systems
4 pages