A Modified Image Processing Method For Deblurring Based On GAN Networks
A Modified Image Processing Method For Deblurring Based On GAN Networks
A Modified Image Processing Method For Deblurring Based On GAN Networks
1
Air Force Harbin Flight Academy, 111000, Liaoyang, China.
2
AllSim Technology Inc, No. 658 Lugu Avenue, Yuelu District, 410013, Changsha, China.
Abstract—In computer vision literature, it is really a challenging The blur can be typically modeled as the convolution of a
issue about removing the images blur resulted from camera point spread function with a hypothetical sharp input image,
shake. As the existing image deblurring methods do not apply to that is, the sharp image is convoluted with the blur kernel, and
the image degraded by partial motion blur, and the existing noise is added at the same time to obtain a blurred image. And
partial blur detection approaches only used low -level blur image deblurring refers to the inverse processing of blurred
features to measure the blurry degree of an image, the blur images, also known as image deconvolution. In real life, the
regions extracted via these methods usually have blur kernel is often unknown and it is quite hard to estimate
misclassification. To solve the imaging deblurring problem, we non-uniform kernel, which brings a lot of obstacles and
propose an image deblurring method based on Generative
challenges for researchers. Hence, there is an urgent need to
Adversarial Network (GAN) architecture using dual path
solve the problem in image blurring.
connection. In comparison with the traditional image
deblurring algorithm, this model can avoid the dependence on
Fortunately, because of the support of computer hardware
apriori-knowledge of blurred image. Experimental results show technology, the emergence of big data, and the prevalence of
that the proposed method significantly outperforms other state- machine learning algorithms, image deblurring technology
of-art algorithms on image deblurring. has made breakthrough progress in both theoretical and
practical applications. The deblurring algorithms proposed by
Keywords- image processing; imaging deblurring; artificial many scholars have been applied to daily life, including
intelligence; dual path connection; convolutional neural network medical imaging [1], traffic monitoring, aerospace [2], target
tracking [3], object recognition [4] and other fields. As science
I. INTRODUCTION and technology advance, image information will be applied to
a wider range of areas, as well as the construction of national
As virtual technology and mobile Internet advance, image economy and defense security. In the future, both the
plays an important role in the process of gaining and passing subjective visual effects and the objective evaluation criteria
information for people. However, there are too many of deblurring will be more stringent.
unforeseeable elements in real life which could bring about To make use of computing performance, in this paper, a
image blur, such as atmospheric turbulence effect, diffraction generative adversarial network (GAN) based on dual path
of optical system in camera equipment, aberration of optical network architecture was proposed to resolve this difficult
system, relative motion between imaging equipment and problem. The experiment got better results, compared with the
object. Degrade images always cannot meet the needs of most advanced blind deblurring algorithms, including two
human beings and even cause non-negligible impact on traditional mathematical methods and a deep-learning
subsequent image processing including image segmentation, method.
image feature extraction, and object tracking. The main contents of this paper are as follows: section II
Recent years, the study of image deblurring has gradually discusses the related work of image restoration. In section III,
become an important research branch in the field of digital a new method was introduced to tackle the problem of image
image processing. With regard to the research of deblurring, restoration for blur images. Section IV gives the experimental
most experts and scholars put forward certain assumptions on results and makes a simple analysis. Finally, section V draws
the basis of linear blur to process degrade images. Because the the conclusion.
proposed assumptions are different, the performances of each
deblur method are different. In actual situations, the type of
blur is not necessarily linear, nor does it necessarily conform
to the assumptions of the algorithm, which results in an
unsatisfactory deblurring effect in practical applications.
Authorized licensed use limited to: ULAKBIM UASL - DOGU AKDENIZ UNIV. Downloaded on July 09,2021 at 21:00:24 UTC from IEEE Xplore. Restrictions apply.
2019 5th International Conference on Big Data and Information Analytics
A. Image Deblurring
Image deblurring or deconvolution is a important issue in
computer vision. According to whether the blur kernel is
known, there are two cases: blind deconvolution or non- perception quality, however, there are some problems in the
blind deconvolution. In most cases, we don’t know what type training of general version, such as gradient disappearance,
of blur is. Generally, deconvolution contains two parts: The mode collapse and so on.
first step is to estimate the PSF. The second step is to recovery
the original image by using non-blind deconvolution of C. Conditional Generative Adversarial Networks
estimated PSF, and the usual solution used here is The conditional generative adversarial networks (CGAN)
regularization. is a further development of the basic GAN [10]. Here y is
Prior information of original image and blur kernel are the added as conditional information to the generator and
key to solve ill-posed problem of image deconvolution. As the discriminator, and y can represent any information, such as
main part of Bayesian method, it has the function of classification information or other data. As shown in Figure 1,
estimating results accurately and speeding up the iteration CGAN achieves its function by sending extra information y as
process. In practical terms, according to the content of part of the input layer to the generator and discriminator. In
regularization methods, the regularizers whose prior the generator, a common hidden layer feature is formed by
distributions are converted into corresponding forms are combining conditional information y with prior input noise
carried out according to their random properties. Constraints p(z). The adversarial training architecture shows quite
are generated in regularization optimization problems due to adaptable features in the composition of hidden layer. Like
these regularizers. The total variability (TV) norm is the most GAN, the objective function of conditional GAN is as
famous and common used image regularizer[6]. The edges follows:
and details of the images can be preserved in minimizing total
variation. Hyper-Laplacian [5], Wavelet-based analysis minmaxV(D, G) = [log(D(x | y))] − [log(1− D(x | y))] (2)
methods [7], or even compound regularizers [8] are also G D x ~r x ~g
30
Authorized licensed use limited to: ULAKBIM UASL - DOGU AKDENIZ UNIV. Downloaded on July 09,2021 at 21:00:24 UTC from IEEE Xplore. Restrictions apply.
2019 5th International Conference on Big Data and Information Analytics
31
Authorized licensed use limited to: ULAKBIM UASL - DOGU AKDENIZ UNIV. Downloaded on July 09,2021 at 21:00:24 UTC from IEEE Xplore. Restrictions apply.
2019 5th International Conference on Big Data and Information Analytics
smooth and lack details, and using L2 loss solely often fail to
B. Loss Functions get a real sparse model. Therefore, we add both to the =net
1) Content Loss loss function given in Eqn. 4 in order to remove these
Generally, the L1 or L2 loss is applied for image limitations by simultaneously leveraging L1 loss and
deblurring as the primary objective function between restored perceptual loss.
image and ground truth. However, simply using L1 loss or L2 Perceptual loss is a simple L2 loss, but based on the
loss will cause some problems. Because L1 loss is sparse and difference of the generated and target image CNN feature
the gradient is not smooth at zero point, this means only using maps. It is defined as following:
L1 loss in deep neural network model will make the image too
subset amounts to 1430 pictures. For each resized image, a
1
Wi , j H i , j trajectory of motion was generated. The trajectory was
$ percep (VGG / i , j ) =
Wi , j H i , j
¦ ¦ (φ
x =1 y =1
i, j ( I S ) x , y −φi , j (GθG ( I B )) x , y ) 2 (4) estimated by modeling a particle’s motion affected by inertial,
impulsive and Gaussian perturbations. The PSFs were used as
blur kernels. Each image from the dataset was convolved with
Where, Wi,j , Hi,j are the width and height of the (i , j)th its corresponding group of four blur kernels, thus producing a
ReLU layer of VGG-19 network [15] and¶i,j is the feature dataset of 5720 images composed of blurred images and
map obtained by the j-th convolution (after activation) before corresponding ground truth images. Gaussian and Poisson
the i-th maxpooling layerwithin the VGG19 network, noise were subsequently added to them. The train-test-
pretrained on ImageNet, IB is a blurred image, IS is a sharp validation split for this project was 80-10-10, meaning there
latent image. were 4576 images in the training set, and 572 images in both
2) Conditional Adversarial Loss test and validation sets.
In our GAN architecture, two image pairs are sent to the
B. Training Details
discriminator. The blurred image as input and the
corresponding images created by generator as output form For software facilities, our experiment runs on a deep
one pair, while the blurred image as input and the ground learning framework called "Pytorch". For hardware facilities,
truth image form another pair. If the conditional distribution all operations are based on a desktop computer with a Nvidia
of the input image is the same as that of the potential image GTX 1080 Ti GPU, 16GB RAM and i7 processor. We use the
modeled by the generator, the result will make the generated adaptive learning method, and each time we start training
image consistent with its output to a high degree at a given based on the weight obtained from the previous training
input. This is in line with our basic needs, 'G' can keep the instead of restarting. The initial value of learning rate was set
dependence of output on blurred input to adapt to various to 0.0002 in both generator and discriminator. After the first
quantities and kinds of shake blur, while avoiding it swinging 120 epoch, the learning rate will be linearly reduced to zero in
too far in the process of cheating discriminator is exactly the next cycle. The Adam optimizer with stochastic gradient
what we want. Therefore, conditional GAN can be descent (SGD) is used to train the loss function. In order to fit
considered as the "relevance regularizer" in our network. GPU RAM, we selected the optimal batch size for network
Specifically, the minimized-conditional loss function can be processing. It was found that when a batch size = 2, the
given by the following formula: experimental results were the best, so it was chosen as the
batch size for each training.
$GAN = − [log Dθ (Gθ ( I B ) | I B )] (5) C. Results and Analysis
I ∈I B D G
32
Authorized licensed use limited to: ULAKBIM UASL - DOGU AKDENIZ UNIV. Downloaded on July 09,2021 at 21:00:24 UTC from IEEE Xplore. Restrictions apply.
2019 5th International Conference on Big Data and Information Analytics
Sroubek et al. Sun et al. Nah et al. Ours framework based on dual path connection. The experimental
PSNR 18.36 24.82 27.57 27.98 results show that our model has excellent performance in the
SSIM 0.553 0.764 0.832 0.847 field of image deblurring, compared with the state-of-the-art.
However, images of pure white background may appear
V. CONCLUSION AND FUTURE WORK during the deblurring process for few images due to some
Our method exploits the property of conditional unknown reasons. In the future work, we need to do some
adversarial network, which is a kernel-free blind motion modification in next iteration of the project, and make
deblurring learning approach compared with traditional improvements by exploring and developing more efficient
mathematical methods, proposes a novel conditional GAN loss functions.
Figure 4. Comparison of deblurred images by our model and other algorithms. From left to right: blurred photo, result of Sroubek et al. [16] ,result of Nah
et al. [18], result of our algorithm.
33
Authorized licensed use limited to: ULAKBIM UASL - DOGU AKDENIZ UNIV. Downloaded on July 09,2021 at 21:00:24 UTC from IEEE Xplore. Restrictions apply.
2019 5th International Conference on Big Data and Information Analytics
34
Authorized licensed use limited to: ULAKBIM UASL - DOGU AKDENIZ UNIV. Downloaded on July 09,2021 at 21:00:24 UTC from IEEE Xplore. Restrictions apply.