08357884

Uploaded by

sandy300322

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views15 pages

08357884

Uploaded by

sandy300322

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

This article has been accepted for publication in a future issue of this journal, but has not been

fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2018.2835659, IEEE Access

Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000.
Digital Object Identifier xx.xxxx/ACCESS.2018.DOI

Stitching for Multi-View Videos With

Large Parallax Based on Adaptive Pixel
Warping
KYU-YUL LEE1 and JAE-YOUNG SIM1 , (Member, IEEE)
1
School of Electrical and Computer Engineering, Ulsan National Institute of Science and Technology, Ulsan 44919, South Korea
Corresponding author: Jae-Young Sim (e-mail: jysim@unist.ac.kr).
This work was supported in part by the National Research Foundation of Korea (NRF) within the Ministry of Science and ICT (MSIT)
under Grant 2017R1A2B4011970 and within the Ministry of Education under Grant 2016R1D1A1A09919618, and in part by Institute for
Information & Communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No.20170006670021001,
Information-Coordination Technique Enabling Augmented Reality with Mobile Objects).

ABSTRACT Conventional stitching techniques for images and videos are based on smooth warping
models, and therefore, they often fail to work on multi-view images and videos with large parallax captured
by cameras with wide baselines. In this paper, we propose a novel video stitching algorithm for such
challenging multi-view videos. We estimate the parameters of ground plane homography, fundamental
matrix, and vertical vanishing points reliably, using both of the appearance and activity based feature
matches validated by geometric constraints. We alleviate the parallax artifacts in stitching by adaptively
warping the off-plane pixels into geometrically accurate matching positions through their ground plane
pixels based on the epipolar geometry. We also exploit the inter-view and inter-frame correspondence
matching information together to estimate the ground plane pixels reliably, which are then refined by energy
minimization. Experimental results show that the proposed algorithm provides geometrically accurate
stitching results of multi-view videos with large parallax and outperforms the state-of-the-art stitching
methods qualitatively and quantitatively.

INDEX TERMS Multi-view videos, video stitching, image stitching, large parallax, adaptive pixel warping,
epipolar geometry.

I. INTRODUCTION Traditional image stitching methods assume that a pair of

ULTI-VIEW videos are widely used in many appli- images are taken from very close camera locations to each
M cations such as surveillance [1]–[3], sports [4]–[6],
virtual training [7] and video conferencing [8], [9]. One of the
other and the captured scene structures are roughly planar.
Based on these assumptions, we obtain stitched images by
essential techniques for multi-view applications is stitching, performing the three major steps: feature matching, image
which combines multiple images, captured from different alignment, and image composition. First, feature points are
viewing positions and directions, to generate a single im- detected from different images, which are then matched
age with a wider field of view [10]. Image stitching has together by using feature descriptors, e.g., SIFT [23]. In
been actively studied in the literatures [11]–[21], and related the alignment step, a global image warping model such
commercial products have been also developed, e.g., Adobe as homography is estimated by using the obtained feature
Photoshop Photomerge™ and Microsoft Image Composite matches, and multiple images are aligned to a common image
Editor. Moreover, many current mobile devices with cam- domain accordingly. Finally, the pixel values in a stitched
eras are able to synthesize a panorama image by stitching image are determined by average blending or seam cutting
multiple images captured at different time instances. Also, methods [10].
around view monitoring is one of the core applications of However, when multi-view cameras capture non-planar
autonomous vehicles, which employs bird’s eye views of scene structures at relatively far camera positions from one
stitched multiple images captured by front, side, and rear another, resulting multi-view images exhibit parallax phe-
view cameras [22]. nomenon where the relative locations of scene contents are

VOLUME x, 2018 1

2169-3536 (c) 2018 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See
http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2018.2835659, IEEE Access