A Hybrid Transformation Technique For Advanced Video Coding: M. Ezhilarasan, P. Thambidurai
A Hybrid Transformation Technique For Advanced Video Coding: M. Ezhilarasan, P. Thambidurai
A Hybrid Transformation Technique For Advanced Video Coding: M. Ezhilarasan, P. Thambidurai
ABSTRACT
A Video encoder performs video data compression by having combination of three
main modules such as Motion estimation and compensation, Transformation, and
Entropy encoding. Among these three modules, transformation is the module of
removing the spatial redundancy that exists in the spatial domain of video
sequence. Discrete Cosine Transformation (DCT) is the defacto transformation
method in existing image and video coding standards. Even though the DCT has
very good energy preserving and decorrelation properties, it suffers from blocking
artifacts. To overcome this problem, a hybridization method has been incorporated
in transformation module of video encoder. This paper presents an hybridization in
the transformation module by incorporating DCT as transformation technique for
inter frames and a combination of wavelet filters for intra frames of video
sequence. This proposal is also applied in the existing H.264/AVC standard.
Extensive experiments have been conducted with various standard CIF and QCIF
video sequences. The results show that the proposed hybrid transformation
technique outperforms the existing technique used in the H.264/AVC considerably.
Keywords: Data Compression, DCT, DWT, Video Coding, Transformation.
INTRODUCTION
Volume 3 Number 3
Page 89
www.ubicc.org
2.1
Basics of Transformation
F(u,v) =
f(i,j)
=
2C(u)C(v)
MN
M 1N 1
2C(u)C(v)
u 0 v 0
MN
(2i 1)u
cos
2M
cos
(2i 1)u
2M
cos
cos
(2 j 1)v
2N
f (i, j )
(2 j 1)v
F (u , v )
2N
(1)
(2)
Volume 3 Number 3
M 1N 1
i 0 j 0
Page 90
C(x)
2
2
=1
if x = 0
otherwise
www.ubicc.org
Volume 3 Number 3
Page 91
(t )
n)
(3)
n Z
n)
(4)
n Z
Quantization
A Quantizer [10][11] simply reduces the number
of bits needed to store the transformed coefficients
by reducing the precision of those values. Since this
is a many to one mapping, it is a lossy process and is
the main source of compression in an encoder.
Quantization can be performed on each individual
coefficient, which is referred as scalar quantization.
Quantization can also be performed on a group of
coefficients together, and which is referred as vector
quantization.
Uniform quantization is a process of partitioning
the domain of input values into equally spaced
intervals, except outer intervals. The end points of
partition intervals are called the quantizer decision
boundaries. The output or reconstruction value
corresponding to each interval is taken to be the
midpoint of the interval. The length of each interval
is referred to as the step size (fixed in the case of
uniform quantization), denoted by the symbol .
The step size is given by
2 X max
M
(6)
Motion Estimation
Motion estimation (ME) [12] is a process to
estimate the pixels of the current frame from
reference
frame(s).
Block
matching
motion
estimation or block matching algorithm (BMA),
which is temporal redundancy removal technique
www.ubicc.org
MAD(i, j)
k, y l)
p( x k
i, y l
j) |
(7)
where M x N is the size of the macro block, c(.,.) and
p(.,.) denote the pixel intensity in the current frame
and previously processed frames respectively, (k,l) is
the coordinates of the upper left corner of the current
block, and (x,y) represents the displacement in pixel
which is relative to the position of current block.
After checking each location in the search area, the
motion vector is then determined as the (x,y) at
which the MAD has the minimum value. In this wok,
an exhaustive full search has been applied for motion
compensated prediction technique.
2.4
Entropy Encoding
Based on scientist Claude E. Shannon [8], the
entropy 11 of an information source with alphabet S =
{s1, s2, , s3} is defined as
11
H (S )
pi log 2
pi
i 1
(8)
Where pi is the probability of symbol si in S. The
term log2 1 indicates the amount of information
pi
Volume 3 Number 3
Page 92
www.ubicc.org
Fn
Reorder
Entropy
encoder
EXPERIMENTAL
DISCUSSION
RESULTS
AND
ME
DWT
Inter
MC
Fn- 1
Choose
intra
prediction
NAL
Intra
prediction
Intra
IDWT
+
+
Filter
Fn
IT
Q1
0.602949018236359
1.115087052456994
0.266864118442872
0.091271763114249
0.026748757410809
1 0 log
(2n
10
1) 2
M SE
(9)
dB
1.115087052456994
0.602949018236379
0.591271763114247
Proposed
combination
1st level
Decomposition
P1
Haar
Haar
0.016864118442874
P2
P3
Haar
Daub
Daub
Haar
0.026748757410809
P4
Daub
Daub
Volume 3 Number 3
Page 93
2nd level
Decomposition
www.ubicc.org
Sequence PSNR
Bus
Stefan
Flower
Garden
Suzie
Mobile
Y
U
V
Y
U
V
Y
U
V
Y
U
V
Y
U
V
Existing
(dB)
35.77
35.83
36.04
36.38
35.00
36.90
36.00
36.51
34.93
37.62
43.76
43.32
33.95
35.13
34.92
P1
(dB)
35.03
35.81
36.03
35.69
35.00
36.90
35.72
36.49
34.92
37.57
43.71
43.35
33.92
35.12
34.96
P2
(dB)
35.88
35.83
36.04
36.50
35.01
36.91
36.13
36.47
34.93
37.66
43.72
43.43
34.10
35.10
34.91
P3
(dB)
35.88
35.82
36.03
36.50
35.00
36.91
36.13
36.50
34.94
37.68
43.75
43.39
34.10
35.08
34.91
P4
(dB)
35.89
35.82
36.03
36.50
35.00
36.91
36.14
36.50
34.93
37.68
43.74
43.39
34.10
35.08
34.91
CONCLUSION
Volume 3 Number 3
Page 94
REFERENCES
[1]
Zixiang
Xiong,
Kannan
Ramachandran,
Michael T. Orchard and Ya-Qin Zhang: A
Comparative study of DCT and Wavelet-Based
Image Coding, IEEE Transactions on Circuits
and Systems for Video Technology, Vol. 9,
No. 5, pp. 692-695 (1999).
[2] N. Ahmed, T. Natarajan and K. R. Rao:
Discrete Cosine Transform, IEEE Transactions
on Computers, pp. 90-93 (1974).
[3]
Ingrid Daubechies: Ten lectures on wavelets,
Capital city Press, Pennsylvania, pp. 53-105
(1992).
[4] Marc Antonini, Michel Barlaud, Pierre Mathieu
and Ingrid Daubechies: Image coding using
wavelet transform, IEEE Transactions on
Image Processing, Vol. 1, No. 2, pp. 205-220
(1992).
[5] Gary J. Sullivan, Pankaj Topiwala and Ajay
Luthra: The H.264/AVC AVC Standard Overview and Introduction to the Fidelity
Range Extensions, SPIE Conference on
Applications of Digital Image Processing
XXVII (2004).
[6] Iain E. G. Richardson: H.264 and MPEG-4
Video Compression, John Wiley & Sons (2003).
[7] ftp://ftp.imtc.org/jvt-experts/reference_software. [8]
C. E. Shannon: A Mathematical theory of
Communication, Bell System Technical Journal,
Vol. 27, pp. 623-656 (1948).
[9] Kelth Jack: Video Demystified, Penram
International Publishing Pvt. Ltd., Mumbai,
pp. 234-236 (2001).
IEEE
[10] Allen
Gersho:
Quantization,
Communications Society Magazine, pp. 16-29
(1977).
[11] Peng H. Ang, Peter A. Ruetz and David Auld:
Video compression makes big gains, IEEE
Spectrum (1991).
[12] Frederic Dufaux, Fabrice Moscheni:Motion
www.ubicc.org
Volume 3 Number 3
Page 95
www.ubicc.org