Abstract
Video coding has been widely explored by academia and industry in recent years, mainly due to the great popularization of video applications and multimedia-capable devices. The Motion Estimation (ME) process receives special attention since it is one of the most complex steps in video coding. The Test Zone Search (TZS) is the main algorithm employed for integer ME in recent video codecs, such as those based on the High Efficiency Video Coding (HEVC), and has been used in the standardization process of the future Versatile Video Coding (VVC) standard. However, even though it is designed as a fast ME algorithm, the computational effort required by TZS is still very high, compromising the encoding process in multimedia-capable devices that operate on limited energy or computational resources. This work presents the Bypass Zone Search (BZS) algorithm, a learning-based solution for fast ME that improves TZS, aiming at a better tradeoff between compression efficiency and computational cost. First, a set of analyses on TZS is presented, which allowed the design of two strategies to reduce the ME computational cost. The first one, named as Learning-based Bypass Motion Estimation (LBME), consists of a machine learning-based approach that predicts whether the best motion vector has already been found and bypasses the remaining ME steps. The second strategy, named as Astroid Raster Pattern (ARP), is a novel search pattern developed for the most complex TZS step, the Raster Search. By combining the two proposed strategies in BZS, the ME processing time is reduced by 60.98% (Random Access) and 63.05% (Low Delay) in comparison to TZS. The overall HEVC encoding time is reduced by 14.32% (Random Access) and 17.64% (Low Delay), with a negligible loss of 0.0837% (Random Access) and 0.04% (Low Delay) in BD-rate.







Similar content being viewed by others
References
Afonso V, Maich H, Audibert L, Zatt B, Porto M, Agostini L, Susin A (2016) Hardware implementation for the HEVC fractional motion estimation targeting real-time and low-energy. J Integr Circuits Syst 11(2):106
Agarwal PK, Sharir M, Comput ACM (1998) Efficient algorithms for geometric optimization. Surv 30(4):412–458. https://doi.org/10.1145/299917.299918
Belghith F, Kibeya H, Loukil H, Ayed MAB, Masmoudi N (2016) A new fast motion estimation algorithm using fast mode decision for highefficiency video coding standard. J Real-Time Image Proc 11(4):675
Bjontegaard G (2001) Calculation of average PSNR differences between RD-curves. VCEG-M33
Bross B, Chen J, Liu S, Wang Y (2020) Versatile Video Coding (Draft 8). JVET-Q2001
CISCO (2018) Cisco visual networking index: Global mobile data traffic forecast update 2017–2022. Cisco Visual Networking Index (VNI)
Cheung C-H, Po L-M (2005) Novel cross-diamond-hexagonal search algorithms for fast block motion estimation. IEEE Trans Multimedia 7(1):16. https://doi.org/10.1109/TMM.2004.840609
Correa G, Assuncao P, Agostini L, da Silva Cruz LA (2012) Performance and computational complexity assessment of high-efficiency video encoders. IEEE Trans Circuits Syst Video Technol 22(12):1899. https://doi.org/10.1109/TCSVT.2012.2223411
Fan R, Zhang Y, Li B (2017) Motion classification-based fast motion estimation for high-efficiency video coding. IEEE Trans Multimedia 19 (5):893. https://doi.org/10.1109/TMM.2016.2642786
Gonçalves P., Correa G, Porto M, Zatt B, Agostini L (2017) Multiple early termination scheme for TZ search algorithm based on data mining and decision trees. In: 2017 IEEE 19th international workshop on multimedia signal processing (MMSP). https://doi.org/10.1109/MMSP.2017.8122245, pp 1–6
Goncalves P, Moraes C, Porto M, Correa G (2019) Complexity-aware TZS algorithm for mobile video encoders. J Integr Circuits Syst 14(3):1
Goncalves P, Porto M, Zatt B, Agostini L, Correa G (2018) Octagonal-axis raster pattern for improved Test Zone Search motion estimation. In: 2018 IEEE international conference on acoustics. speech and signal processing (ICASSP). https://doi.org/10.1109/ICASSP.2018.8462580, pp 1763–1767
Hu N, Yang E (2014) Fast motion estimation based on confidence interval. IEEE Trans Circuits Syst Video Technol 24(8):1310. https://doi.org/10.1109/TCSVT.2014.2306035
ISO/IEC-JCT1/SC29/WG11 (2020) High Efficiency Video Coding (HEVC) Test Model 16 (HM16) Encoder Description. JVET-q2001
ITU Telecommunication Standardization (2003) ITU-T Recommendation H.264
ITU-T (2008) Subjective video quality assessment methods for multimedia applications. p 910
Jia L, Tsui C, Au OC, Jia K (2019) A new rate-complexity-distortion model for fast motion estimation algorithm in HEVC. IEEE Transactions on Multimedia 21(4):835. https://doi.org/10.1109/TMM.2018.2866762
Jing X, Chau Lap-Pui (2004) An efficient three-step search algorithm for block motion estimation. IEEE Transactions on Multimedia 6(3):435. https://doi.org/10.1109/TMM.2004.827517
Karegowda A, Manjunath A (2010) Comparative study of attribute selection using gain ratio and correlation based feature selection. International Journal of Information Technology and Knowledge Management 2
Li N, Zhang Y, Zhu L, Luo W, Kwong S (2019) Reinforcement learning based coding unit early termination algorithm for high efficiency video coding. J Vis Commun Image Represent 60:276. https://doi.org/10.1016/j.jvcir.2019.02.021. http://www.sciencedirect.com/science/article/pii/S1047320319300677
Linck I, Gomez AT, Alaghband G (2018) Test Zonal Search based on Region Label (TZSR) for Motion Estimation in HEVC. In: 2018 IEEE 20th international workshop on multimedia signal processing (MMSP), pp 1–6
Liu Z, Lin TL, Chou CC (2016) Efficient prediction of CU depth and CU mode for fast HEVC encoding using statistical analysis. J Vis Commun Image Represent 38:474. https://doi.org/10.1016/j.jvcir.2016.03.025
Luo F, Wang S, Wang S, Zhang X, Ma S, Gao W (2019) GPU-based hierarchical motion estimation for high efficiency video coding. IEEE Transactions on Multimedia 21(4):851. https://doi.org/10.1109/TMM.2018.2867260
Nalluri P, Alves LN, Navarro A (2015) Complexity reduction methods for fast motion estimation in HEVC. Signal Process Image Commun 39:280. https://doi.org/10.1016/j.image.2015.09.015
Pakdaman F, Hashemi M, Ghanbari M (2020) A low complexity and computationally scalable fast motion estimation algorithm for HEVC. Multimedia Tools and Applications 79. https://doi.org/10.1007/s11042-019-08593-y
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825
Quinlan JR (2004) Data mining tools see5 and c5. 0. http://www.rulequest.com/see5-info.html
Quinlan JR (2014) C4. 5: programs for machine learning. Elsevier, New York
R Core Team (2019) R foundation for statistical computing. R: A Language and Environment for Statistical Computing. http://www.R-project.org/
Sharman K (2018) Common test conditions for HM video coding experiments. JCT-VC Document AC1100
Sullivan GJ, Ohm J, Han W, Wiegand T (2012) . IEEE Trans Circuits Syst Video Technol 22(12):1649. https://doi.org/10.1109/TCSVT.2012.2221191
U.V. Group (2019) Ultra Video Group. Tampere University of Technology. http://ultravideo.cs.tut.fi/
Valizadeh S, Nasiopoulos P, Ward R (2021) Improving compression efficiency of HEVC using perceptual coding. Multimed Tools Appl 80:1
Vanne J, Viitanen M, Hämäläinen TD (2014) Efficient mode decision schemes for HEVC inter prediction. IEEE Trans Circuits Syst Video Technol 24 (9):1579. https://doi.org/10.1109/TCSVT.2014.2308453
Video Technology Research Group (ViTech) GitHub Repository. https://github.com/vitech-ufpel/LBME
Wang Y, Fan X, Xiong R, Zhao D, Gao W (2021) Neural network-based enhancement to inter prediction for video coding. IEEE Trans Circuits Syst Video Technol 1–1
Zhang Y, Zhang C, Fan R, Ma S, Chen Z, Kuo CJ (2019) Recent advances on hevc inter-frame coding: from optimization to implementation and beyond. IEEE Trans Circuits Syst Video Technol 1–1
Zhu C, Lin X, Chau L-P (2002) Hexagon-based search pattern for fast block motion estimation. IEEE Trans Circuits Syst Video Technol 12(5):349. https://doi.org/10.1109/TCSVT.2002.1003474
Zhu S, Ma K-K (2000) A new diamond search algorithm for fast block-matching motion estimation. IEEE Trans Image Process 9(2):287. https://doi.org/10.1109/83.821744
Acknowledgements
This work was supported by the Brazilian Agency for Scientific and Technological Development (CNPq, Brazil). This research was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) – Finance code 001, the Conselho Nacional de Desenvolvimento Científico e Tecnológico - Brasil (CNPq), and the Fundação de Amparo à Pesquisa do Rio Grande do Sul - Brasil (FAPERGS).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Goncalves, P., Correa, G., Agostini, L. et al. Learning-based bypass zone search algorithm for fast motion estimation. Multimed Tools Appl 82, 3535–3560 (2023). https://doi.org/10.1007/s11042-022-13094-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13094-6