Abstract
Creating 3D video content from existing 2D video has been stimulated by recent growth in 3DTV technologies. Depth cues from motion, focus, gradient, or texture shading are typically computed to create 3D world perception. More selective attention might be introduced using manual or automated methods for entertainment or educational purposes. In this paper, we propose an adaptive conversion framework that combines depth and visual saliency cues. A user study was designed and subjective quality scores on test videos were obtained using a tailored single stimulus continuous quality scale (SSCQS) method. The resulting mean opinion scores show that our method is favored by human observers in comparison to other state-of-the-art conversion methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Borji, A., Cheng, M., Jiang, H., Li, J.: Salient object detection: A survey (2014). CoRR abs/1411.5878. http://arxiv.org/abs/1411.5878
Borji, A., Cheng, M., Jiang, H., Li, J.: Salient object detection: A benchmark. CoRR abs/1501.02741 (2015). http://arxiv.org/abs/1501.02741
Ciptadi, A., Hermans, T., Rehg, J.M.: An in depth view of saliency. In: British Machine Vision Conference (BMVC), September 2013
Duncan, K., Sarkar, S.: Saliency in images and video: a brief survey. Computer Vision IET 6(6), 514–523 (2012)
Fang, Y., Wang, J., Narwaria, M., Le Callet, P., Lin, W.: Saliency detection for stereoscopic images. IEEE Transactions on Image Processing 23(6), 2625–2636 (2014)
Goldmann, L., De Simone, F., Ebrahimi, T.: A comprehensive database and subjective evaluation methodology for quality of experience in stereoscopic video (2010). http://dx.doi.org/10.1117/12.839438
ITU-R: Bt.1438 subjective assessment of stereoscopic television pictures
ITU-R: Bt.500- 11 methodology for the subjective assessment of the quality of television pictures
Karasulu, B., Korukoglu, S.: Performance Evaluation Software - Moving Object Detection and Tracking in Videos. Springer Briefs in Computer Science. Springer (2013). http://dx.doi.org/10.1007/978-1-4614-6534-8
Karsch, K., Liu, C., Kang, S.B.: Depth extraction from video using non-parametric sampling. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 775–788. Springer, Heidelberg (2012)
Krishna, M., Ravishankar, M., Babu, D.: Automatic detection and tracking of moving objects in complex environments for video surveillance applications. In: 2011 3rd International Conference on Electronics Computer Technology (ICECT), vol. 1, pp. 234–238, April 2011
Lang, C., Nguyen, T.V., Katti, H., Yadati, K., Kankanhalli, M., Yan, S.: Depth matters: influence of depth cues on visual saliency. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part II. LNCS, vol. 7573, pp. 101–115. Springer, Heidelberg (2012). http://dx.doi.org/10.1007/978-3-642-33709-3_8
Liu, C., Yuen, J., Torralba, A.: SIFT flow: Dense correspondence across scenes and its applications. IEEE Transactions on Pattern Analysis and Machine Intelligence 33(5), 978–994 (2011)
Ramenahalli, S., Niebur, E.: Computing 3D saliency from a 2D image. In: 2013 47th Annual Conference on Information Sciences and Systems (CISS), pp. 1–5, March 2013
Ruksana, S., Kumar, V.: A novel saliency detection framework using low level multi scale image features based on wavelet transformation technique. International Journal of Soft Computing and Engineering (IJSCE) III(1), 8–18 (2014)
Szeliski, R.: Computer Vision: Algorithms and Applications, 1st edn. Springer-Verlag New York Inc., New York (2010)
Wang, J., Fang, Y., Narwaria, M., Lin, W., Le Callet, P.: Stereoscopic image retargeting based on 3D saliency detection. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 669–673, May 2014
Wu, Y., Lim, J., Yang, M.H.: Online object tracking: a benchmark. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2411–2418, June 2013
Zhang, D., Javed, O., Shah, M.: Video object segmentation through spatially accurate and temporally dense extraction of primary object regions. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 628–635, June 2013
Zhang, L., Vazquez, C., Knorr, S.: 3D-TV content creation: Automatic 2D-to-3D video conversion. IEEE Transactions on Broadcasting 57(2), 372–383 (2011)
Zhang, Z., Wang, R., Zhou, C., Wang, Y., Gao, W.: A compact stereoscopic video representation for 3D video generation and coding. Data Compression Conference (DCC) 2012, 189–198 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Taher, H., Rushdi, M., Islam, M., Badawi, A. (2015). Adaptive Saliency-Weighted 2D-to-3D Video Conversion. In: Azzopardi, G., Petkov, N. (eds) Computer Analysis of Images and Patterns. CAIP 2015. Lecture Notes in Computer Science(), vol 9257. Springer, Cham. https://doi.org/10.1007/978-3-319-23117-4_63
Download citation
DOI: https://doi.org/10.1007/978-3-319-23117-4_63
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23116-7
Online ISBN: 978-3-319-23117-4
eBook Packages: Computer ScienceComputer Science (R0)