Abstract
In realizing video retrieval system, the crucial point is how to provide an effective access method of video contents. This paper focuses on Japanese cooking instruction utterances and describes a method of analyzing structure of them, which leads to a summary of video. We detect a hierarchical structure of video contents by using linguistic and visual information. We found that the integration of visual information can improve the detection of task units better than using linguistic information alone.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shibata, T., Kawahara, D., Okamoto, M., Kurohashi, S., Nishida, T.: Structural analysis of instruction utterances. In: Palade, V., Howlett, R.J., Jain, L. (eds.) KES 2003. LNCS, vol. 2773, pp. 1054–1061. Springer, Heidelberg (2003)
Kawahara, D., Kurohashi, S.: Zero pronoun resolution based on automatically constructed case frames and structural preference of antecedents. In: Proceedings of The 1st International Joint Conference on Natural Language Processing, pp. 334–341 (2004)
Izuno, H., Nakamura, Y., Ohta, Y.: Quevico: A framework for video-based interactive media. In: Working Notes WS-5 International Workshop on Intelligent Media Technology for Communicative Reality, PRICAI-2002 (Seventh Pacific Rim International Conference on Artificial Intelligence), August 2002, pp. 6–11 (2002)
Kurohashi, S., Nagao, M.: Automatic detection of discourse structure by checking surface information in sentences. In: Proceedings of 15th COLING, vol. 2, pp. 1123–1127 (1994)
Lienhart, R.: Comparison of automatic shot boundary detection algorithms. In: Proceedings of SPIE Conf. on Storage and Retrieval for Image & Video Databases VII, vol. 3656, pp. 290–301 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shibata, T., Tachiki, M., Kawahara, D., Okamoto, M., Kurohashi, S., Nishida, T. (2004). Structural Analysis of Instruction Utterances Using Linguistic and Visual Information. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2004. Lecture Notes in Computer Science(), vol 3213. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30132-5_57
Download citation
DOI: https://doi.org/10.1007/978-3-540-30132-5_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23318-3
Online ISBN: 978-3-540-30132-5
eBook Packages: Springer Book Archive