Abstract
Entity Disambiguation (ED) is a fundamental task in Natural Language Processing (NLP). The term Entity is used to mean either a Named Entity or an Abstract Concept. Although there have been many works on the ED task for English and some for Vietnamese, this is the first time this paper tackles the general ED task for Vietnamese that deal with both named entities and abstract concepts. In this paper, we propose a method for linking named entities and abstract concepts in Vietnamese documents to the corresponding articles in the Vietnamese Wikipedia. In particular, it first has to recognize Vietnamese entity mentions, i.e., phrases that represent named entities or abstract concepts. Experimental evaluation is also presented to demonstrate the performance of the proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nguyen, H.T., Cao, T.H., Nguyen, T.T., Vo-Thi, T.-L.: Heuristics and Statistics-based Wikification. In: Anthony, P., Ishizuka, M., Lukose, D. (eds.) PRICAI 2012. LNCS, vol. 7458, pp. 879–882. Springer, Heidelberg (2012)
Mihalcea, R., Csomai, A.: Wikify!: Linking Documents to Encyclopedic Knowledge. In: Proc. of the 16th ACM International Conference on Information and Knowledge Management, pp. 233–242 (2007)
Milne, D., Witten, I.H.: Learning to Link with Wikipedia. In: Proc. of the 17th ACM International Conference on Information and Knowledge Management, pp. 509–518 (2008)
Nguyen, H.T., Cao, T.H.: A Knowledge-based Method to Resolve Name Ambiguity in Vietnamese Texts. In: Addendum Contributions of the 5th International Conference on Research, Innovation and Vision for the Future, Studia Informatica Universalis, pp. 83–88 (2007)
Ji, H., Grishman, R., Dang, H.T.: An Overview of the TAC 2011 Knowledge Base Population Track. In: Proc. of Text Analysis Conference (2011)
Ji, H., Grishman, R.: Knowledge Base Population Successful Approaches and Challenge. In: Proc. of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 1148–1158 (2011)
Zhang, W., Su, J., Tan, C.L., Wang, W.: Entity Linking Leveraging Automatically Genrated Annotation. In: Proc. of 23rd International Conference on Computational Linguistics, pp. 1290–1298 (2010)
Han, X., Sun, L., Zhao, J.: Collective Entity Linking in Web Text: A Graph Based Method. In: Proc. of the 34th Annual ACM Special Interest Group on Information Retrieval Conference, pp. 765–774 (2011)
Pham, T.X.T., Tran, T.Q., Dinh, D., Collier, N.: Named Entity Recognition in Vietnamese Using Classifier Voting. ACM Transactions on Asian Language Information Processing 6(4) (2007)
Dinh, D.: Natural Language Processing. VNU-Ho Chi Minh Publisher (2006) (in Vietnamese)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Truong, L.M., Cao, T.H., Dinh, D. (2014). Towards Vietnamese Entity Disambiguation. In: Huynh, V., Denoeux, T., Tran, D., Le, A., Pham, S. (eds) Knowledge and Systems Engineering. Advances in Intelligent Systems and Computing, vol 245. Springer, Cham. https://doi.org/10.1007/978-3-319-02821-7_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-02821-7_26
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02820-0
Online ISBN: 978-3-319-02821-7
eBook Packages: EngineeringEngineering (R0)