Abstract
In order to create a structured database describing researchers, home pages can be used as an information source. As the first step of this task, home pages are searched and identified with the usage of the classifier. Then, the information extraction process is performed to enrich researchers profiles, e.g., extract phone and e-mail. We proposed the algorithm for extracting phone numbers, fax numbers and e-mails based on generalised sequential patterns. Extracted information is stored in the structured database and can be searched by users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hazan, R.: Identyfikacja stron domowych ludzi nauki i wydobywanie z nich informacji. Bachelorâs Thesis, Warsaw University of Technology (2012)
Yao, L., Tang, J., Li, J.Z.: A unified approach to researcher profiling. In: Web Intelligence, pp. 359â366. IEEE Computer Society (2007)
Srikant, R., Agrawal, R.: Mining Sequential Patterns: Generalizations and Performance Improvements. In: Apers, P.M.G., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 1â17. Springer, Heidelberg (1996)
Synat system ontology, http://wizzar.ii.pw.edu.pl/passim-ontology/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Hazan, R., Andruszkiewicz, P. (2013). Home Pages Identification and Information Extraction in Researcher Profiling. In: Bembenik, R., Skonieczny, L., Rybinski, H., Kryszkiewicz, M., Niezgodka, M. (eds) Intelligent Tools for Building a Scientific Information Platform. Studies in Computational Intelligence, vol 467. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35647-6_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-35647-6_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35646-9
Online ISBN: 978-3-642-35647-6
eBook Packages: EngineeringEngineering (R0)