Abstract
The construction of a lexicon is invariably one of the most labour-intensive and expensive parts in the construction of NLP applications. For this reason, techniques have been developed for the acquisition, structuring, maintenance and reusability of lexical resources. Some of the more important techniques will be presented in the lecture. The presentation will be based on Frank Van Eynde & Dafydd Gibbon (eds.), Lexicon Development for Speech and Language Processing.
Slides (.ppt)
Lexical resources
Distribution agencies
The Dutch Human Language Technology Agency TST
The European Language Resources Association ELRA
The Linguistic Data Consortium LDC
References
Brian Boguraev & Ted Briscoe (eds.), Computational lexicography for natural language processing, Longman, 1989.
Ted Briscoe, Valeria de Paiva & Ann Copestake (eds.), Inheritance, Defaults and the Lexicon, Cambridge University Press, 1993.
Christiane Fellbaum (ed.), WordNet. An electronic lexical database. MIT Press, 1998.
John Sinclair, M. Hoelter & C. Peters (eds.), The languages of definition: the formalization of dictionary definitions for natural language processing. Studies in Machine Translation and Natural Language Processing. Volume 7. Office for Official Publications of the European Communities. Luxembourg, 1995, 210 pages.
Frank Van Eynde & Dafydd Gibbon (eds.), Lexicon development for speech and language processing. Kluwer Academic Publishers, 2000, xii + 298 pages.