H02D0A Language Engineering Applications

Computational lexicography

Frank Van Eynde
Centre for Computational Linguistics (KU Leuven)

Abstract

The construction of a lexicon is invariably one of the most labour-intensive and expensive parts in the construction of NLP applications. For this reason, techniques have been developed for the acquisition, structuring, maintenance and reusability of lexical resources. Some of the more important techniques will be presented in the lecture. The presentation will be based on Frank Van Eynde & Dafydd Gibbon (eds.), Lexicon Development for Speech and Language Processing.

Slides (.ppt)

Lexical resources

WordNet

FrameNet

VerbNet

Distribution agencies

The Dutch Human Language Technology Agency TST

The European Language Resources Association ELRA

The Linguistic Data Consortium LDC

References

Brian Boguraev & Ted Briscoe (eds.), Computational lexicography for natural language processing, Longman, 1989.

Ted Briscoe, Valeria de Paiva & Ann Copestake (eds.), Inheritance, Defaults and the Lexicon, Cambridge University Press, 1993.

Christiane Fellbaum (ed.), WordNet. An electronic lexical database. MIT Press, 1998.

John Sinclair, M. Hoelter & C. Peters (eds.), The languages of definition: the formalization of dictionary definitions for natural language processing. Studies in Machine Translation and Natural Language Processing. Volume 7. Office for Official Publications of the European Communities. Luxembourg, 1995, 210 pages.

Frank Van Eynde & Dafydd Gibbon (eds.), Lexicon development for speech and language processing. Kluwer Academic Publishers, 2000, xii + 298 pages.