Word Sense Disambiguation using Optimised Combinations of Knowledge Sources

Mark Stevenson; Yorick Wilks

arxiv: cmp-lg/9806014 · v1 · submitted 1998-06-22 · cmp-lg · cs.CL

Word Sense Disambiguation using Optimised Combinations of Knowledge Sources

Yorick Wilks , Mark Stevenson This is my paper

classification cmp-lg cs.CL

keywords disambiguationsenseknowledgesourceswordcombiningcorpusdescribe

0 comments

read the original abstract

Word sense disambiguation algorithms, with few exceptions, have made use of only one lexical knowledge source. We describe a system which performs unrestricted word sense disambiguation (on all content words in free text) by combining different knowledge sources: semantic preferences, dictionary definitions and subject/domain codes along with part-of-speech tags. The usefulness of these sources is optimised by means of a learning algorithm. We also describe the creation of a new sense tagged corpus by combining existing resources. Tested accuracy of our approach on this corpus exceeds 92%, demonstrating the viability of all-word disambiguation rather than restricting oneself to a small sample.

This paper has not been read by Pith yet.

Word Sense Disambiguation using Optimised Combinations of Knowledge Sources

discussion (0)