pith. sign in

arxiv: cmp-lg/9806014 · v1 · submitted 1998-06-22 · cmp-lg · cs.CL

Word Sense Disambiguation using Optimised Combinations of Knowledge Sources

classification cmp-lg cs.CL
keywords disambiguationsenseknowledgesourceswordcombiningcorpusdescribe
0
0 comments X
read the original abstract

Word sense disambiguation algorithms, with few exceptions, have made use of only one lexical knowledge source. We describe a system which performs unrestricted word sense disambiguation (on all content words in free text) by combining different knowledge sources: semantic preferences, dictionary definitions and subject/domain codes along with part-of-speech tags. The usefulness of these sources is optimised by means of a learning algorithm. We also describe the creation of a new sense tagged corpus by combining existing resources. Tested accuracy of our approach on this corpus exceeds 92%, demonstrating the viability of all-word disambiguation rather than restricting oneself to a small sample.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.