pith. sign in

arxiv: cmp-lg/9411001 · v1 · submitted 1994-11-01 · cmp-lg · cs.CL

Sublanguage Terms: Dictionaries, Usage, and Automatic Classification

classification cmp-lg cs.CL
keywords sublanguagetermsabstractsdegreedictionariesdistinctivenessaccuracyautomatic
0
0 comments X
read the original abstract

The use of terms from natural and social scientific titles and abstracts is studied from the perspective of sublanguages and their specialized dictionaries. Different notions of sublanguage distinctiveness are explored. Objective methods for separating hard and soft sciences are suggested based on measures of sublanguage use, dictionary characteristics, and sublanguage distinctiveness. Abstracts were automatically classified with a high degree of accuracy by using a formula that considers the degree of uniqueness of terms in each sublanguage. This may prove useful for text filtering or information retrieval systems.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.