Valence Induction with a Head-Lexicalized PCFG
classification
cmp-lg
cs.CL
keywords
distributionsaccurateacquiredalgorithmcomparisoncontextcorpusdictionary
read the original abstract
This paper presents an experiment in learning valences (subcategorization frames) from a 50 million word text corpus, based on a lexicalized probabilistic context free grammar. Distributions are estimated using a modified EM algorithm. We evaluate the acquired lexicon both by comparison with a dictionary and by entropy measures. Results show that our model produces highly accurate frame distributions.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.