pith. sign in

arxiv: cmp-lg/9805001 · v1 · submitted 1998-05-05 · cmp-lg · cs.CL

Valence Induction with a Head-Lexicalized PCFG

classification cmp-lg cs.CL
keywords distributionsaccurateacquiredalgorithmcomparisoncontextcorpusdictionary
0
0 comments X
read the original abstract

This paper presents an experiment in learning valences (subcategorization frames) from a 50 million word text corpus, based on a lexicalized probabilistic context free grammar. Distributions are estimated using a modified EM algorithm. We evaluate the acquired lexicon both by comparison with a dictionary and by entropy measures. Results show that our model produces highly accurate frame distributions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.