pith. sign in

arxiv: cs/0003074 · v1 · submitted 2000-03-23 · 💻 cs.CL

A Finite State and Data-Oriented Method for Grapheme to Phoneme Conversion

classification 💻 cs.CL
keywords accuracyphonemeconversiongraphemesmethodsystemwordsachieves
0
0 comments X
read the original abstract

A finite-state method, based on leftmost longest-match replacement, is presented for segmenting words into graphemes, and for converting graphemes into phonemes. A small set of hand-crafted conversion rules for Dutch achieves a phoneme accuracy of over 93%. The accuracy of the system is further improved by using transformation-based learning. The phoneme accuracy of the best system (using a large set of rule templates and a `lazy' variant of Brill's algoritm), trained on only 40K words, reaches 99% accuracy.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.