Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition - Université de Rennes Accéder directement au contenu
Article Dans Une Revue Computer Speech and Language Année : 2010

Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition

Stéphane Huet
Pascale Sébillot

Résumé

Many automatic speech recognition (ASR) systems rely on the sole pronunciation dictionaries and language models to take into account information about language. Implicitly, morphology and syntax are to a certain extent embedded in the language models but the richness of such linguistic knowledge is not exploited. This paper studies the use of morpho-syntactic (MS) information in a post-processing stage of an ASR system, by reordering N-best lists. Each sentence hypothesis is first part-of-speech tagged. A morpho-syntactic score is computed over the tag sequence with a long-span language model and combined to the acoustic and word-level language model scores.This new sentence-level score is finally used to rescore N-best lists by reranking or consensus. Experiments on a French broadcast news task show that morpho-syntactic knowledge improves the word error rate and confidence measures. In particular, it was observed that the errors corrected are not only agreement errors and errors on short grammatical words but also other errors on lexical words where the hypothesized lemma was modified.
Fichier principal
Vignette du fichier
PEER_stage2_10.1016%2Fj.csl.2009.10.001.pdf (1.25 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00508471 , version 1 (04-08-2010)

Identifiants

Citer

Stéphane Huet, Guillaume Gravier, Pascale Sébillot. Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition. Computer Speech and Language, 2010, 24 (4), pp.663-684. ⟨10.1016/j.csl.2009.10.001⟩. ⟨hal-00508471⟩
366 Consultations
319 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More