An N-best strategy, dynamic grammars and selectively trained neural networks for real-time recognition of continuously spelled names over the telephone

Junqua, Jean-Claude, Valente, Stéphane; Fohr, Dominique; Mari, Jean-François
ICASSP 1995, IEEE International Conference on Acoustics, Speech, and Signal Processing, May 9-12, 1995, Detroit, Michigan, USA

We introduce SmarTspelL, a new speaker-independent algorithm to recognize continuously spelled names over the telephone. Our method is based on an N-best multi-pass recognition strategy applying costly constraints when the number of possible candidates is low. This strategy outperforms an HMM recognizer using a grammar containing all the possible names. It is also more suitable to real-time implementation. For a 3388 name dictionary, a 95.3% name recognition rate is obtained. A real-time prototype has been implemented on a workstation. We also present comparisons of different feature sets for speech representation, and two speech recognition approaches based on first- and second-order HMMs.


DOI
Type:
Conférence
City:
Detroit
Date:
1995-05-09
Department:
Sécurité numérique
Eurecom Ref:
5158
Copyright:
© 1995 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/5158