Large Vocabulary Continuous Speech Recognition Using Weighted Finite-State Transducers

Caseiro, Diamantino; Trancoso, Isabel

doi:10.1007/3-540-45433-0_15

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2389))

Included in the following conference series:

International Conference for Natural Language Processing in Portugal

508 Accesses

Abstract

Weighted finite-state transducers are an unifying formalism for the implementation and integration of the various knowledge sources and structures typical of a large vocabulary continuous speech recognition system.

In this work we show how those knowledge sources can be converted to this formalism, and how they can be integrated in an optimized network, using our finite-state library and tools.

Experiments performed using our system showed the importance of the optimization of the integrated network, and allowed us to obtain very significant improvements in the speed of the recognizer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion in Highly Inflected Languages

A Composition Algorithm of Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion

Statistical and Linguistic Knowledge Based Speech Recognition System: Language Acquisition Device for Machines

References

M. Mohri, M. Riley, D. Hindle, A. Ljolje, and F. Pereira. Full expansion of context-dependent networks in large vocabulary speech recognition. In Proc. ICASSP’ 98, Seattle, USA, May 1998.
Google Scholar
M. Mohri and M. Riley. Integrated context-dependent networks in very large vocabulary speech recognition. In Proc. Eurospeech’ 99, Budapest, Hungary, September 1999.
Google Scholar
J. Glass, T. Hazen, and I. Hetherington. Real-time telephone-based speech recognition in the jupiter domain. In Proc. ICASSP’ 2001, Utah, USA, May 2001.
Google Scholar
R. Haeb-Umbach and H. Ney. Improvements in beam search for 10000-word continuous-speech recognition. In IEEE Transactions on Speech and Audio Processing, April 1994.
Google Scholar
M. Mohri, F. Pereira, and M. Riley. Weighted automata in text and speech processing. In ECAI 96 Workshop, August 1996.
Google Scholar
D. Caseiro and I. Trancoso. On integrating the lexicon with the language model. In Proc. Eurospeech’ 2001, September 2001.
Google Scholar
D. Caseiro and I. Trancoso. Transducer composition for ”on-the-fly” lexicon and language model integration. In ASRU 2001 Workshop, December 2001.
Google Scholar
M. Mohri. Finite-state transducers in language and speech processing. Computational Linguistics, 23(2):269–311, June 1997.
Google Scholar
M. Mohri, F. Pereira, and M. Riley. A rational design for a weighted finite-state transducer library. In Automata Implementation. Second International Workshop on Implementing Automata, WIA’ 97. Springer Verlag, 1998. Lecture Notes in Computer Science 1436.
Google Scholar
J. Neto, C. Martins, H. Meinedo, and L. Almeida. The design of a large vocabulary speech corpus for portuguese. In Proc. Eurospeech’ 97, September 1997.
Google Scholar
H. Meinedo and J. Neto. Combination of acoustic models in continuous speech recognition hybrid systems. In Proc. ICSLP’ 2000, Beijing, China, October 2000.
Google Scholar

Download references

Author information

Authors and Affiliations

INESC-ID/IST, Rua Alves Redol 9, Lisbon, Portugal
Diamantino Caseiro & Isabel Trancoso

Authors

Diamantino Caseiro
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Trancoso
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidade de Lisboa e CAUTL (IST), Av. Rovisco Pais, 1049-001, Lisboa, Portugal
Elisabete Ranchhod
L2F/INESC ID Lisboa, Technical University of Lisbon, Av. Rovisco Pais, 1049-001, Lisboa, Portugal
Nuno J. Mamede

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Caseiro, D., Trancoso, I. (2002). Large Vocabulary Continuous Speech Recognition Using Weighted Finite-State Transducers. In: Ranchhod, E., Mamede, N.J. (eds) Advances in Natural Language Processing. PorTAL 2002. Lecture Notes in Computer Science(), vol 2389. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45433-0_15

Download citation

DOI: https://doi.org/10.1007/3-540-45433-0_15
Published: 21 June 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43829-8
Online ISBN: 978-3-540-45433-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Large Vocabulary Continuous Speech Recognition Using Weighted Finite-State Transducers

Abstract

Access this chapter

Preview

Similar content being viewed by others

Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion in Highly Inflected Languages

A Composition Algorithm of Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion

Statistical and Linguistic Knowledge Based Speech Recognition System: Language Acquisition Device for Machines

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Large Vocabulary Continuous Speech Recognition Using Weighted Finite-State Transducers

Abstract

Access this chapter

Preview

Similar content being viewed by others

Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion in Highly Inflected Languages

A Composition Algorithm of Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion

Statistical and Linguistic Knowledge Based Speech Recognition System: Language Acquisition Device for Machines

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation