Abstract
In this paper, we propose a novel method for rapid speaker adaptation called speaker support vector selection (SSVS). By taking gaussian mixture model (GMM) as speaker model, the speakers acoustically close to the test speaker are selected .Different from other selection method, just computing the likelihood between models, we utilizing support vector machines (SVM) to obtain a ‘more optimal speaker subset’. Such selection is dynamically determined according to the distribution of reference speakers close the test. Furthermore, a single-pass re-estimation procedure conditioned on the selected speakers is shown. This adaptation strategy was evaluated in a large vocabulary speech recognition task. The presented method improves the relative accuracy rates by 13% compared to the baseline system.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Legetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density HMM’s in Compute. Speech Lang. 9, 171–186 (1996)
Gauvain, J.L., Lee, C.H.: Maximum a posterior estimation for multivariate Gaussian observations of Markov chains. IEEE Trans. Speech Audio Processing 2, 291–298 (1994)
Sankar, A., Beaufays, F., Digalakis, V.: Training data clustering for improved speech recognition. In: Proc. Eurospeech, pp. 502–505 (1995)
Huang, C., Chen, T., Chang, E.: Speaker Selection Training for Large Vocabulary Continuous Speech Recognition. In: Proc. ICASSP (2002)
Reynolds, D.A., Rose, R.C.: Robust text dependent speaker identification using Gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing 3, 72–83 (1995)
Reynolds, D.A., Quatieri, T., Dunn, R.B.: Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing 10, 19–41 (2000)
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Knowledge Discovery Data Mining 2(2), 121–167 (1998)
Gunn, S.R.: Support vector machines for classification and regression. Technical Report Image Speech and Intelligent Systems Research Group, University of Southampton (1997)
Schmidt, M., Gish, H.: Speaker identification via support vector classifiers. In: Proc. ICASSP, pp. 105–108 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, J., Lei, J., Guo, J., Yang, Z. (2006). SVM Based Speaker Selection Using GMM Supervector for Rapid Speaker Adaptation. In: Wang, TD., et al. Simulated Evolution and Learning. SEAL 2006. Lecture Notes in Computer Science, vol 4247. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11903697_78
Download citation
DOI: https://doi.org/10.1007/11903697_78
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-47331-2
Online ISBN: 978-3-540-47332-9
eBook Packages: Computer ScienceComputer Science (R0)