No data available.
Please log in to see this content.
You have no subscription access to this content.
No metrics data to plot.
The attempt to load metrics for this article has failed.
The attempt to plot a graph for these metrics has failed.
Selection of spectral compressive operator for vector Taylor series-based model adaptation in noisy environments
1. P. J. Moreno, B. Raj, and R. Stern, “ A vector Taylor series approach for environment independent speech recognition,” in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (1996), pp. 733–736.
2. R. Stern, P. J. Moreno, and B. Raj, “ Compensation for speech recognition in degraded acoustical environments,” J. Acoust. Soc. Am. 100, 2792 (1996).
3. A. Acero, L. Deng, T. Kristjansson, and J. Zhang, “ HMM adaptation using vector Taylor series for noisy speech recognition,” in Proceedings of International Conference on Spoken Language Processing (2000), Vol. 3, pp. 869–872.
4. J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, “ A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions,” Comput. Speech Lang. 23, 389–405 (2009).
5. S. Baek and H. Kang, “ Vector Taylor series based HMM adaptation for generalized cepstrum in noisy environment,” in Proceedings of Automatic Speech Recognition and Understanding (2013), pp. 186–191.
7. C. Kim and R. M. Stern, “ Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction,” in Proceedings of INTERSPEECH (2009), pp. 28–31.
8. C. Kim and R. M. Stern, “ Power-normalized cepstral coefficients (pncc) for robust speech recognition,” in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (2012), pp. 4101–4104.
9.ETSI standard doc., “ Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms,” ETSI ES 202 050 v. 1.1.5 (2007).
10. J. Lee, S. Baek, and H. Kang, “ Signal and feature domain enhancement approaches for robust speech recognition,” in Proceedings of ICICS (2011), pp. 1–4.
13. K. Koishida, K. Tokuda, T. Kobayashi, and S. Imai, “ Spectral representation of speech using mel-generalized cepstral coefficients,” J. Acoust. Soc. Am. 100, 2756 (1996).
14. R. A. Gopinath, M. J. F. Gales, P. S. Gopalakrishnan, S. Balakrishnan-Aiyer, and M. A. Picheny, “ Robust speech recognition in noise: Performance of the IBM continuous speech recognizer on the ARPA noise spoke task,” in Proceedings of the ARPA Workshop on Spoken Language Systems Technology (1995).
15. H. G. Hirsch and D. Pearce, “ The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions,” in Proceedings of ISCA ITRW ASR (2000), pp. 181–188.
16. D. A. Kapilow, Y. Stylianou, and J. Schroeter, “ Detection of non-stationarity in speech signals and its application to time-scaling,” in Proceedings of EUROSPEECH (1999).
Article metrics loading...
This letter investigates the impact of spectral compression on the vector Taylor series-based model adaptation algorithm. Unlike mel-frequency cepstral coefficients obtained by the logarithmic compression, the fractional power compression is used for extracting features. Since the relationship between acoustic models for clean and noisy speech depends on nonlinearity of the spectrum, it is important to select an appropriate compressive operator in the model adaptation. In this letter, the dependency of spectral nonlinearity on the speech recognition system is analyzed in various noisy environments. Experimental results confirm that the replacement of the compressive operator improves the performance of the model adaptation.
Full text loading...
Most read this month