No data available.
Please log in to see this content.
You have no subscription access to this content.
No metrics data to plot.
The attempt to load metrics for this article has failed.
The attempt to plot a graph for these metrics has failed.
Robust speech recognition from binary masks
1.Bregman, A. S. (1990). Auditory Scene Analysis (MIT, Cambridge, MA).
3.Ephraim, Y. , and Malah, D. (1985). “Speech enhancement using a minimum mean-square error log-spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process. 33, 443–445.
4.Hermansky, H. , Ellis, D. , and Sharma, S. (2000). “Tandem connectionist feature extraction for conventional HMM systems,” in Proceedings of ICASSP, pp. 1635–1638.
, and Wang
, D. L.
). “Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction
,” Technical Report No. TR51, Department of Computer Science and Engineering, The Ohio State University, Columbus, OH (available online: ftp://ftp.cse.ohio-state.edu/pub/tech-report/2009/TR51.pdf
, S. G.
, M. S.
, and Boldt
, J. B.
). “Robust isolated speech recognition using ideal binary masks
,” Technical Report No. 5780, Department of Informatics and Mathematical Modelling, Technical University of Denmark
, Kgs. Lyngby, Denmark; available at http://isp.imm.dtu.dk/staff/jlarsen/pubs/frame.htm
(Last viewed 10/11/2010).
7.Lecun, Y. , Bottou, L. , Bengio, Y. , and Haffner, P. (1998). “Gradient-based learning applied to document recognition,” Proc. IEEE 86, 2278–2324.
8.Leonard, R. G. (1984). “A database for speaker-independent digit recognition,” in Proceedings of ICASSP, pp. 111–114.
9.Simard, P. Y. , Steinkraus, D. , and Platt, J. C. (2003). “Best practices for convolutional neural networks applied to visual document analysis,” in Proceedings of ICDAR, pp. 958–963.
10.Srinivasan, S. , and Wang, D. L. (2007). “Transforming binary uncertainties for robust speech recognition,” IEEE Trans. Audio, Speech, Lang. Process. 15, 2130–2140.
11.Wang, D. L. , and Brown, G. J. (2006). Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, edited by D. L. Wang and G. J. Brown (Wiley/IEEE, Hoboken, NJ).
12.Wang, D. L. , Kjems, U. , Pedersen, M. S. , Boldt, J. B. , and Lunner, T. (2008). “Speech perception of noise with binary gains,” J. Acoust. Soc. Am. 124, 2303–2307.
13.Wang, D. L. , Kjems, U. , Pedersen, M. S. , Boldt, J. B. , and Lunner, T. (2009). “Speech intelligibility in background noise with ideal binary time-frequency masking,” J. Acoust. Soc. Am. 125, 2336–2347.
14.Young, S. , Kershaw, D. , Odell, J. , Valtchev, V. , and Woodland, P. (2009). The HTK Book (for HTK Version 3.4) (Microsoft Corp., Redmond, WA).
Article metrics loading...
Full text loading...
Most read this month