1887
banner image
No data available.
Please log in to see this content.
You have no subscription access to this content.
No metrics data to plot.
The attempt to load metrics for this article has failed.
The attempt to plot a graph for these metrics has failed.
oa
Effect of temporal modulation rate on the intelligibility of phase-based speech
Rent:
Rent this article for
Access full text Article
/content/asa/journal/jasa/134/6/10.1121/1.4828978
1.
1. ANSI (1997). ANSI S3.5, American National Standards Methods for Calculation of the Speech Intelligibility Index (Acoustical Society of America, New York).
2.
2. Chen, F. , and Loizou, P. (2011). “ Predicting the intelligibility of vocoded speech,” Ear Hear. 32, 331338.
http://dx.doi.org/10.1097/AUD.0b013e3181ff3515
3.
3. Dolson, M. (1986). “ The phase vocoder: A tutorial,” Comput. Music J. 10, 1427.
http://dx.doi.org/10.2307/3680093
4.
4. Dorman, M. , Loizou, P. , and Rainey, D. (1997). “ Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs,” J. Acoust. Soc. Am. 102, 24032411.
http://dx.doi.org/10.1121/1.419603
5.
5. Gilbert, G. , and Lorenzi, C. (2006). “ The ability of listeners to use recovered envelope cues from speech fine structure,” J. Acoust. Soc. Am. 119, 24382444.
http://dx.doi.org/10.1121/1.2173522
6.
6. Greenwood, D. D. (1990). “ A cochlear frequency-position function for several species—29 years later,” J. Acoust. Soc. Am. 87, 25922605.
http://dx.doi.org/10.1121/1.399052
7.
7. Kazama, M. , Gotoh, S. , Tohyama, M. , and Houtgast, T. (2010). “ On the significance of phase in the short term Fourier spectrum for speech intelligibility,” J. Acoust. Soc. Am. 127, 14321439.
http://dx.doi.org/10.1121/1.3294554
8.
8. Lorenzi, C. , Gilbert, G. , Carn, H. , Garnier, S. , and Moore, B. C. (2006). “ Speech perception problems of the hearing impaired reflect inability to use temporal fine structure,” Proc. Natl. Acad. Sci. U.S.A. 103, 1886618869.
http://dx.doi.org/10.1073/pnas.0607364103
9.
9. McAulay, R. , and Quatieri, T. (1995). “ Sinusoidal coding,” in Speech Coding and Synthesis, edited by W. Kleijn and K. Paliwal (Elsevier Science, New York).
10.
10. Moore, B. C. (2008). “ The role of temporal fine structure processing in pitch perception, masking, speech perception for normal-hearing hearing-impaired people,” J. Assoc. Res. Otolaryngol. 9, 399406.
11.
11. Shannon, R. V. , Zeng, F. G. , Kamath, V. , Wygonski, J. , and Ekelid, M. (1995). “ Speech recognition with primarily temporal cues,” Science 270, 303304.
http://dx.doi.org/10.1126/science.270.5234.303
12.
12. Smith, Z. M. , Delgutte, B. , and Oxenham, A. J. (2002). “ Chimaeric sounds reveal dichotomies in auditory perception,” Nature 416, 8790.
http://dx.doi.org/10.1038/416087a
13.
13. Studebaker, G. A. (1985). “ A ‘rationalized’ arcsine transform,” J. Speech Hear. Res. 28, 455462.
14.
14. Wong, L. L. , Soli, S. D. , Liu, S. , Han, N. , and Huang, M. W. (2007). “ Development of the Mandarin hearing in noise test (MHINT),” Ear Hear. 28, 70S74S.
http://dx.doi.org/10.1097/AUD.0b013e31803154d0
15.
15. Xu, L. , Thompson, C. S. , and Pfingst, B. E. (2005). “ Relative contributions of spectral and temporal cues for phoneme recognition,” J. Acoust. Soc. Am. 117, 32553267.
http://dx.doi.org/10.1121/1.1886405
16.
16. Zeng, F. G. , Nie, K. , Liu, S. , Stickney, G. , Del Rio, E. , Kong, Y. Y. , and Chen, H. (2004). “ On the dichotomy in auditory perception between temporal envelope and fine structure cues,” J. Acoust. Soc. Am. 116, 13511354.
http://dx.doi.org/10.1121/1.1777938
http://aip.metastore.ingenta.com/content/asa/journal/jasa/134/6/10.1121/1.4828978
Loading
/content/asa/journal/jasa/134/6/10.1121/1.4828978
Loading

Data & Media loading...

Loading

Article metrics loading...

/content/asa/journal/jasa/134/6/10.1121/1.4828978
2013-11-06
2014-07-31

Abstract

This study investigated the effect of temporal modulation rate on the intelligibility of speech synthesized with primarily phase information using two methods: Phase-based vocoded speech (preserving phase cues and discarding envelope cues) and Hilbert fine-structure stimuli (summing up the multi-channel Hilbert fine-structure waveforms). Listening experiments with normal-hearing participants showed that the intelligibility of the two types of phase-based speech was significantly improved when synthesized using a high temporal modulation rate (or short frame) compared to that synthesized using the whole speech segment. This intelligibility advantage appears to be attributed to better preservation of the temporal envelope cues in phase-based speech.

Loading

Full text loading...

/deliver/fulltext/asa/journal/jasa/134/6/1.4828978.html;jsessionid=dnbq9q6c30cpk.x-aip-live-02?itemId=/content/asa/journal/jasa/134/6/10.1121/1.4828978&mimeType=html&fmt=ahah&containerItemId=content/asa/journal/jasa
true
true
This is a required field
Please enter a valid email address
This feature is disabled while Scitation upgrades its access control system.
This feature is disabled while Scitation upgrades its access control system.
752b84549af89a08dbdd7fdb8b9568b5 journal.articlezxybnytfddd
Scitation: Effect of temporal modulation rate on the intelligibility of phase-based speech
http://aip.metastore.ingenta.com/content/asa/journal/jasa/134/6/10.1121/1.4828978
10.1121/1.4828978
SEARCH_EXPAND_ITEM