Full text loading...
No data available.
Please log in to see this content.
You have no subscription access to this content.
No metrics data to plot.
The attempt to load metrics for this article has failed.
The attempt to plot a graph for these metrics has failed.
The processing and perception of size information in speech sounds
1.Alcántara, J. I. , and Moore, B. C. J. (1995). “The identification of vowel-like harmonic complexes: Effect of component phase, level, and fundamental frequency,” J. Acoust. Soc. Am. 97, 3813–3824.
2.Assmann, P. F., Nearey, T. M., and Scott, J. M. (2002). “Modeling the perception of frequency-shifted vowels,” in “Proceedings of the 7th Int. Conference on Spoken Language Perception,” ICSLP, 425–428.
3.Assmann, P. F., and Nearey, T. M. (2003). “Frequency shifts and vowel identification,” in “Proceedings of the 15th Int. Congress of Phonetic Sciences,” Barcelona ICPhS.
4.Clutton-Brock, T. H. , and Albon, S. D. (1979). “The roaring of red deer and the evolution of honest advertising,” Behaviour 69, 145–170.
5.Cohen, L. (1993). “The scale transform,” IEEE Trans. Acoust., Speech, Signal Process. 41, 3275–3292.
6.Cornsweet, T. N. , and Pinsker, H. M. (1965). “Luminance discrimination of brief flashes under various conditions of adaptation,” J. Physiol. (London) 176, 294–310.
7.Drennan, W. (1998). “Sources of variation in profile analysis: Individual differences, extended training, roving level, component spacing, and dynamic contour,” Ph.D. dissertation, Indiana University.
8.Dudley, H. (1939). “Remaking speech,” J. Acoust. Soc. Am. 11, 169–177.
9.Fant, G. (1960). Acoustic Theory of Speech Production (Mouton, The Hague).
10.Fairchild, L. (1981). “Mate selection and behavioural thermoregulation in Fowler’s toads,” Science 212, 950–951.
11.Fitch, W. T. (1994). “Vocal tract length perception and the evolution of language,” Ph.D. dissertation, Brown University.
12.Fitch, W. T. (1997). “Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques,” J. Acoust. Soc. Am. 102, 1213–1222.
13.Fitch, W. T. , and Giedd, J. (1999). “Morphology and development of the human vocal tract: A study using magnetic resonance imaging,” J. Acoust. Soc. Am. 106, 1511–1522.
14.Foster, D. H. , and Bischof, W. F. (1997). “Bootstrap estimates of the statistical accuracy of thresholds obtained from psychometric functions,” Spatial Vis. 11, 135–139.
15.Gescheider, G. A. (1976). Psychophysics; Method and Theory (Erlbaum, Hillsdale, NJ).
16.González, J. (2004). “Formant frequencies and body size of speaker: A weak relationship in adult humans,” J. Phonetics 32, 277–287.
17.Green, D. M. (1988). Profile Analysis (Oxford University Press, London).
18.Huber, J. E. , Stathopoulos, E. T. , Curione, G. M. , Ash, T. A. , and Johnson, K. (1999). “Formants of children, women, and men: The effects of vocal intensity variation,” J. Acoust. Soc. Am. 106, 1532–1542.
19.Irino, T. , and Patterson, R. D. (1997). “A time-domain. level-dependent auditory filter: The gammachirp,” J. Acoust. Soc. Am. 101, 412–419.
20.Irino, T., and Patterson, R. D. (1999a). “Extracting size and shape information of sound sources in an optimal auditory processing model,” CASA Workshop, IJCAI-99, Stockholm, 1–4 Aug., 1999.
21.Irino, T., and Patterson, R. D. (1999b). “Stabilised wavelet Mellin transform: An auditory strategy for normalising sound-source size,” Eurospeech 99, Budapest, Hungary, Sept., 1899–1902.
22.Irino, T., and Patterson, R. D. (1999c). “An auditory strategy for separating size and shape information of sound sources,” Japan Soc. Artificial Intell., Tech. Rep., SIG-Challenge-9907-6, 33–38.
23.Irino, T. , and Patterson, R. D. (2002). “Segregating information about the size and shape of the vocal tract using a time-domain auditory model: The stabilized wavelet-Mellin transform,” Speech Commun. 36, 181–203.
24.Kawahara, H. , Masuda-Kasuse, I. , and de Cheveigne, A. (1999). “Restructuring speech representations using pitch-adaptive time-frequency smoothing and instantaneous-frequency-based extraction: Possible role of repetitive structure in sounds,” Speech Commun. 27(3–4), 187–207.
25.Kawahara, H., and Matsui, H. (2003). “Auditory morphing based on an elastic perceptual distance metric in an interference-free, time-frequency representation,” in Proceedings IEEE Int. Conference on Acoustics, Speech & Signal Processing (ICASSP ’03) 1, 256–259.
26.Kawahara, H. (2003). “Exemplar-based voice quality analysis and control using a high quality auditory morphing procedure based on STRAIGHT,” VOQUAL’03, ESCA Tutorial and Research Workshop, Geneva, 27–29 August, 2003, 109–114.
27.Krumbholz, K. , Patterson, R. D. , and Pressnitzer, D. (2000). “The lower limit of pitch as determined by rate discrimination,” J. Acoust. Soc. Am. 108, 1170–1180.
28.Lass, N. J. , and Davis, M. (1976). “An investigation of speaker height and weight identification,” J. Acoust. Soc. Am. 60, 700–703.
29.LeCun, Y., and Bengio, Y. (1995). “Convolutional networks for images, speech, and time-series,” in The Handbook of Brain Theory and Neural Networks, edited by M. A. Arbib (MIT Press, Cambridge, MA).
30.Leek, M. R. , Dorman, M. F. , and Summerfield, Q. (1987). “Minimum spectral contrast for vowel identification by normal-hearing and hearing-impaired listeners,” J. Acoust. Soc. Am. 81, 148–154.
31.Liu, C. , and Kewley-Port, D. (2004). “STRAIGHT: a new speech synthesizer for vowel formant discrimination,” ARLO 5, 31–36.
32.Miller, G. A. (1947). “Sensitivity to changes in the intensity of white noise and its relation to masking and loudness,” J. Acoust. Soc. Am. 19, 609–619.
33.Narins, P. M. , and Smith, S. L. (1986). “Clinal variation in anuran advertisement calls—basis for acoustic isolation,” Behav. Ecol. Sociobiol. 19, 135–141.
34.Patterson, R.D., Robinson, K., Holdsworth, J., McKeown, D., Zhang, C., and Allerhand M. (1992). “Complex sounds and auditory images,” in Auditory Physiology and Perception, Proceedings of the 9th International Symposium on Hearing, edited by Y. Cazals, L. Demany, and K. Horner (Pergamon, Oxford), pp. 429–446.
35.Patterson, R. D. , Allerhand, M. H. , and Giguère, C. (1995). “Time-domain modeling of peripheral auditory processing: A modular architecture and a software platform,” J. Acoust. Soc. Am. 98, 1890–1894.
36.Patterson, R. D. (2000). “Auditory images: How complex sounds are represented in the auditory system,” J. Acoust. Soc. Jpn. (E) 21, 183–190.
37.Peterson, G. E. , and Barney, H. I. (1952). “Control methods used in the study of vowels,” J. Acoust. Soc. Am. 24, 75–184.
38.Pressnitzer, D. , Patterson, R. D. , and Krumbholz, K. (2001). “The lower limit of pitch,” J. Acoust. Soc. Am. 109, 2074–2084.
39.Riede, T. , and Fitch, W. T. (1999). “Vocal tract length and acoustics of vocalization in the domestic dog, Canis familiris,” J. Exp. Biol. 202, 2859–2869.
40.Sek, A. , and Moore, B. C. J. (1995). “Frequency discrimination as a function of frequency, measured in several ways,” J. Acoust. Soc. Am. 97, 2479–2486.
41.Smith, D. R. R., Patterson, R. D., and Jefferis, J. (2003). “The perception of scale in vowel sounds,” British Society of Audiology, Nottingham P35.
42.Smith, D. R. R., and Patterson, R. D. (2004). “The existence region of scaled vowels in pitch-VTL space,” 18th Int. Conference on Acoustics, Kyoto Japan, Vol. I, 453–456.
43.Spiegel, M. F. , Picardi, M. C. , and Green, D. M. (1981). “Signal and masker uncertainty in intensity discrimination,” J. Acoust. Soc. Am. 70, 1015–1019.
44.Titchmarsh, E. C. (1948). Introduction to the Theory of Fourier Integrals, 2nd ed. (Oxford University Press, London).
45.Titze, I. R. (1989). “Physiologic and acoustic differences between male and female voices,” J. Acoust. Soc. Am. 85, 1699–1707.
46.Turner, R. E. , and Patterson, R. D. (2003). “An analysis of the size information in classical formant data: Peterson and Barney (1952) revisited,” J. Acoust. Soc. Jpn. 33, 585–589.
47.Turner, R. E., Al-Hames, M. A., Smith, D. R. R., Kawahara, H., Irino, T., and Patterson, R. D. (2004) “Vowel normalisation: Time-domain processing of the internal dynamics of speech,” in Dynamics of Speech Production and Perception, edited by P. Divenyi (in press).
48.Wolpert, D. H. (1996a). “The lack of a priori distinctions between learning algorithms,” Neural Comput. 8, 1341–1390.
49.Wolpert, D. H. (1996b). “The existence of a priori distinctions between learning algorithms,” Neural Comput. 8, 1391–1420.
50.Welling, L. , and Ney, H. (2002). “Speaker adaptive modeling by vocal tract normalization,” IEEE Trans. Speech Audio Process. 10, 415–426.
Article metrics loading...