No data available.
Please log in to see this content.
You have no subscription access to this content.
No metrics data to plot.
The attempt to load metrics for this article has failed.
The attempt to plot a graph for these metrics has failed.
The full text of this article is not currently available.
Voice source characterization using pitch synchronous discrete cosine transform for speaker identification
4. J. Gudnason and M. Brookes, “ Voice source cepstrum coefficients for speaker identification,” in Proceedings of ICASSP (2008), pp. 4821–4824.
5. M. D. Plumpe, T. F. Quatieri, and D. A. Reynolds, “ Modeling of the glottal flow derivative waveform with application to speaker identification,” IEEE Trans. Speech Audio Process. 7, 569–586 (1999).
6. J. Wang and M. T. Johnson, “ Physiologically-motivated feature extraction for speaker identification,” in Proceedings of ICASSP (2014), pp. 1690–1694.
7. T. V. Ananthapadmanabha, “ Acoustic factors determining perceived voice quality,” in Vocal Fold Physiology: Voice Quality Control, edited by O. Fujimura and M. Hirano ( Singular Publishing Group, San Diego, CA, 1995) Chap. 7, pp. 113–126.
8. G. Fant, J. Liljencrants, and Q. Lin, “ A four-parameter model of glottal flow,” Speech Trans. Lab. Q. Prog. Status Rep. 26, 1–13 (1985).
10. S. R. M. Prasanna, C. S. Gupta, and B. Yegnanarayana, “ Extraction of speaker-specific excitation information from linear prediction residual of speech,” Speech Commun. 48, 1243–1261 (2006).
11. D. Y. Wong, J. D. Markel, and A. H. Gray, Jr., “ Least squares glottal inverse filtering from the acoustic speech waveform,” IEEE Trans. Acoust., Speech. Signal Process. 27, 350–355 (1979).
13. T. V. Ananthapadmanabha, “ Acoustic analysis of voice source dynamics,” Speech Trans. Lab. Q. Prog. Status Rep. 25(2–3), 1–24 (1984).
17. A. P. Prathosh, T. V. Ananthapadmanabha, and A. G. Ramakrishnan, “ Epoch extraction based on integrated linear prediction residual using plosion index,” IEEE Trans. Audio, Speech, Lang. Process. 21(12), 2471–2480 (2013).
18. T. V. Ananthapadmanabha, A. P. Prathosh, and A. G. Ramakrishnan, “ Detection of closure-burst transitions of stops and affricates in continuous speech using plosion index,” J. Acoust. Soc. Am. 135(1), 460–471 (2014).
19. D. G. Childers and C. Ahn, “ Modeling the glottal volume velocity waveform for three voice types,” J. Acoust. Soc. Am. 97(1), 505–519 (1995).
20. D. A. Reynolds and R. C. Rose, “ Robust text-independent speaker identification using Gaussian mixture speaker models,” IEEE Trans. Speech Audio Process. 3(1), 72–83 (1995).
21. W. Fisher, G. Doddington, and K. Goudie-Marshall, “ The DARPA speech recognition research database: Specifications and status,” in Proceedings of DARPA Workshop on Speech Recognition (1986), pp. 93–99.
22. T. Drugman and T. Dutoit, “ The deterministic plus stochastic model of the residual signal and its applications,” IEEE Trans. Audio, Speech, Lang. Process. 20, 968–981 (2012).
23. J. Campbell, “ Testing with the YOHO CD-ROM voice verification corpus,” in Proceedings of ICASSP (1995), pp. 341–344.
24.NIST Multimodal Information Group, 2003 NIST Speaker Recognition Evaluation ( Linguistic Data Consortium, Philadelphia).
25. A. Hatch, S. Kajarekar, and A. Stolcke, “ Within-class covariance normalization for SVM-based speaker recognition,” in Proceedings of the International Conference on Spoken Language Processing (2006).
Article metrics loading...
A characterization of the voice source (VS) signal by the pitch synchronous (PS) discrete cosine transform (DCT) is proposed. With the integrated linear prediction residual (ILPR) as the VS estimate, the PS DCT of the ILPR is evaluated as a feature vector for speaker identification (SID). On TIMIT and YOHO databases, using a Gaussian mixture model (GMM)-based classifier, it performs on par with existing VS-based features. On the NIST 2003 database, fusion with a GMM-based classifier using MFCC features improves the identification accuracy by 12% in absolute terms, proving that the proposed characterization has good promise as a feature for SID studies.
Full text loading...
Most read this month