1887
banner image
No data available.
Please log in to see this content.
You have no subscription access to this content.
No metrics data to plot.
The attempt to load metrics for this article has failed.
The attempt to plot a graph for these metrics has failed.
Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction
Rent:
Rent this article for
USD
10.1121/1.1953269
/content/asa/journal/jasa/118/2/10.1121/1.1953269
http://aip.metastore.ingenta.com/content/asa/journal/jasa/118/2/10.1121/1.1953269

Figures

Image of FIG. 1.
FIG. 1.

Distributed speech-recognition-based speech reconstruction from MFCC vectors with fundamental frequency prediction.

Image of FIG. 2.
FIG. 2.

Modeling of the joint MFCC and fundamental frequency feature space using (a) GMM clustering; (b) A series of GMMs, each located within the state of a set of HMMs.

Image of FIG. 3.
FIG. 3.

Examples of prior voicing probabilities for the digits (a) six; (b) three, computed from the proportion of voiced vectors allocated to each state within the respective HMMs.

Image of FIG. 4.
FIG. 4.

Comparison of the predicted fundamental frequency contour (solid line) and reference fundamental frequency contour (dashed line) for the utterance “nine six oh.” A value of zero indicates unvoiced speech or nonspeech.

Image of FIG. 5.
FIG. 5.

Comparison of narrow-band spectrograms of the utterance “nine six oh” for (a) original speech signal; (b) reconstructed speech using the reference fundamental frequency; (c) reconstructed speech using the predicted fundamental frequency.

Tables

Generic image for table
TABLE I.

Classification accuracy and percentage fundamental frequency error for male speech.

Generic image for table
TABLE II.

Classification accuracy and percentage fundamental frequency error for female speech.

Loading

Article metrics loading...

/content/asa/journal/jasa/118/2/10.1121/1.1953269
2005-08-01
2014-04-25
Loading

Full text loading...

This is a required field
Please enter a valid email address
752b84549af89a08dbdd7fdb8b9568b5 journal.articlezxybnytfddd
Scitation: Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction
http://aip.metastore.ingenta.com/content/asa/journal/jasa/118/2/10.1121/1.1953269
10.1121/1.1953269
SEARCH_EXPAND_ITEM