Volume 116, Issue 4, October 2004
Index of content:
- SPEECH PROCESSING AND COMMUNICATION SYSTEMS 
Speech activity detection and enhancement of a moving speaker based on the wideband generalized likelihood ratio and microphone arrays116(2004); http://dx.doi.org/10.1121/1.1781622View Description Hide Description
The subject of this work is a unifying treatment of estimating the Direction of Arrival (DOA), detectingspeech activity and suppressing noise in the case of a moving speaker by using a linear microphone array. The approach is based on the generalized likelihood ratio test applied to the framework of far-field, wideband moving sources (W-GLRT). It is shown that under certain distributional assumptions the W-GLRT provides a framework for the evaluation of DOA measurements against spurious DOAs, probabilistic speech activity detection as well as speech enhancement. As regards speech enhancement, we demonstrate the direct connection of W-GLRT with enhancement based on subspace methods. In addition, through the concept of directive a priori SNR we demonstrate its indirect connection with Minimum Mean Square Error spectral (MMSE_SA) and log-spectral gain modification (MMSE_LSA). The efficiency of the approach is illustrated on a moving speaker when either additive white Gaussian or babble noise is present in the acoustical field at very low SNRs.