Abstract
The detection of speech from silence (actually background noise) is essential in many speech-processing systems. In real-field applications, the correct determination of voice segments highly improves the overall system accuracy and minimises the total computation time. This paper1 presents a novel robust and reliable speech detection algorithm to be used in a speaker recognition system. The paper first introduces some basic concepts on speech activity detection and reviews the techniques currently used in speech detection tasks. Then, the proposed speech/non-speech detection algorithm is described and experimental results are discussed. Conclusions about the algorithm performances are finally presented.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cohn, R.P.: Robust Voiced/Unvoiced Speech Classification Using a Neural Net. Proceedings of ICASSP’91, vol. 1, Paper S7.6 (1991).
Draganescu, M., Stefan G., Burileanu C.: Electronica functionala. Ed. Tehnica, Bucharest (1991), 443–480.
Herrera, A., Ramos A, Yamasaki K.: Speech Detection in High Noise Condition. Proceedings of ICSPAT’ 96, Boston, vol. 2 (1996), 1774–1778.
Le Bouquin Jeannes, R., Faucon G.: Voice Activity Detector Based on the Averaged Magnitude Squared Coherence. Proceedings of ICSPAT’ 95, Boston, vol. 2 (1995), 1964–1968.
Rabiner, L.R., Sambur M.R.: An Algorithm for Determining the Endpoints of Isolated Utterances. The Bell System Technical Journal, vol. 54 (1975), 297–315.
Tolba, H., O’Shaughnessy D.: Voiced-Unvoiced Classification Using the First Mel Frequency Cepstral Coefficient. Proceedings of ICSP’ 97, Seoul, vol. 1 (1997), 137–142.
Tucker, G.B., Spanias A.S., Loizou P.C.: A HMM-Based Endpoint Detector for Computer Communication Application. Proceedings of ICSPAT’ 95, Boston, vol. 2 (1995), 1969–1973.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Burileanu, D., Pascalin, L., Burileanu, C., Puchiu, M. (2000). An Adaptive and Fast Speech Detection Algorithm. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_30
Download citation
DOI: https://doi.org/10.1007/3-540-45323-7_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive