Abstract
This paper presents a comparison between two parametric methods for Signal Enhancement in order to address the problem of robust Automatic Speech Recognition (ASR). An SVD–based technique (ISE) and a non-linear spectral subtraction method (NSS), have been evaluated by means of the Continuous Speech Recognition system that is used in the ERMIS project. The input signal is corrupted with coloured noise with variable signal-to-noise ratio. It was found that fine-tuning of the various parameters of the enhancement techniques is crucial for efficient optimisation of their performance. Both methods provide significant improvement of the speech recogniser performance in the presence of coloured noise, with the NSS method being slightly better.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Boersma, P.: Accurate short-term analysis of the fundamental frequency and the harmonicsto-noise ratio of a sampled sound. Proceedings of the Institute of Phonetic Sciences 17 (1993) 97–110
Dendrinos, M., Bakamidis, S., Carayannis, G.: Speech enhancement from noise: A regenerative approach. Speech Communication, Vol. 10,no.2, February (1991) 45–57
Doclo, S., Dologlou, I., Moonen, M.: A novel iterative signal enhancement algorithm for noise reduction in speech, Proceedings of ICSLP-98, Sydney, Australia, (1998) 1435–1439
Kyriakou, C., Bakamidis, S., Dologlou, I,, Carayannis, G.: Robust Continuous Speech Recognition in the Presence of Coloured Noise., Proceedings of 4th European Conference on Noise Control EURONOISE2001, Vol. 2, Patra, January 14-17 (2001) 702–705
Pellom, B. L., Hansen, J.H.L.: Voice Analysis in Adverse Conditions: The Centennial Olympic Park Bombing 911 Call, Proceedings of IEEE Midwest Symposium on Circuits & Systems, August (1997) 125–128
Uhl, C., and Leib, M.: Experiments with an Extended Adaptive SVD Enhancement Scheme for Speech Enhancement, Proceedings of IEEE ICASSP, Vol. 1, Salt Lake City, Utah, USA, May (2001) 281–284
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Athanaselis, T., Fotinea, SE., Bakamidis, S., Dologlou, I., Giannopoulos, G. (2003). Signal Enhancement for Continuous Speech Recognition. In: Kaynak, O., Alpaydin, E., Oja, E., Xu, L. (eds) Artificial Neural Networks and Neural Information Processing — ICANN/ICONIP 2003. ICANN ICONIP 2003 2003. Lecture Notes in Computer Science, vol 2714. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44989-2_133
Download citation
DOI: https://doi.org/10.1007/3-540-44989-2_133
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40408-8
Online ISBN: 978-3-540-44989-8
eBook Packages: Springer Book Archive