A neural clustering algorithm for estimating visible articulatory trajectory

Fabio Vignoli¹,
Sergio Curinga¹ &
Fabio Lavagetto¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1112))

Included in the following conference series:

International Conference on Artificial Neural Networks

219 Accesses
2 Citations

Abstract

The bimodal acoustic-visual nature of speech establishes sound correlations between its audio component and the corresponding articulatory information associated to the time-varying geometry of the vocal tract. In this paper we propose an estimation structure consisting of a simplified Time-Delay Neural Network (TDNN) working on 4–5 dimensional cepstrum trajectories provided by a preceding clusterization layer based on a Self Organizing Map (SOM). The use of this pre-processing layer has allowed an effective non-linear clusterization of cepstrum vectors thus simplifying of one order the complexity of the resulting system while maintaining unchanged the global estimation performances. The achieved results are shown in terms estimation precision and robustness with reference to previously published results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Improved Estimation of Articulatory Features Based on Acoustic Features with Temporal Context

Phoneme sequence recognition via DTW-based classification

Article 19 October 2015

Dynamic Neural Network Model of Speech Perception

References

F.Lavagetto,”Converting Speech into Lip Movements: A Multimedia Telephone for Hard of Hearing People” IEEE Trans. on RE, Vol.3, n.1, 1995, pp. 90–102.
Google Scholar
A.Q. Summerfield, ”Use of Visual Information for Phonetic Perception”, Phonetica, Vol.36, pp.314–331, 1979.
Google Scholar
E. Owens, B. Blazek, ”Visems Observed by Hearing-Impaired and Normal-Hearing Adult Viewers”, Journal of Speech and Hearing Research, vol.28, pp.381–393, 1985.
Google Scholar
C.A. Fowler ”Coarticulation and Theories of Extrinsic Timing”, Journal of Phonetics, 1980.
Google Scholar
O. Fujimura ”Elementary gestures and temporal organization. What does an articulatory constraint means?” in The cognitive representation of speech, North Holland Amsterdam, pp. 101–110, 1981.
Google Scholar
A.P. Benguerel, M.K. Pichora-Fuller, ”Coarticulation Effects in Lipreading”, Journal of Speech and Hearing Research, Vol.25, pp.600–607, 1982.
Google Scholar
S. Morishima, H. Harashima, ”A Media Conversion from Speech to Facial Image for Intelligent Man-Machine Interface”, IEEE Journal on Sel. Areas in Comm.,vol.9, N.4, pp. 594–600, 1991.
Google Scholar
B.P. Yuhas, M.H. Goldstein Jr. and T.J. Sejnowski, ”Integration of Acoustic and Visual Speech Signal Using Neural Networks”, IEEE Communications Magazine, pp. 65–71, 1989.
Google Scholar

Download references

Author information

Authors and Affiliations

DIST University of Genova, Via Opera Pia 13, 16145, Genova, Italy
Fabio Vignoli, Sergio Curinga & Fabio Lavagetto

Authors

Fabio Vignoli
View author publications
You can also search for this author in PubMed Google Scholar
Sergio Curinga
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Lavagetto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Christoph von der Malsburg Werner von Seelen Jan C. Vorbrüggen Bernhard Sendhoff

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vignoli, F., Curinga, S., Lavagetto, F. (1996). A neural clustering algorithm for estimating visible articulatory trajectory. In: von der Malsburg, C., von Seelen, W., Vorbrüggen, J.C., Sendhoff, B. (eds) Artificial Neural Networks — ICANN 96. ICANN 1996. Lecture Notes in Computer Science, vol 1112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61510-5_145

Download citation

DOI: https://doi.org/10.1007/3-540-61510-5_145
Published: 09 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61510-1
Online ISBN: 978-3-540-68684-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

A neural clustering algorithm for estimating visible articulatory trajectory

Abstract

Access this chapter

Preview

Similar content being viewed by others

Improved Estimation of Articulatory Features Based on Acoustic Features with Temporal Context

Phoneme sequence recognition via DTW-based classification

Dynamic Neural Network Model of Speech Perception

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A neural clustering algorithm for estimating visible articulatory trajectory

Abstract

Access this chapter

Preview

Similar content being viewed by others

Improved Estimation of Articulatory Features Based on Acoustic Features with Temporal Context

Phoneme sequence recognition via DTW-based classification

Dynamic Neural Network Model of Speech Perception

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation