Semi-Automatic Segmentation of Speech: Manual Segmentation Strategy. Problem Space Analysis

Marcin Szymanski³ &
Stefan Grocholewski³

Part of the book series: Advances in Soft Computing ((AINSC,volume 30))

Abstract

The important element connected with today’s speech recognition/ synthesis systems is the speech database — the set of fully annotated wavefiles. Since the manual segmentation of speech is a very time-consuming task, the automatic segmentation algorithms are needed. However, the manual segmentation still outperforms the automatic one and at the same time the quality of resulting synthetic voice highly depends on the accuracy of the phonetic segmentation. In this paper we concentrate on a semi-automatic approach, in which a human expert, unlike in the common approach, manually allocates the selected boundaries prior to the automatic segmentation of the rest of the corpus. In the paper we quest for the appropriate strategy for an expert. We check if locating some boundary classes influence the rest of the annotations. It is done for two difierent quality measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 199.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 249.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation

Article 26 November 2020

Automatic Detection of the Prosodic Structures of Speech Utterances

Segmentation of Telephone Speech Based on Speech and Non-speech Models

References

Grocholewski S. (1997), CORPORA — Speech Database for Polish Diphones, Proc. Eurospeech’97, pp. 1735–1738
Google Scholar
Kvale K. (1993), Segmentation and Labelling of Speech, Ph.D. Thesis, Institutt for Teleteknikk, Trondheim
Google Scholar
Matousek J., Tihelka D., Psutka J. (2003), Automatic Segmentation for Czech Concatenative Speech Synthesis Using Statistical Approach with Boundary-Specific Correction, Proc. Eurospeech 2003, pp. 301–304, Geneva
Google Scholar
Ostendorf M., Digalakis V.V., Kimball O.A. (1996), From HMM’s to Segment Models: A Unified View of Stochastic Modeling for Speech Recognition, IEEE Trans. on Speech and Audio Proc., Vol. 4, No. 5, September 1996
Google Scholar
Steuer R.E. (1986), Multiple Criteria Optimization Ű Theory, Computation and Application, Wiley, New York
MATH Google Scholar
Szymañski M., Grocholewski S. (2003), Automatic Speech Segmentation Based on Transcription, RB-023/03, Poznan Univ. of Tech., Inst. of Computing Sc. (in Polish)
Google Scholar
Szymañski M., Grocholewski S. (2005), Implementation of Speech Segmentation Algorithm with Statistical Duration Models. Tuning the Model Parameters, RB-004/05, Poznan Univ. of Technology, Inst. of Computing Science (in Polish)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computing Science, Poznan University of Technology, ul. Piotrowo 3a, 60-965, Poznañ, Poland
Marcin Szymanski & Stefan Grocholewski

Authors

Marcin Szymanski
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Grocholewski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Electronics, Wroclaw University of Technology, Wybrzeze Wyspianskiego 27, 50-370, Wroclaw, Poland
Marek Kurzyński , Edward Puchała , Michał Woźniak & Andrzej żołnierek , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Szymanski, M., Grocholewski, S. (2005). Semi-Automatic Segmentation of Speech: Manual Segmentation Strategy. Problem Space Analysis. In: Kurzyński, M., Puchała, E., Woźniak, M., żołnierek, A. (eds) Computer Recognition Systems. Advances in Soft Computing, vol 30. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-32390-2_88

Download citation

DOI: https://doi.org/10.1007/3-540-32390-2_88
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25054-8
Online ISBN: 978-3-540-32390-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Semi-Automatic Segmentation of Speech: Manual Segmentation Strategy. Problem Space Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation

Automatic Detection of the Prosodic Structures of Speech Utterances

Segmentation of Telephone Speech Based on Speech and Non-speech Models

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Semi-Automatic Segmentation of Speech: Manual Segmentation Strategy. Problem Space Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Guaranteed Significance Level Criterion in Automatic Speech Signal Segmentation

Automatic Detection of the Prosodic Structures of Speech Utterances

Segmentation of Telephone Speech Based on Speech and Non-speech Models

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation