[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1007/978-3-031-24340-0_26guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Audio Summarization with Audio Features and Probability Distribution Divergence

Published: 26 February 2023 Publication History

Abstract

The automatic summarization of multimedia sources is an important task that facilitates the understanding of an individual by condensing the source while maintaining relevant information. In this paper we focus on audio summarization based on audio features and the probability of distribution divergence. Our method, based on an extractive summarization approach, aims to select the most relevant segments until a time threshold is reached. It takes into account the segment’s length, position and informativeness value. Informativeness of each segment is obtained by mapping a set of audio features issued from its Mel-frequency Cepstral Coefficients and their corresponding Jensen-Shannon divergence score. Results over a multi-evaluator scheme shows that our approach provides understandable and informative summaries.

References

[1]
Christensen H, Gotoh Y, and Renals S A cascaded broadcast news highlighter IEEE Trans. Audio Speech Lang. Process. 2008 16 1 151-161
[2]
Duxans, H., Anguera, X., Conejero, D.: Audio based soccer game summarization. In: 2009 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting. BMSB’09, pp. 1–6. IEEE (2009)
[3]
Jouvet D, Langlois D, Menacer M, Fohr D, Mella O, and Smaïli K Adaptation of speech recognition vocabularies for improved transcription of youtube videos J. Int. Sci. Gen. Appl. 2018 1 1 1-9
[4]
Kullback S and Leibler RA On information and sufficiency Ann. Math. Stat. 1951 22 1 79-86
[5]
Leszczuk M, Grega M, Koźbiał A, Gliwski J, Wasieczko K, and Smaïli K Dziech A and Czyżewski A Video summarization framework for newscasts and reports – work in progress Multimedia Communications, Services and Security 2017 Cham Springer 86-97
[6]
Louis, A., Nenkova, A.: Automatic summary evaluation without human models. In: TAC (2008)
[7]
Louis, A., Nenkova, A.: Automatically evaluating content selection in summarization without human models. In: 2009 Conference on Empirical Methods in Natural Language Processing, Vol, 1. pp. 306–314. ACL (2009)
[8]
Manning CD and Schütze H Foundations of Statistical Natural Language Processing 1999 Cambridge MIT Press
[9]
Maskey, S., Hirschberg, J.: Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization. In: 9th European Conference on Speech Communication and Technology (2005)
[10]
Maskey, S., Hirschberg, J.: Summarizing speech without text using hidden markov models. In: Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, pp. 89–92. Association for Computational Linguistics (2006)
[11]
McFee, B., et al.: librosa: audio and music signal analysis in python. In: 14th Python in Science Conference, pp. 18–25 (2015)
[12]
Rafii, Z., Pardo, B.: Music/voice separation using the similarity matrix. In: ISMIR, pp. 583–588 (2012)
[13]
Rott M and Červa P Sojka P, Horák A, Kopeček I, and Pala K Speech-to-text summarization using automatic phrase extraction from recognized text Text Speech Dialogue 2016 Cham Springer 101-108
[14]
Saggion, H., Torres-Moreno, J.M., Cunha, I.d., SanJuan, E.: Multilingual summarization evaluation without human models. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp. 1059–1067. COLING’10, Association for Computational Linguistics, Stroudsburg, PA, USA (2010). https://dl.acm.org/doi/10.5555/1944566.1944688
[15]
Szaszák, G., Tündik, M.Á., Beke, A.: Summarization of spontaneous speech using automatic speech recognition and a speech prosody based tokenizer. In: KDIR, pp. 221–227 (2016)
[16]
Taskiran CM, Pizlo Z, Amir A, Ponceleon D, and Delp EJ Automated video program summarization using speech transcripts IEEE Trans. Multimedia 2006 8 4 775-791
[17]
Torres-Moreno JM Automatic Text Summarization 2014 Hoboken John Wiley & Sons
[18]
Torres-Moreno, J., Saggion, H., da Cunha, I., SanJuan, E., Velázquez-Morales, P.: Summary evaluation with and without references. Polibits 42, 13–19 (2010). https://polibits.cidetec.ipn.mx/ojs/index.php/polibits/article/view/42-2/1781
[19]
Zechner, K.: Spoken language condensation in the 21st century. In: 8th European Conference on Speech Communication and Technology (2003)
[20]
Zlatintsi, A., Iosif, E., Marago, P., Potamianos, A.: Audio salient event detection and summarization using audio and text modalities. In: 2015 23rd European Signal Processing Conference (EUSIPCO), pp. 2311–2315. IEEE (2015)
[21]
Zlatintsi, A., Maragos, P., Potamianos, A., Evangelopoulos, G.: A saliency-based approach to audio event detection and summarization. In: 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO), pp. 1294–1298. IEEE (2012)

Index Terms

  1. Audio Summarization with Audio Features and Probability Distribution Divergence
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Guide Proceedings
      Computational Linguistics and Intelligent Text Processing: 20th International Conference, CICLing 2019, La Rochelle, France, April 7–13, 2019, Revised Selected Papers, Part II
      Apr 2019
      682 pages
      ISBN:978-3-031-24339-4
      DOI:10.1007/978-3-031-24340-0

      Publisher

      Springer-Verlag

      Berlin, Heidelberg

      Publication History

      Published: 26 February 2023

      Author Tags

      1. Audio summarization
      2. JS divergence
      3. Informativeness
      4. Human language understanding

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 0
        Total Downloads
      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 12 Dec 2024

      Other Metrics

      Citations

      View Options

      View options

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media