[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index

Published: 01 December 2013 Publication History

Abstract

Epoch is defined as the instant of significant excitation within a pitch period of voiced speech. Epoch extraction continues to attract the interest of researchers because of its significance in speech analysis. Existing high performance epoch extraction algorithms require either dynamic programming techniques or a priori information of the average pitch period. An algorithm without such requirements is proposed based on integrated linear prediction residual (ILPR) which resembles the voice source signal. Half wave rectified and negated ILPR (or Hilbert transform of ILPR) is used as the pre-processed signal. A new non-linear temporal measure named the plosion index (PI) has been proposed for detecting ‘transients’ in speech signal. An extension of PI, called the dynamic plosion index (DPI) is applied on pre-processed signal to estimate the epochs. The proposed DPI algorithm is validated using six large databases which provide simultaneous EGG recordings. Creaky and singing voice samples are also analyzed. The algorithm has been tested for its robustness in the presence of additive white and babble noise and on simulated telephone quality speech. The performance of the DPI algorithm is found to be comparable or better than five state-of-the-art techniques for the experiments considered.

Cited By

View all
  1. Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image IEEE Transactions on Audio, Speech, and Language Processing
      IEEE Transactions on Audio, Speech, and Language Processing  Volume 21, Issue 12
      December 2013
      170 pages

      Publisher

      IEEE Press

      Publication History

      Published: 01 December 2013

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 20 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Epoch extraction in real-world scenarioInternational Journal of Speech Technology10.1007/s10772-024-10137-127:3(831-845)Online publication date: 1-Sep-2024
      • (2023)Usefulness of glottal excitation source information for audio-visual speech recognition systemInternational Journal of Speech Technology10.1007/s10772-023-10060-x26:4(933-945)Online publication date: 1-Dec-2023
      • (2023)Epoch Extraction from Telephonic Speech Signal using Stockwell TransformCircuits, Systems, and Signal Processing10.1007/s00034-023-02312-742:7(4238-4251)Online publication date: 26-Feb-2023
      • (2023)Improvement of Audio-Visual Keyword Spotting System Accuracy Using Excitation Source FeatureSpeech and Computer10.1007/978-3-031-48312-7_28(344-356)Online publication date: 29-Nov-2023
      • (2022)New replay attack detection using iterative adaptive inverse filtering and high frequency bandExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.116597195:COnline publication date: 1-Jun-2022
      • (2022)Encrypted speech Biohashing authentication algorithm based on 4D hyperchaotic Bao system and feature fusionMultimedia Tools and Applications10.1007/s11042-022-13933-682:11(16767-16792)Online publication date: 8-Oct-2022
      • (2022)Encrypted speech perceptual hashing authentication algorithm based on improved 2D-Henon encryption and harmonic product spectrumMultimedia Tools and Applications10.1007/s11042-022-12746-x81:18(25829-25852)Online publication date: 1-Jul-2022
      • (2022)Sequence-to-Sequence CNN-BiLSTM Based Glottal Closure Instant Detection from Raw SpeechArtificial Neural Networks in Pattern Recognition10.1007/978-3-031-20650-4_9(107-120)Online publication date: 24-Nov-2022
      • (2021)Detection of replay signals using excitation source and shifted CQCC featuresInternational Journal of Speech Technology10.1007/s10772-021-09810-624:2(497-507)Online publication date: 1-Jun-2021
      • (2021)Event-Based Transformation of Misarticulated Stops in Cleft Lip and Palate SpeechCircuits, Systems, and Signal Processing10.1007/s00034-021-01663-340:8(4064-4088)Online publication date: 1-Aug-2021
      • Show More Cited By

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media