[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models

Published: 01 May 2011 Publication History

Abstract

We present a simple and efficient feature modeling approach for tracking the pitch of two simultaneously active speakers. We model the spectrogram features of single speakers using Gaussian mixture models in combination with the minimum description length model selection criterion. To obtain a probabilistic representation for the speech mixture spectrogram features of both speakers, we employ the mixture maximization model (MIXMAX) and, as an alternative, a linear interaction model. A factorial hidden Markov model is applied for tracking pitch over time. This statistical model can be used for applications beyond speech, whenever the interaction between individual sources can be represented as MIXMAX or linear model. For tracking, we use the loopy max-sum algorithm, and provide empirical comparisons to exact methods. Furthermore, we discuss a scheduling mechanism of loopy belief propagation for online tracking. We demonstrate experimental results using Mocha-TIMIT as well as data from the speech separation challenge provided by Cooke We show the excellent performance of the proposed method in comparison to a well known multipitch tracking algorithm based on correlogram features. Using speaker-dependent models, the proposed method improves the accuracy of correct speaker assignment, which is important for single-channel speech separation. In particular, we are able to reduce the overall tracking error by 51% relative for the speaker-dependent case. Moreover, we use the estimated pitch trajectories to perform single-channel source separation, and demonstrate the beneficial effect of correct speaker assignment on speech separation performance.

Cited By

View all
  • (2023)PGSSProceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i11.26542(13130-13138)Online publication date: 7-Feb-2023
  • (2021)Adaptive Multi-Trace Carving for Robust Frequency Tracking in Forensic ApplicationsIEEE Transactions on Information Forensics and Security10.1109/TIFS.2020.303018216(1174-1189)Online publication date: 1-Jan-2021
  • (2020)Multi-task learning using a hybrid representation for text classificationNeural Computing and Applications10.1007/s00521-018-3934-y32:11(6467-6480)Online publication date: 1-Jun-2020
  • Show More Cited By
  1. A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image IEEE Transactions on Audio, Speech, and Language Processing
    IEEE Transactions on Audio, Speech, and Language Processing  Volume 19, Issue 4
    May 2011
    391 pages

    Publisher

    IEEE Press

    Publication History

    Published: 01 May 2011

    Author Tags

    1. Factorial hidden Markov model (FHMM)
    2. Gaussian mixture model (GMM)
    3. mixture maximization
    4. multipitch tracking
    5. speech analysis

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 20 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)PGSSProceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i11.26542(13130-13138)Online publication date: 7-Feb-2023
    • (2021)Adaptive Multi-Trace Carving for Robust Frequency Tracking in Forensic ApplicationsIEEE Transactions on Information Forensics and Security10.1109/TIFS.2020.303018216(1174-1189)Online publication date: 1-Jan-2021
    • (2020)Multi-task learning using a hybrid representation for text classificationNeural Computing and Applications10.1007/s00521-018-3934-y32:11(6467-6480)Online publication date: 1-Jun-2020
    • (2018)Tracking of Multiple Fundamental Frequencies in Diplophonic VoicesIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2017.276123326:2(330-341)Online publication date: 1-Feb-2018
    • (2018)Permutation Invariant Training for Speaker-Independent Multi-Pitch Tracking2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2018.8461526(5594-5598)Online publication date: 15-Apr-2018
    • (2017)Visually informed multi-pitch analysis of string ensembles2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2017.7952711(3021-3025)Online publication date: 5-Mar-2017
    • (2017)Multi-pitch streaming of interwoven streams2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2017.7952168(311-315)Online publication date: 5-Mar-2017
    • (2016)Single microphone speech separation by diffusion-based HMM estimationEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-016-0094-92016:1(1-19)Online publication date: 1-Dec-2016
    • (2014)Deep LearningFoundations and Trends in Signal Processing10.1561/20000000397:3–4(197-387)Online publication date: 30-Jun-2014
    • (2014)Informed single-channel speech separation using HMM-GMM user-generated exemplar sourceIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2014.235767722:12(2087-2100)Online publication date: 1-Dec-2014
    • Show More Cited By

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media