[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3649329.3657339acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article
Open access

Deep Harmonic Finesse: Signal Separation in Wearable Systems with Limited Data

Published: 07 November 2024 Publication History

Abstract

We present a method, referred to as Deep Harmonic Finesse (DHF), for separation of non-stationary quasi-periodic signals when limited data is available. The problem frequently arises in wearable systems in which, a combination of quasi-periodic physiological phenomena give rise to the sensed signal, and excessive data collection is prohibitive. Our approach utilizes prior knowledge of time-frequency patterns in the signals to mask and in-paint spectrograms. This is achieved through an application-inspired deep harmonic neural network coupled with an integrated pattern alignment component. The network's structure embeds the implicit harmonic priors within the time-frequency domain, while the pattern-alignment method transforms the sensed signal, ensuring a strong alignment with the network. The effectiveness of the algorithm is demonstrated in the context of non-invasive fetal monitoring using both synthesized and in vivo data. When applied to the synthesized data, our method exhibits significant improvements in signal-to-distortion ratio (26% on average) and mean squared error (80% on average), compared to the best competing method. When applied to in vivo data captured in pregnant animal studies, our method improves the correlation error between estimated fetal blood oxygen saturation and the ground truth by 80.5% compared to the state of the art.

References

[1]
Konstantin Dragomiretskiy and Dominique Zosso. 2013. Variational mode decomposition. IEEE transactions on signal processing 62, 3 (2013), 531--544.
[2]
Daniel D Fong, Kaeli J Yamashiro, Kourosh Vali, Laura A Galganski, Jameson Thies, Rasta Moeinzadeh, Christopher Pivetti, André Knoesen, Vivek J Srinivasan, Herman L Hedriana, et al. 2020. Design and in vivo evaluation of a non-invasive transabdominal fetal pulse oximeter. IEEE Transactions on Biomedical Engineering 68, 1 (2020), 256--266.
[3]
Timo Gerkmann and Emmanuel Vincent. 2018. Spectral masking and filtering. Audio source separation and speech enhancement (2018), 65--85.
[4]
Ary L Goldberger, Luis AN Amaral, Leon Glass, Jeffrey M Hausdorff, Plamen Ch Ivanov, Roger G Mark, Joseph E Mietus, George B Moody, Chung-Kang Peng, and H Eugene Stanley. 2000. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. circulation 101, 23 (2000), e215--e220.
[5]
Norden E Huang, Zheng Shen, Steven R Long, Manli C Wu, Hsing H Shih, Quanan Zheng, Nai-Chyuan Yen, Chi Chao Tung, and Henry H Liu. 1998. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proceedings of the Royal Society of London. Series A: mathematical, physical and engineering sciences 454, 1971 (1998), 903--995.
[6]
Alistair Johnson, Lucas Bulgarelli, Tom Pollard, Steven Horng, Leo Anthony Celi, and Roger Mark. 2020. Mimic-iv. PhysioNet. Available online at: https://physionet.org/content/mimiciv/1.0/(accessed August 23, 2021) (2020).
[7]
Begum Kasap, Kourosh Vali, Weitai Qian, Mahya Saffarpour, and Soheil Ghiasi. 2023. KUBAI: Sensor Fusion for Non-Invasive Fetal Heart Rate Tracking. IEEE Transactions on Biomedical Engineering 70, 7 (2023), 2193--2202.
[8]
Qiuqiang Kong, Yong Xu, Wenwu Wang, Philip JB Jackson, and Mark D Plumbley. 2019. Single-channel signal separation and deconvolution with generative adversarial networks. arXiv preprint arXiv:1906.07552 (2019).
[9]
Daniel D Lee and H Sebastian Seung. 1999. Learning the parts of objects by non-negative matrix factorization. Nature 401, 6755 (1999), 788--791.
[10]
Yi Luo and Nima Mesgarani. 2019. Conv-tasnet: Surpassing ideal time-frequency magnitude masking for speech separation. IEEE/ACM transactions on audio, speech, and language processing 27, 8 (2019), 1256--1266.
[11]
Vivek Narayanaswamy, Jayaraman J Thiagarajan, Rushil Anirudh, and Andreas Spanias. 2020. Unsupervised audio source separation using generative priors. arXiv preprint arXiv:2005.13769 (2020).
[12]
Maciej Niedźwiecki and Piotr Kaczmarek. 2005. Estimation and tracking of complex-valued quasi-periodically varying systems. Automatica 41, 9 (2005), 1503--1516.
[13]
Meir Nitzan, Ayal Romem, and Robert Koppel. 2014. Pulse oximetry: fundamentals and technology update. Medical Devices: Evidence and Research (2014), 231--239.
[14]
Zafar Rafii and Bryan Pardo. 2012. Repeating pattern extraction technique (REPET): A simple method for music/voice separation. IEEE transactions on audio, speech, and language processing 21, 1 (2012), 73--84.
[15]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, 234--241.
[16]
Yapeng Tian, Chenliang Xu, and Dingzeyu Li. 2020. Deep audio prior: Learning sound source separation from a single audio mixture. In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.
[17]
Dmitry Ulyanov, Andrea Vedaldi, and Victor Lempitsky. 2018. Deep image prior. In Proceedings of the IEEE conference on computer vision and pattern recognition. 9446--9454.
[18]
Kourosh Vali, Begum Kasap, Weitai Qian, Ata Vafi, Mahya Saffarpour, and Soheil Ghiasi. 2021. Estimation of fetal blood oxygen saturation from transabdominally acquired photoplethysmogram waveforms. In 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE, 1100--1103.
[19]
DeLiang Wang and Jitong Chen. 2018. Supervised speech separation based on deep learning: An overview. IEEE/ACM Transactions on Audio, Speech, and Language Processing 26, 10 (2018), 1702--1726.
[20]
Geliang Zhang and Simon Godsill. 2016. Fundamental frequency estimation in speech signals with variable rate particle filters. IEEE/ACM Transactions on Audio, Speech, and Language Processing 24, 5 (2016), 890--900.
[21]
Zhoutong Zhang, Yunyun Wang, Chuang Gan, Jiajun Wu, Joshua B Tenenbaum, Antonio Torralba, and William T Freeman. 2019. Deep audio priors emerge from harmonic convolutional networks. In International Conference on Learning Representations.

Index Terms

  1. Deep Harmonic Finesse: Signal Separation in Wearable Systems with Limited Data
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DAC '24: Proceedings of the 61st ACM/IEEE Design Automation Conference
    June 2024
    2159 pages
    ISBN:9798400706011
    DOI:10.1145/3649329
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 November 2024

    Check for updates

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    DAC '24
    Sponsor:
    DAC '24: 61st ACM/IEEE Design Automation Conference
    June 23 - 27, 2024
    CA, San Francisco, USA

    Acceptance Rates

    Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

    Upcoming Conference

    DAC '25
    62nd ACM/IEEE Design Automation Conference
    June 22 - 26, 2025
    San Francisco , CA , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 27
      Total Downloads
    • Downloads (Last 12 months)27
    • Downloads (Last 6 weeks)27
    Reflects downloads up to 11 Dec 2024

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media