[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

An fMRI-based visual decoding framework combined with two-stage learning and asynchronous iterative update strategy

Published: 01 October 2024 Publication History

Abstract

Reconstructing visual stimulus information from evoked brain activity is a significant task of visual decoding of the human brain. However, due to the limited size of published functional magnetic resonance imaging (fMRI) datasets, it is difficult to adequately train a complex network with a large number of parameters. Furthermore, the dimensions of fMRI data in existing datasets are extremely high, and the signal-to-noise ratio of the data is relatively low. To address these issues, we design an fMRI-based visual decoding framework that incorporates additional self-supervised training on the encoder and decoder to alleviate the problem of insufficient model training due to limited datasets. Furthermore, we propose a iterative Two Part and Two Stage learning method involving a teacher (supervised)-student (self-supervised) setup and encoder-decoder asynchronous updates strategy. This approach allows the encoder and decoder to mutually reinforce and iteratively update each other under the guidance of a teacher model. The analysis of the ablation experiments demonstrates that the proposed framework can effectively improve the reconstruction accuracy. The experimental results show that the proposed method achieves better visual reconstruction from evoked brain activity of the human brain and that its reconstruction accuracy is superior to that of existing methods.

References

[1]
Logothetis NK and Sheinberg DL Visual object recognition Annu Rev Neurosci 1996 19 1 577-621
[2]
Ogawa S, Lee TM, Kay AR, et al. Brain magnetic resonance imaging with contrast dependent on blood oxygenation Proc Natl Acad Sci 1990 87 24 9868-9872
[3]
Horikawa T and Kamitani Y Generic decoding of seen and imagined objects using hierarchical visual features[J] Nat Commun 2017 8 1 15037
[4]
Naselaris T, Olman CA, Stansbury DE, et al. A voxel-wise encoding model for early visual areas decodes mental images of remembered scenes[J] Neuroimage 2015 105 215-228
[5]
Miyawaki Y, Uchida H, Yamashita O, et al. Visual image reconstruction from human brain activity using a combination of multiscale local image decoders[J] Neuron 2008 60 5 915-929
[6]
Du C, Du C, He H (2017) Sharing deep generative representation for perceived image reconstruction from human brain activity[C]//2017 International Joint Conference on Neural Networks (IJCNN). IEEE, 1049-1056
[7]
Van Gerven MAJ, De Lange FP, and Heskes T Neural decoding with hierarchical generative models[J] Neural Comput 2010 22 12 3127-3142
[8]
Vu VQ, Ravikumar P, Naselaris T, et al. Encoding and decoding V1 fMRI responses to natural images with sparse nonparametric models[J] Ann Appl Stat 2011 5 2B 1159
[9]
Schoenmakers S, Barth M, Heskes T, et al. Linear reconstruction of perceived images from human brain activity[J] Neuroimage 2013 83 951-961
[10]
Miyawaki Y, Uchida H, Yamashita O, et al. Visual image reconstruction from human brain activity using a combination of multiscale local image decoders[J] Neuron 2008 60 5 915-929
[11]
Kay KN, Naselaris T, Prenger RJ, et al. Identifying natural images from human brain activity[J] Nature 2008 452 7185 352-355
[12]
Fujiwara Y, Miyawaki Y, and Kamitani Y Modular encoding and decoding models derived from Bayesian canonical correlation analysis[J] Neural Comput 2013 25 4 979-1005
[13]
Rakhimberdina Z, Jodelet Q, Liu X, and Murata T Natural image reconstruction from fMRI using deep learning: a survey Front Neurosci 2021 15 795488
[14]
Shen G, Dwivedi K, Majima K, et al. End-to-end deep image reconstruction from human brain activity[J] Front Comput Neurosci 2019 13 21
[15]
Li C, Xu J, and Liu B Decoding natural images from evoked brain activities using encoding models with invertible mapping[J] Neural Netw 2018 105 227-235
[16]
Han K, Wen H, Shi J, et al. Variational autoencoder: An unsupervised model for encoding and decoding fMRI activity in visual cortex[J] Neuroimage 2019 198 125-136
[17]
Beliy R, Gaziv G, Hoogi A, et al. (2019) From voxels to pixels and back: Self-supervision in natural-image reconstruction from fMRI. Adv Neural Inf Process Syst, 32
[18]
Gaziv G, Beliy R, Granot N, et al. (2022) Self-Supervised Natural Image Reconstruction and Rich Semantic Classification from Brain Activity. NeuroImage
[19]
St-Yves G, Naselaris T (2018) Generative adversarial networks conditioned on brain activity reconstruct seen images[C]//2018 IEEE international conference on systems, man, and cybernetics (SMC). IEEE: 1054-1061
[20]
Mozafari M, Reddy L, VanRullen R (2020) Reconstructing natural scenes from fmri patterns using bigbigan//2020 International joint conference on neural networks (IJCNN). IEEE, 1-8
[21]
Ren Z, Li J, Xue X, et al. Reconstructing seen image from brain activity by visually-guided cognitive representation and adversarial learning[J] Neuroimage 2021 228 117602
[22]
Ozcelik F, Choksi B, Mozafari M, et al. (2022) Reconstruction of perceived images from fmri patterns and semantic brain exploration using instance-conditioned gans//2022 International Joint Conference on Neural Networks (IJCNN). IEEE, 1-8
[23]
Chen Z, Qing J, Xiang T, et al. (2023) Seeing beyond the brain: Conditional diffusion model with sparse masked modeling for vision decoding[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR). 22710-22720
[24]
Takagi Y, Nishimoto S (2023) High-resolution image reconstruction with latent diffusion models from human brain activity[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR)
[25]
Zeng B, Li S, Liu X, et al. (2024) Controllable Mind Visual Diffusion Model[C]//Association for the Advancement of Artificial Intelligence (AAAI)
[26]
Chen LC, Lopes RG, Cheng B, Naive-student: Leveraging semi-supervised learning in video sequences for urban scene segmentation[C], Computer Vision-ECCV, et al (2020) 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part IX 16. Springer International Publishing 2020:695–714
[27]
Fang T, Qi Y, and Pan G Reconstructing perceptive images from brain activity by shape-semantic GAN[J] Adv Neural Inf Process Syst 2020 33 13038-13048
[28]
Zeki S, Watson JD, Lueck CJ, et al. A direct demonstration of functional specialization in human visual cortex[J] J Neurosci 1991 11 3 641-649
[29]
Güçlü U and van Gerven MAJ Deep neural networks reveal a gradient in the complexity of neural representations across the ventral stream[J] J Neurosci 2015 35 27 10005-10014
[30]
Qiao K, Chen J, Wang L, et al. BigGAN-based bayesian reconstruction of natural images from human brain activity[J] Neuroscience 2020 444 92-105
[31]
Wen H (2018) Neural Encoding and Decoding with Deep Learning for Natural Vision[D]. Purdue University

Index Terms

  1. An fMRI-based visual decoding framework combined with two-stage learning and asynchronous iterative update strategy
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Please enable JavaScript to view thecomments powered by Disqus.

            Information & Contributors

            Information

            Published In

            cover image Pattern Analysis & Applications
            Pattern Analysis & Applications  Volume 27, Issue 4
            Dec 2024
            719 pages

            Publisher

            Springer-Verlag

            Berlin, Heidelberg

            Publication History

            Published: 01 October 2024
            Accepted: 16 September 2024
            Received: 29 August 2023

            Author Tags

            1. Asynchronous iterative update strategy
            2. Brain activity
            3. fMRI
            4. Two-stage learning

            Author Tags

            1. Medical and Health Sciences
            2. Neurosciences
            3. Psychology and Cognitive Sciences
            4. Psychology

            Qualifiers

            • Research-article

            Funding Sources

            • the National Natural Science Foundation of China
            • the National Key Research and Development Program of China
            • the Fundamental Research Funds for the Central Universities of China

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • 0
              Total Citations
            • 0
              Total Downloads
            • Downloads (Last 12 months)0
            • Downloads (Last 6 weeks)0
            Reflects downloads up to 06 Jan 2025

            Other Metrics

            Citations

            View Options

            View options

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media