MUME: Vol 30, No 2

Volume 30, Issue 2Apr 2024

Volume 30, Issue 2

Apr 2024

Publisher:

Springer-Verlag
Berlin, Heidelberg

ISSN:0942-4962

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Optimization of segmentation model based on maximization information fusion and its application in nuclear image analysis

https://doi.org/10.1007/s00530-023-01231-6

Abstract

The Whole Slide Image (WSI) is a pathological image with Hematoxylin & Eosin staining. The low-contrast color staining will bring a challenge on analysis. We propose SNSeg (Staining Nuclear Segmentation) to improve the segmentation performance in ...

research-article

Towards domain adaptation underwater image enhancement and restoration

https://doi.org/10.1007/s00530-023-01246-z

Abstract

Currently, deep convolutional neural networks have made significant research progress in the field of underwater image enhancement and restoration. However, most of the existing methods use fixed-scale convolutional kernels, which are easily ...

research-article

FMR-Net: a fast multi-scale residual network for low-light image enhancement

https://doi.org/10.1007/s00530-023-01252-1

Abstract

The low-light image enhancement algorithm aims to solve the problem of poor contrast and low brightness of images in low-light environments. Although many image enhancement algorithms have been proposed, they still face the problems of loss of ...

research-article

Illustrated character face super-deformation via unsupervised image-to-image translation

https://doi.org/10.1007/s00530-023-01255-y

Abstract

Super-deformation in character design refers to a simplified modeling of character illustrations that are drawn in detail. Such super-deformation requires both texture and geometrical translation. However, directly adopting conventional image-to-...

research-article

Reducing blind spots in esophagogastroduodenoscopy examinations using a novel deep learning model

https://doi.org/10.1007/s00530-024-01259-2

Abstract

The intricate architecture of gastric anatomy coupled with the complexities inherent in esophagogastroduodenoscopy (EGD) procedures can lead to blind spots during examinations. These blind spots refer to anatomical locations not visualized during ...

research-article

Mmy-net: a multimodal network exploiting image and patient metadata for simultaneous segmentation and diagnosis

https://doi.org/10.1007/s00530-024-01260-9

Abstract

Accurate medical image segmentation can effectively assist disease diagnosis and treatment. While neural networks were often applied to solve the segmentation problem in recent computer-aided diagnosis, the metadata of patients was usually ...

research-article

Integrating user-side information into matrix factorization to address data sparsity of collaborative filtering

https://doi.org/10.1007/s00530-024-01261-8

Abstract

Recommendation techniques play a vital role in recommending an actual product to an intended user. The recommendation also supports the user in the decision-making process. In recent years, collaborative filtering has been a widely used technique ...

research-article

A dual-branch hybrid network of CNN and transformer with adaptive keyframe scheduling for video semantic segmentation

https://doi.org/10.1007/s00530-024-01262-7

Abstract

Video semantic segmentation (VSS) plays a crucial role in various realistic applications, such as unmanned vehicles, autonomous robots, and augmented reality. Despite the significant progress achieved in this field, balancing accuracy and ... $^{}$ $^{}$

research-article

RKSeg+: make full use of Runge–Kutta methods in medical image segmentation

https://doi.org/10.1007/s00530-024-01263-6

Abstract

The dynamical system perspective has been used to build efficient image classification networks and semantic segmentation networks. Furthermore, the Runge–Kutta (RK) methods are powerful tools for building networks from the dynamical systems ...

research-article

Pancreas segmentation in CT based on RC-3DUNet with SOM

https://doi.org/10.1007/s00530-024-01264-5

Abstract

Deep learning-based automatic and accurate 3D pancreas segmentation plays a significant role in medical diagnosis and disease treatment, which has received a lot of attention from the medical image processing community. 3D pancreas segmentation ...

research-article

Cascaded refinement residual attention network for image outpainting

https://doi.org/10.1007/s00530-024-01265-4

Abstract

The image outpainting based on deep learning shows good performance and has a wide range of applications in many fields. The previous image outpainting methods mostly used a single image as input. In this paper, we use the left and right images as ...

research-article

GHCL: Gaussian heuristic curriculum learning for Brain CT report generation

https://doi.org/10.1007/s00530-024-01266-3

Abstract

Brain computed tomography (CT) report generation, which aims at generating accurate and descriptive reports for Brain CT imaging, has gained growing attention from researchers. Existing works mainly train a language-generation model with complex ...

research-article

Infant head and brain segmentation from magnetic resonance images using fusion-based deep learning strategies

https://doi.org/10.1007/s00530-024-01267-2

Abstract

Magnetic resonance (MR) imaging is widely used for assessing infant head and brain development and for diagnosing pathologies. The main goal of this work is the development of a segmentation framework to create patient-specific head and brain ...

research-article

A secure video data streaming model using modified firefly and SVD technique

https://doi.org/10.1007/s00530-024-01268-1

Abstract

Due to the expression of sharing information, there has been an increase in interest in safeguarding multimedia information and copyrights in recent times. Attackers are attempting to obtain sensitive information from a variety of sources, ...

research-article

Same-clothes person re-identification with dual-stream network

https://doi.org/10.1007/s00530-024-01269-0

Abstract

Person re-identification (Re-ID) has long been a pressing challenge in the field of computer vision, with researchers primarily focusing on issues such as occlusion, clothing changes, and cross-modality scenarios. However, there has been a lack of ...

research-article

Target aware network architecture search and compression for efficient knowledge transfer

https://doi.org/10.1007/s00530-024-01271-6

Abstract

Transfer learning enables convolutional neural networks (CNN) to acquire knowledge from a source domain and transfer it to a target domain, where collecting large-scale annotated examples is time-consuming and expensive. Conventionally, while ...

research-article

Indirect: invertible and discrete noisy image rescaling with enhancement from case-dependent textures

https://doi.org/10.1007/s00530-024-01272-5

Abstract

Rescaling digital images for display on various devices, while simultaneously removing noise, has increasingly become a focus of attention. However, limited research has been done on a unified framework that can efficiently perform both tasks. In ...

research-article

Zero-shot image classification via Visual–Semantic Feature Decoupling

https://doi.org/10.1007/s00530-024-01273-4

Abstract

Zero-shot image classification refers to the use of labeled images to train a classification model that can correctly classify images of unseen categories. Traditional zero-shot methods use attribute labels as supervisory information and map the ...

research-article

Virtual human pose estimation in a fire education system for children with autism spectrum disorders

https://doi.org/10.1007/s00530-024-01274-3

Abstract

Children with autism face challenges in areas like language and social skills, which hinder their ability to undergo regular fire training. Fire is one of the most common and dangerous disaster in real life, making it essential to provide children ...

research-article

Severity of lung infection identification and classification using optimization-enabled deep learning with IoT

https://doi.org/10.1007/s00530-024-01275-2

Abstract

A major disease affecting individuals irrespective of the different ages is lung disease and this problem is a result of different causes. The recent spread of COVID-19 caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has ...

research-article

MadFormer: multi-attention-driven image super-resolution method based on Transformer

https://doi.org/10.1007/s00530-024-01276-1

Abstract

While the Transformer-based method has demonstrated exceptional performance in low-level visual processing tasks, it has a strong modeling ability only locally, thereby neglecting the importance of spatial feature information and high-frequency ...

research-article

Assessing the adoption of the Yavuz Battleship application in the mixed reality environment using the technology acceptance model

https://doi.org/10.1007/s00530-024-01277-0

Abstract

This study concentrates on developing a mixed reality (MR) app for the historic ship Yavuz Battleship, also known as “SMS Goeben,” and assessing its acceptance using the technology acceptance model (TAM). Mixed reality blends real and virtual ...

research-article

Unsupervised domain adaptation of dynamic extension networks based on class decision boundaries

https://doi.org/10.1007/s00530-024-01278-z

Abstract

In response to the problems of inaccurate feature alignment, loss of source domain information, imbalanced sample distribution, and biased class decision boundaries in traditional unsupervised domain adaptation methods, this paper proposes a class ...

research-article

Scalable image coding with enhancement features for human and machine

https://doi.org/10.1007/s00530-024-01279-y

Abstract

The past decade has seen significant advancements in computer vision technologies, resulting in an increasing consumption of images and videos by both human and machine. Although machines are usually the primary consumers, there are many ...

research-article

Iris-LAHNet: a lightweight attention-guided high-resolution network for iris segmentation and localization

https://doi.org/10.1007/s00530-024-01280-5

Abstract

Iris recognition models that can be deployed on mobile devices have further requirements for both model scale and accuracy. We note that iris segmentation and localization tasks are the basis of iris recognition. Therefore, to better meet the ...

research-article

An efficient black widow optimization-based faster R-CNN for classification of COVID-19 from CT images

https://doi.org/10.1007/s00530-024-01281-4

Abstract

The coronavirus diseases (COVID-19) are transmittable diseases which are caused by Severe Acute Respiratory Syndrome human coronavirus (SARS-CoV). This paper describes the identification of coronavirus disease infections and better treatments ...

research-article

Driver intention prediction based on multi-dimensional cross-modality information interaction

https://doi.org/10.1007/s00530-024-01282-3

Abstract

Driver intention prediction allows drivers to perceive possible dangers in the fastest time and has become one of the most important research topics in the field of self-driving in recent years. In this study, we propose a driver intention ...

research-article

An improved non-local means algorithm for CT image denoising

https://doi.org/10.1007/s00530-024-01283-2

Abstract

The non-local means (NLM) algorithm is a classical image denoising algorithm. However, the denoising effect of the NLM algorithm is easily affected by the noise level of neighboring pixels, which leads to poor denoising effect for high noise level ...

research-article

Multiscale image denoising algorithm based on UNet3+

https://doi.org/10.1007/s00530-024-01284-1

Abstract

To fully exploit the multiscale information for image denoising, we introduce the idea of full-scale skip connections in the image segmentation network UNet3+. However, existing UNet3+ networks aggregate multiscale information by directly ...

research-article

360° video quality assessment based on saliency-guided viewport extraction

https://doi.org/10.1007/s00530-024-01285-0

Abstract

Due to the distortion of projection generated during the production of $360^{\circ}$ video, most quality assessment algorithms used for 2D video have the problem of performance degradation. In this paper, we propose a full-reference $360^{\circ}$ video quality ... $^{}$ $^{}$

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Multimedia Systems

Sections

Optimization of segmentation model based on maximization information fusion and its application in nuclear image analysis

Towards domain adaptation underwater image enhancement and restoration

FMR-Net: a fast multi-scale residual network for low-light image enhancement

Illustrated character face super-deformation via unsupervised image-to-image translation

Reducing blind spots in esophagogastroduodenoscopy examinations using a novel deep learning model

Mmy-net: a multimodal network exploiting image and patient metadata for simultaneous segmentation and diagnosis

Integrating user-side information into matrix factorization to address data sparsity of collaborative filtering

A dual-branch hybrid network of CNN and transformer with adaptive keyframe scheduling for video semantic segmentation

RKSeg+: make full use of Runge–Kutta methods in medical image segmentation

Pancreas segmentation in CT based on RC-3DUNet with SOM

Cascaded refinement residual attention network for image outpainting

GHCL: Gaussian heuristic curriculum learning for Brain CT report generation

Infant head and brain segmentation from magnetic resonance images using fusion-based deep learning strategies

A secure video data streaming model using modified firefly and SVD technique

Same-clothes person re-identification with dual-stream network

Target aware network architecture search and compression for efficient knowledge transfer

Indirect: invertible and discrete noisy image rescaling with enhancement from case-dependent textures

Zero-shot image classification via Visual–Semantic Feature Decoupling

Virtual human pose estimation in a fire education system for children with autism spectrum disorders

Severity of lung infection identification and classification using optimization-enabled deep learning with IoT

MadFormer: multi-attention-driven image super-resolution method based on Transformer

Assessing the adoption of the Yavuz Battleship application in the mixed reality environment using the technology acceptance model

Unsupervised domain adaptation of dynamic extension networks based on class decision boundaries

Scalable image coding with enhancement features for human and machine

Iris-LAHNet: a lightweight attention-guided high-resolution network for iris segmentation and localization

An efficient black widow optimization-based faster R-CNN for classification of COVID-19 from CT images

Driver intention prediction based on multi-dimensional cross-modality information interaction

An improved non-local means algorithm for CT image denoising

Multiscale image denoising algorithm based on UNet3+

360° video quality assessment based on saliency-guided viewport extraction