Optimization of segmentation model based on maximization information fusion and its application in nuclear image analysis
The Whole Slide Image (WSI) is a pathological image with Hematoxylin & Eosin staining. The low-contrast color staining will bring a challenge on analysis. We propose SNSeg (Staining Nuclear Segmentation) to improve the segmentation performance in ...
Towards domain adaptation underwater image enhancement and restoration
Currently, deep convolutional neural networks have made significant research progress in the field of underwater image enhancement and restoration. However, most of the existing methods use fixed-scale convolutional kernels, which are easily ...
FMR-Net: a fast multi-scale residual network for low-light image enhancement
The low-light image enhancement algorithm aims to solve the problem of poor contrast and low brightness of images in low-light environments. Although many image enhancement algorithms have been proposed, they still face the problems of loss of ...
Illustrated character face super-deformation via unsupervised image-to-image translation
Super-deformation in character design refers to a simplified modeling of character illustrations that are drawn in detail. Such super-deformation requires both texture and geometrical translation. However, directly adopting conventional image-to-...
Reducing blind spots in esophagogastroduodenoscopy examinations using a novel deep learning model
The intricate architecture of gastric anatomy coupled with the complexities inherent in esophagogastroduodenoscopy (EGD) procedures can lead to blind spots during examinations. These blind spots refer to anatomical locations not visualized during ...
Integrating user-side information into matrix factorization to address data sparsity of collaborative filtering
Recommendation techniques play a vital role in recommending an actual product to an intended user. The recommendation also supports the user in the decision-making process. In recent years, collaborative filtering has been a widely used technique ...
RKSeg+: make full use of Runge–Kutta methods in medical image segmentation
The dynamical system perspective has been used to build efficient image classification networks and semantic segmentation networks. Furthermore, the Runge–Kutta (RK) methods are powerful tools for building networks from the dynamical systems ...
Pancreas segmentation in CT based on RC-3DUNet with SOM
Deep learning-based automatic and accurate 3D pancreas segmentation plays a significant role in medical diagnosis and disease treatment, which has received a lot of attention from the medical image processing community. 3D pancreas segmentation ...
Cascaded refinement residual attention network for image outpainting
The image outpainting based on deep learning shows good performance and has a wide range of applications in many fields. The previous image outpainting methods mostly used a single image as input. In this paper, we use the left and right images as ...
GHCL: Gaussian heuristic curriculum learning for Brain CT report generation
Brain computed tomography (CT) report generation, which aims at generating accurate and descriptive reports for Brain CT imaging, has gained growing attention from researchers. Existing works mainly train a language-generation model with complex ...
Infant head and brain segmentation from magnetic resonance images using fusion-based deep learning strategies
- Helena R. Torres,
- Bruno Oliveira,
- Pedro Morais,
- Anne Fritze,
- Gabriele Hahn,
- Mario Rüdiger,
- Jaime C. Fonseca,
- João L. Vilaça
Magnetic resonance (MR) imaging is widely used for assessing infant head and brain development and for diagnosing pathologies. The main goal of this work is the development of a segmentation framework to create patient-specific head and brain ...
A secure video data streaming model using modified firefly and SVD technique
Due to the expression of sharing information, there has been an increase in interest in safeguarding multimedia information and copyrights in recent times. Attackers are attempting to obtain sensitive information from a variety of sources, ...
Same-clothes person re-identification with dual-stream network
Person re-identification (Re-ID) has long been a pressing challenge in the field of computer vision, with researchers primarily focusing on issues such as occlusion, clothing changes, and cross-modality scenarios. However, there has been a lack of ...
Target aware network architecture search and compression for efficient knowledge transfer
Transfer learning enables convolutional neural networks (CNN) to acquire knowledge from a source domain and transfer it to a target domain, where collecting large-scale annotated examples is time-consuming and expensive. Conventionally, while ...
Indirect: invertible and discrete noisy image rescaling with enhancement from case-dependent textures
Rescaling digital images for display on various devices, while simultaneously removing noise, has increasingly become a focus of attention. However, limited research has been done on a unified framework that can efficiently perform both tasks. In ...
Zero-shot image classification via Visual–Semantic Feature Decoupling
Zero-shot image classification refers to the use of labeled images to train a classification model that can correctly classify images of unseen categories. Traditional zero-shot methods use attribute labels as supervisory information and map the ...
Virtual human pose estimation in a fire education system for children with autism spectrum disorders
Children with autism face challenges in areas like language and social skills, which hinder their ability to undergo regular fire training. Fire is one of the most common and dangerous disaster in real life, making it essential to provide children ...
Severity of lung infection identification and classification using optimization-enabled deep learning with IoT
A major disease affecting individuals irrespective of the different ages is lung disease and this problem is a result of different causes. The recent spread of COVID-19 caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has ...
MadFormer: multi-attention-driven image super-resolution method based on Transformer
While the Transformer-based method has demonstrated exceptional performance in low-level visual processing tasks, it has a strong modeling ability only locally, thereby neglecting the importance of spatial feature information and high-frequency ...
Assessing the adoption of the Yavuz Battleship application in the mixed reality environment using the technology acceptance model
This study concentrates on developing a mixed reality (MR) app for the historic ship Yavuz Battleship, also known as “SMS Goeben,” and assessing its acceptance using the technology acceptance model (TAM). Mixed reality blends real and virtual ...
Unsupervised domain adaptation of dynamic extension networks based on class decision boundaries
In response to the problems of inaccurate feature alignment, loss of source domain information, imbalanced sample distribution, and biased class decision boundaries in traditional unsupervised domain adaptation methods, this paper proposes a class ...
Iris-LAHNet: a lightweight attention-guided high-resolution network for iris segmentation and localization
Iris recognition models that can be deployed on mobile devices have further requirements for both model scale and accuracy. We note that iris segmentation and localization tasks are the basis of iris recognition. Therefore, to better meet the ...
An efficient black widow optimization-based faster R-CNN for classification of COVID-19 from CT images
The coronavirus diseases (COVID-19) are transmittable diseases which are caused by Severe Acute Respiratory Syndrome human coronavirus (SARS-CoV). This paper describes the identification of coronavirus disease infections and better treatments ...
Driver intention prediction based on multi-dimensional cross-modality information interaction
Driver intention prediction allows drivers to perceive possible dangers in the fastest time and has become one of the most important research topics in the field of self-driving in recent years. In this study, we propose a driver intention ...
An improved non-local means algorithm for CT image denoising
The non-local means (NLM) algorithm is a classical image denoising algorithm. However, the denoising effect of the NLM algorithm is easily affected by the noise level of neighboring pixels, which leads to poor denoising effect for high noise level ...
Multiscale image denoising algorithm based on UNet3+
To fully exploit the multiscale information for image denoising, we introduce the idea of full-scale skip connections in the image segmentation network UNet3+. However, existing UNet3+ networks aggregate multiscale information by directly ...