Kanwal et al., 2023 - Google Patents
CVit-Net: A conformer driven RGB-D salient object detector with operation-wise attention learningKanwal et al., 2023
View PDF- Document ID
- 15370231868977601588
- Author
- Kanwal S
- Taj I
- Publication year
- Publication venue
- Expert Systems with Applications
External Links
Snippet
Salient object detection (SOD) on Red Green Blue Depth (RGB-D) data is often confronted with ambiguous cross-modality fusion, due to three major challenges:(i) How to select complementarity of RGB and depth modalities,(ii) How to alleviate the negative affect on …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhou et al. | LSNet: Lightweight spatial boosting network for detecting salient objects in RGB-thermal images | |
Zhou et al. | CCAFNet: Crossflow and cross-scale adaptive fusion network for detecting salient objects in RGB-D images | |
Zhou et al. | HFNet: Hierarchical feedback network with multilevel atrous spatial pyramid pooling for RGB-D saliency detection | |
Liang et al. | Multi-modal interactive attention and dual progressive decoding network for RGB-D/T salient object detection | |
Yang et al. | Bi-directional progressive guidance network for RGB-D salient object detection | |
Bi et al. | Cross-modal hierarchical interaction network for RGB-D salient object detection | |
Zhou et al. | BCINet: Bilateral cross-modal interaction network for indoor scene understanding in RGB-D images | |
Feng et al. | Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection | |
Wang et al. | Thermal images-aware guided early fusion network for cross-illumination RGB-T salient object detection | |
Fang et al. | M2RNet: Multi-modal and multi-scale refined network for RGB-D salient object detection | |
Yao et al. | Transformers and CNNs fusion network for salient object detection | |
Chen et al. | BINet: Bidirectional interactive network for salient object detection | |
Wu et al. | Aggregate interactive learning for RGB-D salient object detection | |
Kanwal et al. | CVit-Net: A conformer driven RGB-D salient object detector with operation-wise attention learning | |
Chen et al. | : Edge-Aware Multimodal Transformer for RGB-D Salient Object Detection | |
Wang et al. | Global contextual guided residual attention network for salient object detection | |
Wei et al. | Bidirectional attentional interaction networks for rgb-d salient object detection | |
Zhao et al. | Towards diverse binary segmentation via a simple yet general gated network | |
Zhao et al. | Adaptive spatiotemporal graph convolutional network with intermediate aggregation of multi-stream skeleton features for action recognition | |
Zhou et al. | Effective dual-feature fusion network for transmission line detection | |
Yuan et al. | Attention-based bi-directional refinement network for salient object detection | |
Chen et al. | Hybrid attention fusion embedded in transformer for remote sensing image semantic segmentation | |
Zhong et al. | MAGNet: Multi-scale Awareness and Global fusion Network for RGB-D salient object detection | |
Xia et al. | Imsfnet: integrated multi-source feature network for salient object detection | |
Yao et al. | Double cross-modality progressively guided network for RGB-D salient object detection |