Xue et al., 2021 - Google Patents
Boundary-induced and scene-aggregated network for monocular depth predictionXue et al., 2021
View PDF- Document ID
- 1048121400871633009
- Author
- Xue F
- Cao J
- Zhou Y
- Sheng F
- Wang Y
- Ming A
- Publication year
- Publication venue
- Pattern Recognition
External Links
Snippet
Monocular depth prediction is an important task in scene understanding. It aims to predict the dense depth of a single RGB image. With the development of deep learning, the performance of this task has made great improvements. However, two issues remain …
- 230000004927 fusion 0 abstract description 14
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G06K9/34—Segmentation of touching or overlapping patterns in the image field
- G06K9/342—Cutting or merging image elements, e.g. region growing, watershed, clustering-based techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G06K9/32—Aligning or centering of the image pick-up or image-field
- G06K9/3233—Determination of region of interest
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Xue et al. | Boundary-induced and scene-aggregated network for monocular depth prediction | |
Zhou et al. | Review of stereo matching algorithms based on deep learning | |
Peng et al. | Excavating the potential capacity of self-supervised monocular depth estimation | |
EP3815047B1 (en) | Image colorization based on reference information | |
Wang et al. | RGB-D salient object detection via minimum barrier distance transform and saliency fusion | |
Zhou et al. | BCINet: Bilateral cross-modal interaction network for indoor scene understanding in RGB-D images | |
US20230186486A1 (en) | Vehicle tracking method and apparatus, and electronic device | |
Hoyer et al. | Improving semi-supervised and domain-adaptive semantic segmentation with self-supervised depth estimation | |
US20210027526A1 (en) | Lighting estimation | |
US20230260247A1 (en) | System and method for dual-value attention and instance boundary aware regression in computer vision system | |
EP3836083B1 (en) | Disparity estimation system and method, electronic device and computer program product | |
Feng et al. | Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection | |
CN109271842B (en) | General object detection method, system, terminal and storage medium based on key point regression | |
CN108734161A (en) | Recognition methods, device, equipment and the storage medium in crown word number region | |
CN114359361A (en) | Depth estimation method, depth estimation device, electronic equipment and computer-readable storage medium | |
US11694342B2 (en) | Apparatus and method for tracking multiple objects | |
Islam et al. | Depth estimation from a single RGB image using fine-tuned generative adversarial network | |
CN114092931B (en) | Scene character recognition method and device, electronic equipment and storage medium | |
US11847725B2 (en) | Performing interactive digital image operations utilizing modified machine learning models | |
Shi et al. | AdaFI-FCN: an adaptive feature integration fully convolutional network for predicting driver’s visual attention | |
Wang et al. | G-NET: Accurate lane detection model for autonomous vehicle | |
CN107635127B (en) | Stereoscopic image matching apparatus and method requiring small amount of calculation | |
Shen et al. | Stereo matching using random walks | |
US20240005685A1 (en) | Geospatial image data processing to detect nodes and interconnections | |
Kim et al. | Robust appearance feature learning using pixel‐wise discrimination for visual tracking |