Liu et al., 2024 - Google Patents
Joint estimation of pose, depth, and optical flow with a competition–cooperation transformer networkLiu et al., 2024
- Document ID
- 7364927092911636311
- Author
- Liu X
- Zhang T
- Liu M
- Publication year
- Publication venue
- Neural Networks
External Links
Snippet
Estimating depth, ego-motion, and optical flow from consecutive frames is a critical task in robot navigation and has received significant attention in recent years. In this study, we propose PDF-Former, an unsupervised joint estimation network comprising a full transformer …
- 230000003287 optical effect 0 title abstract description 96
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/0068—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image for image registration, e.g. elastic snapping
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Khan et al. | A survey of the vision transformers and their CNN-transformer based variants | |
Luo et al. | Real-time dense monocular SLAM with online adapted depth prediction network | |
Liu et al. | Joint estimation of pose, depth, and optical flow with a competition–cooperation transformer network | |
Liu et al. | Vst++: Efficient and stronger visual saliency transformer | |
Hwang et al. | Self-supervised monocular depth estimation using hybrid transformer encoder | |
CN114863539A (en) | Portrait key point detection method and system based on feature fusion | |
CN117612204A (en) | A method and system for constructing a three-dimensional hand pose estimator | |
Deng et al. | Ternary symmetric fusion network for camouflaged object detection | |
Lin et al. | Transpose: 6d object pose estimation with geometry-aware transformer | |
Gao et al. | Edge devices friendly self-supervised monocular depth estimation via knowledge distillation | |
Li et al. | Feature pre-inpainting enhanced transformer for video inpainting | |
CN116580040A (en) | A Medical Image Segmentation Method Based on Transformer-like Network | |
Li et al. | TSwinPose: Enhanced monocular 3D human pose estimation with JointFlow | |
CN120088814A (en) | 3D human body posture estimation method based on diffusion model | |
Zhang et al. | Dyna-depthformer: Multi-frame transformer for self-supervised depth estimation in dynamic scenes | |
CN117788544A (en) | An image depth estimation method based on lightweight attention mechanism | |
Yan et al. | EMTNet: efficient mobile transformer network for real-time monocular depth estimation | |
CN113723237B (en) | Three-dimensional human body posture estimation method and device based on relative information | |
CN116152298A (en) | A Target Tracking Method Based on Adaptive Local Mining | |
Zheng et al. | A dual encoder–decoder network for self-supervised monocular depth estimation | |
Zhang et al. | Combining self-attention and depth-wise convolution for human pose estimation | |
Hu et al. | Monocular depth estimation with boundary attention mechanism and Shifted Window Adaptive Bins | |
Wang et al. | Lightweight Self-Supervised Monocular Depth Estimation Through CNN and Transformer Integration | |
CN120014713B (en) | 3D human body posture estimation method, system, electronic device and storage medium | |
Shi et al. | IMedSeg: Towards efficient interactive medical segmentation |