Liu et al., 2024 - Google Patents

Joint estimation of pose, depth, and optical flow with a competition–cooperation transformer network

Liu et al., 2024

Document ID: 7364927092911636311
Author: Liu X; Zhang T; Liu M
Publication year: 2024
Publication venue: Neural Networks

External Links

Cited by

Snippet

Estimating depth, ego-motion, and optical flow from consecutive frames is a critical task in robot navigation and has received significant attention in recent years. In this study, we propose PDF-Former, an unsupervised joint estimation network comprising a full transformer …

Continue reading at www.sciencedirect.com (other versions)

230000003287 optical effect 0 title abstract description 96

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/0068—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image for image registration, e.g. elastic snapping
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image

Similar Documents

Publication	Publication Date	Title
Khan et al.	2023	A survey of the vision transformers and their CNN-transformer based variants
Luo et al.	2018	Real-time dense monocular SLAM with online adapted depth prediction network
Liu et al.	2024	Joint estimation of pose, depth, and optical flow with a competition–cooperation transformer network
Liu et al.	2024	Vst++: Efficient and stronger visual saliency transformer
Hwang et al.	2022	Self-supervised monocular depth estimation using hybrid transformer encoder
CN114863539A (en)	2022-08-05	Portrait key point detection method and system based on feature fusion
CN117612204A (en)	2024-02-27	A method and system for constructing a three-dimensional hand pose estimator
Deng et al.	2023	Ternary symmetric fusion network for camouflaged object detection
Lin et al.	2024	Transpose: 6d object pose estimation with geometry-aware transformer
Gao et al.	2023	Edge devices friendly self-supervised monocular depth estimation via knowledge distillation
Li et al.	2023	Feature pre-inpainting enhanced transformer for video inpainting
CN116580040A (en)	2023-08-11	A Medical Image Segmentation Method Based on Transformer-like Network
Li et al.	2024	TSwinPose: Enhanced monocular 3D human pose estimation with JointFlow
CN120088814A (en)	2025-06-03	3D human body posture estimation method based on diffusion model
Zhang et al.	2023	Dyna-depthformer: Multi-frame transformer for self-supervised depth estimation in dynamic scenes
CN117788544A (en)	2024-03-29	An image depth estimation method based on lightweight attention mechanism
Yan et al.	2023	EMTNet: efficient mobile transformer network for real-time monocular depth estimation
CN113723237B (en)	2023-12-05	Three-dimensional human body posture estimation method and device based on relative information
CN116152298A (en)	2023-05-23	A Target Tracking Method Based on Adaptive Local Mining
Zheng et al.	2023	A dual encoder–decoder network for self-supervised monocular depth estimation
Zhang et al.	2024	Combining self-attention and depth-wise convolution for human pose estimation
Hu et al.	2024	Monocular depth estimation with boundary attention mechanism and Shifted Window Adaptive Bins
Wang et al.	2024	Lightweight Self-Supervised Monocular Depth Estimation Through CNN and Transformer Integration
CN120014713B (en)	2025-06-24	3D human body posture estimation method, system, electronic device and storage medium
Shi et al.	2025	IMedSeg: Towards efficient interactive medical segmentation