[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Liu et al., 2024 - Google Patents

Joint estimation of pose, depth, and optical flow with a competition–cooperation transformer network

Liu et al., 2024

Document ID
7364927092911636311
Author
Liu X
Zhang T
Liu M
Publication year
Publication venue
Neural Networks

External Links

Snippet

Estimating depth, ego-motion, and optical flow from consecutive frames is a critical task in robot navigation and has received significant attention in recent years. In this study, we propose PDF-Former, an unsupervised joint estimation network comprising a full transformer …
Continue reading at www.sciencedirect.com (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30244Information retrieval; Database structures therefor; File system structures therefor in image databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6201Matching; Proximity measures
    • G06K9/6202Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
    • G06T3/0068Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image for image registration, e.g. elastic snapping
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image

Similar Documents

Publication Publication Date Title
Khan et al. A survey of the vision transformers and their CNN-transformer based variants
Luo et al. Real-time dense monocular SLAM with online adapted depth prediction network
Liu et al. Joint estimation of pose, depth, and optical flow with a competition–cooperation transformer network
Liu et al. Vst++: Efficient and stronger visual saliency transformer
Hwang et al. Self-supervised monocular depth estimation using hybrid transformer encoder
CN114863539A (en) Portrait key point detection method and system based on feature fusion
CN117612204A (en) A method and system for constructing a three-dimensional hand pose estimator
Deng et al. Ternary symmetric fusion network for camouflaged object detection
Lin et al. Transpose: 6d object pose estimation with geometry-aware transformer
Gao et al. Edge devices friendly self-supervised monocular depth estimation via knowledge distillation
Li et al. Feature pre-inpainting enhanced transformer for video inpainting
CN116580040A (en) A Medical Image Segmentation Method Based on Transformer-like Network
Li et al. TSwinPose: Enhanced monocular 3D human pose estimation with JointFlow
CN120088814A (en) 3D human body posture estimation method based on diffusion model
Zhang et al. Dyna-depthformer: Multi-frame transformer for self-supervised depth estimation in dynamic scenes
CN117788544A (en) An image depth estimation method based on lightweight attention mechanism
Yan et al. EMTNet: efficient mobile transformer network for real-time monocular depth estimation
CN113723237B (en) Three-dimensional human body posture estimation method and device based on relative information
CN116152298A (en) A Target Tracking Method Based on Adaptive Local Mining
Zheng et al. A dual encoder–decoder network for self-supervised monocular depth estimation
Zhang et al. Combining self-attention and depth-wise convolution for human pose estimation
Hu et al. Monocular depth estimation with boundary attention mechanism and Shifted Window Adaptive Bins
Wang et al. Lightweight Self-Supervised Monocular Depth Estimation Through CNN and Transformer Integration
CN120014713B (en) 3D human body posture estimation method, system, electronic device and storage medium
Shi et al. IMedSeg: Towards efficient interactive medical segmentation