Fang et al., 2022 - Google Patents
A ViTDet based dual-source fusion object detection method of UAVFang et al., 2022
- Document ID
- 16147400462001506316
- Author
- Fang Z
- Zhang T
- Fan X
- Publication year
- Publication venue
- 2022 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML)
External Links
Snippet
Dual-source fusion detection can effectively solve the problems of UAV, such as the limited use of visible light imaging and infrared imaging misdetection. Aiming at the problem of the detection accuracy of current dual-source detectors is low, we analyzed the influence of the …
- 230000004927 fusion 0 title abstract description 97
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/0063—Recognising patterns in remote scenes, e.g. aerial images, vegetation versus urban areas
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00664—Recognising scenes such as could be captured by a camera operated by a pedestrian or robot, including objects at substantially different ranges from the camera
- G06K9/00684—Categorising the entire scene, e.g. birthday party or wedding scene
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00791—Recognising scenes perceived from the perspective of a land vehicle, e.g. recognising lanes, obstacles or traffic signs on road scenes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10032—Satellite or aerial image; Remote sensing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/40—Analysis of texture
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G06K9/32—Aligning or centering of the image pick-up or image-field
- G06K9/3233—Determination of region of interest
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112818903B (en) | Small sample remote sensing image target detection method based on meta-learning and cooperative attention | |
Luo et al. | Multi-scale traffic vehicle detection based on faster R–CNN with NAS optimization and feature enrichment | |
CN110069986A (en) | A kind of traffic lights recognition methods and system based on mixed model | |
CN113888754B (en) | Vehicle multi-attribute identification method based on radar vision fusion | |
CN115641507B (en) | Remote sensing image small-scale surface target detection method based on self-adaptive multi-level fusion | |
CN113111727A (en) | Method for detecting rotating target in remote sensing scene based on feature alignment | |
Fang et al. | A ViTDet based dual-source fusion object detection method of UAV | |
Wu et al. | Vehicle detection based on adaptive multi-modal feature fusion and cross-modal vehicle index using RGB-T images | |
CN117372898A (en) | Unmanned aerial vehicle aerial image target detection method based on improved yolov8 | |
CN113052108A (en) | Multi-scale cascade aerial photography target detection method and system based on deep neural network | |
Yan et al. | A traffic sign recognition method under complex illumination conditions | |
CN117011722A (en) | License plate recognition method and device based on unmanned aerial vehicle real-time monitoring video | |
CN116503709A (en) | Vehicle detection method based on improved YOLOv5 in haze weather | |
Huang et al. | Change detection with absolute difference of multiscale deep features | |
Ren et al. | Environment influences on uncertainty of object detection for automated driving systems | |
CN110909656B (en) | Pedestrian detection method and system integrating radar and camera | |
CN113269119B (en) | Night vehicle detection method and device | |
CN114359196A (en) | Fog detection method and system | |
CN114048536A (en) | Road structure prediction and target detection method based on multitask neural network | |
Mo et al. | Research on expressway traffic event detection at night based on Mask-SpyNet | |
Luo et al. | Memory-guided collaborative attention for nighttime thermal infrared image colorization | |
Li et al. | Testing ground-truth errors in an automotive dataset for a DNN-based object detector | |
Zhu et al. | Small target detection algorithm based on multi-target detection head and attention mechanism | |
Liu et al. | FSFM: A feature square tower fusion module for multimodal object detection | |
Zhang et al. | LL-WSOD: Weakly supervised object detection in low-light |