Zhang et al., 2022 - Google Patents
Wide-area crowd counting: Multi-view fusion networks for counting in large scenesZhang et al., 2022
View PDF- Document ID
- 13467930642942673943
- Author
- Zhang Q
- Chan A
- Publication year
- Publication venue
- International Journal of Computer Vision
External Links
Snippet
Crowd counting in single-view images has achieved outstanding performance on existing counting datasets. However, single-view counting is not applicable to large and wide scenes (eg, public parks, long subway platforms, or event spaces) because a single camera cannot …
- 230000004927 fusion 0 title abstract description 130
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wang et al. | Multi-view stereo in the deep learning era: A comprehensive review | |
Wan et al. | Kernel-based density map generation for dense object counting | |
Hu et al. | Revisiting single image depth estimation: Toward higher resolution maps with accurate object boundaries | |
Hu et al. | Deep depth completion from extremely sparse data: A survey | |
Zhang et al. | Wide-area crowd counting: Multi-view fusion networks for counting in large scenes | |
CN101894366B (en) | Method and device for acquiring calibration parameters and video monitoring system | |
Matzen et al. | Nyc3dcars: A dataset of 3d vehicles in geographic context | |
Zou et al. | Manhattan Room Layout Reconstruction from a Single 360^ ∘ 360∘ Image: A Comparative Study of State-of-the-Art Methods | |
Ai et al. | Deep learning for omnidirectional vision: A survey and new perspectives | |
Zhang et al. | 3d crowd counting via multi-view fusion with 3d gaussian kernels | |
Tang et al. | ESTHER: Joint camera self-calibration and automatic radial distortion correction from tracking of walking humans | |
Wan et al. | Face image reflection removal | |
Gao et al. | Exploiting key points supervision and grouped feature fusion for multiview pedestrian detection | |
Cheng et al. | Learning to refine depth for robust stereo estimation | |
CN107948586A (en) | Trans-regional moving target detecting method and device based on video-splicing | |
Xu et al. | Learning inverse depth regression for pixelwise visibility-aware multi-view stereo networks | |
Meng et al. | A block object detection method based on feature fusion networks for autonomous vehicles | |
Song et al. | Adastereo: An efficient domain-adaptive stereo matching approach | |
Tao et al. | An efficient 3D object detection method based on fast guided anchor stereo RCNN | |
Zhang et al. | 3D crowd counting via geometric attention-guided multi-view fusion | |
Yuan et al. | Structure flow-guided network for real depth super-resolution | |
Xiuling et al. | Starting from the structure: A review of small object detection based on deep learning | |
Ju et al. | Stereosnakes: contour based consistent object extraction for stereo images | |
Chen et al. | Multi-scale and multi-column convolutional neural network for crowd density estimation | |
Yang et al. | Survey on algorithms of people counting in dense crowd and crowd density estimation |