Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.05274 (cs)

[Submitted on 17 Sep 2024]

Title:Scale-Invariant Object Detection by Adaptive Convolution with Unified Global-Local Context

Authors:Amrita Singh, Snehasis Mukherjee

Abstract:Dense features are important for detecting minute objects in images. Unfortunately, despite the remarkable efficacy of the CNN models in multi-scale object detection, CNN models often fail to detect smaller objects in images due to the loss of dense features during the pooling process. Atrous convolution addresses this issue by applying sparse kernels. However, sparse kernels often can lose the multi-scale detection efficacy of the CNN model. In this paper, we propose an object detection model using a Switchable (adaptive) Atrous Convolutional Network (SAC-Net) based on the efficientDet model. A fixed atrous rate limits the performance of the CNN models in the convolutional layers. To overcome this limitation, we introduce a switchable mechanism that allows for dynamically adjusting the atrous rate during the forward pass. The proposed SAC-Net encapsulates the benefits of both low-level and high-level features to achieve improved performance on multi-scale object detection tasks, without losing the dense features. Further, we apply a depth-wise switchable atrous rate to the proposed network, to improve the scale-invariant features. Finally, we apply global context on the proposed model. Our extensive experiments on benchmark datasets demonstrate that the proposed SAC-Net outperforms the state-of-the-art models by a significant margin in terms of accuracy.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2410.05274 [cs.CV]
	(or arXiv:2410.05274v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.05274

Submission history

From: Amrita Singh [view email]
[v1] Tue, 17 Sep 2024 10:08:37 UTC (1,658 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Scale-Invariant Object Detection by Adaptive Convolution with Unified Global-Local Context

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Scale-Invariant Object Detection by Adaptive Convolution with Unified Global-Local Context

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators