[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2964284.2967274acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

UnitBox: An Advanced Object Detection Network

Published: 01 October 2016 Publication History

Abstract

In present object detection systems, the deep convolutional neural networks (CNNs) are utilized to predict bounding boxes of object candidates, and have gained performance advantages over the traditional region proposal methods. However, existing deep CNN methods assume the object bounds to be four independent variables, which could be regressed by the l2 loss separately. Such an oversimplified assumption is contrary to the well-received observation, that those variables are correlated, resulting to less accurate localization. To address the issue, we firstly introduce a novel Intersection over Union (IoU) loss function for bounding box prediction, which regresses the four bounds of a predicted box as a whole unit. By taking the advantages of IoU loss and deep fully convolutional networks, the UnitBox is introduced, which performs accurate and efficient localization, shows robust to objects of varied shapes and scales, and converges fast. We apply UnitBox on face detection task and achieve the best performance among all published methods on the FDDB benchmark.

References

[1]
V. Belagiannis, X. Wang, H. Beny Ben Shitrit, K. Hashimoto, R. Stauder, Y. Aoki, M. Kranzfelder, A. Schneider, P. Fua, S. Ilic, H. Feussner, and N. Navab. Parsing human skeletons in an operating room. Machine Vision and Applications, 2016.
[2]
R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In Computer Vision and Pattern Recognition, 2014.
[3]
K. He, X. Zhang, S. Ren, and J. Sun. Deep Residual Learning for Image Recognition. ArXiv e-prints, Dec. 2015.
[4]
J. Hosang, M. Omran, R. Benenson, and B. Schiele. Taking a deeper look at pedestrians. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4073--4082, 2015.
[5]
L. Huang, Y. Yang, Y. Deng, and Y. Yu. DenseBox: Unifying Landmark Localization with End to End Object Detection. ArXiv e-prints, Sept. 2015.
[6]
V. Jain and E. Learned-Miller. Fddb: A benchmark for face detection in unconstrained settings. Technical Report UM-CS-2010-009, University of Massachusetts, Amherst, 2010.
[7]
Z. Jie, X. Liang, J. Feng, W. F. Lu, E. H. F. Tay, and S. Yan. Scale-aware Pixel-wise Object Proposal Networks. ArXiv e-prints, Jan. 2016.
[8]
H. Li, Z. Lin, X. Shen, J. Brandt, and G. Hua. A convolutional neural network cascade for face detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 5325--5334, 2015.
[9]
J. Li, X. Liang, S. Shen, T. Xu, and S. Yan. Scale-aware Fast R-CNN for Pedestrian Detection. ArXiv e-prints, Oct. 2015.
[10]
S. Ren, K. He, R. Girshick, and J. Sun. Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems (NIPS), 2015.
[11]
K. Simonyan and A. Zisserman. Very Deep Convolutional Networks for Large-Scale Image Recognition. ArXiv e-prints, Sept. 2014.
[12]
J. R. R. Uijlings, K. E. A. van de Sande, T. Gevers, and A. W. M. Smeulders. Selective search for object recognition. International Journal of Computer Vision, 104(2):154--171, 2013.
[13]
Z. Wang, S. Chang, Y. Yang, D. Liu, and T. S. Huang. Studying very low resolution recognition using deep networks. CoRR, abs/1601.04153, 2016.
[14]
S. Yang, P. Luo, C. C. Loy, and X. Tang. Wider face: A face detection benchmark. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
[15]
C. L. Zitnick and P. Dollár. Edge boxes: Locating object proposals from edges. In ECCV. European Conference on Computer Vision, September 2014.

Cited By

View all
  • (2025)Review of Object Detection Algorithms for Sonar Images based on Deep LearningRecent Patents on Engineering10.2174/011872212125714523092704194919:3Online publication date: Apr-2025
  • (2025)DetailCaptureYOLO: Accurately Detecting Small Targets in UAV Aerial ImagesJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.104349106(104349)Online publication date: Feb-2025
  • (2025)High performance RGB-Thermal Video Object Detection via hybrid fusion with progressive interaction and temporal-modal differenceInformation Fusion10.1016/j.inffus.2024.102665114(102665)Online publication date: Feb-2025
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '16: Proceedings of the 24th ACM international conference on Multimedia
October 2016
1542 pages
ISBN:9781450336031
DOI:10.1145/2964284
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. IoU loss
  2. bounding box prediction
  3. object detection

Qualifiers

  • Short-paper

Conference

MM '16
Sponsor:
MM '16: ACM Multimedia Conference
October 15 - 19, 2016
Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)315
  • Downloads (Last 6 weeks)51
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Review of Object Detection Algorithms for Sonar Images based on Deep LearningRecent Patents on Engineering10.2174/011872212125714523092704194919:3Online publication date: Apr-2025
  • (2025)DetailCaptureYOLO: Accurately Detecting Small Targets in UAV Aerial ImagesJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.104349106(104349)Online publication date: Feb-2025
  • (2025)High performance RGB-Thermal Video Object Detection via hybrid fusion with progressive interaction and temporal-modal differenceInformation Fusion10.1016/j.inffus.2024.102665114(102665)Online publication date: Feb-2025
  • (2025)Deep-learning-empowered visual ship detection and tracking: Literature review and future directionEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109754141(109754)Online publication date: Feb-2025
  • (2024)Low-altitude UAV obstacle detection method based on position constraint and attentionJournal of Applied Artificial Intelligence10.59782/aai.v1i2.3081:2(289-300)Online publication date: 18-Oct-2024
  • (2024)YOLOv7-BW: 基于遥感图像的密集小目标高效检测器智能机器人10.52810/JIR.2024.0041:1(39-54)Online publication date: 30-May-2024
  • (2024)Drug Recognition Detection Based on Deep Learning and Improved YOLOv8Journal of Organizational and End User Computing10.4018/JOEUC.35977036:1(1-21)Online publication date: 7-Nov-2024
  • (2024)基于多跳深度网络的红外微小目标检测方法Laser & Optoelectronics Progress10.3788/LOP24061061:22(2237008)Online publication date: 2024
  • (2024)基于深度学习的小目标检测技术研究进展(特邀)Infrared and Laser Engineering10.3788/IRLA2024025353:9(20240253)Online publication date: 2024
  • (2024)APD-YOLOv7: Enhancing Sustainable Farming through Precise Identification of Agricultural Pests and Diseases Using a Novel Diagonal Difference Ratio IOU LossSustainability10.3390/su1620885516:20(8855)Online publication date: 13-Oct-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media