[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/CVPRW.2014.131guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

Published: 23 June 2014 Publication History

Abstract

Recent results indicate that the generic descriptors extracted from the convolutional neural networks are very powerful. This paper adds to the mounting evidence that this is indeed the case. We report on a series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13. We use features extracted from the OverFeat network as a generic image representation to tackle the diverse range of recognition tasks of object image classification, scene recognition, fine grained recognition, attribute detection and image retrieval applied to a diverse set of datasets. We selected these tasks and datasets as they gradually move further away from the original task and data the OverFeat network was trained to solve. Astonishingly, we report consistent superior results compared to the highly tuned state-of-the-art systems in all the visual classification tasks on various datasets. For instance retrieval it consistently outperforms low memory footprint methods except for sculptures dataset. The results are achieved using a linear SVM classifier (or L2 distance in case of retrieval) applied to a feature representation of size 4096 extracted from a layer in the net. The representations are further modified using simple augmentation techniques e.g. jittering. The results strongly suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

Cited By

View all
  • (2024)Efficiently localizing system anomalies for cloud infrastructures: a novel Dynamic Graph Transformer based Parallel FrameworkJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-024-00677-x13:1Online publication date: 4-Jun-2024
  • (2024)Optimization of the YOLOv5 deep learning model for peripheral blood cell detection and recognitionProceedings of the 2024 4th International Conference on Bioinformatics and Intelligent Computing10.1145/3665689.3665756(401-407)Online publication date: 26-Jan-2024
  • (2024)Component-Level Oracle Bone Inscription RetrievalProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658116(647-656)Online publication date: 30-May-2024
  • Show More Cited By
  1. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    CVPRW '14: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops
    June 2014
    830 pages
    ISBN:9781479943081

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 23 June 2014

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Efficiently localizing system anomalies for cloud infrastructures: a novel Dynamic Graph Transformer based Parallel FrameworkJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-024-00677-x13:1Online publication date: 4-Jun-2024
    • (2024)Optimization of the YOLOv5 deep learning model for peripheral blood cell detection and recognitionProceedings of the 2024 4th International Conference on Bioinformatics and Intelligent Computing10.1145/3665689.3665756(401-407)Online publication date: 26-Jan-2024
    • (2024)Component-Level Oracle Bone Inscription RetrievalProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658116(647-656)Online publication date: 30-May-2024
    • (2024)Top-tuningImage and Vision Computing10.1016/j.imavis.2023.104894142:COnline publication date: 16-May-2024
    • (2024)ISMSFuseComputers and Electronics in Agriculture10.1016/j.compag.2024.109089223:COnline publication date: 1-Aug-2024
    • (2024)Industrial object detection with multi-modal SSD: closing the gap between synthetic and real imagesMultimedia Tools and Applications10.1007/s11042-023-15367-083:4(12111-12138)Online publication date: 1-Jan-2024
    • (2024)Comprehensive comparison of modified deep convolutional neural networks for automated detection of external and middle ear conditionsNeural Computing and Applications10.1007/s00521-023-09365-436:10(5529-5544)Online publication date: 1-Apr-2024
    • (2023)A high-resolution dataset for instance detection with multi-view instance captureProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667944(42064-42076)Online publication date: 10-Dec-2023
    • (2023)Does progress on ImageNet transfer to real-world datasets?Proceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667211(25050-25080)Online publication date: 10-Dec-2023
    • (2023)Simple transferability estimation for regression tasksProceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence10.5555/3625834.3625976(1510-1521)Online publication date: 31-Jul-2023
    • Show More Cited By

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media