[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding

Published: 01 November 2021 Publication History

Abstract

The ubiquitous camera networks in the city brain system grow at a rapid pace, creating massive amounts of images and videos at a range of spatial-temporal scales and thereby forming the &#x201C;biggest&#x201D; big data. However, the sensing system often lags behind the construction of the fast-growing city brain system, in the sense that such exponentially growing data far exceed today&#x2019;s sensing capabilities. Therefore, critical issues arise regarding how to better leverage the existing city brain system and significantly improve the city-scale performance in intelligent applications. To tackle the unprecedented challenges, we articulate a vision towards a novel visual computing framework, termed as <italic>digital retina</italic>, which aligns high-efficiency sensing models with the emerging Visual Coding for Machine (VCM) paradigm. In particular, digital retina may consist of video coding, feature coding, model coding, as well as their joint optimization. The digital retina is biologically-inspired, rooted on the widely accepted view that the retina encodes the visual information for human perception, and extracts features by the brain downstream areas to disentangle the visual objects. Within the digital retina framework, three streams, i.e., video stream, feature stream, and model stream, work collaboratively over the end-edge-cloud platform. In particular, the compressed video stream serves for human vision, the compact feature stream targets for machine vision, and the model stream incrementally updates deep learning models to improve the performance of human/machine vision tasks. We have developed a prototype to demonstrate the technical advantages of digital retina, and extensive experiments have been conducted to validate that it is able to effectively support the video big data analysis and retrieval in the intelligent city system. In particular, up to <inline-formula> <tex-math notation="LaTeX">$7000\times $ </tex-math></inline-formula> compression ratio could be realized for visual data compression while maintaining competitive performance with pristine signal in a series of visual analysis tasks.

Cited By

View all
  • (2024)A Unified Framework for Jointly Compressing Visual and Semantic DataACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365480020:7(1-24)Online publication date: 28-Mar-2024
  • (2024)Compact Visual Data Representation for Multimedia Search and AnalyticsProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658597(1326-1327)Online publication date: 30-May-2024
  • (2024)Overview of Visual Signal Compression towards Machine VisionProceedings of the 3rd Mile-High Video Conference10.1145/3638036.3640291(126-127)Online publication date: 11-Feb-2024
  • Show More Cited By

Index Terms

  1. Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image IEEE Transactions on Circuits and Systems for Video Technology
    IEEE Transactions on Circuits and Systems for Video Technology  Volume 31, Issue 11
    Nov. 2021
    407 pages

    Publisher

    IEEE Press

    Publication History

    Published: 01 November 2021

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 28 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Unified Framework for Jointly Compressing Visual and Semantic DataACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365480020:7(1-24)Online publication date: 28-Mar-2024
    • (2024)Compact Visual Data Representation for Multimedia Search and AnalyticsProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658597(1326-1327)Online publication date: 30-May-2024
    • (2024)Overview of Visual Signal Compression towards Machine VisionProceedings of the 3rd Mile-High Video Conference10.1145/3638036.3640291(126-127)Online publication date: 11-Feb-2024
    • (2024)Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative AnalyticsIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.336729346:7(5174-5191)Online publication date: 20-Feb-2024
    • (2024)CAU: A Causality Attention Unit for Spatial-Temporal Sequence ForecastIEEE Transactions on Multimedia10.1109/TMM.2023.332628926(4749-4763)Online publication date: 1-Jan-2024
    • (2024)Sensitivity Decouple Learning for Image Compression Artifacts ReductionIEEE Transactions on Image Processing10.1109/TIP.2024.340303433(3620-3633)Online publication date: 24-May-2024
    • (2024)Digital Retina for IoV Towards 6G: Architecture, Opportunities, and ChallengesIEEE Network: The Magazine of Global Internetworking10.1109/MNET.2024.335483638:2(62-69)Online publication date: 16-Jan-2024
    • (2024)Rate-Distortion-Cognition Controllable Versatile Neural Image CompressionComputer Vision – ECCV 202410.1007/978-3-031-72992-8_19(329-348)Online publication date: 29-Sep-2024
    • (2023)A Survey on Perceptually Optimized Video CodingACM Computing Surveys10.1145/357172755:12(1-37)Online publication date: 2-Mar-2023
    • (2023)STAM: A SpatioTemporal Attention Based Memory for Video PredictionIEEE Transactions on Multimedia10.1109/TMM.2022.314672125(2354-2367)Online publication date: 1-Jan-2023
    • Show More Cited By

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media