research-article

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding

Authors:

Wen Gao,

Tiejun HuangAuthors Info & Claims

IEEE Transactions on Circuits and Systems for Video Technology, Volume 31, Issue 11

Pages 4147 - 4161

https://doi.org/10.1109/TCSVT.2021.3104305

Published: 01 November 2021 Publication History

Abstract

The ubiquitous camera networks in the city brain system grow at a rapid pace, creating massive amounts of images and videos at a range of spatial-temporal scales and thereby forming the “biggest” big data. However, the sensing system often lags behind the construction of the fast-growing city brain system, in the sense that such exponentially growing data far exceed today’s sensing capabilities. Therefore, critical issues arise regarding how to better leverage the existing city brain system and significantly improve the city-scale performance in intelligent applications. To tackle the unprecedented challenges, we articulate a vision towards a novel visual computing framework, termed as <italic>digital retina</italic>, which aligns high-efficiency sensing models with the emerging Visual Coding for Machine (VCM) paradigm. In particular, digital retina may consist of video coding, feature coding, model coding, as well as their joint optimization. The digital retina is biologically-inspired, rooted on the widely accepted view that the retina encodes the visual information for human perception, and extracts features by the brain downstream areas to disentangle the visual objects. Within the digital retina framework, three streams, i.e., video stream, feature stream, and model stream, work collaboratively over the end-edge-cloud platform. In particular, the compressed video stream serves for human vision, the compact feature stream targets for machine vision, and the model stream incrementally updates deep learning models to improve the performance of human/machine vision tasks. We have developed a prototype to demonstrate the technical advantages of digital retina, and extensive experiments have been conducted to validate that it is able to effectively support the video big data analysis and retrieval in the intelligent city system. In particular, up to <inline-formula> <tex-math notation="LaTeX">$7000\times $ </tex-math></inline-formula> compression ratio could be realized for visual data compression while maintaining competitive performance with pristine signal in a series of visual analysis tasks.

Cited By

View all

Liu SLin WChen YZhang YDai WSee JXiong H(2024)A Unified Framework for Jointly Compressing Visual and Semantic DataACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365480020:7(1-24)Online publication date: 28-Mar-2024
https://dl.acm.org/doi/10.1145/3654800
Wang SZhang XGurrin CKongkachandra RSchoeffmann KDang-Nguyen DRossetto LSatoh SZhou L(2024)Compact Visual Data Representation for Multimedia Search and AnalyticsProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658597(1326-1327)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3652583.3658597
Wang SWang SYe Y(2024)Overview of Visual Signal Compression towards Machine VisionProceedings of the 3rd Mile-High Video Conference10.1145/3638036.3640291(126-127)Online publication date: 11-Feb-2024
https://dl.acm.org/doi/10.1145/3638036.3640291
Show More Cited By

Index Terms

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks

Index terms have been assigned to the content through auto-classification.

Recommendations

Sharpening of directional selectivity from neural output of rabbit retina

The estimation of motion direction from time varying retinal images is a fundamental task of visual systems. Neurons that selectively respond to directional visual motion are found in almost all species. In many of them already in the retina direction ...
Digital image coding techniques: CODEC design using vector quantization method
Initial processing of visual information within the retina and the LGN

The initial stage of information processing by the visual system reduces the information contained in the continuous image on the retina into a discrete set of responses which are carried from the lateral geniculate nucleus (LGN) to the visual cortex.-...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Circuits and Systems for Video Technology

IEEE Transactions on Circuits and Systems for Video Technology Volume 31, Issue 11

Nov. 2021

407 pages

ISSN:1051-8215

Issue’s Table of Contents

Publisher

IEEE Press

Publication History

Published: 01 November 2021

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 26 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Liu SLin WChen YZhang YDai WSee JXiong H(2024)A Unified Framework for Jointly Compressing Visual and Semantic DataACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365480020:7(1-24)Online publication date: 28-Mar-2024
https://dl.acm.org/doi/10.1145/3654800
Wang SZhang XGurrin CKongkachandra RSchoeffmann KDang-Nguyen DRossetto LSatoh SZhou L(2024)Compact Visual Data Representation for Multimedia Search and AnalyticsProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3658597(1326-1327)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3652583.3658597
Wang SWang SYe Y(2024)Overview of Visual Signal Compression towards Machine VisionProceedings of the 3rd Mile-High Video Conference10.1145/3638036.3640291(126-127)Online publication date: 11-Feb-2024
https://dl.acm.org/doi/10.1145/3638036.3640291
Yang WHuang HHu YDuan LLiu J(2024)Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative AnalyticsIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.336729346:7(5174-5191)Online publication date: 20-Feb-2024
https://dl.acm.org/doi/10.1109/TPAMI.2024.3367293
Qin BMeng FYuan SMu B(2024)CAU: A Causality Attention Unit for Spatial-Temporal Sequence ForecastIEEE Transactions on Multimedia10.1109/TMM.2023.332628926(4749-4763)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2023.3326289
Ma LZhao YPeng PTian Y(2024)Sensitivity Decouple Learning for Image Compression Artifacts ReductionIEEE Transactions on Image Processing10.1109/TIP.2024.340303433(3620-3633)Online publication date: 24-May-2024
https://dl.acm.org/doi/10.1109/TIP.2024.3403034
Zheng KMei JYang HHou LMa S(2024)Digital Retina for IoV Towards 6G: Architecture, Opportunities, and ChallengesIEEE Network: The Magazine of Global Internetworking10.1109/MNET.2024.335483638:2(62-69)Online publication date: 16-Jan-2024
https://dl.acm.org/doi/10.1109/MNET.2024.3354836
Liu JFeng RQi YChen QChen ZZeng WJin X(2024)Rate-Distortion-Cognition Controllable Versatile Neural Image CompressionComputer Vision – ECCV 202410.1007/978-3-031-72992-8_19(329-348)Online publication date: 29-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-72992-8_19
Zhang YZhu LJiang GKwong SKuo C(2023)A Survey on Perceptually Optimized Video CodingACM Computing Surveys10.1145/357172755:12(1-37)Online publication date: 2-Mar-2023
https://dl.acm.org/doi/10.1145/3571727
Chang ZZhang XWang SMa SGao W(2023)STAM: A SpatioTemporal Attention Based Memory for Video PredictionIEEE Transactions on Multimedia10.1109/TMM.2022.314672125(2354-2367)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TMM.2022.3146721
Show More Cited By

Abstract

Cited By

Index Terms

Recommendations

Sharpening of directional selectivity from neural output of rabbit retina

Digital image coding techniques: CODEC design using vector quantization method

Initial processing of visual information within the retina and the LGN

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations