More Web Proxy on the site http://driver.im/

research-article

Controlled Intentional Degradation in Analytical Video Systems

Authors:

Michael CafarellaAuthors Info & Claims

SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data

Pages 2105 - 2119

https://doi.org/10.1145/3514221.3517899

Published: 11 June 2022 Publication History

Abstract

It is increasingly affordable for governments to collect video data of public locations. This video can be used for a range of broadly valuable analytical tasks, such as counting traffic, measuring commerce, or detecting accidents. Governments also have a range of policy goals --- preserving privacy, reducing bandwidth use, and legal compliance --- that may be obtained by degrading the video at some potential cost to analytical accuracy. Ideally, public administrators could employ controlled intentional video degradation to achieve policy goals while still obtaining the required analytical accuracy. Unfortunately, the optimal amount of induced degradation is data- and query-dependent, and so is difficult to determine even when public policy preferences are well-known. We propose a video degradation-accuracy profiling model for the problem of controlling the appropriate amount of degradation. It offers administrators a profile that illustrates the tradeoff between increased analytical accuracy and increased amounts of degradation. Computing the true tradeoff curves requires full access to the non-degraded video stream, so a primary technical contribution of this work lies in methods for accurately approximating the curves with only limited information. In addition, we propose a profile repair policy to further improve tradeoff curves' accuracy. We describe our prototype system, Smokescreen, plus experiments on two video datasets, two detection models and four aggregate query types. Compared with competing methods, we show our upper bound estimation of analytical error is up to 155% tighter, and Smokescreen enables 88% more accurate tradeoffs.

Supplemental Material

PDF File

Read me

Download
41.50 KB

ZIP File

Source Code

Download
1.05 MB

References

[1]

Waleed Abdulla. 2017. Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow. https://github.com/matterport/Mask_RCNN.

[2]

Sameer Agarwal, Henry Milner, Ariel Kleiner, Ameet Talwalkar, Michael Jordan, Samuel Madden, Barzan Mozafari, and Ion Stoica. 2014. Knowing when you're wrong: building fast and reliable approximate query processing systems. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. 481--492.

Digital Library

[3]

Sameer Agarwal, Barzan Mozafari, Aurojit Panda, Henry Milner, Samuel Madden, and Ion Stoica. 2013. BlinkDB: queries with bounded errors and bounded response times on very large data. In Proceedings of the 8th ACM European Conference on Computer Systems. 29--42.

Digital Library

[4]

Michael R Anderson, Michael Cafarella, German Ros, and Thomas F Wenisch. 2019. Physical representation-based predicate optimization for a visual analytics database. In 2019 IEEE 35th International Conference on Data Engineering (ICDE). IEEE, 1466--1477.

[5]

Ronnie T Apteker, James A Fisher, Valentin S Kisimov, and Hanoch Neishlos. 1995. Video acceptability and frame rate. IEEE multimedia, Vol. 2, 3 (1995), 32--40.

Digital Library

[6]

Azeem Aqil, Ahmed OF Atya, Srikanth V Krishnamurthy, and George Papageorgiou. 2015. Streaming lower quality video over LTE: How much energy can you save?. In 2015 IEEE 23rd International Conference on Network Protocols (ICNP). IEEE, 156--167.

[7]

Jean-Yves Audibert, Rémi Munos, and Csaba Szepesvári. 2007. Tuning bandit algorithms in stochastic environments. In International conference on algorithmic learning theory. Springer, 150--165.

Digital Library

[8]

Rémi Bardenet, Odalric-Ambrym Maillard, et al. 2015. Concentration inequalities for sampling without replacement. Bernoulli, Vol. 21, 3 (2015), 1361--1385.

[9]

Favyen Bastani, Songtao He, Arjun Balasingam, Karthik Gopalakrishnan, Mohammad Alizadeh, Hari Balakrishnan, Michael Cafarella, Tim Kraska, and Sam Madden. 2020. MIRIS: Fast Object Track Queries in Video. In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 1907--1921.

Digital Library

[10]

Avrim Blum, Cynthia Dwork, Frank McSherry, and Kobbi Nissim. 2005. Practical privacy: the SuLQ framework. In Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. 128--138.

Digital Library

[11]

Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. 2020. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint arXiv:2004.10934 (2020).

[12]

Badrish Chandramouli, Jonathan Goldstein, and Abdul Quamar. 2013. Scalable progressive analytics on big data in the cloud. Proceedings of the VLDB Endowment, Vol. 6, 14 (2013), 1726--1737.

Digital Library

[13]

Surajit Chaudhuri, Bolin Ding, and Srikanth Kandula. 2017. Approximate query processing: No silver bullet. In Proceedings of the 2017 ACM International Conference on Management of Data. 511--519.

Digital Library

[14]

Rui Chen, Noman Mohammed, Benjamin CM Fung, Bipin C Desai, and Li Xiong. 2011. Publishing set-valued data via differential privacy. Proceedings of the VLDB Endowment, Vol. 4, 11 (2011), 1087--1098.

Digital Library

[15]

Maggie Xiaoyan Cheng, Lu Ruan, and Weili Wu. 2005. Achieving minimum coverage breach under bandwidth constraints in wireless sensor networks. In Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies., Vol. 4. IEEE, 2638--2645.

[16]

Ji Dai, Jonathan Wu, Behrouz Saghafi, Janusz Konrad, and Prakash Ishwar. 2015. Towards privacy-preserving activity recognition using extremely low temporal and spatial resolution cameras. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 68--76.

[17]

Maureen Daum, Brandon Haynes, Dong He, Amrita Mazumdar, Magdalena Balazinska, and Alvin Cheung. 2020. TASM: A Tile-Based Storage Manager for Video Analytics. arXiv preprint arXiv:2006.02958 (2020).

[18]

Cynthia Dwork. 2008. Differential privacy: A survey of results. In International conference on theory and applications of models of computation. Springer, 1--19.

Digital Library

[19]

Willliam Feller. 2008. An introduction to probability theory and its applications, vol 2 .John Wiley & Sons.

[20]

Douglas A Fidaleo, Hoang-Anh Nguyen, and Mohan Trivedi. 2004. The networked sensor tapestry (NeST) a privacy enhanced software architecture for interactive analysis of data in video-sensor networks. In Proceedings of the ACM 2nd international workshop on Video surveillance & sensor networks. 46--53.

Digital Library

[21]

Benjamin CM Fung, Ke Wang, Rui Chen, and Philip S Yu. 2010. Privacy-preserving data publishing: A survey of recent developments. ACM Computing Surveys (Csur), Vol. 42, 4 (2010), 1--53.

Digital Library

[22]

Edward Gan, Jialin Ding, Kai Sheng Tai, Vatsal Sharan, and Peter Bailis. 2018. Moment-based quantile sketches for efficient high cardinality aggregation queries. arXiv preprint arXiv:1803.01969 (2018).

[23]

Wilson S Geisler and Jeffrey S Perry. 1998. Real-time foveated multiresolution system for low-bandwidth video communication. In Human vision and electronic imaging III, Vol. 3299. International Society for Optics and Photonics, 294--305.

[24]

Jonathan Goldstein, Raghu Ramakrishnan, and Uri Shaft. 1998. Compressing relations and indexes. In Proceedings 14th International Conference on Data Engineering. IEEE, 370--379.

[25]

Michael Greenwald and Sanjeev Khanna. 2001. Space-efficient online computation of quantile summaries. ACM SIGMOD Record, Vol. 30, 2 (2001), 58--66.

Digital Library

[26]

Michael B Greenwald and Sanjeev Khanna. 2004. Power-conserving computation of order-statistics over sensor networks. In Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. 275--285.

Digital Library

[27]

Ajit D Gupte, Bharadwaj Amrutur, Mahesh M Mehendale, Ajit V Rao, and Madhukar Budagavi. 2011. Memory bandwidth and power reduction using lossy reference frame compression in video encoding. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 21, 2 (2011), 225--230.

Digital Library

[28]

Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision. 2961--2969.

[29]

Wenjia He, Michael R Anderson, Maxwell Strome, and Michael Cafarella. 2020. A Method for Optimizing Opaque Filter Queries. In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 1257--1272.

Digital Library

[30]

Joseph M Hellerstein, Peter J Haas, and Helen J Wang. 1997. Online aggregation. In Proceedings of the 1997 ACM SIGMOD international conference on Management of data. 171--182.

Digital Library

[31]

Wassily Hoeffding. 1994. Probability inequalities for sums of bounded random variables. In The Collected Works of Wassily Hoeffding. Springer, 409--426.

[32]

Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B Gibbons, and Onur Mutlu. 2018. Focus: Querying large video datasets with low latency and low cost. In 13th $$USENIX$$ Symposium on Operating Systems Design and Implementation ($$OSDI$$ 18). 269--286.

[33]

Junchen Jiang, Ganesh Ananthanarayanan, Peter Bodik, Siddhartha Sen, and Ion Stoica. 2018. Chameleon: scalable adaptation of video analytics. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication. 253--266.

Digital Library

[34]

Daniel Kang, Peter Bailis, and Matei Zaharia. 2018. BlazeIt: optimizing declarative aggregation and limit queries for neural network-based video analytics. arXiv preprint arXiv:1805.01046 (2018).

[35]

Daniel Kang, John Emmons, Firas Abuzaid, Peter Bailis, and Matei Zaharia. 2017. Noscope: optimizing neural network queries over video at scale. arXiv preprint arXiv:1703.02529 (2017).

[36]

Vladimir Koltchinskii and Karim Lounici. 2017. Concentration inequalities and moment bounds for sample covariance operators. Bernoulli, Vol. 23, 1 (2017), 110--133.

[37]

Michał Koziarski and Bogusław Cyganek. 2018. Impact of low resolution on image recognition with deep neural networks: An experimental study. International Journal of Applied Mathematics and Computer Science, Vol. 28, 4 (2018), 735--744.

Digital Library

[38]

Tony CT Kuo and Arbee LP Chen. 2000. Content-based query processing for video databases. IEEE Transactions on Multimedia, Vol. 2, 1 (2000), 1--13.

Digital Library

[39]

Kristen LeFevre, David J DeWitt, and Raghu Ramakrishnan. 2006. Mondrian multidimensional k-anonymity. In 22nd International conference on data engineering (ICDE'06). IEEE, 25--25.

Digital Library

[40]

Ji Li, Siyao Cheng, Zhipeng Cai, Jiguo Yu, Chaokun Wang, and Yingshu Li. 2017. Approximate holistic aggregation in wireless sensor networks. ACM Transactions on Sensor Networks (TOSN), Vol. 13, 2 (2017), 1--24.

Digital Library

[41]

Kaiyu Li and Guoliang Li. 2018. Approximate query processing: What is new and where to go? Data Science and Engineering, Vol. 3, 4 (2018), 379--397.

[42]

Rui Liu, Tianyi Wu, and Barzan Mozafari. 2019. A bandit approach to maximum inner product search. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 4376--4383.

Digital Library

[43]

Yao Lu, Aakanksha Chowdhery, and Srikanth Kandula. 2016. Optasia: A relational platform for efficient large-scale video analytics. In Proceedings of the Seventh ACM Symposium on Cloud Computing. 57--70.

Digital Library

[44]

Gurmeet Singh Manku, Sridhar Rajagopalan, and Bruce G Lindsay. 1998. Approximate medians and other quantiles in one pass and with limited memory. ACM SIGMOD Record, Vol. 27, 2 (1998), 426--435.

Digital Library

[45]

Gurmeet Singh Manku, Sridhar Rajagopalan, and Bruce G Lindsay. 1999. Random sampling techniques for space efficient online computation of order statistics of large datasets. ACM SIGMOD Record, Vol. 28, 2 (1999), 251--262.

Digital Library

[46]

Amrita Mazumdar, Brandon Haynes, Magdalena Balazinska, Luis Ceze, Alvin Cheung, and Mark Oskin. 2019. Vignette: Perceptual Compression for Video Storage and Processing Systems. arXiv preprint arXiv:1902.01372 (2019).

[47]

Florence Merlevède, Magda Peligrad, and Emmanuel Rio. 2009. Bernstein inequality and moderate deviations under strong mixing conditions. In High dimensional probability V: the Luminy volume. Institute of Mathematical Statistics, 273--292.

[48]

Volodymyr Mnih, Csaba Szepesvári, and Jean-Yves Audibert. 2008. Empirical bernstein stopping. In Proceedings of the 25th international conference on Machine learning. 672--679.

Digital Library

[49]

Arvind Narayanan, Xumiao Zhang, Ruiyang Zhu, Ahmad Hassan, Shuowei Jin, Xiao Zhu, Xiaoxuan Zhang, Denis Rybkin, Zhengxuan Yang, Zhuoqing Morley Mao, et al. 2021. A variegated look at 5G in the wild: performance, power, and QoE implications. In Proceedings of the 2021 ACM SIGCOMM 2021 Conference. 610--625.

Digital Library

[50]

WL Nicholson et al. 1956. On the normal approximation to the hypergeometric distribution. The annals of mathematical statistics, Vol. 27, 2 (1956), 471--483.

[51]

Frank Olken. 1993. Random sampling from databases. Ph.D. Dissertation. University of California, Berkeley.

[52]

Alex Poms, Will Crichton, Pat Hanrahan, and Kayvon Fatahalian. 2018. Scanner: Efficient video analysis at scale. ACM Transactions on Graphics (TOG), Vol. 37, 4 (2018), 1--13.

Digital Library

[53]

Xin Qi, Qing Yang, David T Nguyen, Gang Zhou, and Ge Peng. 2015. LBVC: towards low-bandwidth video chat on smartphones. In Proceedings of the 6th ACM Multimedia Systems Conference. 1--12.

Digital Library

[54]

Qasim Mahmood Rajpoot and Christian Damsgaard Jensen. 2015. Video surveillance: Privacy issues and legal compliance. In Promoting Social Change and Democracy Through Information Technology. IGI global, 69--92.

[55]

Joseph Redmon. 2013. Darknet: Open source neural networks in c.

[56]

Mirek Riedewald, Divyakant Agrawal, et al. 2000. pCube: Update-efficient online aggregation with progressive feedback and error bounds. In Proceedings. 12th International Conference on Scientific and Statistica Database Management. IEEE, 95--108.

Digital Library

[57]

Mark A Roth and Scott J Van Horn. 1993. Database compression. ACM Sigmod Record, Vol. 22, 3 (1993), 31--39.

Digital Library

[58]

Mark Rudelson and Roman Vershynin. 2013. Hanson-wright inequality and sub-gaussian concentration. Electronic Communications in Probability, Vol. 18 (2013), 1--9.

[59]

Pierangela Samarati and Latanya Sweeney. 1998. Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. (1998).

[60]

Nisheeth Shrivastava, Chiranjeeb Buragohain, Divyakant Agrawal, and Subhash Suri. 2004. Medians and beyond: new aggregation techniques for sensor networks. In Proceedings of the 2nd international conference on Embedded networked sensor systems. 239--249.

Digital Library

[61]

Bhavani Thuraisingham. 2005. Privacy constraint processing in a privacy-enhanced database management system. Data & Knowledge Engineering, Vol. 55, 2 (2005), 159--188.

Digital Library

[62]

Dimitris Tsirogiannis, Stavros Harizopoulos, and Mehul A Shah. 2010. Analyzing the energy efficiency of a database server. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. 231--242.

Digital Library

[63]

Paul Voigt and Axel Von dem Bussche. 2017. The eu general data protection regulation (gdpr). A Practical Guide, 1st Ed., Cham: Springer International Publishing, Vol. 10 (2017), 3152676.

[64]

Longyin Wen, Dawei Du, Zhaowei Cai, Zhen Lei, Ming-Ching Chang, Honggang Qi, Jongwoo Lim, Ming-Hsuan Yang, and Siwei Lyu. 2020. UA-DETRAC: A New Benchmark and Protocol for Multi-Object Detection and Tracking. Computer Vision and Image Understanding (2020).

[65]

Takayuki Yamada, Seiichi Gohshi, and Isao Echizen. 2012. Use of invisible noise signals to prevent privacy invasion through face recognition from camera images. In Proceedings of the 20th ACM international conference on Multimedia. 1315--1316.

Digital Library

[66]

Jiadi Yu, Haofu Han, Hongzi Zhu, Yingying Chen, Jie Yang, Yanmin Zhu, Guangtao Xue, and Minglu Li. 2014. Sensing human-screen interaction for energy-efficient frame rate adaptation on smartphones. IEEE Transactions on Mobile Computing, Vol. 14, 8 (2014), 1698--1711.

Digital Library

[67]

Kai Zeng, Shi Gao, Barzan Mozafari, and Carlo Zaniolo. 2014. The analytical bootstrap: a new method for fast error estimation in approximate query processing. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. 277--288.

Digital Library

[68]

Haoyu Zhang, Ganesh Ananthanarayanan, Peter Bodik, Matthai Philipose, Paramvir Bahl, and Michael J Freedman. 2017. Live video analytics at scale with approximation and delay-tolerance. In 14th $$USENIX$$ Symposium on Networked Systems Design and Implementation ($$NSDI$$ 17). 377--392.

[69]

Kaipeng Zhang, Zhanpeng Zhang, Zhifeng Li, and Yu Qiao. 2016. Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Processing Letters, Vol. 23, 10 (2016), 1499--1503.

Cited By

He WSabek ILou YCafarella M(2024)Optimizing Video Selection LIMIT Queries with Commonsense KnowledgeProceedings of the VLDB Endowment10.14778/3654621.365463917:7(1751-1764)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.14778/3654621.3654639
Sirin UIdreos S(2024)The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage FormatProceedings of the ACM on Management of Data10.1145/36393072:1(1-31)Online publication date: 26-Mar-2024
https://doi.org/10.1145/3639307

Index Terms

Controlled Intentional Degradation in Analytical Video Systems
1. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Video search
  2. Information systems applications
    1. Decision support systems
      1. Data analytics

Recommendations

OTIF: Efficient Tracker Pre-processing over Large Video Datasets
SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data

Performing analytics tasks over large-scale video datasets is increasingly common in a wide range of applications, from traffic planning to sports analytics. These tasks generally involve object detection and tracking operations that require pre-...
Graceful degradation via versions: specifications and implementations
PODC '07: Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing

Correctness of a fault-tolerant system hinges on the failure model, which typically constrains the number of concurrent failures in the system. These assumptions are sometimes violated in practice, inevitably leading to degraded system behavior that ...
An analytical delay model for RLC interconnects

Elmore delay has been widely used to estimate interconnect delays in the performance-driven synthesis and layout of very-large-scale-integration (VLSI) routing topologies. For typical RLC interconnections, however, Elmore delay can deviate significantly ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data

June 2022

2597 pages

ISBN:9781450392495

DOI:10.1145/3514221

General Chair:
Zachary Ives
University of Pennsylvania (USA)
,
Program Chairs:
Angela Bonifati
Lyon 1 University (France)
,
Amr El Abbadi
University of California, Santa Barbara (USA)

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMOD: ACM Special Interest Group on Management of Data

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Badges

Author Tags

Qualifiers

Research-article

Funding Sources

Federal Highway Administration

Conference

SIGMOD/PODS '22

Sponsor:

SIGMOD

SIGMOD/PODS '22: International Conference on Management of Data

June 12 - 17, 2022

PA, Philadelphia, USA

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
277
Total Downloads

Downloads (Last 12 months)28
Downloads (Last 6 weeks)6

Reflects downloads up to 13 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

He WSabek ILou YCafarella M(2024)Optimizing Video Selection LIMIT Queries with Commonsense KnowledgeProceedings of the VLDB Endowment10.14778/3654621.365463917:7(1751-1764)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.14778/3654621.3654639
Sirin UIdreos S(2024)The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage FormatProceedings of the ACM on Management of Data10.1145/36393072:1(1-31)Online publication date: 26-Mar-2024
https://doi.org/10.1145/3639307

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents