[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3511808.3557172acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

CLEAR: A Fully User-side Image Search System

Published: 17 October 2022 Publication History

Abstract

We use many search engines on the Internet in our daily lives. However, they are not perfect. Their scoring function may not model our intent or they may accept only text queries even though we want to carry out a similar image search. In such cases, we need to make a compromise: We continue to use the unsatisfactory service or leave the service. Recently, a new solution, user-side search systems, has been proposed. In this framework, each user builds their own search system that meets their preference with a user-defined scoring function and user-defined interface. Although the concept is appealing, it is still not clear if this approach is feasible in practice. In this demonstration, we show the first fully user-side image search system, CLEAR, which realizes a similar-image search engine for Flickr. The challenge is that Flickr does not provide an official similar image search engine or corresponding API. Nevertheless, CLEAR realizes it fully on a user-side. CLEAR does not use a backend server at all nor store any images or build search indices. It is in contrast to traditional search algorithms that require preparing a backend server and building a search index. Therefore, each user can easily deploy their own CLEAR engine, and the resulting service is custom-made and privacy-preserving. The online demo is available at https://clear.joisino.net. The source code is available at https://github.com/joisino/clear.

References

[1]
Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. J. Mach. Learn. Res., 3:397--422, 2002.
[2]
Ricardo A. Baeza-Yates, Carlos Castillo, Mauricio Marín, and M. Andrea Rodríguez. Crawling a country: better strategies than breadth-first for web page ordering. In Proceedings of the 2005 World Wide Web Conference, WWW, pages 864--872, 2005.
[3]
Krisztian Balog, Filip Radlinski, and Shushan Arakelyan. Transparent, scrutable and explainable user models for personalized recommendation. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR, pages 265--274, 2019.
[4]
Luciano Barbosa and Juliana Freire. An adaptive crawler for locating hidden-web entry points. In Proceedings of the 2007 World Wide Web Conference, WWW, pages 441--450, 2007.
[5]
Asia J. Biega, Krishna P. Gummadi, and Gerhard Weikum. Equity of attention: Amortizing individual fairness in rankings. In Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR, pages 405--414. ACM, 2018.
[6]
Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, and Philip S. Yu. Deep visual-semantic hashing for cross-modal retrieval. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, pages 1445--1454, 2016.
[7]
Soumen Chakrabarti, Martin van den Berg, and Byron Dom. Focused crawling: A new approach to topic-specific web resource discovery. Comput. Networks, 31 (11--16):1623--1640, 1999.
[8]
Tao Chen, Ming-Ming Cheng, Ping Tan, Ariel Shamir, and Shi-Min Hu. Sketch2photo: internet image montage. ACM Trans. Graph., 28(5):124, 2009.
[9]
Stephen J. Green, Paul Lamere, Jeffrey Alexander, François Maillet, Susanna Kirk, Jessica Holt, Jackie Bourque, and Xiao-Wen Mak. Generating transparent, steerable recommendations from textual descriptions of items. In Proceedings of the 2009 ACM Conference on Recommender Systems, RecSys, pages 281--284, 2009.
[10]
Ziyu Guan, Can Wang, Chun Chen, Jiajun Bu, and Junfeng Wang. Guide focused crawler efficiently and effectively using on-line topical importance estimation. In Proceedings of the 31st International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR, pages 757--758, 2008.
[11]
Jeff Johnson, Matthijs Douze, and Hervé Jégou. Billion-scale similarity search with gpus. IEEE Trans. Big Data, 7(3):535--547, 2021.
[12]
Judy Johnson, Kostas Tsioutsiouliklis, and C. Lee Giles. Evolving strategies for focused web crawling. In Proceedings of the Twentieth International Conference on Machine Learning, ICML, pages 298--305, 2003.
[13]
Saeid Balaneshin Kordan and Alexander Kotov. Deep neural architecture for multi-modal retrieval based on joint embedding space for text and images. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining, WSDM, pages 28--36, 2018.
[14]
Alina Kuznetsova, Hassan Rom, Neil Alldrin, Jasper Uijlings, Ivan Krasin, Jordi Pont-Tuset, Shahab Kamali, Stefan Popov, Matteo Malloci, Alexander Kolesnikov, Tom Duerig, and Vittorio Ferrari. The open images dataset v4: Unified image classification, object detection, and visual relationship detection at scale. Int. J. Comput. Vis., 128(7):1956--1981, 2020.
[15]
Lihong Li, Wei Chu, John Langford, and Robert E. Schapire. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 2010 World Wide Web Conference, WWW, pages 661--670, 2010.
[16]
Andrew McCallum, Kamal Nigam, Jason Rennie, and Kristie Seymore. Automating the construction of internet portals with machine learning. Inf. Retr., 3(2): 127--163, 2000.
[17]
Robert Meusel, Peter Mika, and Roi Blanco. Focused crawling for structured data. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM, pages 1039--1048, 2014.
[18]
Anh Mai Nguyen, Alexey Dosovitskiy, Jason Yosinski, Thomas Brox, and Jeff Clune. Synthesizing the preferred inputs for neurons in neural networks via deep generator networks. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, NeurIPS, pages 3387--3395, 2016.
[19]
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. Glove: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP, pages 1532--1543, 2014.
[20]
Kien Pham, Aécio S. R. Santos, and Juliana Freire. Bootstrapping domain-specific content discovery on the web. In Proceedings of the 2019 World Wide Web Conference, WWW, pages 1476--1486, 2019.
[21]
Mark Sandler, Andrew G. Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pages 4510--4520, 2018.
[22]
Ryoma Sato. Enumerating fair packages for group recommendations. In Proceedings of the 15th ACM International Conference on Web Search and Data Mining, WSDM, pages 870--878. ACM, 2022.
[23]
Ryoma Sato. Towards principled user-side recommender systems. In The 31st ACM International Conference on Information and Knowledge Management, CIKM, 2022.
[24]
Ryoma Sato. Private recommender systems: How can users build their own fair recommender systems without log data? In Proceedings of the 2022 SIAM International Conference on Data Mining, SDM, 2022.
[25]
Ryoma Sato. Retrieving black-box optimal images from external databases. In Proceedings of the 15th ACM International Conference on Web Search and Data Mining, WSDM, pages 879--887. ACM, 2022.
[26]
Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. Deep inside convolutional networks: Visualising image classification models and saliency maps. In Proceedings of the 2nd International Conference on Learning Representations, ICLR, Workshop Track Proceedings, 2014.
[27]
Ashudeep Singh and Thorsten Joachims. Fairness of exposure in rankings. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, pages 2219--2228, 2018.
[28]
Daniel Smilkov, Nikhil Thorat, Yannick Assogba, Ann Yuan, Nick Kreeger, Ping Yu, Kangyi Zhang, Shanqing Cai, Eric Nielsen, David Soergel, Stan Bileschi, Michael Terry, Charles Nicholson, Sandeep N. Gupta, Sarah Sirajuddin, D. Sculley, Rajat Monga, Greg Corrado, Fernanda B. Viégas, and Martin Wattenberg. Tensorflow.js: Machine learning for the web and beyond. In Proceedings of Machine Learning and Systems 2019, MLSys. mlsys.org, 2019.
[29]
Hao Yuan, Jiliang Tang, Xia Hu, and Shuiwang Ji. XGNN: towards model-level explanations of graph neural networks. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, pages 430--438, 2020.
[30]
Zheng-Jun Zha, Linjun Yang, Tao Mei, Meng Wang, and Zengfu Wang. Visual query suggestion. In Proceedings of the 17th International Conference on Multimedia, MM, pages 15--24. ACM, 2009.

Cited By

View all
  • (2022)Word Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemWord Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemJournal of Natural Language Processing10.5715/jnlp.29.129729:4(1297-1301)Online publication date: 2022
  • (2022)Towards Principled User-side Recommender SystemsProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557476(1757-1766)Online publication date: 17-Oct-2022

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management
October 2022
5274 pages
ISBN:9781450392365
DOI:10.1145/3511808
  • General Chairs:
  • Mohammad Al Hasan,
  • Li Xiong
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. information retrieval
  2. user-side systems
  3. web searching

Qualifiers

  • Short-paper

Funding Sources

Conference

CIKM '22
Sponsor:

Acceptance Rates

CIKM '22 Paper Acceptance Rate 621 of 2,257 submissions, 28%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)2
Reflects downloads up to 25 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Word Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemWord Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemJournal of Natural Language Processing10.5715/jnlp.29.129729:4(1297-1301)Online publication date: 2022
  • (2022)Towards Principled User-side Recommender SystemsProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557476(1757-1766)Online publication date: 17-Oct-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media