More Web Proxy on the site http://driver.im/

short-paper

CLEAR: A Fully User-side Image Search System

Author:

Ryoma SatoAuthors Info & Claims

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Pages 4970 - 4974

https://doi.org/10.1145/3511808.3557172

Published: 17 October 2022 Publication History

Abstract

We use many search engines on the Internet in our daily lives. However, they are not perfect. Their scoring function may not model our intent or they may accept only text queries even though we want to carry out a similar image search. In such cases, we need to make a compromise: We continue to use the unsatisfactory service or leave the service. Recently, a new solution, user-side search systems, has been proposed. In this framework, each user builds their own search system that meets their preference with a user-defined scoring function and user-defined interface. Although the concept is appealing, it is still not clear if this approach is feasible in practice. In this demonstration, we show the first fully user-side image search system, CLEAR, which realizes a similar-image search engine for Flickr. The challenge is that Flickr does not provide an official similar image search engine or corresponding API. Nevertheless, CLEAR realizes it fully on a user-side. CLEAR does not use a backend server at all nor store any images or build search indices. It is in contrast to traditional search algorithms that require preparing a backend server and building a search index. Therefore, each user can easily deploy their own CLEAR engine, and the resulting service is custom-made and privacy-preserving. The online demo is available at https://clear.joisino.net. The source code is available at https://github.com/joisino/clear.

References

[1]

Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. J. Mach. Learn. Res., 3:397--422, 2002.

Digital Library

[2]

Ricardo A. Baeza-Yates, Carlos Castillo, Mauricio Marín, and M. Andrea Rodríguez. Crawling a country: better strategies than breadth-first for web page ordering. In Proceedings of the 2005 World Wide Web Conference, WWW, pages 864--872, 2005.

Digital Library

[3]

Krisztian Balog, Filip Radlinski, and Shushan Arakelyan. Transparent, scrutable and explainable user models for personalized recommendation. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR, pages 265--274, 2019.

Digital Library

[4]

Luciano Barbosa and Juliana Freire. An adaptive crawler for locating hidden-web entry points. In Proceedings of the 2007 World Wide Web Conference, WWW, pages 441--450, 2007.

Digital Library

[5]

Asia J. Biega, Krishna P. Gummadi, and Gerhard Weikum. Equity of attention: Amortizing individual fairness in rankings. In Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR, pages 405--414. ACM, 2018.

Digital Library

[6]

Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, and Philip S. Yu. Deep visual-semantic hashing for cross-modal retrieval. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, pages 1445--1454, 2016.

Digital Library

[7]

Soumen Chakrabarti, Martin van den Berg, and Byron Dom. Focused crawling: A new approach to topic-specific web resource discovery. Comput. Networks, 31 (11--16):1623--1640, 1999.

Digital Library

[8]

Tao Chen, Ming-Ming Cheng, Ping Tan, Ariel Shamir, and Shi-Min Hu. Sketch2photo: internet image montage. ACM Trans. Graph., 28(5):124, 2009.

Digital Library

[9]

Stephen J. Green, Paul Lamere, Jeffrey Alexander, François Maillet, Susanna Kirk, Jessica Holt, Jackie Bourque, and Xiao-Wen Mak. Generating transparent, steerable recommendations from textual descriptions of items. In Proceedings of the 2009 ACM Conference on Recommender Systems, RecSys, pages 281--284, 2009.

Digital Library

[10]

Ziyu Guan, Can Wang, Chun Chen, Jiajun Bu, and Junfeng Wang. Guide focused crawler efficiently and effectively using on-line topical importance estimation. In Proceedings of the 31st International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR, pages 757--758, 2008.

Digital Library

[11]

Jeff Johnson, Matthijs Douze, and Hervé Jégou. Billion-scale similarity search with gpus. IEEE Trans. Big Data, 7(3):535--547, 2021.

[12]

Judy Johnson, Kostas Tsioutsiouliklis, and C. Lee Giles. Evolving strategies for focused web crawling. In Proceedings of the Twentieth International Conference on Machine Learning, ICML, pages 298--305, 2003.

[13]

Saeid Balaneshin Kordan and Alexander Kotov. Deep neural architecture for multi-modal retrieval based on joint embedding space for text and images. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining, WSDM, pages 28--36, 2018.

[14]

Alina Kuznetsova, Hassan Rom, Neil Alldrin, Jasper Uijlings, Ivan Krasin, Jordi Pont-Tuset, Shahab Kamali, Stefan Popov, Matteo Malloci, Alexander Kolesnikov, Tom Duerig, and Vittorio Ferrari. The open images dataset v4: Unified image classification, object detection, and visual relationship detection at scale. Int. J. Comput. Vis., 128(7):1956--1981, 2020.

[15]

Lihong Li, Wei Chu, John Langford, and Robert E. Schapire. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 2010 World Wide Web Conference, WWW, pages 661--670, 2010.

Digital Library

[16]

Andrew McCallum, Kamal Nigam, Jason Rennie, and Kristie Seymore. Automating the construction of internet portals with machine learning. Inf. Retr., 3(2): 127--163, 2000.

Digital Library

[17]

Robert Meusel, Peter Mika, and Roi Blanco. Focused crawling for structured data. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM, pages 1039--1048, 2014.

Digital Library

[18]

Anh Mai Nguyen, Alexey Dosovitskiy, Jason Yosinski, Thomas Brox, and Jeff Clune. Synthesizing the preferred inputs for neurons in neural networks via deep generator networks. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, NeurIPS, pages 3387--3395, 2016.

Digital Library

[19]

Jeffrey Pennington, Richard Socher, and Christopher D. Manning. Glove: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP, pages 1532--1543, 2014.

[20]

Kien Pham, Aécio S. R. Santos, and Juliana Freire. Bootstrapping domain-specific content discovery on the web. In Proceedings of the 2019 World Wide Web Conference, WWW, pages 1476--1486, 2019.

Digital Library

[21]

Mark Sandler, Andrew G. Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pages 4510--4520, 2018.

[22]

Ryoma Sato. Enumerating fair packages for group recommendations. In Proceedings of the 15th ACM International Conference on Web Search and Data Mining, WSDM, pages 870--878. ACM, 2022.

Digital Library

[23]

Ryoma Sato. Towards principled user-side recommender systems. In The 31st ACM International Conference on Information and Knowledge Management, CIKM, 2022.

Digital Library

[24]

Ryoma Sato. Private recommender systems: How can users build their own fair recommender systems without log data? In Proceedings of the 2022 SIAM International Conference on Data Mining, SDM, 2022.

[25]

Ryoma Sato. Retrieving black-box optimal images from external databases. In Proceedings of the 15th ACM International Conference on Web Search and Data Mining, WSDM, pages 879--887. ACM, 2022.

Digital Library

[26]

Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. Deep inside convolutional networks: Visualising image classification models and saliency maps. In Proceedings of the 2nd International Conference on Learning Representations, ICLR, Workshop Track Proceedings, 2014.

[27]

Ashudeep Singh and Thorsten Joachims. Fairness of exposure in rankings. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, pages 2219--2228, 2018.

Digital Library

[28]

Daniel Smilkov, Nikhil Thorat, Yannick Assogba, Ann Yuan, Nick Kreeger, Ping Yu, Kangyi Zhang, Shanqing Cai, Eric Nielsen, David Soergel, Stan Bileschi, Michael Terry, Charles Nicholson, Sandeep N. Gupta, Sarah Sirajuddin, D. Sculley, Rajat Monga, Greg Corrado, Fernanda B. Viégas, and Martin Wattenberg. Tensorflow.js: Machine learning for the web and beyond. In Proceedings of Machine Learning and Systems 2019, MLSys. mlsys.org, 2019.

[29]

Hao Yuan, Jiliang Tang, Xia Hu, and Shuiwang Ji. XGNN: towards model-level explanations of graph neural networks. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, pages 430--438, 2020.

Digital Library

[30]

Zheng-Jun Zha, Linjun Yang, Tao Mei, Meng Wang, and Zengfu Wang. Visual query suggestion. In Proceedings of the 17th International Conference on Multimedia, MM, pages 15--24. ACM, 2009.

Digital Library

Cited By

Sato R(2022)Word Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemWord Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemJournal of Natural Language Processing10.5715/jnlp.29.129729:4(1297-1301)Online publication date: 2022
https://doi.org/10.5715/jnlp.29.1297
Sato RAl Hasan MXiong L(2022)Towards Principled User-side Recommender SystemsProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557476(1757-1766)Online publication date: 17-Oct-2022
https://dl.acm.org/doi/10.1145/3511808.3557476

Index Terms

CLEAR: A Fully User-side Image Search System
1. Information systems
  1. Information retrieval
    1. Users and interactive retrieval
  2. World Wide Web
    1. Web searching and information discovery

Recommendations

Search engine user behaviour: How can users be guided to quality content?
ICSTI 2007 and 2008

The typical behaviour of the Web search engine user is widely known: a user only types in one or a few keywords and expects the search engine to produce relevant results in an instant. Search engines not only adapt to this behaviour. On the contrary, ...
Site-searching strategies of searchers referred from search engines
ASIST '13: Proceedings of the 76th ASIS&T Annual Meeting: Beyond the Cloud: Rethinking Information Boundaries

In this research, we analyze the referral queries and associated site-search queries at the session level from searchers coming from web search engines. Findings are based on a random sample of 10,000 from a total of 327,261 searching sessions of an ...
The comparative effectiveness of sponsored and nonsponsored links for Web e-commerce queries

The predominant business model for Web search engines is sponsored search, which generates billions in yearly revenue. But are sponsored links providing online consumers with relevant choices for products and services? We address this and related issues ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

October 2022

5274 pages

ISBN:9781450392365

DOI:10.1145/3511808

General Chairs:
Mohammad Al Hasan
Indiana University Purdue University, Indianapolis, USA
,
Li Xiong
Emory University, Atlanta, USA

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Japan Society for the Promotion of Science

Conference

CIKM '22

Sponsor:

CIKM '22: The 31st ACM International Conference on Information and Knowledge Management

October 17 - 21, 2022

GA, Atlanta, USA

Acceptance Rates

CIKM '22 Paper Acceptance Rate 621 of 2,257 submissions, 28%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
107
Total Downloads

Downloads (Last 12 months)12
Downloads (Last 6 weeks)2

Reflects downloads up to 25 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sato R(2022)Word Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemWord Tour: One-dimensional Word Embeddings via the Traveling Salesman ProblemJournal of Natural Language Processing10.5715/jnlp.29.129729:4(1297-1301)Online publication date: 2022
https://doi.org/10.5715/jnlp.29.1297
Sato RAl Hasan MXiong L(2022)Towards Principled User-side Recommender SystemsProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557476(1757-1766)Online publication date: 17-Oct-2022
https://dl.acm.org/doi/10.1145/3511808.3557476

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten