[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Simrank++: query rewriting through link analysis of the click graph

Published: 01 August 2008 Publication History

Abstract

We focus on the problem of query rewriting for sponsored search. We base rewrites on a historical click graph that records the ads that have been clicked on in response to past user queries. Given a query q, we first consider Simrank [7] as a way to identify queries similar to q, i.e., queries whose ads a user may be interested in. We argue that Simrank fails to properly identify query similarities in our application, and we present two enhanced versions of Simrank: one that exploits weights on click graph edges and another that exploits "evidence." We experimentally evaluate our new schemes against Simrank, using actual click graphs and queries from Yahoo!, and using a variety of metrics. Our results show that the enhanced methods can yield more and better query rewrites.

References

[1]
Reid Andersen, Fan Chung, and Kevin Lang. Local graph partitioning using pagerank vectors. In FOCS '06.
[2]
I. Antonellis, H. Garcia-Molina, and C. Chang. Simrank++: Query rewriting through link analysis of the click graph. In Technical Report, url: http://dbpubs.stanford.edu/pub/2007--32, 2007.
[3]
Doug Beeferman and Adam Berger. Agglomerative clustering of a search engine query log. In KDD '00.
[4]
Nick Craswell and Martin Szummer. Random walks on the click graph. In Proc. SIGIR '07.
[5]
Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified data processing on large clusters. In OSDI 2004.
[6]
S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. 1990.
[7]
Glen Jeh and Jennifer Widom. Simrank: a measure of structural-context similarity. In KDD '02.
[8]
Rosie Jones and Daniel C. Fain. Query word deletion prediction. In SIGIR '03.
[9]
Rosie Jones, Benjamin Rey, Omid Madani, and Wiley Greiner. Generating query substitutions. In WWW '07.
[10]
Christos H. Papadimitriou, Hisao Tamaki, Prabhakar Raghavan, and Santosh Vempala. Latent semantic indexing: a probabilistic analysis. In PODS '98.
[11]
M. Regelson and D. Fain. Predicting click-through rate using keyword clusters. In Proc. 2nd Workshop on Sponsored Search Auctions.
[12]
Matthew Richardson, Ewa Dominowska, and Robert Ragno. Predicting clicks: Estimating the click-through rate for new ads. In WWW '07.
[13]
Ian Ruthven. Re-examining the potential effectiveness of interactive query expansion. In SIGIR '03, pages 213--220.
[14]
A. Sinclair. Algorithms for random generation and counting: A markov chain approach. In Birkhauser, Boston-Basel-Berlin, 1993.
[15]
Egidio Terra and Charles L. A. Clarke. Scoring missing terms in information retrieval tasks. In CIKM '04.
[16]
Ji-Rong Wen, Jian-Yun Nie, and Hong-Jiang Zhang. Query clustering using user logs. ACM Trans. Inf. Syst., 2002.
[17]
Wei Vivian Zhang, Xiaofei He, Benjamin Rey, and Rosie Jones. Query rewriting using active learning for sponsored search. In SIGIR '07.
[18]
Wei Vivian Zhang and Rosie Jones. Comparing click logs and editorial labels for training query rewriting. In Query Log Analysis Workshop, WWW '07.

Cited By

View all
  • (2023)Billion-Scale Bipartite Graph Embedding: A Global-Local Induced ApproachProceedings of the VLDB Endowment10.14778/3626292.362630017:2(175-183)Online publication date: 1-Oct-2023
  • (2023)Securing Public Clouds using Dynamic Communication GraphsProceedings of the 22nd ACM Workshop on Hot Topics in Networks10.1145/3626111.3628198(272-279)Online publication date: 28-Nov-2023
  • (2022)RETE: Retrieval-Enhanced Temporal Event Forecasting on Unified Query Product Evolutionary GraphProceedings of the ACM Web Conference 202210.1145/3485447.3511974(462-472)Online publication date: 25-Apr-2022
  • Show More Cited By

Index Terms

  1. Simrank++: query rewriting through link analysis of the click graph

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 1, Issue 1
    August 2008
    1216 pages

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 August 2008
    Published in PVLDB Volume 1, Issue 1

    Author Tags

    1. click graph
    2. link analysis
    3. similarity metric
    4. sponsored search

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 19 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Billion-Scale Bipartite Graph Embedding: A Global-Local Induced ApproachProceedings of the VLDB Endowment10.14778/3626292.362630017:2(175-183)Online publication date: 1-Oct-2023
    • (2023)Securing Public Clouds using Dynamic Communication GraphsProceedings of the 22nd ACM Workshop on Hot Topics in Networks10.1145/3626111.3628198(272-279)Online publication date: 28-Nov-2023
    • (2022)RETE: Retrieval-Enhanced Temporal Event Forecasting on Unified Query Product Evolutionary GraphProceedings of the ACM Web Conference 202210.1145/3485447.3511974(462-472)Online publication date: 25-Apr-2022
    • (2022)A survey of structural representation learning for social networksNeurocomputing10.1016/j.neucom.2022.04.128496:C(56-71)Online publication date: 28-Jul-2022
    • (2021)Comprehensively Computing Link-based Similarities by Building A Random Surfer GraphProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482329(2578-2587)Online publication date: 26-Oct-2021
    • (2021)SMADProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482183(3543-3547)Online publication date: 26-Oct-2021
    • (2021)Structural Generalizability: The Case of Similarity SearchProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3457316(326-338)Online publication date: 9-Jun-2021
    • (2021)ExactSim: benchmarking single-source SimRank algorithms with high-precision ground truthsThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-021-00672-730:6(989-1015)Online publication date: 5-Jun-2021
    • (2021)Efficient structural node similarity computation on billion-scale graphsThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-021-00654-930:3(471-493)Online publication date: 23-Feb-2021
    • (2020)Realtime index-free single source SimRank processing on web-scale graphsProceedings of the VLDB Endowment10.14778/3384345.338434713:7(966-980)Online publication date: 26-Mar-2020
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media