[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

An analysis of web proxy logs with query distribution pattern approach for search engines

Published: 01 January 2012 Publication History

Abstract

This study presents an analysis of users' queries directed at different search engines to investigate trends and suggest better search engine capabilities. The query distribution among search engines that includes spawning of queries, number of terms per query and query lengths is discussed to highlight the principal factors affecting a user's choice of search engines and evaluate the reasons of varying the length of queries. The results could be used to develop long to short term business plans for search engine service providers to determine whether or not to opt for more focused topic specific search offerings to gain better market share. Highlights We analyse user queries to highlight the main factors affecting search engine choice. We evaluate the reasons of varying the length of queries by different search engines. Results can assist whether to offer topic specific search to gain market share. Results could be used to develop strategies for designing advanced search engine. Results could also be used to develop search engine service provider business.

References

[1]
A. Patel, N. Schmidt, Application of structured document parsing to focused Web crawling, Computer Standards and Interfaces Journal, 32 (November 2010) x-y.
[2]
A. Patel, An adaptive updating topic specific Web search system using T-graph, Journal of Computer Science, 6 (2010) 450-456.
[3]
A. Patel, M.J. Khan, Evaluation of service management algorithms in a distributed Web search system, Computer Standards & Interfaces, 29 (February 2007) 152-160.
[4]
C. Silverstein, M. Henzinger, H. Marais, M. Moricz, Analysis of a Very Large AltaVista Query Log, Digital SRC Technical Note 1998014. ftp://gatekeeper.research.compaq.com/pub/DEC/SRC/technical-notes/SRC-1998-014.pdf
[5]
Y. Zhang, A. Spink, B.J. Jasen, Time series analysis of a Web search engine transaction log, Information Processing & Management, 45 (2009) 230-245.
[6]
A. Spink, J.L. Xu, Selected results from a large study of Web Searching: the Excite study, in: Information Research, Vol. 6 No. 1, 2000. http://informationr.net/ir/6-1/paper90.html
[7]
Analog, The most popular log file analyser in the world. http://www.analog.cx
[8]
Experian Hitwise, Top websites & search engine analysis. http://www.hitwise.com
[9]
SEO Consultants Directory, Top Ten Search Engine Top 10 SEs. http://www.seoconsultants.com/search-engines
[10]
V. Bhatiasevi, Y. Chairavutthi, The battle for World Wide Web dominance: in search of network externalities, in: International Business Management, 5, Medwell Journals, 2011.
[11]
C. Marcus, Web 1.0, Web 2.0, Web 3.0 and Web 4.0 explained. http://www.marcuscake.com/key-concepts/internet-evolution
[12]
M. Chen, J. Dal Busco, K. Garrett, A. Sinha, A Search Engine Usage. http://courses.ischool.berkeley.edu/i271a/f00/SearchEngine/appendix.htm
[13]
R.W. White, S.T. Dumais, Characterizing and predicting search engine switching behaviour, in: CIKM'09 Proceeding of the 18th ACM conference on Information and Knowledge Management, 2009, pp. 87-96.
[14]
M. Zhaoli, G. Jiong, L. Guijun, Competition and adoption of search engine software, International Journal of u- and e-Service, Science and Technology, 2 (2009).
[15]
A. Spink, S. Ozmutlu, H.C. Ozmutlu, B.J. Jansen, US versus European Web Searching Trends, ACM SIGIR Forum, Vol. 36 No. 2. http://www.acm.org/sigir/forum/F2002/spink.pdf
[16]
H. Hananzita, K. Kiran, Malaysian Web search engines: a critical analysis, in: Malaysian Journal of Library & Information Science, Vol.11, no.1, July 2006, pp. 103-122. http://eprints.um.edu.my/282/1/web_search_engines_kiran.pdf
[17]
M. Chau, X. Fang, C.C. Yang, Web searching in Chinese: a study of a search engine in Hong Kong, Journal of the American Society for Information Science and Technology, 58 (2007) 1044-1054.
[18]
H.T. Pu, S.-L. Chuang, C. Yang, Subject categorization of query terms for exploring Web users' Search interests, Journal of the American Society for Information Science and Technology, 53 (2002) 617-630.
[19]
B. Cory, J. Rosie, R. Moira, The linguistic structure of English Web-search queries, in: Proceeding of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008, pp. 1021-1030.
[20]
L. Na, A. Patel, R. Latih, C. Wills, Z. Shukur, R. Mulla, A study of mashup as a software application development technique with examples from an end-user programming perspective, Journal of Computer Science, 6 (November, 2010) 1406-1415.

Cited By

View all
  • (2021)Mining Domain Terminologies Using Search Engine's Query LogACM Transactions on Asian and Low-Resource Language Information Processing10.1145/346232720:6(1-32)Online publication date: 12-Aug-2021
  • (2019)Based on The Document-Link and Time-Clue Relationships Between Blog Posts to Improve the Performance of Google Blog SearchInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.201901010315:1(52-75)Online publication date: 1-Jan-2019
  • (2018)Characterising Dataset Search QueriesCompanion Proceedings of the The Web Conference 201810.1145/3184558.3191597(1485-1488)Online publication date: 23-Apr-2018
  • Show More Cited By
  1. An analysis of web proxy logs with query distribution pattern approach for search engines

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Computer Standards & Interfaces
    Computer Standards & Interfaces  Volume 34, Issue 1
    January, 2012
    230 pages

    Publisher

    Elsevier Science Publishers B. V.

    Netherlands

    Publication History

    Published: 01 January 2012

    Author Tags

    1. Distributed search engines
    2. Proxy server logs
    3. Query analysis
    4. Search engines
    5. Web search services

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Mining Domain Terminologies Using Search Engine's Query LogACM Transactions on Asian and Low-Resource Language Information Processing10.1145/346232720:6(1-32)Online publication date: 12-Aug-2021
    • (2019)Based on The Document-Link and Time-Clue Relationships Between Blog Posts to Improve the Performance of Google Blog SearchInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.201901010315:1(52-75)Online publication date: 1-Jan-2019
    • (2018)Characterising Dataset Search QueriesCompanion Proceedings of the The Web Conference 201810.1145/3184558.3191597(1485-1488)Online publication date: 23-Apr-2018
    • (2017)New technique to deal with verbose queries in social book searchProceedings of the International Conference on Web Intelligence10.1145/3106426.3106481(799-806)Online publication date: 23-Aug-2017
    • (2017)The Trials and Tribulations of Working with Structured DataProceedings of the 2017 CHI Conference on Human Factors in Computing Systems10.1145/3025453.3025838(1277-1289)Online publication date: 2-May-2017
    • (2015)Towards a feature-rich data set for personalized access to long-tail contentProceedings of the 30th Annual ACM Symposium on Applied Computing10.1145/2695664.2695671(1031-1038)Online publication date: 13-Apr-2015
    • (2015)An automatic methodology to evaluate personalized information retrieval systemsUser Modeling and User-Adapted Interaction10.1007/s11257-014-9148-925:1(1-37)Online publication date: 1-Mar-2015
    • (2014)Towards building a scholarly big data platformProceedings of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries10.5555/2740769.2740789(117-126)Online publication date: 8-Sep-2014
    • (2013)Web mining based extraction of problem solution ideasExpert Systems with Applications: An International Journal10.1016/j.eswa.2013.01.01340:10(3961-3969)Online publication date: 1-Aug-2013
    • (2012)Extending BM25 with multiple query operatorsProceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval10.1145/2348283.2348406(921-930)Online publication date: 12-Aug-2012

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media