[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

How to Improve Your Search Engine Ranking: Myths and Reality

Published: 01 March 2014 Publication History

Abstract

Search engines have greatly influenced the way people access information on the Internet, as such engines provide the preferred entry point to billions of pages on the Web. Therefore, highly ranked Web pages generally have higher visibility to people and pushing the ranking higher has become the top priority for Web masters. As a matter of fact, Search Engine Optimization (SEO) has became a sizeable business that attempts to improve their clients’ ranking. Still, the lack of ways to validate SEO’s methods has created numerous myths and fallacies associated with ranking algorithms.
In this article, we focus on two ranking algorithms, Google’s and Bing’s, and design, implement, and evaluate a ranking system to systematically validate assumptions others have made about these popular ranking algorithms. We demonstrate that linear learning models, coupled with a recursive partitioning ranking scheme, are capable of predicting ranking results with high accuracy. As an example, we manage to correctly predict 7 out of the top 10 pages for 78% of evaluated keywords. Moreover, for content-only ranking, our system can correctly predict 9 or more pages out of the top 10 ones for 77% of search terms. We show how our ranking system can be used to reveal the relative importance of ranking features in a search engine’s ranking function, provide guidelines for SEOs and Web masters to optimize their Web pages, validate or disprove new ranking features, and evaluate search engine ranking results for possible ranking bias.

References

[1]
AccuraCast. 2007. Google algorithm’s top ranking factors. http://www.accuracast.com/seo-weekly/ranking-factors.php.
[2]
Alexa. 2014. Alexa, the web information company. http://www.alexa.com/.
[3]
Anderson, S. 2007. Google seo test google prefers valid HTML and CSS. http://www.hobo-web.co.uk/seo-blog/index.php/official-google-prefers-valid-html-css/.
[4]
Aubuchon, V. 2010. Google ranking factors. http://www.vaughns-1-pagers.com/internet/google-ranking-factors.htm.
[5]
Benczur, A. A., Csalogany, K., Sarlos, T., and Uher, M. 2005. Spamrank--Fully automatic link spam detection. In Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web (AIRWeb’05). 25--38.
[6]
Blog1. 2013. How google works: Why does crappy website rank higher than mine? http://www.trafficgenerationcafe.com/how-google-works-relevance/.
[7]
Blog2. 2013. My actual blog post ranks lower than pages associated with it. http://productforums.google.com/forum/#!topic/webmasters/QOv273CK07I.
[8]
Cheng, J., Wang, Z., and Pollastri, G. 2008. A neural network approach to ordinal regression. In Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN’08). IEEE World Congress on Computational Intelligence, 1279--1284.
[9]
Cho, J. and Roy, S. 2004. Impact of search engines on page popularity. In Proceedings of the 13th International Conference on World Wide Web (WWW’04). ACM Press, New York, 20--29.
[10]
CPLEX. 2014. Ilog cplex: High-performance software for mathematical programming and optimization. http://www.ilog.com/products/cplex/.
[11]
Davison, B. D. 2000. Recognizing nepotistic links on the web. In Proceedings of the AAAI Workshop on Artificial Intelligence for Web Search.
[12]
FANN. 2014. Fast artificial neural network library. http://leenissen.dk/fann.
[13]
Friedman, J. 2001. Greedy function approximation: A gradient boosting machine. Annals Statist. 29, 5, 1182--1232.
[14]
Friedman, J. 2003. Statistical gradient boosting. Comput. Statist. Data Anal. 38, 4, 367--378.
[15]
Gift, N. 2007. RSYNC version 3 alpha out - O’reilly onlamp blog. http://www.oreillynet.com/onlamp/blog/2007/10/rsync_version_3_alpha_out.html.
[16]
Google. 2014a. Pagerank on Google toolbar. http://www.google.com/support/toolbar/bin/answer.py?hl=en&answer=79837.
[17]
Google. 2014b. Google trends. http://www.google.com/trends.
[18]
Google. 2014c. Google webmaster tools. http://www.google.com/webmasters/.
[19]
Grimmelmann, J. 2013. What to do about Google? Comm. ACM 56, 9, 28--30.
[20]
Gyongyi, Z., Berkhin, P., Garcia-Molina, H., and Pedersen, J. 2006. Link spam detection based on mass estimation. In Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB’06). 439--450.
[21]
Hopkins, L. 2012. Online reputation management: Why the first page of Google matters so much. http://www.leehopkins.net/2012/08/30/online-reputation-management-why-the-first-page-of-google-matters-so-much/.
[22]
HTML Tidy. 2014. HTML tidy library project. http://tidy.sourceforge.net/.
[23]
Joachims, T. 1999. Making large-scale SVM learning practical. In Advances in Kernel Methods---Support Vector Learning, B. Scholkopf, C. J. C. Burges, and A. J. Smola Eds., MIT Press, Cambridge, MA, 169--184.
[24]
Joachims, T. 2009. SVM-rank support vector machine. http://www.cs.cornell.edu/People/tj/svm_light/svm_rank.html.
[25]
Joachims, T., Granka, L., Pang, B., Hembrooke, H., and Gay, G. 2005. Accurately interpreting click-through data as implicit feedback. In Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’05). 154--161.
[26]
Kontopoulos, G. 2007. Top Google ranking factors. http://www.squidoo.com/topGoogleRankingFactors.
[27]
Li, L. 2008. Google’s top search engine ranking factors. http://lornali.com/online-marketing/seo/googles-top-search-engine-ranking-factors.
[28]
Marshall, B. 2009. Top 10 most important Google ranking factors. http://blogs.myspace.com/index.cfm?fuseaction=blog.view&friendId=21196&blogId=493022330.
[29]
Mitchell, T. 1997. Machine Learning. McGraw-Hill.
[30]
Moran, M. and Hunt, B. 2005. Search Engine Marketing, Inc.: Driving Search Traffic to Your Company’s Web Site. Prentice Hall PTR, Upper Saddle River, NJ.
[31]
Palaniswamy, A. 2005. Iptables dependency: Why we got there and how we got out. http://www.zimbrablog.com/blog/archives/2005/11/iptables-dependency-why-we-got-there-and-howwe-got-out.html.
[32]
Panda, B., Herbach, J. S., Basu, S., and Bayardo, R. J. 2009. Planet: Massively parallel learning of tree ensembles with mapreduce. In Proceedings of the 35th International Conference on Very Large Data Bases (VLDB’09).
[33]
Patel, N. 2006. A breakdown of Google’s ranking factors. http://www.pronetadvertising.com/articles/a-breakdown-of-googles-ranking-factors.html.
[34]
PHP HTML Parser. 2014. PHP simple HTML dom parser. http://simplehtmldom.sourceforge.net.
[35]
RankNet. 2009. RankNet: How Bing works. http://neotracks.blogspot.com/2009/06/ranknethow-bing-works.html.
[36]
RSS Pieces. 2007. HTML validation: The hidden key to seo. http://www.rsspieces.com/html-validation-the-hidden-key-to-seo.
[37]
SEOmoz. 2007. Search engine ranking factors. http://www.seomoz.org/article/search-ranking-factors.
[38]
SeoPros. 2014. Seopros.org. http://www.seopros.org/.
[39]
Seth, S. 2010. Yahoo! Transitions organic search back-end to Microsoft platform. http://www.ysearchblog.com/2010/08/24/yahoo-transitions-organic-search-back-end-to-microsoft-platform/.
[40]
Su, A.-J., Hu, Y. C., Kuzmanovic, A., and Koh, C.-K. 2010. How to improve your Google ranking: Myths and reality. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence (WI’10).
[41]
TOPSEOs. 2014. Topseos.com. http://www.topseos.com/.
[42]
Vapnik, N. V. 2000. The Nature of Statistical Learning Theory. Springer.
[43]
Wu, B. and Davison, B. D. 2005. Identifying link farm spam pages. In Proceedings of the Special Interest Tracks and Posters of the 14th International Conference on World Wide Web (WWW’05). 820--829.

Cited By

View all
  • (2024)Effects of using multi-category web pages on rank estimation of Google search engine results pageWeb Intelligence10.3233/WEB-230239(1-17)Online publication date: 19-Sep-2024
  • (2024)Exploring the impact of SEO-based ranking factors for voice queries through machine learningArtificial Intelligence Review10.1007/s10462-024-10780-957:6Online publication date: 16-May-2024
  • (2023)SEO Through the E-Costumer JourneyBig Data Marketing Strategies for Superior Customer Experience10.4018/978-1-6684-6454-0.ch007(165-193)Online publication date: 24-Feb-2023
  • Show More Cited By

Index Terms

  1. How to Improve Your Search Engine Ranking: Myths and Reality

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on the Web
    ACM Transactions on the Web  Volume 8, Issue 2
    March 2014
    226 pages
    ISSN:1559-1131
    EISSN:1559-114X
    DOI:10.1145/2600093
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 March 2014
    Accepted: 01 October 2013
    Revised: 01 September 2013
    Received: 01 February 2011
    Published in TWEB Volume 8, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Search engine
    2. learning
    3. ranking algorithm
    4. search engine optimization

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)78
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Effects of using multi-category web pages on rank estimation of Google search engine results pageWeb Intelligence10.3233/WEB-230239(1-17)Online publication date: 19-Sep-2024
    • (2024)Exploring the impact of SEO-based ranking factors for voice queries through machine learningArtificial Intelligence Review10.1007/s10462-024-10780-957:6Online publication date: 16-May-2024
    • (2023)SEO Through the E-Costumer JourneyBig Data Marketing Strategies for Superior Customer Experience10.4018/978-1-6684-6454-0.ch007(165-193)Online publication date: 24-Feb-2023
    • (2023)Mini-Map Design Features as a Navigation Aid in the Virtual Geographical Space Based on Video GamesISPRS International Journal of Geo-Information10.3390/ijgi1202005812:2(58)Online publication date: 8-Feb-2023
    • (2023)Exploring the relationship between YouTube video optimisation practices and video rankings for online marketing: a machine learning approachJournal of Business Analytics10.1080/2573234X.2023.2292536(1-16)Online publication date: 12-Dec-2023
    • (2022)Enhancing adaptation of digital entrepreneurship to consumers’ online purchasing in the COVID-19 pandemic eraTechnology Analysis & Strategic Management10.1080/09537325.2022.211005736:8(1749-1762)Online publication date: 8-Aug-2022
    • (2022)Public awareness and attitudes towards search engine optimizationBehaviour & Information Technology10.1080/0144929X.2022.205650742:8(1025-1044)Online publication date: 6-Apr-2022
    • (2021)Análisis de la implementación Schema.org en el repositorio RODERIC e impacto en el posicionamiento en Google y Google ScholarRevista Española de Documentación Científica10.3989/redc.2021.3.179744:3(e300)Online publication date: 1-Jul-2021
    • (2021)Using Machine Learning for Web Page Classification in Search Engine OptimizationFuture Internet10.3390/fi1301000913:1(9)Online publication date: 2-Jan-2021
    • (2021)The influence of search engine optimization on Google's resultsProceedings of the 13th ACM Web Science Conference 202110.1145/3447535.3462479(12-20)Online publication date: 21-Jun-2021
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media