Abstract
In the presence of a vast amount of user generated content evolving around entities such as people, locations, products, events, etc., it seems that documentoriented retrieval is rather old-fashioned. Imagine an HIV-relevant search task that with the goal of finding drugs that may interfere with HIV protease inhibitors. Retrieving an exhaustive list of explicit results (i.e., drugs that may interfere with HIV protease inhibitors) can be crucial for people suffering from HIV, whose health depends on the unmediated effect of protease inhibitors. Moreover it might be desirable to have the drugs in the result list ranked by their probability of interfering with protease inhibitors. In order to automatically retrieve such an exhaustive list of ranked answers, there are two subtasks that have to be addressed: (1) knowledge about drugs that stand in an interference relationship to protease inhibitors needs to be extracted from various web pages and appropriately combined, (2) the drugs need to be ranked by their probability of interfering with protease inhibitors. Neither of these tasks can be addressed by state-of-the-art search engines. Expecting the user to manually inspect retrieved documents to construct an exhaustive list of answers is simply unrealistic. As a matter of fact, major players in the search engine industry have recognized these issues and are attempting to shift the focus towards knowledge retrieval. For example, in 2010, Google acquired Metaweb, the company behind Freebase, one of the largest knowledge bases with explicit facts about real-world entities. In 2011, Google’s search group was restructured and renamed into “knowledge group” [6]. Another example is Microsoft’s Bing, which has undergone similar changes in recent years. By the end of 2009 Bing was returning Wolfram Alpha results to entity-related and scholarly queries [8], and by the end 2010 Bing announced the new “health search experience” with the focus “on further enabling people to get relevant information and make better decisions” [7].
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Kasneci, G., Van Gael, J., Herbrich, R., Graepel, T.: Bayesian Knowledge Corroboration with Logical Rules and User Feedback. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds.) ECML PKDD 2010, Part II. LNCS, vol. 6322, pp. 1–18. Springer, Heidelberg (2010)
Kasneci, G., Van Gael, J., Stern, D.H., Graepel, T.: CoBayes: Bayesian Knowledge Corroboration with Assessors of Unknown Areas of Expertise. In: International Conference on Web Search and Web Data Mining (WSDM 2011), pp. 465–474. ACM (2011)
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ISWC/ASWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Large Ontology from Wikipedia and WordNet. J. Web Sem. 6(3), 203–217 (2008)
Infer.NET, http://research.microsoft.com/en-us/um/cambridge/projects/infernet/
Google Dissolves Search Group, http://techcrunch.com/2011/05/03/google-dissolves-search-group-internally-now-called-knowledge
Bringing Knowledge into Health, http://www.bing.com/community/site_blogs/b/search/archive/2010/01/12/bringing-knowledge-into-health-search.aspx
Bing, Wolfram Alpha Agree on Licensing Deal, http://www.zdnet.com/news/bing-wolfram-alpha-agree-on-licensing-deal/333870
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kasneci, G. (2012). Reasoning about Knowledge from the Web. In: Grossniklaus, M., Wimmer, M. (eds) Current Trends in Web Engineering. ICWE 2012. Lecture Notes in Computer Science, vol 7703. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35623-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-35623-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35622-3
Online ISBN: 978-3-642-35623-0
eBook Packages: Computer ScienceComputer Science (R0)