Abstract
We focus on replacing human processing web resources by automated processing. On an experimental system we identify uncertainty issues making this process difficult for automated processing and try to minimize human intervention. In particular we focus on uncertainty issues in a Web content mining system and a user preference mining system. We conclude with possible future development heading to an extension of OWL with uncertainty features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Baumgartner, R., Flesca, S., Gottlob, G.: Visual Web Information Extraction. In: VLDB Conference (2001)
Bednárek, D., Obdržálek, D., Yaghob, J., Zavoral, F.: Data Integration Using DataPile Structure. In: Proceedings of the 9th East-European Conference on Advances in Databases and Information Systems, ADBIS 2005, Tallinn, pp. 178–188 (2005) ISBN 9985-59-545-9
Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. In: Scientific American Magazine (May 2001)
Chang, C.-H., Lui, S.-L.: IEPAD: Information extraction based on pattern discovery. In: WWW-10 (2001)
Liu, B., Grossman, R., Zhai, Y.: Mining Data Records in Web Pages. In: Procs. SIGKDD 2003, Washington, DC, USA, August 24-27 (2003)
Eckhardt, A., Pokorný, J., Vojtáš, P.: A system recommending top-k objects for multiple users preferences. In: 2007 IEEE Conference on Fuzzy Systems, pp. 1101–1106. IEEE, Los Alamitos (2007)
Eckhardt, A., Horváth, T., Vojtáš, P.: PHASES: A User Profile Learning Approach for Web Search. In: WI 2007 Web Intelligence Conference, Fremont, CA (November 2007) (accepted)
Eckhardt, A., Horváth, T., Vojtáš, P.: Learning different user profile annotated rules for fuzzy preference top-k querying. In: SUM 2007 Scalable Uncertainty Management Conference, Washington DC Area (October 2007) (accepted)
Embley, D.W., Campbell, D.M., Smith, R.D., Liddle, S.W.: Ontology-Based Extraction and Structuring of Information from Data-Rich Unstructured Documents. In: CIKM 1998, pp. 52–59 (1998)
Fagin, R., Lotem, A., Naor, M.: Optimal Aggregation Algorithms for Middleware. In: Proc. 20th ACM Symposium on Principles of Database Systems, pp. 102–113 (2001)
Galamboš, L.: Dynamization in IR Systems. In: Klopotek, M.A. (ed.) Proc. IIPWM 2004 - Intelligent Information Processing And Web Mining, pp. 297–310. Springer, Heidelberg (2004)
Galamboš, L.: Semi-automatic stemmer evaluation, ibid., 209–218
Gursky, P., Horvath, T., Novotny, R., Vanekova, V., Vojtas, P.: UPRE: User preference based search system. In: IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006), pp. 841–844. IEEE, Los Alamitos (2006)
Horvath, T., Vojtas, P.: Ordinal Classification with Monotonicity Constraints. In: Perner, P. (ed.) ICDM 2006. LNCS, vol. 4065, pp. 217–225. Springer, Heidelberg (2006)
Kushmerick, N.: Wrapper induction: efficiency and expressiveness. Artificial Intelligence 118, 15–68 (2000)
Muslea, I., Minton, S., Knoblock, C.: A hierarchical approach to wrapper induction. In: Conf. on Autonomous Agents (1999)
Potharst, R., Feelders, A.J.: Classification trees for problems with monotonicity constraints. In: ACM SIGKDD Explorations Newsletter archive, vol. 4(1), pp. 1–10. ACM Press, New York (2002)
Turtle, H.R., Croft, W.B.: Uncertainty in Information Retrieval Systems. In: Proc. Second Workshop Uncertainty Management and Information Systems: From Needs to Solutions, Catalina, Calif., 1993 as quoted in S. Parsons. Current Approaches to Handling Imperfect Information in Data and Knowledge Bases. IEEE TKDE, vol. 8(3), pp. 353–372 (1996)
Vojtáš, P.: \(\mathcal{E}\mathcal{L}\) description logic with aggregation of user preference concepts. In: Duží, M., et al. (eds.) Information modeling and Knowledge Bases XVIII, pp. 154–165. IOS Press, Amsterdam (2007)
Yaghob, J., Zavoral, F.: Semantic Web Infrastructure using DataPile. In: Butz, C.J., et al. (eds.) Proc. 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, pp. 630–633. IEEE, Los Alamitos (2006)
JRex – The Java Browser Component, http://jrex.mozdev.org/index.html
org.w3c.dom.Document on, http://www.w3.org
Charter of W3C Uncertainty Reasoning for the World Wide Web Incubator Group, http://www.w3.org/2005/Incubator/urw3/charter
Wiki of W3C Uncertainty Reasoning for the World Wide Web XG Search, http://www.w3.org/2005/Incubator/urw3/wiki/FrontPage
Smarty: Template Engine (April 2, 2008), http://smarty.php.net
DiffUtil (March 25, 2008), http://www.gnu.org/software/diffutils/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Eckhardt, A., Horváth, T., Maruščák, D., Novotný, R., Vojtáš, P. (2008). Uncertainty Issues and Algorithms in Automating Process Connecting Web and User. In: da Costa, P.C.G., et al. Uncertainty Reasoning for the Semantic Web I. URSW URSW URSW 2006 2007 2005. Lecture Notes in Computer Science(), vol 5327. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89765-1_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-89765-1_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89764-4
Online ISBN: 978-3-540-89765-1
eBook Packages: Computer ScienceComputer Science (R0)