Abstract
We present a set covering algorithm and a compositional algorithm to describe sequences of www pages visits in click-stream data. The set covering algorithm utilizes the approach of rule specialization like the well known CN2 algorithm, the compositional algorithm is based on our original KEX algorithm, however both algorithms deal with sequences of events (visited pages) instead of sets of attributevalue pairs. The learned rules can be used to predict next page to be viewed by a user or to describe the most typical paths of www pages visitors and the dependencies among the www pages. We have successfully used both algorithms on real data from an internet shop and we mined useful information from the data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Berka, P., Ivánek, J. (1994) Automated knowledge acquisition for PROSPECTOR-like expert systems. In. (Bergadano, de Raedt eds.) Proc. ECML’94, Springer 1994, 339–342
Bruha, I., Kočkov, S. (1994) A support for decision making: Cost-sensitive learning system. Artificial Intelligence in Medicine, 6, 67–82
Clark, P., Niblett, T. (1989) The CN2 induction algorithm. Machine Learning, 3, 261–283
Cooley, R., Tan, P. N., Srivastava, J. (1999) Discovery of interesting usage patterns from web data. Tech.Rep. TR 99-022, Univ. of Minnesota
Kaufman, K. A., Michalski, R. S. (1999) Learning from inconsistent and noisy data: The AQ18 approach. In: Proc 11th Int. Symposium on Methodologies for Intelligent Systems
Kosala, R., Blockeel, H. (2000) Web Mining Research: A Survey. SIGKDD Explorations, Vol. 2 Issue 1
Spiliopoulou, M., Faulstich, L. (1999) WUM: A tool for web utilization analysis. In Proc. EDBT Workshop WebDB’98, Springer LNCS 1590
Srivastava, J., Cooley, R., Deshpande, M., Tan, P. N. (2000) Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data. SIGKDD Explorations, Vol. 1 Issue 2
Zaiane, O., Han, J. (1998) WebML: Querying the World-Wide Web for resources and knowledge. In: Workshop on Web Information and Data Management WIDM’98, Bethesda, 9–12
Zaine, O., Xin, M., Han, J. (1998) Discovering web access patterns and trends by applying OLAP and data mining technology on web logs. In: Advances in Digital Libraries
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Berka, P., Laš, V., Kočka, T. (2005). Rule Induction for Click-Stream Analysis: Set Covering and Compositional Approach. In: Kłopotek, M.A., Wierzchoń, S.T., Trojanowski, K. (eds) Intelligent Information Processing and Web Mining. Advances in Soft Computing, vol 31. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-32392-9_2
Download citation
DOI: https://doi.org/10.1007/3-540-32392-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25056-2
Online ISBN: 978-3-540-32392-1
eBook Packages: EngineeringEngineering (R0)