Abstract
A neural based multi-agent system for automatic HTML pages retrieval is presented. The system is based on the EαNet architecture, a neural network having good generalization capabilities and able to learn the activation function of its hidden units. The starting hypothesis is that the HTML pages are stored in networked repositories. The system goal is to retrieve documents satisfying a user query and belonging to a given class (i.e. documents containing the word “football” and talking about “Sports”). The system is composed by three interacting agents: the EαNet Neural Classifier Mobile Agent, the Query Agent, and the Locator Agent. The whole system was successfully implemented exploiting the Jade platform features and facilities. The preliminary experimental results show a good classification rate: in the best case a classification error of 9.98% is reached.
This research has been partially funded by Engineering Ingegneria Informatica SpA within the Teschet project.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gaglio, S., Pilato, G., Sorbello, F., Vassallo, G.: Using the Hermite regression formula to design a neural architecture with automatic learning of the ‘hidden’ activation functions. In: Lamma, E., Mello, P. (eds.) AI*IA 1999. LNCS (LNAI), vol. 1792, pp. 226–237. Springer, Heidelberg (2000)
Pilato, G., Sorbello, F., Vassallo, G.: An innovative way to measure the quality of a neural network without the use of the test set. International Journal of Artificial Computational Intelligence 5(1), 31–36 (2001)
Cirasa, A., Pilato, G., Sorbello, F., Vassallo, G.: An enhanced version of the αNet architecture: EαNet. In: Proc. of AI*IA Workshop of Robotics Parma, Italy (1999)
Cirasa, A., Pilato, G., Sorbello, F., Vassallo, G.: EαNet: A Neural Solution for Web Pages Classification. In: Proc. of 4th World MultiConference on Systemics, Cybernetics and Informatics - SCI 2000, Orlando, Florida, U.S.A., July 23-26 (2000)
Glitho, R.H., Olougouna, E., Pierre, S.: Mobile Agents and Their Use for Information Retrieval: a brief overview and an elaborate case study. IEEE Network, 34–41 (2002)
Brewington, B., Gray, R., Moizumi, K., Kotz, D., Cybenko, G., Rus, D.: Mobile agents in distributed information retrieval Intelligent Information Agents, pp. 1–32. Springer, Heidelberg (1999)
Kosala, R., Blockeel, H.: Web Mining research: a survey. ACM SIGKDD Explorations 2(1), 1–15 (2000)
Ng, Y.-K., Tang, J., Goodrich, M.: A Binary-Categorization Approach for Classifyng Multiple-Record Web Documents Using Application Ontologies and a Probabilistic Model. In: Proceedings of Seventh International Conference on Database Systems for Advanced Applications (2001)
Liang, J., Doermann, D., Ma, M., Guo, J.K.: Page Classification Through Logical Labelling. In: Proceedings of 16th International Conference on Pattern Recognition, vol. 3 (2002)
Soonthornphisaj, N., Kijsirikul, B.: The Effects of Different Feature Sets on the Web Page Categorization Problem Using the Iterative Cross-Training Algorithm. In: Proceeding of the International Conference on Enterprise and Information System (ICEIS), Setubal, Portugal (July 2001)
Pal, S.K., Talwar, V., Mitra, P.: Web Mining in Soft Computing Framework: Relevance, State of the Art and Future Direction. IEEE Transaction on Neural Networks 13(9) (September 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pilato, G., Vitabile, S., Vassallo, G., Conti, V., Sorbello, F. (2003). A Concurrent Neural Classifier for HTML Documents Retrieval. In: Apolloni, B., Marinaro, M., Tagliaferri, R. (eds) Neural Nets. WIRN 2003. Lecture Notes in Computer Science, vol 2859. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45216-4_24
Download citation
DOI: https://doi.org/10.1007/978-3-540-45216-4_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20227-1
Online ISBN: 978-3-540-45216-4
eBook Packages: Springer Book Archive