Abstract
With the network information growing day by day, people engaging in commercial affairs are crying for a commerce-oriented search engine. The primary step of building up the search engine is to get commercial information efficiently from Internet. This paper introduces a method used to filter commerce-oriented information from Internet. By this method, Spider decides the passing orientation by judging whether the hyperlink is relevant to commercial affairs. In the experiments, we used word-filtering technology to optimize the program and use the thread pool to improve the performance.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Xu, B.-w., Zhang, W.-f.: Search engine and Information Fetching Technology. Tsinghua University Press, China (2003)
Heaton, J.: Creating a Thread Pool with Java [EB/ OL], http://www.informit.com/articles/article.asp?p=30483&redir=1
Heaton, J.: Programming a Spider in Java [EB/OL], http://www.jeffheaton.com/jhmag.shtml
Che, D.: Brief Introduction of LUCENE, the whole-length search engine based on JAVA, http://www.chedong.com/tech/lucene.html
Ling, Y., Wang, X., Fei, Y.: Intelligent Technology and Information Processing. Science Press (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, M., Fei, Y. (2008). Filter Technology of Commerce-Oriented Network Information. In: Li, H., Liu, T., Ma, WY., Sakai, T., Wong, KF., Zhou, G. (eds) Information Retrieval Technology. AIRS 2008. Lecture Notes in Computer Science, vol 4993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68636-1_55
Download citation
DOI: https://doi.org/10.1007/978-3-540-68636-1_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68633-0
Online ISBN: 978-3-540-68636-1
eBook Packages: Computer ScienceComputer Science (R0)