Information retrieval is concerned with selecting documents from a collection that will be of interest to a user with a stated information need or query. Research aimed at improving the performance of retrieval systems, that is, selecting those documents most likely to match the user's information need, remains an area of considerable theoretical and practical importance.
This dissertation describes a new formal retrieval model that uses probabilistic inference networks to represent documents and information needs. Retrieval is viewed as an evidential reasoning process in which multiple sources of evidence about document and query content are combined to estimate the probability that a given document matches a query. This model generalizes several current retrieval models and provides a framework within which disparate information retrieval research results can be integrated.
To test the effectiveness of the inference network model, a retrieval system based on the model was implemented. Two test collections were built and used to compare retrieval performance with that of conventional retrieval models. The inference network model gives substantial improvements in retrieval performance with computational costs that are comparable to those associated with conventional retrieval models and which are feasible for large collections.
Cited By
- Chebil W, Soualmia L, Omri M and Darmoni S (2016). Indexing biomedical documents with a possibilistic network, Journal of the Association for Information Science and Technology, 67:4, (928-941), Online publication date: 1-Apr-2016.
- Rodríguez J, Gayo J and Ordoñez de Pablos P (2012). An Extensible Framework to Sort out Nodes in Graph-Based Structures Powered by the Spreading Activation Technique, International Journal of Knowledge Society Research, 3:4, (57-71), Online publication date: 1-Oct-2012.
- Alvarez J, Polo L, Jimenez W, Abella P and Labra J Application of the spreading activation technique for recommending concepts of well-known ontologies in medical systems Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine, (626-635)
- Choi D, Kim T, Min M and Lee J An approach to use query-related web context on document ranking Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication, (1-7)
- Yan R and Hauptmann A (2007). A review of text and image retrieval approaches for broadcast news video, Information Retrieval, 10:4-5, (445-484), Online publication date: 1-Oct-2007.
- Arcoverde J and Das Graças Volpe Nunes M NLP-driven constructive learning for filtering an IR document stream Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval, (74-82)
- Suomela S and Kekäläinen J Ontology as a search-tool Proceedings of the 27th European conference on Advances in Information Retrieval Research, (315-329)
- Conrad J, Guo X and Schriber C Online duplicate document detection Proceedings of the twelfth international conference on Information and knowledge management, (443-452)
- Conrad J and Claussen J Client-system collaboration for legal corpus selection in an online production environment Proceedings of the 9th international conference on Artificial intelligence and law, (262-273)
- Kekäläinen J and Järvelin K User-oriented evaluation methods for information retrieval Exploring artificial intelligence in the new millennium, (355-379)
- Liu X and Croft W Passage retrieval based on language models Proceedings of the eleventh international conference on Information and knowledge management, (375-382)
- Conrad J, Guo X, Jackson P and Meziou M Database selection using actual physical and acquired logical collection resources in a massive domain-specific operational environment Proceedings of the 28th international conference on Very Large Data Bases, (71-82)
- Graves A and Lalmas M Video retrieval using an MPEG-7 based inference network Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, (339-346)
- Järvelin K and Kekäläinen J IR evaluation methods for retrieving highly relevant documents Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, (41-48)
- Mills T, Pye D, Hollinghurst N and Wood K AT&TV Content-Based Multimedia Information Access - Volume 2, (1135-1144)
- Lawrie D and Rus D A self-organized file cabinet Proceedings of the eighth international conference on Information and knowledge management, (499-506)
- Rolker C and Kramer R Quality of service transferred to information retrieval Proceedings of the eighth international conference on Information and knowledge management, (399-404)
- Aslam J, Pelekhov K and Rus D A practical clustering algorithm for static and dynamic information organization Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms, (51-60)
- Aslam J, Pelekhov K and Rus D Static and dynamic information organization with star clusters Proceedings of the seventh international conference on Information and knowledge management, (208-217)
- Kekäläinen J and Järvelin K The impact of query structure and query expansion on retrieval performance Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, (130-137)
- Xu J and Callan J Effective retrieval with distributed collections Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, (112-120)
- de Campos L, Fernández J and Huete J Query expansion in information retrieval systems using a Bayesian network-based thesaurus Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, (53-60)
- Rus D and Allan J (1998). Structural Queries in Electronic Corpora, Multimedia Tools and Applications, 6:2, (153-169), Online publication date: 1-Mar-1998.
- Pyreddy P and Croft W TINTIN Proceedings of the second ACM international conference on Digital libraries, (193-200)
- Mills T, Moody K and Rodden K Cobra Computer-Assisted Information Searching on Internet, (425-449)
- Allan J Incremental relevance feedback for information filtering Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, (270-278)
- Singhal A, Buckley C and Mitra M Pivoted document length normalization Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, (21-29)
- Allan J Relevance feedback with too much data Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, (337-343)
- Jing Y and Croft W An association thesaurus for information retrieval Intelligent Multimedia Information Retrieval Systems and Management - Volume 1, (146-160)
- Gey F Inferring probability of relevance using the method of logistic regression Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, (222-231)
- Turtle H Natural language vs. Boolean query evaluation Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, (212-220)
- Fujii H and Croft W A comparison of indexing techniques for Japanese text retrieval Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, (237-246)
- Croft W, Smith L and Turtle H A loosely-coupled integration of a text retrieval system and an object-oriented database system Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, (223-232)
- Croft W, Turtle H and Lewis D The use of phrases and structured queries in information retrieval Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, (32-45)
- Lewis D Data extraction as text categorization Proceedings of the 3rd conference on Message understanding, (245-255)
Index Terms
- Inference networks for document retrieval
Recommendations
Document expansion for image retrieval
RIAO '10: Adaptivity, Personalization and Fusion of Heterogeneous InformationSuccessful information retrieval requires effective matching between the user's search request and the contents of relevant documents. Often the request entered by a user may not use the same topic relevant terms as the authors' of these documents. One ...
Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion
Cross-language spoken document retrieval (CL-SDR) is the technology that facilitates automatic retrieval of relevant information from a collection of spoken documents in a language that is different from that used in the queries. Information sources ...
Non-relevance Feedback for Document Retrieval
KAM '09: Proceedings of the 2009 Second International Symposium on Knowledge Acquisition and Modeling - Volume 02We need to find documents that relate to human interesting from a large data set of documents. The relevance feedback method needs a set of relevant and non-relevant documents to work usefully. However, the initial retrieved documents, which are ...