Document Server@UHasselt >
Research >
Research publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/17947

Title: A new method for information retrieval, based on the theory of relative concentration
Authors: Egghe, Leo
Issue Date: 1990
Publisher: ACM
Citation: Vidick, Jean-Luc (Ed.). Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval, p. 469-493
Abstract: This paper introduces a new method for information retrieval of documents that are represented by a vector. The novelty of the algorithm lies in the fact that no (generalized) p-norms are used as a matching function between the query and the document (as is done e.g. by Salton and others) but a function that measures the relative dispersion of the terms between a document and a query. This function originates from an earlier paper of the author where a good measure of relative concentration was introduced, used in informetrics to measure the degree of specialization of a journal w.r.t. the entire subject. This new information retrieval algorithm is shown to have many desirable properties (in the sense of the new Cater-Kraft wish list) including those of the original cosine-matching function of Salton. In addition the property of the cosine-matching function that, if one only uses weights 0 to 1, one is reduced to Boolean IR, is refined in the sense that one takes into consideration the broadness or specialization of a document and a query. Our new matching function satisfies these additional properties.
URI: http://hdl.handle.net/1942/17947
DOI: 10.1145/96749.98254
ISBN: 0-89791-408-2
Category: C1
Type: Proceedings Paper
Appears in Collections: Research publications

Files in This Item:

There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.