www.uhasselt.be
DSpace

Document Server@UHasselt >
Research >
Research publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/11317

Title: Good Properties of Similarity Measures and Their Complementarity
Authors: EGGHE, Leo
Issue Date: 2010
Publisher: JOHN WILEY & SONS INC
Citation: JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 61(10). p. 2151-2160
Abstract: Similarity measures, such as the ones of Jaccard, Dice, or Cosine, measure the similarity between two vectors. A good property for similarity measures would be that, if we add a constant vector to both vectors, then the similarity must increase. We show that Dice and Jaccard satisfy this property while Cosine and both overlap measures do not. Adding a constant vector is called, in Lorenz concentration theory, "nominal increase" and we show that the stronger "transfer principle" is not a required good property for similarity measures. Another good property is that, when we have two vectors and if we add one of these vectors to both vectors, then the similarity must increase. Now Dice, Jaccard, Cosine, and one of the overlap measures satisfy this property, while the other overlap measure does not. Also a variant of this latter property is studied.
Notes: [Egghe, Leo] Univ Hasselt UHasselt, B-3590 Diepenbeek, Belgium. [Egghe, Leo] Univ Antwerp UA, B-2000 Antwerp, Belgium.
URI: http://hdl.handle.net/1942/11317
DOI: 10.1002/asi.21380
ISI #: 000282778400015
ISSN: 1532-2882
Category: A1
Type: Journal Contribution
Validation: ecoom, 2011
Appears in Collections: Research publications

Files in This Item:

Description SizeFormat
Published version181.24 kBAdobe PDF
Peer-reviewed author version499.67 kBAdobe PDF

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.