Knowing an Object by the Company It Keeps: A Domain-Agnostic Scheme for Similarity Discovery

被引：4

作者：

Gornerup, Olof ^{[1
]}

Gillblad, Daniel ^{[1
]}

Vasiloudis, Theodore ^{[1
]}

机构：

[1] Swedish Inst Comp Sci, SE-16429 Kista, Sweden

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2015年

关键词：

DATABASE;

D O I：

10.1109/ICDM.2015.85

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Appropriately defining and then efficiently calculating similarities from large data sets are often essential in data mining, both for building tractable representations and for gaining understanding of data and generating processes. Here we rely on the premise that given a set of objects and their correlations, each object is characterized by its context, i.e. its correlations to the other objects, and that the similarity between two objects therefore can be expressed in terms of the similarity between their respective contexts. Resting on this principle, we propose a data-driven and highly scalable approach for discovering similarities from large data sets by representing objects and their relations as a correlation graph that is transformed to a similarity graph. Together these graphs can express rich structural properties among objects. Specifically, we show that concepts - representations of abstract ideas and notions - are constituted by groups of similar objects that can be identified by clustering the objects in the similarity graph. These principles and methods are applicable in a wide range of domains, and will here be demonstrated for three distinct types of objects: codons, artists and words, where the numbers of objects and correlations range from small to very large.

引用

页码：121 / 130

页数：10

共 4 条

[1] Domain-agnostic discovery of similarities and concepts at scale
Gornerup, Olof
Gillblad, Daniel
Vasiloudis, Theodore
KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (02) : 531 - 560
[2] Domain-agnostic discovery of similarities and concepts at scale
Olof Görnerup
Daniel Gillblad
Theodore Vasiloudis
Knowledge and Information Systems, 2017, 51 : 531 - 560
[3] labelCloud: A Lightweight Labeling Tool for Domain-Agnostic 3D Object Detection in Point Clouds
Sager C.
Zschech P.
Kühl N.
Computer-Aided Design and Applications, 2022, 19 (06) : 1191 - 1206
[4] Image Cosegmentation Using Shape Similarity and Object Discovery Scheme
Xu, Haiping
Wang, Meiqing
Chen, Fei
Lai, Choi-Hong
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (10)

← 1 →