Domain-agnostic discovery of similarities and concepts at scale

被引:1
|
作者
Gornerup, Olof [1 ]
Gillblad, Daniel [1 ]
Vasiloudis, Theodore [1 ]
机构
[1] Swedish Inst Comp Sci SICS, S-16429 Kista, Sweden
关键词
Similarity discovery; Concept mining; Distributional semantics; Graph processing; NETWORKS; DATABASE;
D O I
10.1007/s10115-016-0984-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Appropriately defining and efficiently calculating similarities from large data sets are often essential in data mining, both for gaining understanding of data and generating processes and for building tractable representations. Given a set of objects and their correlations, we here rely on the premise that each object is characterized by its context, i.e., its correlations to the other objects. The similarity between two objects can then be expressed in terms of the similarity between their contexts. In this way, similarity pertains to the general notion that objects are similar if they are exchangeable in the data. We propose a scalable approach for calculating all relevant similarities among objects by relating them in a correlation graph that is transformed to a similarity graph. These graphs can express rich structural properties among objects. Specifically, we show that concepts-abstractions of objects-are constituted by groups of similar objects that can be discovered by clustering the objects in the similarity graph. These principles and methods are applicable in a wide range of fields and will be demonstrated here in three domains: computational linguistics, music, and molecular biology, where the numbers of objects and correlations range from small to very large.
引用
收藏
页码:531 / 560
页数:30
相关论文
共 50 条
  • [1] Domain-agnostic discovery of similarities and concepts at scale
    Olof Görnerup
    Daniel Gillblad
    Theodore Vasiloudis
    Knowledge and Information Systems, 2017, 51 : 531 - 560
  • [2] Knowing an Object by the Company It Keeps: A Domain-Agnostic Scheme for Similarity Discovery
    Gornerup, Olof
    Gillblad, Daniel
    Vasiloudis, Theodore
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 121 - 130
  • [3] Towards Domain-Agnostic Contrastive Learning
    Verma, Vikas
    Minh-Thang Luong
    Kawaguchi, Kenji
    Hieu Pham
    Le, Quoc, V
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7544 - 7554
  • [4] Towards Domain-agnostic Depth Completion
    Xu, Guangkai
    Yin, Wei
    Zhang, Jianming
    Wang, Oliver
    Niklaus, Simon
    Chen, Simon
    Bian, Jia-Wang
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (04) : 652 - 669
  • [5] Medical domain knowledge in domain-agnostic generative AI
    Jakob Nikolas Kather
    Narmin Ghaffari Laleh
    Sebastian Foersch
    Daniel Truhn
    npj Digital Medicine, 5
  • [6] Medical domain knowledge in domain-agnostic generative AI
    Kather, Jakob Nikolas
    Ghaffari Laleh, Narmin
    Foersch, Sebastian
    Truhn, Daniel
    NPJ DIGITAL MEDICINE, 2022, 5 (01)
  • [7] DOMAIN-AGNOSTIC DOMAIN ADAPTION FOR BUILDING FOOTPRINT EXTRACTION
    Zhang, Fahong
    Shi, Yilei
    Zhu, Xiao Xiang
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 318 - 321
  • [8] Fully Unsupervised Domain-Agnostic Image Retrieval
    Zheng, Ziqiang
    Ren, Hao
    Wu, Yang
    Zhang, Weichuan
    Lu, Hong
    Yang, Yang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5077 - 5090
  • [9] Domain-Agnostic Prior for Transfer Semantic Segmentation
    Huo, Xinyue
    Xie, Lingxi
    Hu, Hengtong
    Zhou, Wengang
    Li, Houqiang
    Tian, Qi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 7065 - 7075
  • [10] Domain-Agnostic Representation of Side-Channels
    Spence, Aaron
    Bangay, Shaun
    ENTROPY, 2024, 26 (08)