Generalized k-medians clustering for strings

被引:0
|
作者
Martínez-Hinarejos, CD [1 ]
Juan, A [1 ]
Casacuberta, F [1 ]
机构
[1] Univ Politecn Valencia, Inst Tecnol Informat, Dept Sistemes Informat & Computacio, Valencia 46022, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering methods are used in pattern recognition to obtain natural groups from a data set in the framework Of unsupervised learning as well as for obtaining clusters of data from a known class. In sets of strings, the concept of set median string can be extended to the (set) k-medians problem. The solution of the k-medians problem can be viewed as a clustering method, where each cluster is generated by each of the k strings of that solution. A concept which is related to set median string is the (generalized) median string, which is an NP-Hard problem. However, different algorithms have been proposed to find approximations to the (generalized) median string. We propose extending the (generalized) median string problem to k strings, resulting in the generalized k-medians problem, which can also be viewed as a clustering technique. This new technique is applied to a corpus of chromosomes represented by strings and compared to the conventional k-medians technique.
引用
收藏
页码:502 / 509
页数:8
相关论文
共 50 条
  • [21] An Approximation Algorithm for the Continuous k-Medians Problem in a Convex Polygon
    Carlsson, John Gunnar
    Jia, Fan
    Li, Ying
    INFORMS JOURNAL ON COMPUTING, 2014, 26 (02) : 280 - 289
  • [22] Near-optimal Algorithms for Explainable k-Medians and k-Means
    Makarychev, Konstantin
    Shan, Liren
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [23] 基于海明距离的量子k-medians算法
    辛娟娟
    魏贺
    戚晗
    拱长青
    计算机仿真, 2024, 41 (01) : 409 - 414
  • [24] 基于K-Medians的学习质量评价方法研究
    冯广
    陈卓
    罗时强
    邱凯星
    伍文燕
    中国教育信息化, 2022, (04) : 80 - 86
  • [25] Efficient computation of k-medians over data streams under memory constraints
    Chong, ZH
    Yu, JX
    Zhang, ZJ
    Lin, XM
    Wang, W
    Zhou, AY
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2006, 21 (02) : 284 - 296
  • [26] Efficient Computation of k-Medians over Data Streams Under Memory Constraints
    Zhi-Hong Chong
    Jeffrey Xu Yu
    Zhen-Jie Zhang
    Xue-Min Lin
    Wei Wang
    Ao-Ying Zhou
    Journal of Computer Science and Technology, 2006, 21 : 284 - 296
  • [27] A Location Method of Partial Discharge Based on Truncated Singular Value Decomposition and K-Medians
    Ning S.
    He Y.
    Liu Q.
    Sui Y.
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2022, 37 (13): : 3441 - 3452
  • [29] 基于K-Medians谱聚类的无功电压分区方法研究
    宋新甫
    张述铭
    张欢
    赵志强
    王新刚
    电工技术, 2019, (09) : 43 - 45+49
  • [30] Generalized medians
    Calvo, T
    Mesiar, R
    FUZZY SETS AND SYSTEMS, 2001, 124 (01) : 59 - 64