A New Samples Selecting Method based on K Nearest Neighbors

被引:0
|
作者
Yang, Kai [1 ]
Cai, Yi [1 ]
Cai, Zhiwei [1 ]
Tan, Xingwei [1 ]
Xie, Haoran [2 ]
Wong, Tak Lam [2 ]
Chan, Wai Hong [2 ]
机构
[1] South China Univ Technol, Sch Software, Guangzhou, Guangdong, Peoples R China
[2] Educ Univ Hong Kong, Dept Math & Informat Technol, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Short text classification uses a supervised learning process, and it needs a huge amount of labeled data for training. This process consumes a lot of human resources. In traditional supervised learning problems, active learning can reduce the amount of samples that need to be labeled manually. It achieves this goal by selecting the most representative samples to represent the whole training set. Uncertainty sampling is the most popular way in active learning, but it has poor performance when it is affected by outliers. In our paper, we propose a new sampling method for training sets containing short text, which is denoted as Top-K Representative (TKR). However, the optimization process of TKR is a N-P hard problem. To solve this problem, a new algorithm, based on the greedy algorithm, is proposed to obtain the approximating results. The experiments show that our proposed sampling method performs better than the state-of-the-art methods.
引用
收藏
页码:457 / 462
页数:6
相关论文
共 50 条
  • [21] An improved method for coherent structure identification based on mutual K-nearest neighbors
    Wei, Zeming
    Zhang, Jiazhong
    Jia, Ruidong
    Gao, Jingsheng
    JOURNAL OF TURBULENCE, 2022, 23 (11-12): : 655 - 673
  • [22] A new adaptation method based on adaptability under k-nearest neighbors for case adaptation in case-based design
    Qi, Jin
    Hu, Jie
    Peng, Yinghong
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (07) : 6485 - 6502
  • [23] A new approach for increasing K-nearest neighbors performance
    Aamer, Youssef
    Benkaouz, Yahya
    Ouzzif, Mohammed
    Bouragba, Khalid
    2020 8TH INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS AND MOBILE COMMUNICATIONS (WINCOM 2020), 2020, : 35 - 39
  • [24] A new k-nearest neighbors classifier for functional data
    Zhu, Tianming
    Zhang, Jin-ting
    STATISTICS AND ITS INTERFACE, 2022, 15 (02) : 247 - 260
  • [25] A New Version of the Dendritic Cell Immune Algorithm Based on the K-Nearest Neighbors
    Ben Ali, Kaouther
    Chelly, Zeineb
    Elouedi, Zied
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 688 - 695
  • [26] A Novel K Nearest Neighbors Classifier Based on Nonparametric Separability
    Kuo, Bor-Chen
    Ho, Hsin-Hua
    Sheu, Tian-Wei
    Shih, Shu-Chuan
    2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 2738 - +
  • [27] Human Sleep Scoring Based on K-Nearest Neighbors
    Qureshi, Shahnawaz
    Karrila, Seppo
    Vanichayobon, Sirirut
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2018, 26 (06) : 2802 - +
  • [28] Robustness Certification of k -Nearest Neighbors
    Fassina, Nicolo
    Ranzato, Francesco
    Zanella, Marco
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 110 - 119
  • [29] K-Nearest Neighbors Hashing
    He, Xiangyu
    Wang, Peisong
    Cheng, Jian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2834 - 2843
  • [30] Skeletonization Based on K-Nearest-Neighbors on Binary Image
    Ren, Yi
    Zhang, Min
    Zhou, Hongyu
    Liu, Ji
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 243 - 254