Feature Filtering Spectral Clustering Method Based on High Dimensional Online Clustering Method

被引:0
|
作者
Feng, Zizhou [1 ]
Gu, Yujian [1 ]
Yang, Bin [2 ]
Chen, Baitong [3 ]
Bao, Wenzheng [1 ]
机构
[1] Xuzhou Univ Technol, Sch Informat Engn, Xuzhou 221000, Jiangsu, Peoples R China
[2] Zaozhuang Univ, Sch Informat Sci & Engn, Zaozhuang 277160, Peoples R China
[3] Xuzhou 1 Peoples Hosp, Xuzhou 221000, Jiangsu, Peoples R China
关键词
Golgi appratus; Malonylation; SMOTE; Protein; DNA METHYLATION; N-6-ADENINE; SITES;
D O I
10.1007/978-3-030-97124-3_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Golgi is an important eukaryotic organelle. Golgi plays a key role in protein synthesis in eukaryotic cells, and its dysfunction will lead to various genetic and neurodegenerative diseases. In order to better develop drugs to treat diseases, one of the key problems is to identify the protein category of Golgi apparatus. In the past, the physical and chemical properties of Golgi proteins have often been used as feature extraction methods, but more accurate sub-Golgi protein identification is still challenged by existing methods. In this paper, we use the tape-bert model to extract the features of Golgi body. To create a balanced dataset from an unbalanced Golgi dataset, we used the SMOTE oversampling method. In addition, we screened out the important eigenvalues of 300 dimensions to identify the types of Golgi proteins. In 10-fold cross validation and independent test set test, the accuracy rate reached 90.6% and 95.31%.
引用
收藏
页码:157 / 164
页数:8
相关论文
共 50 条
  • [1] Spectral Clustering Method for High Dimensional Data based on K-SVD
    Wu Sen
    Shao Xiaochen
    Song Rui
    [J]. 2015 INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS), 2015,
  • [2] Stratified feature sampling method for ensemble clustering of high dimensional data
    Jing, Liping
    Tian, Kuang
    Huang, Joshua Z.
    [J]. PATTERN RECOGNITION, 2015, 48 (11) : 3688 - 3702
  • [3] An Unknown Protocol Clustering Analysis Method Based on Spectral Clustering
    Ni, Lulin
    Shi, Yu
    Luo, Jie
    Ji, Qingbing
    [J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2021), 2021, : 445 - 449
  • [4] A LINGUISTIC FEATURE BASED TEXT CLUSTERING METHOD
    Shi, Kansheng
    Li, Lemin
    He, Jie
    Liu, Haitao
    Zhang, Naitong
    Song, Wentao
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS, 2011, : 108 - 112
  • [5] A feature group weighting method for subspace clustering of high-dimensional data
    Chen, Xiaojun
    Ye, Yunming
    Xu, Xiaofei
    Huang, Joshua Zhexue
    [J]. PATTERN RECOGNITION, 2012, 45 (01) : 434 - 446
  • [6] Combined Method for Effective Clustering based on Parallel SOM and Spectral Clustering
    Vojacek, Lukas
    Martinovic, Jan
    Slaninova, Katerina
    Drazdilova, Pavla
    Dvorsky, Jiri
    [J]. DATESO 2011: DATABASES, TEXTS, SPECIFICATIONS, OBJECTS, 2011, 706 : 120 - 131
  • [7] A Spectral Clustering Algorithm Based on Hierarchical Method
    Chen, Xiwei
    Liu, Li
    Luo, Dashi
    Xu, Guandong
    Lu, Yonggang
    Liu, Ming
    Gao, Rongmin
    [J]. AGENTS AND DATA MINING INTERACTION (ADMI 2013), 2014, 8316 : 111 - 123
  • [8] An anchor-based spectral clustering method
    Qin Zhang
    Guo-qiang Zhong
    Jun-yu Dong
    [J]. Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 1385 - 1396
  • [9] An anchor-based spectral clustering method
    Zhang, Qin
    Zhong, Guo-qiang
    Dong, Jun-yu
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (11) : 1385 - 1396
  • [10] Improved spectral clustering based on Nystrom method
    Zhan, Qiang
    Mao, Yu
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (19) : 20149 - 20165