Accurate and efficient protein embedding using multi-teacher distillation learning

被引：0

作者：

Shang, Jiayu ^{[1
]}

Peng, Cheng ^{[2
]}

Ji, Yongxin ^{[2
]}

Guan, Jiaojiao ^{[2
]}

Cai, Dehan ^{[2
]}

Tang, Xubo ^{[2
]}

Sun, Yanni ^{[2
]}

机构：

[1] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

来源：

BIOINFORMATICS | 2024年 / 40卷 / 09期

关键词：

D O I：

10.1093/bioinformatics/btae567

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Motivation Protein embedding, which represents proteins as numerical vectors, is a crucial step in various learning-based protein annotation/classification problems, including gene ontology prediction, protein-protein interaction prediction, and protein structure prediction. However, existing protein embedding methods are often computationally expensive due to their large number of parameters, which can reach millions or even billions. The growing availability of large-scale protein datasets and the need for efficient analysis tools have created a pressing demand for efficient protein embedding methods.Results We propose a novel protein embedding approach based on multi-teacher distillation learning, which leverages the knowledge of multiple pre-trained protein embedding models to learn a compact and informative representation of proteins. Our method achieves comparable performance to state-of-the-art methods while significantly reducing computational costs and resource requirements. Specifically, our approach reduces computational time by similar to 70% and maintains +/- 1.5% accuracy as the original large models. This makes our method well-suited for large-scale protein analysis and enables the bioinformatics community to perform protein embedding tasks more efficiently.Availability and implementation The source code of MTDP is available via https://github.com/KennthShang/MTDP

引用

页数：5

共 50 条

[41] MTMS: Multi-teacher Multi-stage Knowledge Distillation for Reasoning-Based Machine Reading Comprehension
Zhao, Zhuo
Xie, Zhiwen
Zhou, Guangyou
Huang, Jimmy Xiangji
[J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1995 - 2005
[42] Adaptive multi-teacher softened relational knowledge distillation framework for payload mismatch in image steganalysis
Yu, Lifang
Li, Yunwei
Weng, Shaowei
Tian, Huawei
Liu, Jing
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
[43] Improving Bi-encoder Document Ranking Models with Two Rankers and Multi-teacher Distillation
Choi, Jaekeol
Jung, Euna
Suh, Jangwon
Rhee, Wonjong
[J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2192 - 2196
[44] Cross-View Gait Recognition Method Based on Multi-Teacher Joint Knowledge Distillation
Li, Ruoyu
Yun, Lijun
Zhang, Mingxuan
Yang, Yanchen
Cheng, Feiyan
[J]. SENSORS, 2023, 23 (22)
[45] Model Compression with Two-stage Multi-teacher Knowledge Distillation for Web Question Answering System
Yang, Ze
Shou, Linjun
Gong, Ming
Lin, Wutao
Jiang, Daxin
[J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 690 - 698
[46] A Multi-Teacher Policy Distillation Framework for Enhancing Zero-Shot Generalization of Autonomous Driving Policies
Yang, Jiachen
Zhang, Jipeng
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (07) : 9734 - 9746
[47] MTUW-GAN: A Multi-Teacher Knowledge Distillation Generative Adversarial Network for Underwater Image Enhancement
Zhang, Tianchi
Liu, Yuxuan
Mase, Atsushi
[J]. APPLIED SCIENCES-BASEL, 2024, 14 (02):
[48] Data-Free Low-Bit Quantization via Dynamic Multi-teacher Knowledge Distillation
Huang, Chong
Lin, Shaohui
Zhang, Yan
Li, Ke
Zhang, Baochang
[J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 28 - 41
[49] Unsupervised Domain Adaptation in Medical Image Segmentation via Fourier Feature Decoupling and Multi-teacher Distillation
Hu, Wei
Xu, Qiaozhi
Qi, Xuanhao
Yin, Yanjun
Zhi, Min
Lian, Zhe
Yang, Na
Duan, Wentao
Yu, Lei
[J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 98 - 110
[50] Semi-supervised lung adenocarcinoma histopathology image classification based on multi-teacher knowledge distillation
Wang, Qixuan
Zhang, Yanjun
Lu, Jun
Li, Congsheng
Zhang, Yungang
[J]. PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (18):

← 1 2 3 4 5 →