Multilingual translation for zero-shot biomedical classification using BioTranslator

被引:4
|
作者
Xu, Hanwen [1 ]
Woicik, Addie [1 ]
Poon, Hoifung [2 ]
Altman, Russ B. [3 ,4 ,5 ]
Wang, Sheng [1 ]
机构
[1] Univ Washington, Sch Comp Sci & Engn, Seattle, WA USA
[2] Microsoft Res, Redmond, WA USA
[3] Stanford Univ, Dept Bioengn, Stanford, CA USA
[4] Stanford Univ, Dept Genet, Stanford, CA USA
[5] Chan Zuckerberg Biohub, San Francisco, CA USA
关键词
MEDICAL LANGUAGE SYSTEM; DRUG-SENSITIVITY; ONTOLOGY; PATHWAY; INTEGRATION; GENECARDS; LANDSCAPE; DISCOVERY; GENOMICS; GRAPH;
D O I
10.1038/s41467-023-36476-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Existing annotation paradigms rely on controlled vocabularies, where each data instance is classified into one term from a predefined set of controlled vocabularies. This paradigm restricts the analysis to concepts that are known and well-characterized. Here, we present the novel multilingual translation method BioTranslator to address this problem. BioTranslator takes a user-written textual description of a new concept and then translates this description to a non-text biological data instance. The key idea of BioTranslator is to develop a multilingual translation framework, where multiple modalities of biological data are all translated to text. We demonstrate how BioTranslator enables the identification of novel cell types using only a textual description and how BioTranslator can be further generalized to protein function prediction and drug target identification. Our tool frees scientists from limiting their analyses within predefined controlled vocabularies, enabling them to interact with biological data using free text.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Cost Effective Annotation Framework Using Zero-Shot Text Classification
    Kasthuriarachchy, Buddhika
    Chetty, Madhu
    Shatte, Adrian
    Walls, Darren
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [42] Improving Zero-Shot Translation by Disentangling Positional Information
    Liu, Danni
    Niehues, Jan
    Cross, James
    Guzman, Francisco
    Li, Xian
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1259 - 1273
  • [43] Towards Zero-Shot Multilingual Transfer for Code-Switched Responses
    Wu, Ting-Wei
    Zhao, Changsheng
    Chang, Ernie
    Shi, Yangyang
    Chuang, Pierce
    Chandra, Vikas
    Juang, Biing
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7551 - 7563
  • [44] Zero-shot Pose Estimation Using Image Translation to Maintain Object Pose
    Fujita, Kohei
    Tasaki, Tsuyoshi
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2023, 143 (12) : 1113 - 1122
  • [45] Enhanced VAEGAN: a zero-shot image classification method
    Ding, Bo
    Fan, Yufei
    He, Yongjun
    Zhao, Jing
    [J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 9235 - 9246
  • [46] Learning Autoencoder of Attribute Constraint for Zero-Shot Classification
    Wang, Kun
    Wu, Songsong
    Gao, Guangwei
    Zhou, Quan
    Jing, Xiao-Yuan
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 605 - 610
  • [47] Zero-Shot Audio Classification Via Semantic Embeddings
    Xie, Huang
    Virtanen, Tuomas
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1233 - 1242
  • [48] Zero-shot image classification based on factor space
    Guan, Shijie
    Guan, Qixue
    Yin, Anqi
    [J]. International Journal of Web Engineering and Technology, 2021, 16 (01) : 1 - 29
  • [49] Unified benchmark for zero-shot Turkish text classification
    celik, Emrecan
    Dalyan, Tugba
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [50] Zero-shot Relation Classification from Side Information
    Gong, Jiaying
    Eldardiry, Hoda
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 576 - 585