Back-Translation for Discovering Distant Protein Homologies

被引:0
|
作者
Girdea, Marta [1 ]
Noe, Laurent [1 ]
Kucherov, Gregory [1 ]
机构
[1] Univ Lille 1, INRIA Lille Nord Europe, LIFL, CNRS, F-59655 Villeneuve Dascq, France
来源
关键词
SEQUENCE; FRAME; MODEL; GENE;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level. To cope with this situation, we propose a novel method to infer distant homology relations of two proteins, that accounts for frameshift and point mutations that may have affected the coding sequences. We design a dynamic programming alignment algorithm over memory-efficient graph representations of the complete set of putative DNA sequences of each protein, with the goal of determining the two putative DNA sequences which have the best scoring alignment under a powerful scoring system designed to reflect the most probable evolutionary process. This allows us to uncover evolutionary information that is not captured by traditional alignment methods, which is confirmed by biologically significant examples.
引用
收藏
页码:108 / 120
页数:13
相关论文
共 50 条
  • [41] Comparison of Grammatical Error Correction Using Back-Translation Models
    Koyama, Aomi
    Hotate, Kengo
    Kaneko, Masahiro
    Komachi, Mamoru
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 126 - 135
  • [42] Data Augmentation via Back-translation for Aspect Term Extraction
    Xu, Qingting
    Hong, Yu
    Chen, Jiaxiang
    Yao, Jianmin
    Zhou, Guodong
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [43] MODULAR, FULLY-ABSTRACT COMPILATION BY APPROXIMATE BACK-TRANSLATION
    Devriese, Dominique
    Patrignani, Marco
    Piessens, Frank
    Keuchel, Steven
    [J]. LOGICAL METHODS IN COMPUTER SCIENCE, 2017, 13 (04)
  • [44] Validity of back-translation as a quality assessment tool: empirical evidence
    Tyupa, Sergiy
    Wild, Diane
    [J]. QUALITY OF LIFE RESEARCH, 2016, 25 : 170 - 170
  • [45] Video-guided machine translation via dual-level back-translation
    Chen, Shiyu
    Zeng, Yawen
    Cao, Da
    Lu, Shaofei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 245
  • [46] DUAL BACK TRANSLATION VERSUS SINGLE BACK-TRANSLATION METHODOLOGY WHEN TRANSLATING PATIENT REPORTED OUTCOMES (PRO)
    Talbert, M.
    Brandt, B. A.
    McKown, S.
    Gawlicki, M. C.
    Heinzman, A.
    Polltiz, A.
    [J]. VALUE IN HEALTH, 2013, 16 (07) : A596 - A596
  • [47] Revisiting Back-Translation for Low-Resource Machine Translation Between Chinese and Vietnamese
    Li, Hongzheng
    Sha, Jiu
    Shi, Can
    [J]. IEEE ACCESS, 2020, 8 (08) : 119931 - 119939
  • [48] Zooming of states and parameters using a lumping approach including back-translation
    Sunnaker, Mikael
    Schmidt, Henning
    Jirstrand, Mats
    Cedersund, Gunnar
    [J]. BMC SYSTEMS BIOLOGY, 2010, 4
  • [49] Sensory profiling in animal models of neuropathic pain: a call for back-translation
    Rice, Andrew S. C.
    Finnerup, Nanna B.
    Kemp, Harriet I.
    Currie, Gillian L.
    Baron, Ralf
    [J]. PAIN, 2018, 159 (05) : 819 - 824
  • [50] A Study on the Back-Translation of LinYutang's My Country and My People
    付雪梅
    [J]. 海外英语, 2016, (17) : 128 - 129