Domain Adaptation with Logistic Regression for the Task of Splice Site Prediction

被引:2
|
作者
Herndon, Nic [1 ]
Caragea, Doina [1 ]
机构
[1] Kansas State Univ, Comp & Informat Sci, Manhattan, KS 66506 USA
基金
美国国家科学基金会;
关键词
Domain adaptation; Logistic regression; Splice site prediction; Imbalanced data; BAYES;
D O I
10.1007/978-3-319-19048-8_11
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Supervised classifiers are highly dependent on abundant labeled training data. Alternatives for addressing the lack of labeled data include: labeling data (but this is costly and time consuming); training classifiers with abundant data from another domain (however, the classification accuracy usually decreases as the distance between domains increases); or complementing the limited labeled data with abundant unlabeled data from the same domain and learning semi-supervised classifiers (but the unlabeled data can mislead the classifier). A better alternative is to use both the abundant labeled data from a source domain and the limited labeled data from the target domain to train classifiers in a domain adaptation setting. We propose such a classifier, based on logistic regression, and evaluate it for the task of splice site prediction a difficult and essential step in gene prediction. Our classifier achieved high accuracy, with highest areas under the precision-recall curve between 50.83% and 82.61%.
引用
收藏
页码:125 / 137
页数:13
相关论文
共 50 条
  • [1] A Study of Domain Adaptation Classifiers Derived From Logistic Regression for the Task of Splice Site Prediction
    Herndon, Nic
    Caragea, Doina
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2016, 15 (02) : 77 - 85
  • [2] Empirical Study of Domain Adaptation Algorithms on the Task of Splice Site Prediction
    Herndon, Nic
    Caragea, Doina
    BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, BIOSTEC 2014, 2015, 511 : 195 - 211
  • [3] Empirical Study of Domain Adaptation with Naive Bayes on the Task of Splice Site Prediction
    Herndon, Nic
    Caragea, Doina
    BIOINFORMATICS 2014: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOINFORMATICS MODELS, METHODS AND ALGORITHMS, 2014, : 57 - 67
  • [4] An Evaluation of Self-training Styles for Domain Adaptation on the Task of Splice Site Prediction
    Herndon, Nic
    Caragea, Doina
    PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 1042 - 1047
  • [5] Unsupervised Domain Adaptation with Robust Deep Logistic Regression
    Wu, Guangbin
    Chen, Weishan
    Zuo, Wangmeng
    Zhang, David
    IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), 2018, 10749 : 199 - 211
  • [6] Logistic regression projection-based feature representation for visual domain adaptation
    Hamidreza Hosseinzadeh
    Zahra Einalou
    Signal, Image and Video Processing, 2020, 14 : 1115 - 1123
  • [7] Logistic regression projection-based feature representation for visual domain adaptation
    Hosseinzadeh, Hamidreza
    Einalou, Zahra
    SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (06) : 1115 - 1123
  • [8] LAceP: Lysine Acetylation Site Prediction Using Logistic Regression Classifiers
    Hou, Ting
    Zheng, Guangyong
    Zhang, Pingyu
    Jia, Jia
    Li, Jing
    Xie, Lu
    Wei, Chaochun
    Li, Yixue
    PLOS ONE, 2014, 9 (02):
  • [9] Unsupervised Adversarial Domain Adaptation Regression for Rate of Penetration Prediction
    Jiang, Jun
    Guo, Zhenzhen
    SPE JOURNAL, 2023, 28 (05): : 2604 - 2618
  • [10] Domain Adaptation in Regression
    Cortes, Corinna
    Mohri, Mehryar
    ALGORITHMIC LEARNING THEORY, 2011, 6925 : 308 - 323