The GENEREG Corpus for Gene Expression Regulation Events-An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability

被引:0
|
作者
Buyko, Ekaterina [1 ]
Beisswanger, Elena [1 ]
Hahn, Udo [1 ]
机构
[1] Univ Jena, Jena Univ Language & Informat Engn JULIE Lab, D-07743 Jena, Germany
关键词
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Despite the large variety of corpora in the biomedical domain their annotations differ in many respects, e.g., the coverage of different, highly specialized knowledge domains, varying degrees of granularity of the targeted relations, the specificity of linguistic grounding of relations and named entities referred to in the documents, etc. We here introduce GENEREG (Gene Regulation Corpus), the result of an annotation campaign led by the Jena University Language & Information Engineering (JULIE) Lab. The GENEREG corpus consists of 314 abstracts dealing with the regulation of gene expression in the model organism E. coli. Our emphasis in this paper is on the compatibility and, thus, linkage, of the GENEREG corpus with the alternative GENIA event corpus and with several in-domain and out-of-domain lexical resources, e.g., the SPECIALIST LEXICON, FRAMENET, and WORDNET. The links we established from the GENEREG corpus to these external resources will help improve the performance of the automatic relation extraction engine JREX trained and evaluated on GENEREG.
引用
下载
收藏
页码:2662 / 2666
页数:5
相关论文
共 7 条
  • [1] Learning from noisy out-of-domain corpus using dataless classification
    Jin, Yiping
    Wanvarie, Dittaya
    Le, Phu T., V
    NATURAL LANGUAGE ENGINEERING, 2022, 28 (01) : 39 - 69
  • [2] Corpus specificity in LSA and word2vec: the role of out-of-domain documents
    Altszyler, Edgar
    Sigman, Mariano
    Slezak, Diego Fernandez
    REPRESENTATION LEARNING FOR NLP, 2018, : 1 - 10
  • [3] PRAP, a prolactin receptor associated protein: Its gene expression and regulation in the corpus luteum
    Duan, WR
    Parmer, TG
    Albarracin, CT
    Zhong, L
    Gibori, G
    ENDOCRINOLOGY, 1997, 138 (08) : 3216 - 3221
  • [4] Characterization of DER1-like domain family, member 1 (DERL1) and its expression in bovine corpus luteum
    Ndiaye, Kalidou
    Lussier, Jacques
    Pate, Joy
    BIOLOGY OF REPRODUCTION, 2008, : 289 - 289
  • [5] Divergence of the vertebrate sp1A/ryanodine receptor domain and SOCS box-containing (Spsb) gene family and its expression and regulation within the mouse brain
    Kleiber, Morgan L.
    Singh, Shiva M.
    GENOMICS, 2009, 93 (04) : 358 - 366
  • [6] Characterization of fibronectin type III domain-containing protein 5 (FNDC5) gene in chickens: Cloning, tissue expression, and regulation of its expression in the muscle by fasting and cold exposure
    Li, Xin
    Fang, Wenqian
    Hu, Yuanyuan
    Wang, Yajun
    Li, Juan
    GENE, 2015, 570 (02) : 221 - 229
  • [7] How the sequestration of a protein interferes with its mechanism of action:: Example of a new family of proteins characterized by a particular cysteine-rich carboxy-terminal domain involved in gene expression regulation
    Thébault, S
    Mesnard, JM
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2001, 2 (02) : 155 - 167