The GENEREG Corpus for Gene Expression Regulation Events-An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability

被引：0

作者：

Buyko, Ekaterina ^{[1
]}

Beisswanger, Elena ^{[1
]}

Hahn, Udo ^{[1
]}

机构：

[1] Univ Jena, Jena Univ Language & Informat Engn JULIE Lab, D-07743 Jena, Germany

来源：

LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2010年

关键词：

D O I：

暂无

中图分类号：

H [语言、文字];

学科分类号：

05 ;

摘要：

Despite the large variety of corpora in the biomedical domain their annotations differ in many respects, e.g., the coverage of different, highly specialized knowledge domains, varying degrees of granularity of the targeted relations, the specificity of linguistic grounding of relations and named entities referred to in the documents, etc. We here introduce GENEREG (Gene Regulation Corpus), the result of an annotation campaign led by the Jena University Language & Information Engineering (JULIE) Lab. The GENEREG corpus consists of 314 abstracts dealing with the regulation of gene expression in the model organism E. coli. Our emphasis in this paper is on the compatibility and, thus, linkage, of the GENEREG corpus with the alternative GENIA event corpus and with several in-domain and out-of-domain lexical resources, e.g., the SPECIALIST LEXICON, FRAMENET, and WORDNET. The links we established from the GENEREG corpus to these external resources will help improve the performance of the automatic relation extraction engine JREX trained and evaluated on GENEREG.

引用

下载

页码：2662 / 2666

页数：5

共 7 条

[1] Learning from noisy out-of-domain corpus using dataless classification
Jin, Yiping
Wanvarie, Dittaya
Le, Phu T., V
NATURAL LANGUAGE ENGINEERING, 2022, 28 (01) : 39 - 69
[2] Corpus specificity in LSA and word2vec: the role of out-of-domain documents
Altszyler, Edgar
Sigman, Mariano
Slezak, Diego Fernandez
REPRESENTATION LEARNING FOR NLP, 2018, : 1 - 10
[3] PRAP, a prolactin receptor associated protein: Its gene expression and regulation in the corpus luteum
Duan, WR
Parmer, TG
Albarracin, CT
Zhong, L
Gibori, G
ENDOCRINOLOGY, 1997, 138 (08) : 3216 - 3221
[4] Characterization of DER1-like domain family, member 1 (DERL1) and its expression in bovine corpus luteum
Ndiaye, Kalidou
Lussier, Jacques
Pate, Joy
BIOLOGY OF REPRODUCTION, 2008, : 289 - 289
[5] Divergence of the vertebrate sp1A/ryanodine receptor domain and SOCS box-containing (Spsb) gene family and its expression and regulation within the mouse brain
Kleiber, Morgan L.
Singh, Shiva M.
GENOMICS, 2009, 93 (04) : 358 - 366
[6] Characterization of fibronectin type III domain-containing protein 5 (FNDC5) gene in chickens: Cloning, tissue expression, and regulation of its expression in the muscle by fasting and cold exposure
Li, Xin
Fang, Wenqian
Hu, Yuanyuan
Wang, Yajun
Li, Juan
GENE, 2015, 570 (02) : 221 - 229
[7] How the sequestration of a protein interferes with its mechanism of action:: Example of a new family of proteins characterized by a particular cysteine-rich carboxy-terminal domain involved in gene expression regulation
Thébault, S
Mesnard, JM
CURRENT PROTEIN & PEPTIDE SCIENCE, 2001, 2 (02) : 155 - 167

← 1 →