Martini: using literature keywords to compare gene sets

被引:23
|
作者
Soldatos, Theodoros G. [1 ]
O'Donoghue, Sean I. [1 ]
Satagopam, Venkata P. [1 ]
Jensen, Lars J. [1 ]
Brown, Nigel P. [1 ]
Barbosa-Silva, Adriano [1 ]
Schneider, Reinhard [1 ]
机构
[1] European Mol Biol Lab, D-69117 Heidelberg, Germany
关键词
EXPRESSION PROFILES; MICROARRAY DATA; CELL-CYCLE; TOOL; INFORMATION; DATABASE;
D O I
10.1093/nar/gkp876
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Life scientists are often interested to compare two gene sets to gain insight into differences between two distinct, but related, phenotypes or conditions. Several tools have been developed for comparing gene sets, most of which find Gene Ontology (GO) terms that are significantly over-represented in one gene set. However, such tools often return GO terms that are too generic or too few to be informative. Here, we present Martini, an easy-to-use tool for comparing gene sets. Martini is based, not on GO, but on keywords extracted from Medline abstracts; Martini also supports a much wider range of species than comparable tools. To evaluate Martini we created a benchmark based on the human cell cycle, and we tested several comparable tools (CoPub, FatiGO, Marmite and ProfCom). Martini had the best benchmark performance, delivering a more detailed and accurate description of function. Martini also gave best or equal performance with three other datasets (related to Arabidopsis, melanoma and ovarian cancer), suggesting that Martini represents an advance in the automated comparison of gene sets. In agreement with previous studies, our results further suggest that literature-derived keywords are a richer source of gene-function information than GO annotations. Martini is freely available at http://martini.embl.de.
引用
收藏
页码:26 / 38
页数:13
相关论文
共 50 条
  • [1] Caipirini: using gene sets to rank literature
    Soldatos, Theodoros G.
    O'Donoghue, Sean I.
    Satagopam, Venkata P.
    Barbosa-Silva, Adriano
    Pavlopoulos, Georgios A.
    Wanderley-Nogueira, Ana Carolina
    Soares-Cavalcanti, Nina Mota
    Schneider, Reinhard
    BIODATA MINING, 2012, 5
  • [2] Caipirini: using gene sets to rank literature
    Theodoros G Soldatos
    Seán I O'Donoghue
    Venkata P Satagopam
    Adriano Barbosa-Silva
    Georgios A Pavlopoulos
    Ana Carolina Wanderley-Nogueira
    Nina Mota Soares-Cavalcanti
    Reinhard Schneider
    BioData Mining, 5
  • [3] Historical keywords - Gene
    Ricard, P
    LANCET, 2005, 366 (9481): : 197 - 197
  • [4] Keywords for Children's Literature
    Marshall, Elizabeth
    LION AND THE UNICORN, 2012, 36 (01): : 75 - 78
  • [5] The use of keywords in archaeornithology literature
    Dirrigl, Frank J.
    White, Justin
    INTERNATIONAL JOURNAL OF OSTEOARCHAEOLOGY, 2023, 33 (04) : 787 - 797
  • [6] Keywords for Children's Literature
    Kertzer, Adrienne
    LION AND THE UNICORN, 2022, 46 (01): : 119 - +
  • [7] Keywords for Children's Literature
    Garavini, Melissa
    INTERNATIONAL RESEARCH IN CHILDRENS LITERATURE, 2012, 5 (02) : 223 - 224
  • [8] Keywords on Children's Literature
    Zhang Shengzhen
    CHILDRENS LITERATURE ASSOCIATION QUARTERLY, 2021, 46 (02) : 216 - 219
  • [9] Functional gene clustering via gene annotation sentences, MeSH and GO keywords from biomedical literature
    Natarajan, Jeyakumar
    Ganapathy, Jawahar
    BIOINFORMATION, 2007, 2 (05) : 185 - 193
  • [10] GOseek: A Gene Ontology Search Engine using Enhanced Keywords
    Taha, Kamal
    2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 1502 - 1505