Wikipedia Chemical Structure Explorer: substructure and similarity searching of molecules from Wikipedia

被引:25
|
作者
Ertl, Peter [1 ]
Patiny, Luc [2 ]
Sander, Thomas [3 ]
Rufener, Christian [3 ]
Zasso, Michael [2 ]
机构
[1] Novartis Inst BioMed Res, CH-4056 Basel, Switzerland
[2] Ecole Polytech Fed Lausanne, Inst Chem Sci & Engn ISIC, CH-1015 Lausanne, Switzerland
[3] Actelion Pharmaceut Ltd, CH-4123 Allschwil, Switzerland
来源
关键词
Wikipedia; SMILES; Substructure search; Similarity search; Chemical database; !text type='Java']Java[!/text]Script; VISUALIZATION;
D O I
10.1186/s13321-015-0061-y
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Wikipedia, the world's largest and most popular encyclopedia is an indispensable source of chemistry information. It contains among others also entries for over 15,000 chemicals including metabolites, drugs, agrochemicals and industrial chemicals. To provide an easy access to this wealth of information we decided to develop a substructure and similarity search tool for chemical structures referenced in Wikipedia. Results: We extracted chemical structures from entries in Wikipedia and implemented a web system allowing structure and similarity searching on these data. The whole search as well as visualization system is written in JavaScript and therefore can run locally within a web page and does not require a central server. The Wikipedia Chemical Structure Explorer is accessible on-line at www.cheminfo.org/wikipedia and is available also as an open source project from GitHub for local installation. Conclusions: The web-based Wikipedia Chemical Structure Explorer provides a useful resource for research as well as for chemical education enabling both researchers and students easy and user friendly chemistry searching and identification of relevant information in Wikipedia. The tool can also help to improve quality of chemical entries in Wikipedia by providing potential contributors regularly updated list of entries with problematic structures. And last but not least this search system is a nice example of how the modern web technology can be applied in the field of cheminformatics.
引用
收藏
页数:7
相关论文
共 29 条
  • [1] Wikipedia Chemical Structure Explorer: substructure and similarity searching of molecules from Wikipedia
    Peter Ertl
    Luc Patiny
    Thomas Sander
    Christian Rufener
    Michaël Zasso
    [J]. Journal of Cheminformatics, 7
  • [2] Creating a Phrase Similarity Graph From Wikipedia
    Stanchev, Lubomir
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 68 - 75
  • [3] SEARCHING AND COMPUTING FOR VOCABULARIES WITH SEMANTIC CORRELATIONS FROM CHINESE WIKIPEDIA
    Li, Yun
    Huang, Kaiyan
    Ren, Fuji
    Zhong, Yixin
    [J]. CIICT 2008: PROCEEDINGS OF CHINA-IRELAND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATIONS TECHNOLOGIES 2008, 2008, : 58 - +
  • [4] A bilingual dictionary extracted from the Wikipedia link structure
    Erdmann, Maike
    Nakayama, Kotaro
    Hara, Takahiro
    Nishio, Shojiro
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 686 - 689
  • [5] Extracting central places from the link structure in Wikipedia
    Kessler, Carsten
    [J]. TRANSACTIONS IN GIS, 2017, 21 (03) : 488 - 502
  • [6] Wikipedia bi-linear link (WBLM) model: A new approach for measuring semantic similarity and relatedness between linguistic concepts using Wikipedia link structure
    Hussain, Muhammad Jawad
    Bai, Heming
    Jiang, Yuncheng
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)
  • [7] Linguistically Informed Mining Lexical Semantic Relations from Wikipedia Structure
    Piasecki, Maciej
    Indyka-Piasecka, Agnieszka
    Kurc, Roman
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2011, PT I, 2011, 6591 : 297 - 306
  • [8] HYPERSTRUCTURE MODEL FOR CHEMICAL-STRUCTURE HANDLING - TECHNIQUES FOR SUBSTRUCTURE SEARCHING
    BROWN, RD
    DOWNS, GM
    JONES, G
    WILLETT, P
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (01): : 47 - 53
  • [9] MOLECULAR SUBSTRUCTURE SIMILARITY SEARCHING - EFFICIENT RETRIEVAL IN 2-DIMENSIONAL STRUCTURE DATABASES
    HAGADONE, TR
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (05): : 515 - 521
  • [10] Bioturbo Similarity Searching: Combining Chemical and Biological Similarity To Discover Structurally Diverse Bioactive Molecules
    Wassermann, Anne Mai
    Lounkine, Eugen
    Glick, Meir
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (03) : 692 - 703