Resource Capability Discovery and Description Management System for Bioinformatics Data and Service Integration - An Experiment with Gene Regulatory Networks

被引:0
|
作者
Ahmed, Emdad [1 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Integrat Informat Lab, Detroit, MI 48202 USA
关键词
Table structure; Table modeling; Web mining; Ontology generation; Semantic Web; Intelligent Wrapper; Web Information Extraction; !text type='HTML']HTML[!/text] forms; Web Data Integration; extraction ontology; Hidden Web; Web Automation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional legacy HTML based web sites/page can be thought of as web services because the dynamic web pages can take user input argument via web forms and response to user query. The ability of agents and services to automatically locate and interact with unknown partners is a goal for Web based Data Integration system. This "serendipitous interoperability" is hindered by the lack of an explicit means of describing what web pages are able to do and in order to do it what input it takes and what output it produces, that is what is their capabilities [1]. The tremendous success of the WWW is countervailed by the efforts needed to search and find relevant information. For tabular structures embedded in HTML documents, typical keyword or link-analysis based search fails. The next phase envisioned for the WWW is automatic ad-hoe interaction between intelligent agents, web services, databases and semantic web enabled applications. A large amount of information available on the Web is formatted in HTML tables, which are mainly presentation oriented and are not suited for database applications. As a result, how to capture information in HTML tables semantically and integrate relevant information is a challenge. We are envisioning another layer of web abstraction where user can query intra web document table like structure. Our prototype application is based on WebFusion and an ad hoe query language BioFlow [2], [3], [4], [5], [6] a software agent that can simulate a person interacting with web search forms and extracting information from the resulting pages by means of an API. We need to develop a framework which is able to query search web forms and the web page tables in a SQL way. In this context we also report a Java based implementation for integrating Flybase and AlignACE site.
引用
收藏
页码:120 / 125
页数:6
相关论文
共 37 条
  • [1] Bioinformatics Web Data and Service Integration - An Experiemnt with Gene Regulatory Networks
    Ahmed, Emdad
    [J]. PROCEEDINGS OF ICECE 2008, VOLS 1 AND 2, 2008, : 70 - 75
  • [2] Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration
    Rex T Nelson
    Shulamit Avraham
    Randy C Shoemaker
    Gregory D May
    Doreen Ware
    Damian DG Gessler
    [J]. BioData Mining, 3
  • [3] Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration
    Nelson, Rex T.
    Avraham, Shulamit
    Shoemaker, Randy C.
    May, Gregory D.
    Ware, Doreen
    Gessler, Damian D. G.
    [J]. BIODATA MINING, 2010, 3
  • [4] Towards precise reconstruction of gene regulatory networks by data integration
    Liu, Zhi-Ping
    [J]. QUANTITATIVE BIOLOGY, 2018, 6 (02) : 113 - 128
  • [5] Towards precise reconstruction of gene regulatory networks by data integration
    Zhi-Ping Liu
    [J]. Quantitative Biology, 2018, 6 (02) - 128
  • [6] Data integration for inferring context-specific gene regulatory networks
    Baur, Brittany
    Shin, Junha
    Zhang, Shilu
    Roy, Sushmita
    [J]. CURRENT OPINION IN SYSTEMS BIOLOGY, 2020, 23 : 38 - 46
  • [7] Integration of Steady-State and Temporal Gene Expression Data for the Inference of Gene Regulatory Networks
    Wang, Yi Kan
    Hurley, Daniel G.
    Schnell, Santiago
    Print, Cristin G.
    Crampin, Edmund J.
    [J]. PLOS ONE, 2013, 8 (08):
  • [8] SYSTEM INTEGRATION WITH MULTISCALE NETWORKS (SIMON): A MODULAR FRAMEWORK FOR RESOURCE MANAGEMENT MODELS
    Hughes, Marisa
    Kelbaugh, Michael
    Campbell, Victoria
    Reilly, Elizabeth
    Agarwala, Susama
    Wilt, Miller
    Badger, Andrew
    Fuller, Evan
    Ponzo, Dillon
    Arevalo, Ximena Calderon
    Fiallos, Alex
    Fozo, Lydia
    Jones, Jalen
    [J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 656 - 667
  • [9] Enhancing gene regulatory networks inference through hub-based data integration
    Naseri, Atefeh
    Sharghi, Mehran
    Hasheminejad, Seyed Mohammad Hossein
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2021, 95
  • [10] Dynamic integration of multiple data mining techniques in a knowledge discovery management system
    Puuronen, S
    Terziyan, V
    Katasonov, A
    Tsymbal, A
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY, 1999, 3695 : 128 - 139