Extracting Room Prices from Web Tables - an Ontology-Aware Approach

被引:0
|
作者
Buttinger, Christina [1 ]
Feilmayr, Christina [1 ]
Guttenbrunner, Michael [1 ]
Parzer, Stefan [1 ]
Proell, Birgit [1 ]
机构
[1] Johannes Kepler Univ Linz, Inst Applicat Oriented Knowledge Proc, A-4040 Linz, Austria
关键词
Ontology-based Information Extraction; Table Information Extraction; Price Table Pattern; Tourism Price Ontology; Ontology-aware Price Annotation;
D O I
10.1007/978-3-211-99407-8_19
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The growing amount of semi-structured and unstructured data on tourism Web sites with heterogeneous designs requires information extraction (IE) mechanisms, to create, for instance, tourism portals. In order to build semantic eTourism environments, the acquisition of room prices is of particular interest. Room prices and related information often appear in tabular structures, which still challenge Web information extraction techniques. In this paper, we begin by identifying various price table patterns which are characterized by the position of a number of features that determine a room price. We then describe an extended ontology model for tourism prices. Finally, we present TAINEX, a plug-in for functional and structural analysis and data interpretation of price tables, which extends the existing prototype TourIE, a rule-/ontology-based information extraction system for Web sites with heterogeneous designs.
引用
收藏
页码:223 / 234
页数:12
相关论文
共 50 条
  • [21] EXISTENTIAL DEPENDENCY DRIVEN APPROACH FOR EXTRACTING VIEWS FROM DOMAIN ONTOLOGY
    Ahmed, Soraya Setti
    Malki, Mimoun
    Benslimane, Sidi Mohamed
    [J]. KEOD 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND ONTOLOGY DEVELOPMENT, 2011, : 413 - 418
  • [22] Bootstrapping Approach for Extracting Object Attribute Names from the Web
    Hijikata, Yoshinori
    Nomura, Shintaro
    Nakane, Fumitaka
    Nishida, Shogo
    [J]. 2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 1, 2015, : 177 - 180
  • [23] Extracting Records from the Web Using a Signal Processing Approach
    Velloso, Roberto Panerai
    Dorneles, Carina F.
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 197 - 206
  • [24] A hybrid approach for extracting informative content from web pages
    Uzun, Erdinc
    Agun, Hayri Volkan
    Yerlikaya, Tarik
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (04) : 928 - 944
  • [25] Self-Adaptive QoS-Aware Web Service Discovery using Ontology Approach
    Win, Nwe Nwe Htay
    Bao, Jianmin
    Gang, Cui
    Rehman, Saif Ur
    [J]. INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2015, 7 (03) : 65 - 84
  • [26] Extracting forward-looking information from security prices: A new approach
    Weiss, Dan
    Naik, Prasad A.
    Tsai, Chih-Ling
    [J]. ACCOUNTING REVIEW, 2008, 83 (04): : 1101 - 1124
  • [27] Extracting personalised ontology from data-intensive web application: an HTML']HTML forms-based reverse engineering approach
    Benslimane, Sidi Mohamed
    Malki, Mimoun
    Rahmouni, Mustapha Kamal
    Benslimane, Djamal
    [J]. INFORMATICA, 2007, 18 (04) : 511 - 534
  • [28] The Approach to Extracting Semantic Trees from Texts to Build an Ontology from Wiki-Resources
    Yarushkina, Nadezhda
    Filippov, Aleksey
    Moshkin, Vadim
    Dyakov, Ivan
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'18), VOL 1, 2019, 874 : 127 - 137
  • [29] WEB2ONTO: Automatic Ontology Construction Approach from Web pages
    Elmesalmy, Naglaa
    Hadhoud, Mayada
    Fayeka, Magda
    [J]. 2019 15TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO 2019), 2019, : 175 - 182
  • [30] A STRUCTURAL APPROACH TO EXTRACTING CHINESE POSITION RELATIONS FROM WEB PAGES
    Jin, Peiquan
    Yang, Jia
    Zhao, Jie
    Liu, Yanhong
    [J]. JOURNAL OF WEB ENGINEERING, 2013, 12 (05): : 363 - 382