Information Extraction from Heterogenous Web Sites Using Additional Search of Related Contents Based on a User's Instantiated Example

被引:0
|
作者
Mitsui, Yuki [1 ]
Oka, Hironori [2 ]
Akiyoshi, Masanori [1 ]
Komoda, Norihisa [1 ]
机构
[1] Osaka Univ, 2-1 Yamadaoka Suita, Osaka, Japan
[2] Codetoys, Osaka, Japan
关键词
Information Extraction; Additional Search; User's Instantiated Example;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, since the growth of the Internet, World Wide Web has become significant infrastructure in various fields such as business, commerce, education and so on. Accordingly, a user has gathered information by using the Internet. However due to the flood of Web pages, it becomes difficult for a user to collect desirable information. Advanced Web search engines may provide solution to some extent, it is still up to a user to summarize or extract meaningful information from such retrieval results. Based on this viewpoints, we addressed a generation method of table-style data from heterogeneous Webpages that reflects a user's intention. However if original pages have less information, our system may not extract sufficient information. To improve this problem, we address a method that searches related page contents automatically. We apply this method to shopping sites and the experimental result shows it improves recall rate.
引用
收藏
页码:593 / +
页数:3
相关论文
共 15 条
  • [1] Information Extraction from Heterogeneous Web Sites Using Clue Complement Process Based on a User's Instantiated Example
    Shimada, Junya
    Oka, Hironori
    Akiyoshi, Masanori
    Komoda, Norihisa
    [J]. DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2010, 79 : 585 - +
  • [2] Automatic extraction of user’s search intention from web search logs
    Kinam Park
    Hyesung Jee
    Taemin Lee
    Soonyoung Jung
    Heuiseok Lim
    [J]. Multimedia Tools and Applications, 2012, 61 : 145 - 162
  • [3] Automatic extraction of user's search intention from web search logs
    Park, Kinam
    Jee, Hyesung
    Lee, Taemin
    Jung, Soonyoung
    Lim, Heuiseok
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 61 (01) : 145 - 162
  • [4] Information extraction from personal computer specifications on the Web using a user's request
    Shimada, K
    Fukumoto, A
    Endo, T
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (08) : 1386 - 1395
  • [5] Analysis of menu priorities for web sites of tourism information from the user's standpoint
    Kwon, YG
    Byun, SN
    [J]. ERGONOMICS AND SAFETY FOR GLOBAL BUSINESS QUALITY AND PRODUCTIVITY, 2000, : 429 - 432
  • [6] A Template-Based Information Extraction from Web Sites with Unstable Markup
    Kolchin, Maxim
    Kozlov, Fedor
    [J]. SEMANTIC WEB EVALUATION CHALLENGE, 2014, 475 : 89 - 94
  • [7] Learning knowledge bases for information extraction from multiple text based web sites
    Gao, XY
    Zhang, MJ
    [J]. IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 119 - 125
  • [8] Analysis of User's Behaviour Based on Search Intentions for Information Retrieval Using Search Engines
    Kori, Shogo
    Zhu, Yanjun
    Yamaguchi, Koichi
    Takiguchi, Satoru
    Takama, Yasufumi
    [J]. 2015 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2015, : 64 - 70
  • [9] SNS Retrieval Based on User Profile Estimation Using Transfer Learning from Web Search
    Kataoka, Daisuke
    Tajima, Keishi
    [J]. 2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 278 - 285
  • [10] Information retrieval from the World Wide Web: a user-focused approach based on individual experience with search engines
    Liaw, SS
    Huang, HM
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2006, 22 (03) : 501 - 517