共 50 条
An information update method towards internal search engine
被引:0
|作者:
Bian, Zhifan
[1
]
Li, Yukun
Yue, Tinghai
Lei, Pengfei
Zhao, Dexin
Xiao, Yingyuan
机构:
[1] Tianjin Univ Technol, Key Lab Intelligence Comp & Novel Software Techno, Tianjin 300384, Peoples R China
关键词:
internal search engine;
crawling strategies;
update method;
D O I:
10.1109/WISA.2015.69
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
To enterprises or other organizations, how to efficiently manage unstructured and semi-structured data on the web becomes an important problem. Internal search engine is well-used to deal with it, but how to efficiently find the latest updates of web sources is still a research issue. In this paper, we proposed a graph-based method to efficiently locate the updated information of an organization's web resources, which is based on modeling an organization's information resources with a graph and marking each web page with a parameter "update cycle" that represents the possibility of a web page to be updated and is taken as a factor to tune the algorithm of update identification. By this method, the latest updated information can be located in time. The experiments' results show the effectiveness of our method.
引用
收藏
页码:211 / 216
页数:6
相关论文