Toward Full-text Searching Middleware over Hierarchical Documents

被引:0
|
作者
Ma, Kun [1 ]
Yang, Bo [1 ]
Abraham, Ajith [2 ,3 ]
机构
[1] Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan, Peoples R China
[2] Sci Network Innovat & Res Excellence, Machine Intelligence Res Labs, Seattle, WA USA
[3] VSB Tech Univ Ostrava, IT4Innovat, Ostrava, Czech Republic
关键词
Full-text searching; middleware; hierarchical documents; NoSQL; MODEL TRANSFORMATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, full-text searching can benefit from the emerging NoSQL databases and traditional indexing tools in the big data era. However, there are some drawbacks of current solutions. On one hand, the indexing documents lack of the hierarchy. On the other hand, big data have become the bottleneck of full-text searching. In the context of big data, we design a full-text searching middleware over hierarchical documents. We discuss the architecture of this middleware in detail. In addition, we propose a structure-independent hierarchical document model to present the hierarchical document. Moreover, the transformation engine is designed to translate the rich files into models. The core log event listener is responsible for capturing the changed documents and push them to the indexing storage at the same time. The experimental results show that our middleware is more advantageous than RDBMS with indexes and RDBMS with Lucene solutions.
引用
收藏
页码:194 / 198
页数:5
相关论文
共 50 条