Text Mining: Design of Interactive Search Engine Based Regular Expressions of Online Automobile Advertisements

被引:1
|
作者
Jalal, Ahmed Adeeb [1 ]
机构
[1] Al Iraqia Univ, Coll Engn, Comp Engn Dept, Baghdad, Iraq
来源
关键词
Information Extraction; Information Retrieval; Natural Language Processing; Text Mining; Web Crawler;
D O I
10.3991/ijep.v10i3.12419
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Technology world has greatly evolved over the past decades, which led to inflated data volume. This progress of technology in the digital form generated scattered texts across millions of web pages. Unstructured texts contain a vast amount of textual data. Discover of useful and interesting relations from unstructured texts requires more processing by computers. Therefore, text mining and information extraction have become an exciting research field to get structured and valuable information. This paper focuses on text preprocessing of automotive advertisements domains to configure a structured database. The structured database was created by extract the information over unstructured automotive advertisements, which is an area of natural language processing. Information extraction deals with finding factual information in text using learning regular expressions. We manually craft rule-based specific approaches to extract structured information from unstructured web pages. Structured information will be provided by user-friendly search engine designed for topic-specific knowledge. Consequently, this information that extracted from these advertisements uses to perform a structured search over certain interesting attributes. Thus, the tuples are assigned a probability and indexed to support the efficiency of extraction and exploration via user queries.
引用
收藏
页码:35 / 48
页数:14
相关论文
共 50 条
  • [1] Lightweight Search Engine Based on Text-Mining
    Liu, Chao
    Yin, Shi Qun
    Sun, Meng Meng
    Gao, Sheng
    [J]. FUZZY SYSTEM AND DATA MINING, 2016, 281 : 264 - 270
  • [2] A Study of Online Transaction Platform Based on Interactive Search Engine
    Li, Qifang
    Yang, Ting
    [J]. 2009 IEEE 16TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2009, : 627 - +
  • [3] Interactive Text Graph Mining with a Prolog-based Dialog Engine
    Tarau, Paul
    Blanco, Eduardo
    [J]. PRACTICAL ASPECTS OF DECLARATIVE LANGUAGES (PADL 2020), 2020, 12007 : 3 - 19
  • [4] Interactive Text Graph Mining with a Prolog-Based Dialog Engine
    Tarau, Paul
    Blanco, Eduardo
    [J]. THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2021, 21 (02) : 244 - 263
  • [5] Design and simulation of FPGA engine for regular expressions matching based on PFA
    Jing, Mao-Hua
    Jiang, Bin
    Xin, Yang
    Yang, Yi-Xian
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2015, 38 (06): : 69 - 73
  • [6] Generating Better Search Engine Text Advertisements with Deep Reinforcement Learning
    Hughes, J. Weston
    Chang, Keng-hao
    Zhang, Ruofei
    [J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 2269 - 2277
  • [7] System Design of Cloud Search Engine Based on Rich Text Content
    Chan, Hao-peng
    Xu, Liang
    Liu, Hui-hui
    Zhang, Run-tian
    Sangaiah, Arun Kumar
    [J]. MOBILE NETWORKS & APPLICATIONS, 2021, 26 (01): : 459 - 472
  • [8] System Design of Cloud Search Engine Based on Rich Text Content
    Hao-peng Chan
    Liang Xu
    Hui-hui Liu
    Run-tian Zhang
    Arun Kumar Sangaiah
    [J]. Mobile Networks and Applications, 2021, 26 : 459 - 472
  • [9] Mondou: Interface with text data mining for Web search engine
    Kawano, H
    Hasegawa, T
    [J]. PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL V: MODELING TECHNOLOGIES AND INTELLIGENT SYSTEMS TRACK, 1998, : 275 - 283
  • [10] CROSS-MEDIA RELEVANCE MINING FOR EVALUATING TEXT-BASED IMAGE SEARCH ENGINE
    Xu, Zhongwen
    Yang, Yi
    Kassim, Ashraf
    Yan, Shuicheng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,