Combining Web Data Extraction and Data Mining Techniques to Discover Knowledge

被引:0
|
作者
Bouldoukian, Nathalie A. [1 ]
机构
[1] Holy Spirit Univ Kaslik USEK, Dept Comp Sci, Jounieh, Lebanon
关键词
Web Data Extraction; Data Mining; Clustering; Classification; Data processing;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The design and implementation of a support system for Knowledge Discovery is the challenge of many researchers. As Data Mining is the main key step in Knowledge Discovery process in Databases (KDD), it is necessary to find a new methodology that combines web data extraction playing the role of data collection from the web and data mining techniques on the extracted categorical data in order to discover knowledge. The main contribution of this research is proposing a methodology to apply the clustering notion on categorical web data and to use the clustering results as part of the input for the classification conducted on another set of data. Data mining and relative data processing are conducted by developing intelligent tools. The performance of the algorithms used in our methodology is demonstrated with the clustered job postings dataset and classified job searchers dataset by using the three measures accuracy, recall and precision for the clustering algorithm and the error of classification for the classification technique. The results show that our proposed approach of combination ends up with good results in Knowledge Discovery from the web.
引用
收藏
页码:170 / 175
页数:6
相关论文
共 50 条
  • [1] Knowledge Obtention Combining Information Extraction Techniques with Linked Data
    Luis Garrido, Angel
    Blazquez, Pilar
    Buey, Maria G.
    Ilarri, Sergio
    [J]. WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 643 - 648
  • [2] The use of data mining techniques to discover knowledge from animal and food data: Examples related to the cattle industry
    Garcia, Ana Belen
    [J]. TRENDS IN FOOD SCIENCE & TECHNOLOGY, 2013, 29 (02) : 151 - 157
  • [3] Data Mining Algorithms for Knowledge Extraction from Web Log Files
    El Alami, Anass Abdelhamid
    Ezzikouri, Hanane
    Erritali, Mohammed
    [J]. ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2019): VOL 1 - ADVANCED INTELLIGENT SYSTEMS FOR EDUCATION AND INTELLIGENT LEARNING SYSTEM, 2020, 1102 : 118 - 128
  • [4] Web Data Mining Trends and Techniques
    Patil, Ujwala Manoj
    Patil, J. B.
    [J]. PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 961 - 965
  • [5] Data Mining: Web Data Mining Techniques, Tools and Algorithms: An Overview
    Mughal, Muhammd Jawad Hamid
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (06) : 208 - 215
  • [6] Combining different data mining techniques to improve data analysis
    Greco, S
    Masciari, E
    Pontieri, L
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS: RECENT ADVANCES, 2001, : 455 - 464
  • [7] Web Data Extraction Techniques: A Review
    Kamanwar, N. V.
    Kale, S. G.
    [J]. 2016 WORLD CONFERENCE ON FUTURISTIC TRENDS IN RESEARCH AND INNOVATION FOR SOCIAL WELFARE (STARTUP CONCLAVE), 2016,
  • [8] A Comparison of Web Data Extraction Techniques
    Salah, Mosa
    Al Okush, Basem
    Al Rifaee, Mustafa
    [J]. 2019 IEEE JORDAN INTERNATIONAL JOINT CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION TECHNOLOGY (JEEIT), 2019, : 785 - 789
  • [9] Analysis of Data Extraction and Data Cleaning in Web Usage Mining
    Srivastava, Mitali
    Garg, Rakhi
    Mishra, P. K.
    [J]. ICARCSET'15: PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH IN COMPUTER SCIENCE ENGINEERING & TECHNOLOGY (ICARCSET - 2015), 2015,
  • [10] Implementation of data mining techniques in web of things
    Vihari, G.
    Prasad, N.
    Satyanarayan, K.
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER VISION AND MACHINE LEARNING, 2019, 1228