MINING FOR RELEVANT TERMS FROM LOG FILES

被引:0
|
作者
Saneifar, Hassan [1 ,2 ]
Bonniol, Stephane [2 ]
Laurent, Anne [1 ]
Poncelet, Pascal [1 ]
Roche, Mathieu [1 ]
机构
[1] Univ Montpellier 2, CNRS, LIRMM, 161 Rue Ada, F-34392 Montpellier 5, France
[2] Sain IP Technol, F-34960 Montpellier, France
关键词
Natural language processing; Information retrieval; Terminology extraction; Terminology ranking; Log files;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Information extracted from log files of computing systems can be considered one of the important resources of information systems. In the case of Integrated Circuit design, log files generated by design tools are not exhaustively exploited. The logs of this domain are multi-source, multi-format, and have a heterogeneous and evolving structure. Moreover, they usually do not respect the grammar and the structures of natural language though they are written in English. According to features of such textual data, applying the classical methods of information extraction is not an easy task, more particularly for terminology extraction. We have previously introduced EXTERLOG approach to extract the terminology from such log files. In this paper, we introduce a new developed version of EXTERLOG guided by Web. We score the extracted terms by a Web and context based measure. We favor the more relevant terms of domain and emphasize the precision by filtering terms based on their scores. The experiments show that EXTERLOG is well-adapted terminology extraction approach from log files.
引用
收藏
页码:77 / +
页数:2
相关论文
共 50 条
  • [21] Formal Analysis of Log Files
    Barringer, Howard
    Groce, Alex
    Havelund, Klaus
    Smith, Margaret
    JOURNAL OF AEROSPACE COMPUTING INFORMATION AND COMMUNICATION, 2010, 7 (11): : 365 - 390
  • [22] A Survey on Predicting User Behavior Based on Web Server Log Files in a Web Usage Mining
    Anitha, V.
    Devi, P. Isakki
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [23] AGGREGATION OF IMPLICIT FEEDBACKS FROM SEARCH ENGINE LOG FILES
    Veilumuthu, Ashok
    Ramachandran, Parthasarathy
    KDIR 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2010, : 269 - 274
  • [24] Discovering implicit feedbacks from search engine log files
    Veilumuthu, Ashok
    Ramachandran, Parthasarathy
    DISCOVERY SCIENCE, PROCEEDINGS, 2007, 4755 : 231 - +
  • [25] Performance Evaluation of Data Mining Frameworks in Hadoop Cluster Using Virtual Campus Log Files
    Xhafa, Fatos
    Ramirez, Daniel
    Garcia, Daniel
    Caballe, Santi
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS IEEE INCOS 2015, 2015, : 217 - 222
  • [26] An Extensive Comparison of Systems for Entity Extraction from Log Files
    Chhabra, Anubhav
    Branco, Paula
    Jourdan, Guy-Vincent
    Viktor, Herna L.
    FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2021, 2022, 13291 : 376 - 392
  • [27] On the provenance extraction techniques from large scale log files
    Tufek, Alper
    Aktas, Mehmet S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (15):
  • [28] A bestiary of log files: Log file format in distributed systems
    Edwards, HK
    Bauer, MA
    SERP'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH AND PRACTICE, VOLS 1 AND 2, 2003, : 286 - 292
  • [29] The laborious way from data mining to web log mining
    Spiliopoulou, M
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1999, 14 (02): : 113 - 126
  • [30] Automated QA using log files
    Hernandez, V.
    Abella, R.
    RADIOTHERAPY AND ONCOLOGY, 2016, 119 : S286 - S286