MINING FOR RELEVANT TERMS FROM LOG FILES

被引:0
|
作者
Saneifar, Hassan [1 ,2 ]
Bonniol, Stephane [2 ]
Laurent, Anne [1 ]
Poncelet, Pascal [1 ]
Roche, Mathieu [1 ]
机构
[1] Univ Montpellier 2, CNRS, LIRMM, 161 Rue Ada, F-34392 Montpellier 5, France
[2] Sain IP Technol, F-34960 Montpellier, France
关键词
Natural language processing; Information retrieval; Terminology extraction; Terminology ranking; Log files;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Information extracted from log files of computing systems can be considered one of the important resources of information systems. In the case of Integrated Circuit design, log files generated by design tools are not exhaustively exploited. The logs of this domain are multi-source, multi-format, and have a heterogeneous and evolving structure. Moreover, they usually do not respect the grammar and the structures of natural language though they are written in English. According to features of such textual data, applying the classical methods of information extraction is not an easy task, more particularly for terminology extraction. We have previously introduced EXTERLOG approach to extract the terminology from such log files. In this paper, we introduce a new developed version of EXTERLOG guided by Web. We score the extracted terms by a Web and context based measure. We favor the more relevant terms of domain and emphasize the precision by filtering terms based on their scores. The experiments show that EXTERLOG is well-adapted terminology extraction approach from log files.
引用
收藏
页码:77 / +
页数:2
相关论文
共 50 条
  • [41] Obtaining subject data from log files using deep log analysis: case study OhioLINK
    Huntington, Paul
    Nicholas, David
    Jamali, Hamid R.
    Watkinson, Anthony
    JOURNAL OF INFORMATION SCIENCE, 2006, 32 (04) : 299 - 308
  • [42] Identifying Anomaly Detection Patterns from Log Files: A Dynamic Approach
    Cavallaro, Claudia
    Ronchieri, Elisabetta
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT II, 2021, 12950 : 517 - 532
  • [43] Discovering Learners Behaviour Patterns From Log Files Using LSA
    Milat, Iness Nedji
    Seridi, Hassina
    Moudjari, Abdelkader
    INTERNATIONAL JOURNAL OF DISTANCE EDUCATION TECHNOLOGIES, 2020, 18 (02) : 90 - 113
  • [44] Mining User Profiles from Query Log
    Peng, Minlong
    Zhao, Jun
    Zhang, Qi
    Gui, Tao
    Huang, Xuanjing
    Fu, Jinlan
    INFORMATION RETRIEVAL (CCIR 2019), 2019, 11772 : 3 - 15
  • [45] Cross-system validation of engagement prediction from log files
    Cocea, Mihaela
    Weibelzahl, Stephan
    CREATING NEW LEARNING EXPERIENCES ON A GLOBAL SCALE, PROCEEDINGS, 2007, 4753 : 14 - +
  • [46] From Terminology Extraction to Terminology Validation: An Approach Adapted to Log Files
    Saneifar, Hassan
    Bonniol, Stephane
    Poncelet, Pascal
    Roche, Mathieu
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2015, 21 (04) : 604 - 635
  • [47] Mining Criminal Networks from Chat Log
    Iqbal, Farkhund
    Fung, Benjamin C. M.
    Debbabi, Mourad
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 332 - 337
  • [48] Online discovery of relevant terms from Internet
    Ji, DH
    Yang, LP
    Yu, N
    Li, T
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 327 - 332
  • [49] SEARCHING UNINDEXED AND NONUNIFORMLY GENERATED FILES IN LOG LOG N TIME
    WILLARD, DE
    SIAM JOURNAL ON COMPUTING, 1985, 14 (04) : 1013 - 1029
  • [50] Towards microaggregation of log files for Web usage mining in B2C e-commerce
    Navarro-Arribas, Guillermo
    Torra, Vicenc
    2009 ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, 2009, : 380 - 385