A TEXT FEATURE WORD EXTRACTION METHOD APPLIED TO ENTERPRISE COMPETITIVE INTELLIGENCE SYSTEM

被引:0
|
作者
Zhang, Zhiwei [1 ]
Zhang, Haining [1 ]
Zhu, Guangliang [1 ]
机构
[1] Suzhou Univ, Sch Informat & Engn, Suzhou, Peoples R China
来源
UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE | 2023年 / 85卷 / 04期
关键词
feature word extraction; information extraction; part-of-speech tagging; text classification; competitive intelligence; natural language processing;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To acquire industry feature words, professionals need to collect and analyze the feature word sets from industry sites according to their experience and finally merge the feature word sets at different sites to form industry feature word sets. This method is characterized by a large workload, the difficulty in ensuring the accuracy of text classification, and the necessity to adjust feature words by repeating the above process. To solve the above problems, a universal feature word extraction scheme and a system framework were first proposed in this study based on the actual requirements of the enterprise competitive intelligence system. Then, the key problems involved in the process of feature word extraction were elaborated in detail. Finally, the traditional feature word weight was improved on basis of predecessors' results, and the Sogou lexicon was introduced to correct the word frequency and part of speech. A good classification effect was achieved through experiments with the KNN classifier, which was verified using the vector space model (VSM), and a high average F1 value was acquired.
引用
收藏
页码:221 / 234
页数:14
相关论文
共 50 条
  • [31] Word Sense Disambiguation Studio: A Flexible System for WSD Feature Extraction
    Agre, Gennady
    Petrov, Daniel
    Keskinova, Simona
    INFORMATION, 2019, 10 (03)
  • [32] A Novel Method To Summarize and Retrieve Text Documents Using Text Feature Extraction Based on Ontology
    Patil, Aradhana R.
    Manjrekar, Amrita A.
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 1256 - 1260
  • [33] The Application of Web Data Mining Technique in Competitive Intelligence System of Enterprise based on XML
    Chen Ting
    Niu Xiao
    Yang Weiping
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 2, PROCEEDINGS, 2009, : 396 - 399
  • [34] A Feature-Enhanced Smoothing Method for LDA Model Applied to Text Classification
    Liu, DongXin
    Xu, WeiRan
    Hu, JiaNi
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 17 - 23
  • [35] A statistical domain terminology extraction method based on word length and grammatical feature
    Liu L.
    Xiao Y.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2017, 38 (09): : 1437 - 1443
  • [36] FREGEX: A Feature Extraction Method for Biomedical Text Classification using Regular Expressions
    Flores, Christopher A.
    Figueroa, Rosa L.
    Pezoa, Jorge E.
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 6085 - 6088
  • [37] Apriori and N-gram Based Chinese Text Feature Extraction Method
    王晔
    黄上腾
    Journal of Shanghai Jiaotong University, 2004, (04) : 11 - 14
  • [38] Research of Enterprise Competitive Intelligence Collection System Based On Cross-Language Information Retrieval
    Li Xiang-jun
    Si Hong-na
    Yao Li-wen
    Yao Xiao-kun
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL I, 2009, : 601 - 604
  • [39] Color Feature Extraction Method in the Video Surveillance System
    Zhang, Li
    Yu, Haibin
    He, Zhiwei
    ADVANCED RESEARCH ON INDUSTRY, INFORMATION SYSTEMS AND MATERIAL ENGINEERING, PTS 1-7, 2011, 204-210 : 178 - +
  • [40] Fast Feature Extraction Method for Faults Detection System
    Wang, Hongmin
    Zhu, Xiaohui
    Niu, Xiangyong
    Xue, Ping
    ADVANCED HYBRID INFORMATION PROCESSING, 2018, 219 : 297 - 306