Keyphrases Concentrated Area Identification from Academic Articles as Feature of Keyphrase Extraction: A New Unsupervised Approach

被引:0
|
作者
Miah, Mohammad Badrul Alam [1 ,2 ]
Awang, Suryanti [3 ]
Azad, Md Saiful [4 ]
Rahman, Md Mustafizur [5 ]
机构
[1] Univ Malaysia Pahang, Pekan, Malaysia
[2] Mawlana Bhashani Sci & Technol Univ, Informat & Commun Technol, Tangail, Bangladesh
[3] Univ Malaysia Pahang, Fac Comp, Ctr Data Sci & Artificial Intelligence, Data Sci Ctr,Soft Comp & Intelligent Syst, Pekan, Malaysia
[4] Green Univ Bangladesh, Comp Sci & Engn, Dhaka, Bangladesh
[5] Univ Malaysia Pahang, Fac Engn, Dept Mech Engn, Gambang, Kuantan, Malaysia
关键词
Keyphrase concentrated area; KCA identification; feature extraction; data processing; keyphrase extraction; curve fitting;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The extraction of high-quality keywords and summarising documents at a high level has become more difficult in current research due to technological advancements and the exponential expansion of textual data and digital sources. Extracting high-quality keywords and summarising the documents at a highlevel need to use features for the keyphrase extraction, becoming more popular. A new unsupervised keyphrase concentrated area (KCA) identification approach is proposed in this study as a feature of keyphrase extraction: corpus, domain and language independent; document length-free; utilized by both supervised and unsupervised techniques. In the proposed system, there are three phases: data pre-processing, data processing, and KCA identification. The system employs various text pre-processing methods before transferring the acquired datasets to the data processing step. The pre-processed data is subsequently used during the data processing step. The statistical approaches, curve plotting, and curve fitting technique are applied in the KCA identification step. The proposed system is then tested and evaluated using benchmark datasets collected from various sources. To demonstrate our proposed approach's effectiveness, merits, and significance, we compared it with other proposed techniques. The experimental results on eleven (11) datasets show that the proposed approach effectively recognizes the KCA from articles as well as significantly enhances the current keyphrase extraction methods based on various text sizes, languages, and domains.
引用
收藏
页码:788 / 796
页数:9
相关论文
共 26 条
  • [21] A new approach to kinematic feature extraction from the human right ventricle for classification of hypertension: a feasibility study
    Wu, Jia
    Wang, Yingqian
    Simon, Marc A.
    Brigham, John C.
    PHYSICS IN MEDICINE AND BIOLOGY, 2012, 57 (23): : 7905 - 7922
  • [22] Determination of crack characteristics. A new approach of optimised feature extraction from empirical primary data.
    Mallwitz, R
    Becker, WJ
    MATERIALPRUFUNG, 1999, 41 (1-2): : 40 - 44
  • [23] A novel approach for feature extraction from a gamma-ray energy spectrum based on image descriptor transferring for radionuclide identification
    Hao-Lin Liu
    Hai-Bo Ji
    Jiang-Mei Zhang
    Cao-Lin Zhang
    Jing Lu
    Xing-Hua Feng
    Nuclear Science and Techniques, 2022, 33
  • [24] A novel approach for feature extraction from a gamma-ray energy spectrum based on image descriptor transferring for radionuclide identification
    Hao-Lin Liu
    Hai-Bo Ji
    Jiang-Mei Zhang
    Cao-Lin Zhang
    Jing Lu
    Xing-Hua Feng
    NuclearScienceandTechniques, 2022, 33 (12) : 90 - 106
  • [25] A novel approach for feature extraction from a gamma-ray energy spectrum based on image descriptor transferring for radionuclide identification
    Liu, Hao-Lin
    Ji, Hai-Bo
    Zhang, Jiang-Mei
    Zhang, Cao-Lin
    Lu, Jing
    Feng, Xing-Hua
    NUCLEAR SCIENCE AND TECHNIQUES, 2022, 33 (12)
  • [26] Gel substrates and ammonia-EDTA extraction solution: a new nondestructive combined approach for the identification of anthraquinone dyes from wool textiles
    Germinario, G.
    Ciccola, A.
    Serafini, I
    Ruggiero, L.
    Sbroscia, M.
    Vincenti, F.
    Fasolato, C.
    Curini, R.
    Ioele, M.
    Postorino, P.
    Sodo, A.
    MICROCHEMICAL JOURNAL, 2020, 155