Keyphrases Concentrated Area Identification from Academic Articles as Feature of Keyphrase Extraction: A New Unsupervised Approach

被引:0
|
作者
Miah, Mohammad Badrul Alam [1 ,2 ]
Awang, Suryanti [3 ]
Azad, Md Saiful [4 ]
Rahman, Md Mustafizur [5 ]
机构
[1] Univ Malaysia Pahang, Pekan, Malaysia
[2] Mawlana Bhashani Sci & Technol Univ, Informat & Commun Technol, Tangail, Bangladesh
[3] Univ Malaysia Pahang, Fac Comp, Ctr Data Sci & Artificial Intelligence, Data Sci Ctr,Soft Comp & Intelligent Syst, Pekan, Malaysia
[4] Green Univ Bangladesh, Comp Sci & Engn, Dhaka, Bangladesh
[5] Univ Malaysia Pahang, Fac Engn, Dept Mech Engn, Gambang, Kuantan, Malaysia
关键词
Keyphrase concentrated area; KCA identification; feature extraction; data processing; keyphrase extraction; curve fitting;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The extraction of high-quality keywords and summarising documents at a high level has become more difficult in current research due to technological advancements and the exponential expansion of textual data and digital sources. Extracting high-quality keywords and summarising the documents at a highlevel need to use features for the keyphrase extraction, becoming more popular. A new unsupervised keyphrase concentrated area (KCA) identification approach is proposed in this study as a feature of keyphrase extraction: corpus, domain and language independent; document length-free; utilized by both supervised and unsupervised techniques. In the proposed system, there are three phases: data pre-processing, data processing, and KCA identification. The system employs various text pre-processing methods before transferring the acquired datasets to the data processing step. The pre-processed data is subsequently used during the data processing step. The statistical approaches, curve plotting, and curve fitting technique are applied in the KCA identification step. The proposed system is then tested and evaluated using benchmark datasets collected from various sources. To demonstrate our proposed approach's effectiveness, merits, and significance, we compared it with other proposed techniques. The experimental results on eleven (11) datasets show that the proposed approach effectively recognizes the KCA from articles as well as significantly enhances the current keyphrase extraction methods based on various text sizes, languages, and domains.
引用
收藏
页码:788 / 796
页数:9
相关论文
共 26 条
  • [1] Keyphrase Distance Analysis Technique from News Articles as a Feature for Keyphrase Extraction: An Unsupervised Approach
    Miah, Mohammad Badrul Alam
    Awang, Suryanti
    Rahman, Md Mustafizur
    Hosen, A. S. M. Sanwar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 995 - 1002
  • [2] RAKE-PMI AUTOMATED KEYPHRASE EXTRACTION An unsupervised approach for automated extraction of keyphrases
    Gupta, Somya
    Mittal, Namita
    Kumar, Alok
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [3] RAKE-PMI Automated keyphrase extraction: An unsupervised approach for automated extraction of keyphrases
    2016, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (25-26-August-2016):
  • [4] A New Unsupervised Technique to Analyze the Centroid and Frequency of Keyphrases from Academic Articles
    Miah, Mohammad Badrul Alam
    Awang, Suryanti
    Rahman, Md Mustafizur
    Hosen, A. S. M. Sanwar
    Ra, In-Ho
    ELECTRONICS, 2022, 11 (17)
  • [5] Enhancing keyphrase extraction from academic articles with their reference information
    Chengzhi Zhang
    Lei Zhao
    Mengyuan Zhao
    Yingyi Zhang
    Scientometrics, 2022, 127 : 703 - 731
  • [6] Enhancing keyphrase extraction from academic articles with their reference information
    Zhang, Chengzhi
    Zhao, Lei
    Zhao, Mengyuan
    Zhang, Yingyi
    SCIENTOMETRICS, 2022, 127 (02) : 703 - 731
  • [7] PositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents
    Florescu, Corina
    Caragea, Cornelia
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1105 - 1115
  • [8] Enhancing keyphrase extraction from academic articles using section structure information
    Zhang, Chengzhi
    Yan, Xinyi
    Zhao, Lei
    Zhang, Yingyi
    SCIENTOMETRICS, 2025, : 2311 - 2343
  • [9] SemKeyphrase: An Unsupervised Approach to Keyphrase Extraction from MOOC Video Lectures
    Albahr, Abdulaziz
    Che, Dunren
    Albahar, Marwan
    2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 303 - 307
  • [10] Keyphrases Frequency Analysis From Research Articles: A Region-Based Unsupervised Novel Approach
    Miah, Mohammad Badrul Alam
    Awang, Suryanti
    Rahman, MD. Mustafizur
    Hosen, A. S. M. Sanwar
    Ra, In-Ho
    IEEE ACCESS, 2022, 10 : 120838 - 120849