Topic segmentation using word-level semantic relatedness functions

被引:3
|
作者
Ercan, Gonenc [1 ]
Cicekli, Ilyas [2 ]
机构
[1] Hacettepe Univ, Inst Informat, PO 06800, Ankara, Turkey
[2] Hacettepe Univ, Dept Comp Engn, Ankara, Turkey
关键词
Lexical cohesion; semantic relatedness; topic segmentation; TEXT; SIMILARITY; MODELS;
D O I
10.1177/0165551515602460
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic relatedness deals with the problem of measuring how much two words are related to each other. While there is a large body of research for developing new measures, the use of semantic relatedness (SR) measures in topic segmentation has not been explored. In this research the performance of different SR measures is evaluated in the topic segmentation problem. To this end, two topic segmentation algorithms that use the difference in SR of words are introduced. Our results indicate that using an SR measure trained with a general domain corpora achieves better results than topic segmentation algorithms using Wordnet or simple word repetition. Furthermore, when compared with computationally more complex algorithms performing global analysis, our local analysis, enhanced with general domain lexical semantic information, achieves comparable results.
引用
收藏
页码:597 / 608
页数:12
相关论文
共 50 条
  • [2] Transition-Based Neural Word Segmentation Using Word-Level Features
    Zhang, Meishan
    Zhang, Yue
    Fu, Guohong
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 63 : 923 - 953
  • [3] Sentence-Level Semantic Textual Similarity Using Word-Level Semantics
    Shajalal, Md
    Aono, Masaki
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2018, : 113 - 116
  • [4] Infants' use of rhythmic cues in word-level segmentation
    Childers, JB
    Echols, CH
    [J]. PROCEEDINGS OF THE 20TH ANNUAL BOSTON UNIVERSITY CONFERENCE ON LANGUAGE DEVELOPMENT, VOLS 1 AND 2, 1996, : 167 - 176
  • [5] Mathematical framework for representing discrete functions as word-level polynomials
    Pradhan, DK
    Askar, S
    Ciesielski, M
    [J]. EIGHTH IEEE INTERNATIONAL HIGH-LEVEL DESIGN VALIDATION AND TEST WORKSHOP, PROCEEDINGS, 2003, : 135 - 139
  • [6] Word-level Chinese named entity recognition based on segmentation digraph
    Gao, H
    Huang, D
    Yang, YS
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 380 - 383
  • [7] Cascaded Segmentation-Detection Networks for Word-Level Text Spotting
    Qin, Siyang
    Manduchi, Roberto
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1275 - 1282
  • [8] Word sense disambiguation using semantic relatedness measurement
    YANG Che-Yu (Department of Computer Science and Information Engineering
    [J]. Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2006, (10) : 1609 - 1625
  • [9] Using measures of semantic relatedness for word sense disambiguation
    Patwardhan, S
    Banerjee, S
    Pedersen, T
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 241 - 257
  • [10] Word sense disambiguation using semantic relatedness measurement
    Yang C.-Y.
    [J]. Journal of Zhejiang University-SCIENCE A, 2006, 7 (10): : 1609 - 1625