Topic segmentation using word-level semantic relatedness functions

被引:3
|
作者
Ercan, Gonenc [1 ]
Cicekli, Ilyas [2 ]
机构
[1] Hacettepe Univ, Inst Informat, PO 06800, Ankara, Turkey
[2] Hacettepe Univ, Dept Comp Engn, Ankara, Turkey
关键词
Lexical cohesion; semantic relatedness; topic segmentation; TEXT; SIMILARITY; MODELS;
D O I
10.1177/0165551515602460
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic relatedness deals with the problem of measuring how much two words are related to each other. While there is a large body of research for developing new measures, the use of semantic relatedness (SR) measures in topic segmentation has not been explored. In this research the performance of different SR measures is evaluated in the topic segmentation problem. To this end, two topic segmentation algorithms that use the difference in SR of words are introduced. Our results indicate that using an SR measure trained with a general domain corpora achieves better results than topic segmentation algorithms using Wordnet or simple word repetition. Furthermore, when compared with computationally more complex algorithms performing global analysis, our local analysis, enhanced with general domain lexical semantic information, achieves comparable results.
引用
收藏
页码:597 / 608
页数:12
相关论文
共 50 条
  • [31] Word-level Speech Recognition with a Letter to Word Encoder
    Collobert, Ronan
    Hannun, Awni
    Synnaeve, Gabriel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [32] Word-Level Script Identification Using Texture Based Features
    Singh, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2015, 4 (02) : 74 - 94
  • [33] WARP: Word-level Adversarial ReProgramming
    Hambardzumyan, Karen
    Khachatrian, Hrant
    May, Jonathan
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4921 - 4933
  • [34] Word-Level Symbolic Trajectory Evaluation
    Chakraborty, Supratik
    Khasidashvili, Zurab
    Seger, Carl-Johan H.
    Gajavelly, Rajkumar
    Haldankar, Tanmay
    Chhatani, Dinesh
    Mistry, Rakesh
    COMPUTER AIDED VERIFICATION, CAV 2015, PT II, 2015, 9207 : 128 - 143
  • [35] A word-level graph manipulation package
    Höreth S.
    International Journal on Software Tools for Technology Transfer, 2001, 3 (02) : 182 - 192
  • [36] The Phonetics of Paiwan Word-Level Prosody
    Chen, Chun-Mei
    LANGUAGE AND LINGUISTICS, 2009, 10 (03) : 593 - 625
  • [37] Word-level optical font recognition using typographical features
    Kim, SH
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (04) : 541 - 561
  • [38] Word-level Speech Recognition with a Letter to Word Encoder
    Collobert, Ronan
    Hannun, Awni
    Synnaeve, Gabriel
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [39] On Using Control Signals for Word-Level Identification in A Gate-Level Netlist
    Tashjian, Edward
    Davoodi, Azadeh
    2015 52ND ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2015,
  • [40] WORD-LEVEL RECOGNITION OF CURSIVE SCRIPT
    FARAG, RFH
    IEEE TRANSACTIONS ON COMPUTERS, 1979, 28 (02) : 172 - 175