Abstract or Full-text in Topic Modeling?

被引:0
|
作者
Tekin, Yasar [1 ]
Cosar, Ahmet [2 ]
机构
[1] Orta Dogu Tekn Univ, Bilim & Teknol Politikasi Calismalari, Ankara, Turkey
[2] Orta Dogu Tekn Univ, Bilgisayar Muhendisligi, Ankara, Turkey
来源
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2022年
关键词
topic modeling; LDA; parameter optimization; abstract; full-text; DIFFERENTIAL EVOLUTION;
D O I
10.1109/SIU55565.2022.9864707
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Topic modeling is a text mining technique used for automatic extraction of topics addressed in document collections. Although there are different topic models proposed by researchers, the most preferred one is Latent Dirichlet Allocation (LDA). Despite such widespread use, uncertainties about LDA have not been fully resolved yet. In this study, the effect of using abstracts or full-text articles on LDA model parameters is investigated. For this purpose, LDA parameters are optimized on abstracts and full-texts of articles published in two different scientific journals and the results obtained are compared with each other.
引用
收藏
页数:4
相关论文
共 50 条