Mix multiple features to evaluate the content and the linguistic quality of text summaries

被引:3
|
作者
Ellouze S. [1 ]
Jaoua M. [1 ]
Belguith L.H. [1 ]
机构
[1] Faculty of Economics and Management of Sfax, ANLP research Group, MIRACL Laboratory, University of Sfax, Road of the Airport Km 4, Sfax
关键词
Content; Linguistic quality; Machine learning; Regression or classification; Summary evaluation;
D O I
10.20532/cit.2017.1003398
中图分类号
学科分类号
摘要
In this article, we propose a method of text summary's content and linguistic quality evaluation that is based on a machine learning approach. This method operates by combining multiple features to build predictive models that evaluate the content and the linguistic quality of new summaries (unseen) constructed from the same source documents as the summaries used in the training and the validation of models. To obtain the best model, many single and ensemble learning classifiers are tested. Using the constructed models, we have achieved a good performance in predicting the content and the linguistic quality scores. In order to evaluate the summarization systems, we calculated the system score as the average of the score of summaries that are built from the same system. Then, we evaluated the correlation of the system score with the manual system score. The obtained correlation indicates that the system score outperforms the baseline scores.
引用
收藏
页码:149 / 166
页数:17
相关论文
共 50 条
  • [1] Linguistic Summaries and Their Quality
    Hudec, Miroslav
    APPLICATION OF FUZZY LOGIC FOR MANAGERIAL DECISION MAKING PROCESSES: LATEST RESEARCH AND CASE STUDIES, 2017, : 99 - 101
  • [2] A Crowdsourcing Approach to Evaluate the Quality of Query-based Extractive Text Summaries
    Iskender, Neslihan
    Gabryszak, Aleksandra
    Polzehl, Tim
    Hennig, Leonhard
    Moeller, Sebastian
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2019,
  • [3] Unsupervised Video Summaries Using Multiple Features and Image Quality
    Hu, Tongling
    Li, Zechao
    Su, Weiyang
    Mu, Xing
    Tang, Jinhui
    2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 117 - 120
  • [4] Improving text recall with multiple summaries
    van der Meij, Hans
    van der Meij, Jan
    BRITISH JOURNAL OF EDUCATIONAL PSYCHOLOGY, 2012, 82 (02) : 257 - 269
  • [5] On some quality criteria of bipolar linguistic summaries
    Dziedzic, Mateusz
    Kacprzyk, Janusz
    Zadrozny, Slawomir
    2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2013, : 643 - 646
  • [6] EFFECTS OF TEXT MANIPULATIONS ON QUALITY OF WRITTEN SUMMARIES
    GARNER, R
    MCCALEB, JL
    CONTEMPORARY EDUCATIONAL PSYCHOLOGY, 1985, 10 (02) : 139 - 149
  • [7] Linguistic Features of Copywriting and Rewriting in the Field of Text Content for Corporate Websites: Semantic Aspect
    Seidekhanov, Sanatbek
    Dossanova, Albina
    JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2024, 53 (01)
  • [8] Linguistic Features of Copywriting and Rewriting in the Field of Text Content for Corporate Websites: Semantic Aspect
    Sanatbek Seidekhanov
    Albina Dossanova
    Journal of Psycholinguistic Research, 2024, 53
  • [9] Perceived Text Quality and Readability in Extractive and Abstractive Summaries
    Monsen, Julius
    Rennes, Evelina
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 305 - 312
  • [10] Text Genres and Registers: The Computation of Linguistic Features
    Pan, Fan
    Tao, Guoxiao
    SCIENTOMETRICS, 2017, 113 (03) : 1815 - 1818