Models for Arabic Document Quality Assessment

被引:1
|
作者
Yahya, Adnan [1 ]
Ahmad, Afnan [1 ]
Assaf, Alaa [1 ]
Khater, Rawan [1 ]
Salhi, Ali [1 ]
机构
[1] Birzeit Univ, Elect & Comp Engn Dept, Birzeit, Palestine
关键词
Document quality assessment; Arabic Wikipedia; Arabic information retrieval;
D O I
10.1007/978-3-030-61146-0_24
中图分类号
F [经济];
学科分类号
02 ;
摘要
Digital content has been increasing rapidly. This content can be generated, accessed and used by anyone and thus the need for quality assessment of web content before usage becomes an important issue. Devising methods to assess the quality of Arabic digital content is the focus of this paper. Our work was partially based on Wikipedia articles annotated into featured and good according to quality guidelines of Wikipedia. Our analysis was directed at finding features that can serve as best quality indicators. Using the defined features, we trained a high accuracy quality assessment model using machine-learning algorithms. Our work went beyond the Wikipedia documents to build a general model that can assess the quality of Arabic documents that lack Wikipedia metadata with acceptable accuracy. The model was trained and built using features from documents we collected from Arabic online news sites and blogs, and annotated in collaboration with university students.
引用
收藏
页码:297 / 310
页数:14
相关论文
共 50 条
  • [41] Localization Quality Assessment for More Reliable E-Commerce Applications in Arabic
    Omar, Abdulfattah
    Altohami, Waheed M. A.
    Ethelb, Hamza
    Hamidi, Bahramuddin
    [J]. EDUCATION RESEARCH INTERNATIONAL, 2022, 2022
  • [42] Arabic Translation and Validation of Olfactory-Specific Quality of Life Assessment Questionnaire
    Alsayid, Hoda
    Alnakhli, Sarah
    Marzouki, Hani Z.
    Varshney, Rickull
    Zawawi, Faisal
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2021, 13 (06)
  • [43] Dyslexia assessment in Arabic
    Elbeheri, Gad
    Everatt, John
    Reid, Gavin
    Al Mannai, Haya
    [J]. JOURNAL OF RESEARCH IN SPECIAL EDUCATIONAL NEEDS, 2006, 6 (03): : 143 - 152
  • [44] Identification of key textural attributes of Arabic bread and their application to the assessment of storage and quality
    Toufeili, I
    Chammas, H
    Shadarevian, S
    [J]. JOURNAL OF TEXTURE STUDIES, 1998, 29 (01) : 57 - 66
  • [45] Assessment of the Arabic patient-centered online information about orthodontic pain: A quality and readability assessment
    Alassaf, Muath Saad
    Hamadallah, Hatem Hazzaa
    Almuzaini, Abdulrahman
    Aloufi, Aseel M.
    Al-Turki, Khalid N.
    Khoshhal, Ahmed S.
    Alsulaimani, Mahmoud A.
    Eshky, Rawah
    [J]. PLOS ONE, 2024, 19 (05):
  • [46] Quality of Adaptation: User Cognitive Models in Adaptation Quality Assessment
    Lopez-Jaquero, Victor
    Montero, Francisco
    Gonzalez, Pascual
    [J]. COMPUTER-AIDED DESIGN OF USER INTERFACES VI, 2009, : 265 - 275
  • [47] Blind quality assessment metric and degradation classification for degraded document images
    Shahkolaei, Atena
    Beghdadi, Azeddine
    Cheriet, Mohamed
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 76 : 11 - 21
  • [48] Investigating the possibilities of document cameras for quality assessment of foodstuffs by measuring of color
    Baycheva, Stanka
    Zlatev, Zlatin
    Dimitrova, Antoaneta
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON VIRTUAL LEARNING, 2016, : 204 - 208
  • [49] Crime Type Document Classification from Arabic Corpus
    Alruily, Meshrif
    Ayesh, Aladdin
    Zedan, Hussein
    [J]. 2009 SECOND INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2009), 2009, : 153 - 159
  • [50] ARABIC DOCUMENT SUMMARIZATION USING FA FUZZY ONTOLOGY
    Atlam, El-Sayed
    El-Barbary, Omnia
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (04): : 1351 - 1367