An Empirical Study on Document Similarity Comparison Evaluation Between Machine Learning Techniques and Human Experts

被引:0
|
作者
Jang, Won-Jung [1 ]
机构
[1] Catholic Kwandong Univ, 25601 502,Mary Hall,24,Beomil Ro 576, Gangneung Si 25601, Gangwon Do, South Korea
来源
TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2024年 / 31卷 / 05期
关键词
ANN model; count-based model; document similarity; ensemble learning model; machine learning; FREQUENCY; MODEL;
D O I
10.17559/TV-20231011001013
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Current machine-learning training focuses solely on accuracy. In this study, the weights of other dimensions were examined rather than measuring only the accuracy of machine learning. By comparatively analyzing the decision-making of machine learning and humans in various fields, this study examines how well organizational vision is propagated to lower levels of the organization. Also, the results evaluated by humans and machine learning models were comparatively analyzed from multiple perspectives. As numerical representation methods of words, count-based models (Bag of Words, TF-IDF),- IDF ), artificial neural network (ANN) models (Word2Vec, GloVe), and a vision propagation measurement (VPMS) model combining two methods were used to calculate the similarity between documents, which are comparatively analyzed with the actual results measured by an expert group. The findings of this study can be used as an evaluation metric for how effectively the vision of the upper organization is being disseminated to the lower-level organizations. Additionally, it could be utilized in developing algorithms such as customer segmentation for target marketing using text data. The study makes two key contributions- (i) providing an extensive empirical comparison of document similarity analysis by different ML techniques versus human experts, and (ii) proposing a new VPMS model that outperforms existing methods.
引用
收藏
页码:1668 / 1679
页数:12
相关论文
共 50 条
  • [1] An empirical comparison of machine learning techniques for chant classification
    Kokkinidis, K.
    Mastoras, T.
    Tsagaris, A.
    Fotaris, P.
    [J]. 2018 7TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2018,
  • [2] An Empirical Evaluation of Machine Learning Techniques for Crop Prediction
    Mariammal, G.
    Suruliandi, A.
    Raja, S. P.
    Poongothai, E.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 96 - 104
  • [3] An empirical study of machine learning techniques for affect recognition in human–robot interaction
    Pramila Rani
    Changchun Liu
    Nilanjan Sarkar
    Eric Vanman
    [J]. Pattern Analysis and Applications, 2006, 9 : 58 - 69
  • [4] An empirical comparison of machine learning techniques for dam behaviour modelling
    Salazar, F.
    Toledo, M. A.
    Onate, E.
    Moran, R.
    [J]. STRUCTURAL SAFETY, 2015, 56 : 9 - 17
  • [5] An empirical study of machine learning techniques for affect recognition in human-robot interaction
    Liu, CC
    Rani, P
    Sarkar, N
    [J]. 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, Vols 1-4, 2005, : 2451 - 2456
  • [6] An empirical study of machine learning techniques for affect recognition in human-robot interaction
    Rani, Pramila
    Liu, Changchun
    Sarkar, Nilanjan
    Vanman, Eric
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2006, 9 (01) : 58 - 69
  • [7] An Empirical Comparison of Individual Machine Learning Techniques in Signature and Fingerprint Classification
    Abreu, Marjory
    Fairhurst, Michael
    [J]. BIOMETRICS AND IDENTITY MANAGEMENT, 2008, 5372 : 130 - 139
  • [8] An Empirical Evaluation of Set Similarity Join Techniques
    Mann, Willi
    Augsten, Nikolaus
    Bouros, Panagiotis
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (09): : 636 - 647
  • [9] An Empirical Evaluation of Machine Learning Techniques for Chronic Kidney Disease Prophecy
    Khan, Bilal
    Naseem, Rashid
    Muhammad, Fazal
    Abbas, Ghulam
    Kim, Sunghwan
    [J]. IEEE ACCESS, 2020, 8 : 55012 - 55022
  • [10] An Empirical Evaluation of Document Embeddings and Similarity Metrics for Scientific Articles
    Gomez, Joaquin
    Vazquez, Pere-Pau
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (11):