Measuring Innovation in Mauritius' ICT Sector Using Unsupervised Machine Learning: A Web Mining and Topic Modeling Approach

被引:1
|
作者
Boehmecke-Schwafert, Moritz [1 ]
Doerries, Colin [1 ]
机构
[1] Tech Univ Berlin, Chair Innovat Econ, Dept Econ & Management, Berlin, Germany
关键词
Innovation; Indicators; Developing countries; Natural language processing; Emerging countries; ICT sector; Topic modeling; Web mining; O30; O33; C81; C88; PERFORMANCE; QUALITY; GROWTH; FIRMS;
D O I
10.1007/s13132-023-01587-0
中图分类号
F [经济];
学科分类号
02 ;
摘要
Measuring innovation accurately and efficiently is crucial for policymakers to encourage innovation activity. However, the established indicator landscape lacks timeliness and accuracy. In this study, we focus on the country of Mauritius that is transforming its economy towards the information and communication technology (ICT) sector. We seek to extend the knowledge base on innovation activity and the status quo of innovation in Mauritius by applying an unsupervised machine learning approach. Building on previous work on new experimental innovation indicators, we combine recent advances in web mining and topic modeling and address the following research questions: What are potential areas of innovation activity in the ICT sector of Mauritius? Furthermore, do web mining and topic modeling provide sufficient indicators to understand innovation activities in emerging countries? To answer these questions, we apply the natural language processing (NLP) technique of Latent Dirichlet Allocation (LDA) to ICT companies' website text data. We then generate topic models from the scraped text data. As a result, we derive seven categories that describe the innovation activities of ICT firms in Mauritius. Albeit the model approach fulfills the requirements for innovation indicators as suggested in the Oslo Manual, it needs to be combined with additional metrics for innovation, for example, with traditional indicators such as patents, to unfold its potential. Furthermore, our approach carries methodological implications and is intended to be reproduced in similar contexts of scarce or unavailable data or where traditional metrics have demonstrated insufficient explanatory power.
引用
收藏
页码:1 / 34
页数:34
相关论文
共 50 条
  • [1] Mining FDA drug labels using an unsupervised learning technique - topic modeling
    Bisgin, Halil
    Liu, Zhichao
    Fang, Hong
    Xu, Xiaowei
    Tong, Weida
    BMC BIOINFORMATICS, 2011, 12
  • [2] Mining FDA drug labels using an unsupervised learning technique - topic modeling
    Halil Bisgin
    Zhichao Liu
    Hong Fang
    Xiaowei Xu
    Weida Tong
    BMC Bioinformatics, 12
  • [3] A machine learning approach to Web mining
    Esposito, F
    Malerba, D
    Di Pace, L
    Leo, P
    AI(ASTERISK)IA 99: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2000, 1792 : 190 - 201
  • [4] Opinion Mining on Food Services using Topic Modeling and Machine Learning Algorithms
    Akila, R.
    Revathi, S.
    Shreedevi, G.
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 1071 - 1076
  • [5] Machine learning in finance: A topic modeling approach
    Aziz, Saqib
    Dowling, Michael
    Hammami, Helmi
    Piepenbrink, Anke
    EUROPEAN FINANCIAL MANAGEMENT, 2022, 28 (03) : 744 - 770
  • [6] Unsupervised Concept Hierarchy Learning: A Topic Modeling Guided Approach
    Anoop, V. S.
    Asharaf, S.
    Deepak, P.
    TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 386 - 394
  • [7] Exploring Accounting Research Topic Evolution: An Unsupervised Machine Learning Approach
    Cao, June
    Gu, Zhanzhong
    Hasan, Iftekhar
    JOURNAL OF INTERNATIONAL ACCOUNTING RESEARCH, 2023, 22 (03) : 1 - 30
  • [8] Mining Contentious Documents Using an Unsupervised Topic Model Based Approach
    Trabelsi, Amine
    Zaiane, Osmar R.
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 550 - 559
  • [9] Corporate governance and innovation: a predictive modeling approach using machine learning
    de Pilla, Leonardo Henrique Lima
    Silveira, Elaine Barbosa Couto
    Caldieraro, Fabio
    Peci, Alketa
    Aggarwal, Ishani
    R & D MANAGEMENT, 2025, 55 (02) : 385 - 404
  • [10] Introduction to the JASIST special topic section on web retrieval and mining:: A machine learning perspective
    Chen, H
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (07): : 621 - 624