Information Filtering Method for Twitter Streaming Data Using Human-in-the-Loop Machine Learning

被引:1
|
作者
Suzuki, Yu [1 ]
Nakamura, Satoshi [1 ]
机构
[1] Nara Inst Sci & Technol, 8916-5 Takayama, Nara 6300192, Japan
关键词
D O I
10.1007/978-3-319-98812-2_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are a massive amount of texts on social media. However, only a small portion of these texts is informative for a specific purpose. If we accurately filter the texts in the streams, we can obtain useful information in real time. In a keyword-based approach, filters are constructed using keywords, but selecting the appropriate keywords to include is often difficult. In this work, we propose a method for filtering texts that are related to specific topics using both crowdsourcing and machine learning based text classification method. In our approach, we construct a text classifier using FastText and then annotate whether the tweets are related to the topics using crowdsourcing. In this step, we consider two strategies, optimistic and pessimistic approach, for selecting tweets which should be assessed. Then, we reconstruct the text classifier using the annotated texts and classify them again. We assume that if we continue instigating this loop, the accuracy of the classifier will improve, and we will obtain useful information without having to specify keywords. Experimental results demonstrated that our proposed system is effective for filtering social media streams. Moreover, we confirmed that the pessimistic approach is better than the optimistic approach.
引用
收藏
页码:167 / 175
页数:9
相关论文
共 50 条
  • [1] A survey of human-in-the-loop for machine learning
    Wu, Xingjiao
    Xiao, Luwei
    Sun, Yixuan
    Zhang, Junhang
    Ma, Tianlong
    He, Liang
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 135 : 364 - 381
  • [2] Human-in-the-loop Applied Machine Learning
    Brodley, Carla E.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1 - 1
  • [3] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
    Mosqueira-Rey, Eduardo
    Hernandez-Pereira, Elena
    Bobes-Bascaran, Jose
    Alonso-Rios, David
    Perez-Sanchez, Alberto
    Fernandez-Leal, Angel
    Moret-Bonillo, Vicente
    Vidal-Insua, Yolanda
    Vazquez-Rivera, Francisca
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (05): : 2597 - 2616
  • [4] HELIX: Accelerating Human-in-the-loop Machine Learning
    Xin, Doris
    Ma, Litian
    Liu, Jialin
    Macke, Stephen
    Song, Shuchen
    Parameswaran, Aditya
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 1958 - 1961
  • [5] Human-in-the-loop machine learning: a state of the art
    Mosqueira-Rey, Eduardo
    Hernandez-Pereira, Elena
    Alonso-Rios, David
    Bobes-Bascaran, Jose
    Fernandez-Leal, Angel
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (04) : 3005 - 3054
  • [6] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach
    Eduardo Mosqueira-Rey
    Elena Hernández-Pereira
    José Bobes-Bascarán
    David Alonso-Ríos
    Alberto Pérez-Sánchez
    Ángel Fernández-Leal
    Vicente Moret-Bonillo
    Yolanda Vidal-Ínsua
    Francisca Vázquez-Rivera
    [J]. Neural Computing and Applications, 2024, 36 : 2597 - 2616
  • [7] Human-in-the-loop machine learning: a state of the art
    Eduardo Mosqueira-Rey
    Elena Hernández-Pereira
    David Alonso-Ríos
    José Bobes-Bascarán
    Ángel Fernández-Leal
    [J]. Artificial Intelligence Review, 2023, 56 : 3005 - 3054
  • [8] Using Segmentation to Improve Machine Learning Performance in Human-in-the-Loop Systems
    Carneiro, Davide
    Carvalho, Mariana
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 413 - 428
  • [9] Human-in-the-Loop Machine Learning for the Treatment of Pancreatic Cancer
    Mosqueira-Rey, Eduardo
    Perez-Sanchez, Alberto
    Hernandez-Pereira, Elena
    Alonso-Rios, David
    Bobes-Bascaran, Jose
    Fernandez-Leal, Angel
    Moret-Bonillo, Vicente
    Vidal-Insua, Yolanda
    Vazquez-Rivera, Francisca
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [10] Human-in-the-loop machine learning with applications for population health
    Long Chen
    Jiangtao Wang
    Bin Guo
    Liming Chen
    [J]. CCF Transactions on Pervasive Computing and Interaction, 2023, 5 : 1 - 12