Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity

被引:16
|
作者
Unnikrishnan, Vishnu [1 ]
Beyer, Christian [1 ]
Matuszyk, Pawel [1 ]
Niemann, Uli [1 ]
Pryss, Ruediger [2 ]
Schlee, Winfried [3 ]
Ntoutsi, Eirini [4 ]
Spiliopoulou, Myra [1 ]
机构
[1] Otto von Guericke Univ, Magdeburg, Germany
[2] Univ Ulm, Ulm, Germany
[3] Univ Hosp Regensburg, Regensburg, Germany
[4] Leibniz Univ Hannover, Hannover, Germany
关键词
Stream classification; kNN; Entity similarity; TIME-SERIES; MODEL;
D O I
10.1007/s41060-019-00177-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stream classification algorithms traditionally treat arriving instances as independent. However, in many applications, the arriving examples may depend on the "entity" that generated them, e.g. in product reviews or in the interactions of users with an application server. In this study, we investigate the potential of this dependency by partitioning the original stream of instances/"observations" into entity-centric substreams and by incorporating entity-specific information into the learning model. We propose a k-nearest-neighbour-inspired stream classification approach, in which the label of an arriving observation is predicted by exploiting knowledge on the observations belonging to this entity and to entities similar to it. For the computation of entity similarity, we consider knowledge about the observations and knowledge about the entity, potentially from a domain/feature space different from that in which predictions are made. To distinguish between cases where this knowledge transfer is beneficial for stream classification and cases where the knowledge on the entities does not contribute to classifying the observations, we also propose a heuristic approach based on random sampling of substreams using k Random Entities (kRE). Our learning scenario is not fully supervised: after acquiring labels for the initial m observations of each entity, we assume that no additional labels arrive and attempt to predict the labels of near-future and far-future observations from that initial seed. We report on our findings from three datasets.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Entity-Level Stream Classification: Exploiting Entity Similarity to Label the Future Observations Referring to an Entity
    Unnikrishnan, Vishnu
    Beyer, Christian
    Matuszyk, Pawel
    Niemann, Uli
    Pryss, Ruediger
    Schlee, Winfried
    Ntoutsi, Eirini
    Spiliopoulou, Myra
    [J]. 2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 246 - 255
  • [2] Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity
    Vishnu Unnikrishnan
    Christian Beyer
    Pawel Matuszyk
    Uli Niemann
    Rüdiger Pryss
    Winfried Schlee
    Eirini Ntoutsi
    Myra Spiliopoulou
    [J]. International Journal of Data Science and Analytics, 2020, 9 : 1 - 15
  • [3] Entity-Sensitive Attention and Fusion Network for Entity-Level Multimodal Sentiment Classification
    Yu, Jianfei
    Jiang, Jing
    Xia, Rui
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 429 - 439
  • [4] Exploiting Entity Information for Stream Classification over a Stream of Reviews
    Beyer, Christian
    Unnikrishnan, Vishnu
    Niemann, Uli
    Matuszyk, Pawel
    Ntoutsi, Eirini
    Spiliopoulou, Myra
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 564 - 573
  • [5] Semantic Fingerprinting: A Novel Method for Entity-Level Content Classification
    Govind
    Alec, Celine
    Spaniol, Marc
    [J]. WEB ENGINEERING, ICWE 2018, 2018, 10845 : 277 - 285
  • [6] ECO: Entity-level Captioning in Context
    Cho, Hyunsouk
    Hwang, Seung-won
    [J]. PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, 2016, : 750 - 751
  • [7] Entity-level simulation of urban operations
    Nash, DA
    Pratt, DR
    Kendall, TM
    [J]. Proceedings of the HPCMP, Users Group Conference 2005, 2005, : 428 - 432
  • [8] Entity-Level Sentiment Analysis of Issue Comments
    Ding, Jin
    Sun, Hailong
    Wang, Xu
    Liu, Xudong
    [J]. 2018 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON EMOTION AWARENESS IN SOFTWARE ENGINEERING (SEMOTION), 2018, : 7 - 13
  • [9] Named entity recognition of agricultural based entity-level masking BERT and BiLSTM-CRF
    Wei, Zijun
    Song, Ling
    Hu, Xiaochun
    Chen, Ningjiang
    [J]. Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (15): : 195 - 203
  • [10] Entity-level Factual Consistency of Abstractive Text Summarization
    Nan, Feng
    Nallapati, Ramesh
    Wang, Zhiguo
    dos Santos, Cicero Nogueira
    Zhu, Henghui
    Zhang, Dejiao
    McKeown, Kathleen
    Xiang, Bing
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2727 - 2733