Supervised Word Mover's Distance

被引:0
|
作者
Huang, Gao [1 ]
Guo, Chuan [1 ]
Kusner, Matt J. [2 ]
Sun, Yu [1 ]
Weinberger, Kilian Q. [1 ]
Sha, Fei [3 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Univ Warwick, Alan Turing Inst, Coventry, W Midlands, England
[3] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, a new document metric called the word mover's distance (WMD) has been proposed with unprecedented results on k NN-based document classification. The WMD elevates high-quality word embeddings to a document metric by formulating the distance between two documents as an optimal transport problem between the embedded words. However, the document distances are entirely unsupervised and lack a mechanism to incorporate supervision when available. In this paper we propose an efficient technique to learn a supervised metric, which we call the Supervised-WMD (S-WMD) metric. The supervised training minimizes the stochastic leave-one-out nearest neighbor classification error on a per-document level by updating an affine transformation of the underlying word embedding space and a word-imporance weight vector. As the gradient of the original WMD distance would result in an inefficient nested optimization problem, we provide an arbitrarily close approximation that results in a practical and efficient update rule. We evaluate S-WMD on eight real-world text classification tasks on which it consistently outperforms almost all of our 26 competitive baselines.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Word Mover's Distance for Affect Detection
    Alshahrani, Mohammed
    Samothrakis, Spyridon
    Fasli, Maria
    [J]. 2017 INTERNATIONAL CONFERENCE ON THE FRONTIERS AND ADVANCES IN DATA SCIENCE (FADS), 2017, : 22 - 27
  • [2] Binary Encoded Word Mover's Distance
    Johnson, Christian
    [J]. PROCEEDINGS OF THE 7TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP, 2022, : 167 - 172
  • [3] Boosting Tricks for Word Mover's Distance
    Skianis, Konstantinos
    Malliaros, Fragkiskos D.
    Tziortziotis, Nikolaos
    Vazirgiannis, Michalis
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 761 - 772
  • [4] Re-evaluating Word Mover's Distance
    Sato, Ryoma
    Yamada, Makoto
    Kashima, Hisashi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19231 - 19249
  • [5] Supervised Mover's Distance: A Simple Model for Sentence Comparison
    Srivastava, Muktabh Mayank
    [J]. ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE (AINL 2018), 2018, 930 : 61 - 66
  • [6] Word Mover's Distance for Agglomerative Short Text Clustering
    Franciscus, Nigel
    Ren, Xuguang
    Wang, Junhu
    Stantic, Bela
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT I, 2019, 11431 : 128 - 139
  • [7] Topic Modeling on User Stories using Word Mover's Distance
    Guelle, Kim Julian
    Ford, Nicholas
    Ebel, Patrick
    Brokhausen, Florian
    Vogelsang, Andreas
    [J]. 2020 IEEE SEVENTH INTERNATIONAL WORKSHOP ON ARTIFICIAL INTELLIGENCE FOR REQUIREMENTS ENGINEERING (AIRE 2020), 2020, : 52 - 60
  • [8] Improve Word Mover's Distance with Part-of-Speech Tagging
    Chen, Xiaojun
    Bai, Li
    Wang, Dakui
    Shi, Jinqiao
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3722 - 3728
  • [9] Network Log Analysis based on the Topic Word Mover's Distance
    Chen, Renai
    Gao, Qing
    Ji, Weiliang
    Long, Fei
    Ling, Qiang
    [J]. PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 4082 - 4086
  • [10] Supervised Earth Mover's Distance Learning and Its Computer Vision Applications
    Wang, Fan
    Guibas, Leonidas J.
    [J]. COMPUTER VISION - ECCV 2012, PT I, 2012, 7572 : 442 - 455