Unsupervised Deep Structured Semantic Models for Commonsense Reasoning

被引:0
|
作者
Wang, Shuohang [1 ]
Zhang, Sheng [2 ]
Shen, Yelong [4 ]
Liu, Xiaodong [3 ]
Liu, Jingjing [3 ]
Gao, Jianfeng [3 ]
Jiang, Jing [1 ]
机构
[1] Singapore Management Univ, Singapore, Singapore
[2] Johns Hopkins Univ, Baltimore, MD 21218 USA
[3] Microsoft, Redmond, WA USA
[4] Tencent AI Lab, Bellevue, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Commonsense reasoning is fundamental to natural language understanding. While traditional methods rely heavily on human-crafted features and knowledge bases, we explore learning commonsense knowledge from a large amount of raw text via unsupervised learning. We propose two neural network models based on the Deep Structured Semantic Models (DSSM) framework to tackle two classic commonsense reasoning tasks, Winograd Schema challenges (WSC) and Pronoun Disambiguation (PDP). Evaluation shows that the proposed models effectively capture contextual information in the sentence and co-reference information between pronouns and nouns, and achieve significant improvement over previous state-of-the-art approaches.
引用
收藏
页码:882 / 891
页数:10
相关论文
共 50 条
  • [1] PrefaceSpecial issue on commonsense reasoning for the semantic web
    Frank van Harmelen
    Andreas Herzig
    Pascal Hitzler
    Guilin Qi
    [J]. Annals of Mathematics and Artificial Intelligence, 2010, 58 : 1 - 2
  • [2] Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning
    Branco, Ruben
    Branco, Antonio
    Silva, Joao
    Rodrigues, Joao
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1504 - 1521
  • [3] Special issue on commonsense reasoning for the semantic web Preface
    van Harmelen, Frank
    Herzig, Andreas
    Hitzler, Pascal
    Qi, Guilin
    [J]. ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2010, 58 (1-2) : 1 - 2
  • [4] Exploring Context with Deep Structured Models for Semantic Segmentation
    Lin, Guosheng
    Shen, Chunhua
    van den Hengel, Anton
    Reid, Ian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (06) : 1352 - 1366
  • [5] A Semantic-based Method for Unsupervised Commonsense Question Answering
    Niu, Yilin
    Huang, Fei
    Liang, Jiaming
    Chen, Wenkai
    Zhu, Xiaoyan
    Huang, Minlie
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3037 - 3049
  • [6] Psycholinguistic Diagnosis of Language Models' Commonsense Reasoning
    Cong, Yan
    [J]. PROCEEDINGS OF THE FIRST WORKSHOP ON COMMONSENSE REPRESENTATION AND REASONING (CSRR 2022), 2022, : 17 - 22
  • [7] EXPLAGRAPHS: An Explanation Graph Generation Task for Structured Commonsense Reasoning
    Saha, Swarnadeep
    Yadav, Prateek
    Bauer, Lisa
    Bansal, Mohit
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7716 - 7740
  • [8] Learning Commonsense Knowledge Models for Semantic Analytics
    Hu Shangfeng
    Kanagasabai, Rajaraman
    [J]. 2016 IEEE TENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2016, : 399 - 402
  • [9] Commonsense Reasoning to Guide Deep Learning for Scene Understanding
    Sridharan, Mohan
    Mota, Tiago
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4760 - 4764
  • [10] Unsupervised semantic deep hashing
    Jin, Sheng
    Yao, Hongxun
    Sun, Xiaoshuai
    Zhou, Shangchen
    [J]. NEUROCOMPUTING, 2019, 351 (19-25) : 19 - 25