Unsupervised Open-domain Keyphrase Generation

被引:0
|
作者
Lam Thanh Do [1 ,3 ]
Akash, Pritom Saha [2 ]
Chang, Kevin Chen-Chuan [2 ,3 ]
机构
[1] Hanoi Univ Sci & Technol, Hanoi, Vietnam
[2] Univ Illinois, Champaign, IL USA
[3] Cazoodle Inc, Champaign, IL USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the problem of unsupervised open-domain keyphrase generation, where the objective is a keyphrase generation model that can be built without using human-labeled data and can perform consistently across domains. To solve this problem, we propose a seq2seq model that consists of two modules, namely phraseness and informativeness module, both of which can be built in an unsupervised and open-domain fashion. The phraseness module generates phrases, while the informativeness module guides the generation towards those that represent the core concepts of the text. We thoroughly evaluate our proposed method using eight benchmark datasets from different domains. Results on in-domain datasets show that our approach achieves stateof-the-art results compared with existing unsupervised models, and overall narrows the gap between supervised and unsupervised methods down to about 16%. Furthermore, we demonstrate that our model performs consistently across domains, as it overall surpasses the baselines on out-of-domain datasets.
引用
收藏
页码:10614 / 10627
页数:14
相关论文
共 50 条
  • [1] Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction
    Wang, Yansen
    Fan, Zhen
    Rose, Carolyn P.
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1790 - 1800
  • [2] Unsupervised Deep Keyphrase Generation
    Shen, Xianjie
    Wang, Yinghan
    Meng, Rui
    Shang, Jingbo
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11303 - 11311
  • [3] Adversarial Evaluation for Open-Domain Dialogue Generation
    Bruni, Elia
    Fernandez, Raquel
    [J]. 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 284 - 288
  • [4] RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems
    Tao, Chongyang
    Mou, Lili
    Zhao, Dongyan
    Yan, Rui
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 722 - 729
  • [5] DART: Open-Domain Structured Data Record to Text Generation
    Nan, Linyong
    Radev, Dragomir
    Zhang, Rui
    Rau, Amrit
    Sivaprasad, Abhinand
    Hsieh, Chiachun
    Tang, Xiangru
    Vyas, Aadit
    Verma, Neha
    Krishna, Pranav
    Liu, Yangxiaokang
    Irwanto, Nadia
    Pan, Jessica
    Rahman, Faiaz
    Zaidi, Ahmad
    Mutuma, Mutethia
    Tarabar, Yasin
    Gupta, Ankit
    Yu, Tao
    Tan, Yi Chern
    Lin, Xi Victoria
    Xiong, Caiming
    Socher, Richard
    Rajani, Nazneen Fatema
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 432 - 447
  • [6] Promoting convergence and efficacy of open-domain question answering via unsupervised clustering
    Liu, Shuoyan
    Han, Qiuchi
    [J]. ELECTRONICS LETTERS, 2024, 60 (16)
  • [7] Execution-Based Evaluation for Open-Domain Code Generation
    Wang, Zhiruo
    Zhou, Shuyan
    Fried, Daniel
    Neubig, Graham
    [J]. arXiv, 2022,
  • [8] Open-domain clarification question generation without question examples
    White, Julia
    Poesia, Gabriel
    Hawkins, Robert
    Sadigh, Dorsa
    Goodman, Noah
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 563 - 570
  • [9] A Randomized Link Transformer for Diverse Open-Domain Dialogue Generation
    Lee, Jing Yang
    Lee, Kong Aik
    Gan, Woon Seng
    [J]. PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 1 - 11
  • [10] Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation
    Pang, Bo
    Nijkamp, Erik
    Han, Wenjuan
    Zhou, Linqi
    Liu, Yixian
    Tu, Kewei
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3619 - 3629