Unsupervised Open-domain Keyphrase Generation

被引：0

作者：

Lam Thanh Do ^{[1
,3
]}

Akash, Pritom Saha ^{[2
]}

Chang, Kevin Chen-Chuan ^{[2
,3
]}

机构：

[1] Hanoi Univ Sci & Technol, Hanoi, Vietnam

[2] Univ Illinois, Champaign, IL USA

[3] Cazoodle Inc, Champaign, IL USA

来源：

PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we study the problem of unsupervised open-domain keyphrase generation, where the objective is a keyphrase generation model that can be built without using human-labeled data and can perform consistently across domains. To solve this problem, we propose a seq2seq model that consists of two modules, namely phraseness and informativeness module, both of which can be built in an unsupervised and open-domain fashion. The phraseness module generates phrases, while the informativeness module guides the generation towards those that represent the core concepts of the text. We thoroughly evaluate our proposed method using eight benchmark datasets from different domains. Results on in-domain datasets show that our approach achieves stateof-the-art results compared with existing unsupervised models, and overall narrows the gap between supervised and unsupervised methods down to about 16%. Furthermore, we demonstrate that our model performs consistently across domains, as it overall surpasses the baselines on out-of-domain datasets.

引用

页码：10614 / 10627

页数：14

共 50 条

[1] Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction
Wang, Yansen
Fan, Zhen
Rose, Carolyn P.
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1790 - 1800
[2] Unsupervised Deep Keyphrase Generation
Shen, Xianjie
Wang, Yinghan
Meng, Rui
Shang, Jingbo
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11303 - 11311
[3] Adversarial Evaluation for Open-Domain Dialogue Generation
Bruni, Elia
Fernandez, Raquel
[J]. 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 284 - 288
[4] RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems
Tao, Chongyang
Mou, Lili
Zhao, Dongyan
Yan, Rui
[J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 722 - 729
[5] DART: Open-Domain Structured Data Record to Text Generation
Nan, Linyong
Radev, Dragomir
Zhang, Rui
Rau, Amrit
Sivaprasad, Abhinand
Hsieh, Chiachun
Tang, Xiangru
Vyas, Aadit
Verma, Neha
Krishna, Pranav
Liu, Yangxiaokang
Irwanto, Nadia
Pan, Jessica
Rahman, Faiaz
Zaidi, Ahmad
Mutuma, Mutethia
Tarabar, Yasin
Gupta, Ankit
Yu, Tao
Tan, Yi Chern
Lin, Xi Victoria
Xiong, Caiming
Socher, Richard
Rajani, Nazneen Fatema
[J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 432 - 447
[6] Promoting convergence and efficacy of open-domain question answering via unsupervised clustering
Liu, Shuoyan
Han, Qiuchi
[J]. ELECTRONICS LETTERS, 2024, 60 (16)
[7] Execution-Based Evaluation for Open-Domain Code Generation
Wang, Zhiruo
Zhou, Shuyan
Fried, Daniel
Neubig, Graham
[J]. arXiv, 2022,
[8] Open-domain clarification question generation without question examples
White, Julia
Poesia, Gabriel
Hawkins, Robert
Sadigh, Dorsa
Goodman, Noah
[J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 563 - 570
[9] A Randomized Link Transformer for Diverse Open-Domain Dialogue Generation
Lee, Jing Yang
Lee, Kong Aik
Gan, Woon Seng
[J]. PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 1 - 11
[10] Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation
Pang, Bo
Nijkamp, Erik
Han, Wenjuan
Zhou, Linqi
Liu, Yixian
Tu, Kewei
[J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3619 - 3629

← 1 2 3 4 5 →