TamilATIS: Dataset for Task-Oriented Dialog in Tamil

被引:0
|
作者
Ramaneswaran, S. [1 ]
Vijay, Sanchit [1 ]
Srinivasan, Kathiravan [1 ]
机构
[1] Vellore Inst Technol, Vellore, Tamil Nadu, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Task-Oriented Dialogue (TOD) systems allow users to accomplish tasks by giving directions to the system using natural language utterances. With the widespread adoption of conversational agents and chat platforms, TOD has become mainstream in NLP research today. However, developing TOD systems require massive amounts of data, and there has been limited work done for TOD in low-resource languages like Tamil. Towards this objective, we introduce TamilATIS - a TOD dataset for Tamil which contains 4874 utterances. We present a detailed account of the entire data collection and data annotation process. We train state-of-the-art NLU models and report their performances. The Joint BERT model with XLMRoberta as utterance encoder achieved the highest score with an intent accuracy of 96.26% and slot F1 of 94.01%.
引用
收藏
页码:25 / 32
页数:8
相关论文
共 50 条
  • [41] Semantic Parsing in Task-Oriented Dialog with Recursive Insertion-Based Encoder
    Mansimov, Elman
    Zhang, Yi
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11067 - 11075
  • [42] PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation
    Gu, Jing
    Wu, Qingyang
    Wu, Chongruo
    Shi, Weiyan
    Yu, Zhou
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 305 - 313
  • [43] An adaptable task-oriented dialog system for stand-alone embedded devices
    Long Duong
    Vu Cong Duy Hoang
    Tuyen Quang Pham
    Hong, Yu-Heng
    Dovgalecs, Vladislavs
    Bashkansky, Guy
    Black, Jason
    Bleeker, Andrew
    Le Huitouze, Serge
    Johnson, Mark
    [J]. PROCEEDINGS OF THE 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, (ACL 2019), 2019, : 49 - 57
  • [44] An End-to-End Neural Dialog State Tracking for Task-Oriented Dialogs
    Kim, A-Yeong
    Kim, Tae-Hyeong
    Song, Hyun-Je
    Park, Seong-Bae
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [45] RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems
    Peng, Baolin
    Li, Chunyuan
    Zhang, Zhu
    Zhu, Chenguang
    Li, Jinchao
    Gao, Jianfeng
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4418 - 4429
  • [46] Modeling the impact of out-of-schema questions in task-oriented dialog systems
    Meem, Jannat Ara
    Rashid, Muhammad Shihab
    Hristidis, Vagelis
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 2466 - 2494
  • [47] An Efficient Framework for Development of Task-Oriented Dialog Systems in a Smart Home Environment
    Park, Youngmin
    Kang, Sangwoo
    Seo, Jungyun
    [J]. SENSORS, 2018, 18 (05)
  • [48] Multiuser, multimodal sensemaking cognitive immersive environment with a task-oriented dialog system
    Briggs, Shannon
    Chabot, Sam
    Sanders, Abraham
    Peveler, Matthew
    Strzalkowski, Tomek
    Braasch, Jonas
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY (HST), 2022,
  • [49] Building a Task-oriented Dialog System for languages with no training data: the Case for Basque
    Lopez de Lacalle, Maddalen
    Saralegi, Xabier
    San Vicente, Inaki
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2796 - 2802
  • [50] TASK-ORIENTED ARCHITECTURES
    BISIANI, R
    MAUERSBERG, H
    REDDY, R
    [J]. PROCEEDINGS OF THE IEEE, 1983, 71 (07) : 885 - 898