A Case Study on Start-up of Dataset Construction: In Case of Recipe Named Entity Corpus

被引:0
|
作者
Yamakata, Yoko [1 ]
Tajima, Keishi [1 ]
Mori, Shinsuke [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
[2] Kyoto Univ, Acad Ctr Comp & Media Studies, Kyoto, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we report our experience in constructing a cooking recipe text corpus. We describe problems we found and explain how we managed them. One of the problems we faced in the construction of our recipe corpus is the difficulty of establishing a clear, stable, and complete guideline instructing annotators how to annotate. During the annotation, we found many unexpected cases for which the pre-defined guideline is not clear enough, and even cases for which the pre-defined guideline provides no guidance at all. As a result, we needed to update the guideline twice during the annotation, and also needed to revise annotations we have done before the updates. During that process, we have several trade-offs, and it is not easy to decide when and how often we should revise the annotations. It is even unclear whether we should revise them or should instead use the human resource for annotating more data. We show an experiment, whose result suggests that we should revise the old annotations. Another problem we had is the management of versions of the guideline, sets of annotations corresponding to them, and communication between participants.
引用
收藏
页码:3564 / 3567
页数:4
相关论文
共 50 条
  • [1] Statistical dataset evaluation: A case study on named entity recognition
    Wang, Chengwen
    Dong, Qingxiu
    Wang, Xiaochen
    Sui, Zhifang
    [J]. NATURAL LANGUAGE PROCESSING, 2024,
  • [2] Comparative Study on Start-up Business Incubator Construction Case Study on Incubators in Tianjin
    Zhe, Li
    [J]. PROCEEDINGS OF THE 2013 CONFERENCE ON EDUCATION TECHNOLOGY AND MANAGEMENT SCIENCE (ICETMS 2013), 2013, : 1056 - 1059
  • [3] Start-up's ecosystem: a case study on DevX
    Srivastava, Sachin Kumar
    Khosla, Rekha P.
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE AND LEARNING, 2023, 16 (01) : 97 - 110
  • [4] CONTRIBUTIONS OF SUSTAINABLE START-UP ECOSYSTEM TO DYNAMICS OF START-UP COMPANIES: THE CASE OF LITHUANIA
    Lauzikas, Mindaugas
    Tindale, Hailee
    Bilota, Augustinas
    Bielousovaite, Dovile
    [J]. ENTREPRENEURSHIP AND SUSTAINABILITY ISSUES, 2015, 3 (01): : 8 - 24
  • [5] Agile methodology selection criteria: IT start-up case study
    Micic, Lj
    [J]. INNOVATIVE IDEAS IN SCIENCE 2016, 2017, 200
  • [6] Pitching for Finance for a Business Start-Up: A Case Study of IviewCameras
    Molian, David
    [J]. INTERNATIONAL REVIEW OF ENTREPRENEURSHIP, 2007, 5 : 193 - 208
  • [7] Start-up of New Modern Facility (Case)
    Hietava-Lorenzi, Maija
    [J]. 41ST R3 -NORDIC SYMPOSIUM: CLEANROOM TECHNOLOGY, CONTAMINATION CONTROL AND CLEANING, 2010, 266 : 28 - 30
  • [8] Start-up Companies - the Case of Kosice Region
    Dzupka, Peter
    Klasova, Slavka
    Vajda, Viliam
    [J]. CENTRAL EUROPEAN CONFERENCE IN FINANCE AND ECONOMICS (CEFE2015), 2015, : 135 - 142
  • [9] Innovation development in biopharmaceutical start-up firms: An Italian case study
    Nosella, A.
    Petroni, G.
    Verbano, C.
    [J]. JOURNAL OF ENGINEERING AND TECHNOLOGY MANAGEMENT, 2006, 23 (03) : 202 - 220
  • [10] Management controls, heterarchy and innovation: a case study of a start-up company
    Taylor, David
    King, Robyn
    Smith, David
    [J]. ACCOUNTING AUDITING & ACCOUNTABILITY JOURNAL, 2019, 32 (06): : 1636 - 1661