A data decomposition-based hierarchical classification method for multi-label classification of contractual obligations for the purpose of their governance

被引:0
|
作者
Singh, Amrita [1 ]
Rose Anish, Preethu [1 ]
Verma, Aparna [2 ]
Venkatesan, Sivanthy [3 ]
V, V. [4 ]
Ghaisas, Smita [1 ]
机构
[1] Tata Consultancy Serv TCS Res, Data & Decis Sci DDS Dept, Pune, Maharashtra, India
[2] Tata Consultancy Serv TCS, Delivery Excellence Grp DEG, Bangalore, Karnataka, India
[3] Tata Consultancy Serv TCS, Delivery Excellence Grp DEG, Chennai, Tamil Nadu, India
[4] Tata Consultancy Serv TCS, Ultimatix Dept, Chennai, Tamil Nadu, India
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Contract governance; Automated extraction; Multi-label classification; Fine-grained governance model;
D O I
10.1038/s41598-024-63648-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Contract governance ensures that the agreed outcomes between customers and vendors are fulfilled. Information Technology (IT) outsourcing organizations enter thousands of contractual relationships each month leading to a high volume of business-critical contractual text that must be reviewed and deciphered for effective governance. The key to effective governance of contracts is a model that facilitates assigning ownership of the obligations to the right departments in an organization and allocating their accountability to the right stakeholders. For this, the contractual obligations need to be identified and classified so that details such as actions to be taken by departments in an organization and their ownership as per a given clause are brought out for the purpose of governance. In this paper, we present our work on automated extraction and classification of obligations present in Software Engineering (SE) contracts for the purpose of contracts governance. We propose a novel data decomposition-based hierarchical classification method for a multi-label classification of contractual obligations. We conducted experiments for a fine-grained automated classification of more than 55,000 statements from 50 large real-life SE contract documents received from a large vendor organization into 152 governance-specific classes. The results indicate that the proposed method can bring about a 7-8% improvement in accuracies when compared to state-of-the-art classification baselines such as BERT, RoBERTa, and generative models such as GPT-2.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] An evolutionary decomposition-based multi-objective feature selection for multi-label classification
    Bidgoli, Azam Asilian
    Ebrahimpour-Komleh, Hossein
    Rahnamayan, Shahryar
    PEERJ COMPUTER SCIENCE, 2020, 2020 (03) : 1 - 32
  • [2] A deep neural network based hierarchical multi-label classification method
    Feng, Shou
    Zhao, Chunhui
    Fu, Ping
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2020, 91 (02):
  • [3] ReliefF for Hierarchical Multi-label Classification
    Slavkov, Ivica
    Karcheska, Jana
    Kocev, Dragi
    Kalajdziski, Slobodan
    Dzeroski, Saso
    NEW FRONTIERS IN MINING COMPLEX PATTERNS, NFMCP 2013, 2014, 8399 : 148 - 161
  • [4] Hierarchical Multi-Label Classification Networks
    Wehrmann, Jonatas
    Cerri, Ricardo
    Barros, Rodrigo C.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [5] The importance of the label hierarchy in hierarchical multi-label classification
    Jurica Levatić
    Dragi Kocev
    Sašo Džeroski
    Journal of Intelligent Information Systems, 2015, 45 : 247 - 271
  • [6] The importance of the label hierarchy in hierarchical multi-label classification
    Levatic, Jurica
    Kocev, Dragi
    Dzeroski, Saso
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 45 (02) : 247 - 271
  • [7] Label Correction Strategy on Hierarchical Multi-Label Classification
    Ananpiriyakul, Thanawut
    Poomsirivilai, Piyapan
    Vateekul, Peerapon
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 213 - 227
  • [8] Web Genre Classification via Hierarchical Multi-label Classification
    Madjarov, Gjorgji
    Vidulin, Vedrana
    Dimitrovski, Ivica
    Kocev, Dragi
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 9 - 17
  • [9] Active learning for hierarchical multi-label classification
    Nakano, Felipe Kenji
    Cerri, Ricardo
    Vens, Celine
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (05) : 1496 - 1530
  • [10] Feature Selection for Hierarchical Multi-label Classification
    da Silva, Luan V. M.
    Cerri, Ricardo
    ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 196 - 208