On the combination of graph data for assessing thin-file borrowers' creditworthiness

被引:9
|
作者
Munoz-Cancino, Ricardo [1 ]
Bravo, Cristian [2 ]
Rios, Sebastian A. [1 ]
Grana, Manuel [3 ]
机构
[1] Univ Chile, Business Intelligence Res Ctr CEINE, Ind Engn Dept, Beauchef 851, Santiago 8370456, Chile
[2] Univ Western Ontario, Dept Stat & Actuarial Sci, 1151 Richmond St, London, ON N6A 5B7, Canada
[3] Univ Basque Country, Computat Intelligence Grp, San Sebastian 20018, Spain
基金
加拿大自然科学与工程研究理事会;
关键词
Credit scoring; Machine learning; Social network analysis; Network data; Graph neural networks; FEATURE-SELECTION; CREDIT; PREDICTION; MODEL;
D O I
10.1016/j.eswa.2022.118809
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thin-file borrowers are customers for whom a creditworthiness assessment is uncertain due to their lack of credit history. To address missing credit information, many researchers have used borrowers' social interactions as an alternative data source. Exploiting social networking data has traditionally been achieved by hand-crafted feature engineering, but lately, graph neural networks have emerged as a promising alternative. Here we introduce an information-processing framework to improve credit scoring models by blending several methods of graph representation learning: feature engineering, graph embeddings, and graph neural networks. In this approach, we aggregate the methods' outputs to be fed to a gradient boosting classifier to produce a final creditworthiness score. We have validated this framework over a unique multi-source dataset that characterizes the relationships, interactions, and credit history for the entire population of a Latin American country, applying it to credit risk models, application, and behavior. It also allows us to study both individuals and companies. Our results show that the methods of graph representation learning should be used as complements; they should not be seen as self-sufficient methods, as it is currently done. We improve the creditworthiness assessment performance in terms of the measures of Area Under the ROC Curve (AUC) and Kolmogorov- Smirnov (KS), outperforming traditional methods of exploiting social interaction data. In the area of corporate lending, where the potential gain is much higher, our results confirm that the evaluation of a thin-file company cannot solely consider the company's own characteristics. The business ecosystem in which these companies interact with their owners, suppliers, customers, and other companies provides novel knowledge that enables financial institutions to enhance their creditworthiness assessment. Our results let us know when and on which population to use graph data and the expected effects on performance. They also show the enormous value of graph data on the credit scoring problem for thin-file borrowers, mainly to help companies with thin or no credit history to enter the financial system.
引用
收藏
页数:15
相关论文
共 34 条
  • [31] Preliminary safety data of the PRODIGE 81-FFCD 2101-TRIPLET-HCC trial assessing the triple combination atezolizumab-bevacizumab-ipilimumab in patients (pts) treated in systemic therapy for hepatocellular carcinoma (HCC)
    Merle, Philippe
    Peron, Jean-Marie
    Le Malicot, Karine
    Guarssifi, Meriem
    Aparicio, Thomas
    Bourgeois, Vincent
    Assenat, Eric
    Blanc, Jean-Frederic
    Bouattour, Mohamed
    Touchefeu, Yann
    Akouz, Faiza Khemissa
    Nguyen-Khac, Eric
    Bronowicki, Jean-Pierre
    Girot, Paul
    Lam, You Heng
    Bolliet, Marion
    Adhoute, Xavier
    Edeline, Julien
    Nault, Jean-Charles
    Phelip, Jean Marc
    JOURNAL OF CLINICAL ONCOLOGY, 2024, 42 (16)
  • [32] Diagnostic yields of a combination use of multi-guided devices in the basis of the obtained data from virtual bronchoscopic navigation LungPoint system using thin video-bronchoscope for small (≤30 mm) peripheral pulmonary lesions
    Misawa, Masafumi
    Takai, Motohisa
    Munakata, Yu
    Kobayashi, Haruki
    Nakashima, Kei
    Katsurada, Naoko
    Makino, Hideki
    Kaneko, Norihiro
    Aoshima, Masahiro
    EUROPEAN RESPIRATORY JOURNAL, 2013, 42
  • [33] Clinical data from the DeCidE1 trial: Assessing the first combination of DPX-Survivac, low dose cyclophosphamide (CPA), and epacadostat (INCB024360) in subjects with stage IIc-IV recurrent epithelial ovarian cancer
    Dorigo, Oliver
    Tanyi, Janos Laszlo
    Strauss, James
    Oza, Amit M.
    Pejovic, Tanja
    Ghatage, Prafull
    Villella, Jeannine A.
    Fiset, Stephan
    MacDonald, Lisa Diana
    Leopold, Lance
    Rosu, Gabriela Nicola
    JOURNAL OF CLINICAL ONCOLOGY, 2018, 36 (15)
  • [34] A COMBINATION USE OF MULTI-GUIDED DEVICES IN THE BASIS OF THE OBTAINED DATA FROM VIRTUAL BRONCHOSCOPIC NAVIGATION LUNGPOINT SYSTEM USING THIN VIDEO-BRONCHOSCOPE FOR DIAGNOSING SMALL (≤ 30 MM) PERIPHERAL PULMONARY LESIONS SUSPECTED LUNG CANCER
    Misawa, Masafumi
    Takai, Motohisa
    Munakata, Yu
    Kobayashi, Haruki
    Nakashima, Kei
    Katsurada, Naoko
    Makino, Hideki
    Kaneko, Norihiro
    Aoshima, Masahiro
    JOURNAL OF THORACIC ONCOLOGY, 2013, 8 : S949 - S949