Optimizing non-decomposable measures with deep networks

被引:0
|
作者
Amartya Sanyal
Pawan Kumar
Purushottam Kar
Sanjay Chawla
Fabrizio Sebastiani
机构
[1] The University of Oxford,
[2] The Alan Turing Institute,undefined
[3] Indian Institute of Technology Kanpur,undefined
[4] Qatar Computing Research Institute,undefined
[5] Istituto di Scienza e Tecnologia dell’Informazione,undefined
来源
Machine Learning | 2018年 / 107卷
关键词
Optimization; Deep learning; F-measure; Task-specific training;
D O I
暂无
中图分类号
学科分类号
摘要
We present a class of algorithms capable of directly training deep neural networks with respect to popular families of task-specific performance measures for binary classification such as the F-measure, QMean and the Kullback–Leibler divergence that are structured and non-decomposable. Our goal is to address tasks such as label-imbalanced learning and quantification. Our techniques present a departure from standard deep learning techniques that typically use squared or cross-entropy loss functions (that are decomposable) to train neural networks. We demonstrate that directly training with task-specific loss functions yields faster and more stable convergence across problems and datasets. Our proposed algorithms and implementations offer several advantages including (i) the use of fewer training samples to achieve a desired level of convergence, (ii) a substantial reduction in training time, (iii) a seamless integration of our implementation into existing symbolic gradient frameworks, and (iv) assurance of convergence to first order stationary points. It is noteworthy that the algorithms achieve this, especially point (iv), despite being asked to optimize complex objective functions. We implement our techniques on a variety of deep architectures including multi-layer perceptrons and recurrent neural networks and show that on a variety of benchmark and real data sets, our algorithms outperform traditional approaches to training deep networks, as well as popular techniques used to handle label imbalance.
引用
收藏
页码:1597 / 1620
页数:23
相关论文
共 50 条
  • [41] Dual decomposable inequality measures
    Ebert, U
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 1999, 32 (01): : 234 - 246
  • [42] On Decomposable Measures Induced by Metrics
    Qiu, Dong
    Zhang, Weiquan
    JOURNAL OF APPLIED MATHEMATICS, 2012,
  • [43] A CLASS OF DECOMPOSABLE POVERTY MEASURES
    FOSTER, J
    GREER, J
    THORBECKE, E
    ECONOMETRICA, 1984, 52 (03) : 761 - 766
  • [44] Decomposable measures and nonlinear equations
    Pap, E
    FUZZY SETS AND SYSTEMS, 1997, 92 (02) : 205 - 221
  • [45] Decomposable Signed Fuzzy Measures
    Mihailovic, Bijana
    Pap, Endre
    NEW DIMENSIONS IN FUZZY LOGIC AND RELATED TECHNOLOGIES, VOL I, PROCEEDINGS, 2007, : 265 - +
  • [46] DECOMPOSABLE INCOME INEQUALITY MEASURES
    BOURGUIGNON, F
    ECONOMETRICA, 1979, 47 (04) : 901 - 920
  • [47] Decomposable measures and information measures for intuitionistic fuzzy sets
    Ban, AI
    Gal, SG
    FUZZY SETS AND SYSTEMS, 2001, 123 (01) : 103 - 117
  • [48] Visualizing Deep Networks by Optimizing with Integrated Gradients
    Qi, Zhongang
    Khorram, Saeed
    Li Fuxin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11890 - 11898
  • [49] DECOMPOSABLE MEASURES OF ECONOMIC-INSTABILITY
    BRODSKY, DA
    OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 1980, 42 (04) : 361 - 374
  • [50] SAMPLING VARIANCE AND DECOMPOSABLE INEQUALITY MEASURES
    COWELL, FA
    JOURNAL OF ECONOMETRICS, 1989, 42 (01) : 27 - 41