Nonextensive Information Theoretic Kernels on Measures

被引:0
|
作者
Martins, Andre F. T. [1 ,2 ]
Smith, Noah A. [1 ]
Xing, Eric P. [1 ]
Aguiar, Pedro M. Q. [3 ]
Figueiredo, Mario A. T. [2 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Inst Super Tecn, Inst Telecomunicacoes, Lisbon, Portugal
[3] Inst Super Tecn, Inst Sistemas & Robot, Lisbon, Portugal
关键词
positive definite kernels; nonextensive information theory; Tsallis entropy; Jensen-Shannon divergence; string kernels; DIVERGENCE MEASURE; CLASSIFICATION; INEQUALITIES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Positive definite kernels on probability measures have been recently applied to classification problems involving text, images, and other types of structured data. Some of these kernels are related to classic information theoretic quantities, such as (Shannon's) mutual information and the Jensen-Shannon (JS) divergence. Meanwhile, there have been recent advances in nonextensive generalizations of Shannon's information theory. This paper bridges these two trends by introducing nonextensive information theoretic kernels on probability measures, based on new JS-type divergences. These new divergences result from extending the the two building blocks of the classical JS divergence: convexity and Shannon's entropy. The notion of convexity is extended to the wider concept of q-convexity, for which we prove a Jensen q-inequality. Based on this inequality, we introduce Jensen-Tsallis (JT) q-differences, a nonextensive generalization of the JS divergence, and define a k-th order JT q-difference between stochastic processes. We then define a new family of nonextensive mutual information kernels, which allow weights to be assigned to their arguments, and which includes the Boolean, JS, and linear kernels as particular cases. Nonextensive string kernels are also defined that generalize the p-spectrum kernel. We illustrate the performance of these kernels on text categorization tasks, in which documents are modeled both as bags of words and as sequences of characters.
引用
收藏
页码:935 / 975
页数:41
相关论文
共 50 条
  • [1] Nonextensive information theoretic kernels on measures
    Martins, André F.T.
    Smith, Noah A.
    Xing, Eric P.
    Aguiar, Pedro M.Q.
    Figueiredo, Mário A.T.
    Journal of Machine Learning Research, 2009, 10 : 935 - 975
  • [2] Information theoretic learning with adaptive kernels
    Singh, Abhishek
    Principe, Jose C.
    SIGNAL PROCESSING, 2011, 91 (02) : 203 - 213
  • [3] Nonextensive information-theoretic measure for image edge detection
    Ben Hamza, A.
    JOURNAL OF ELECTRONIC IMAGING, 2006, 15 (01)
  • [4] Information Theoretic Measures and Their Applications
    Rosso, Osvaldo A.
    Montani, Fernando
    ENTROPY, 2020, 22 (12)
  • [5] On Measures of Information Theoretic Security
    Liu, Shuiyin
    Hong, Yi
    Viterbo, Emanuele
    2014 IEEE INFORMATION THEORY WORKSHOP (ITW), 2014, : 309 - 310
  • [6] Combining information theoretic kernels with generative embeddings for classification
    Bicego, Manuele
    Ulas, Aydin
    Castellani, Umberto
    Perina, Alessandro
    Murino, Vittorio
    Martins, Andre F. T.
    Aguiar, Pedro M. Q.
    Figueiredo, Mario A. T.
    NEUROCOMPUTING, 2013, 101 : 161 - 169
  • [7] Information theoretic measures for the maturity of ecosystems
    Wilhelm, T
    Brüggemann, R
    INTEGRATIVE SYSTEMS APPROACHES TO NATURAL AND SOCIAL DYNAMICS, 2001, : 263 - 273
  • [8] Information theoretic measures in Makarov potential
    Nath, Debraj
    Roy, Amlan K. K.
    EUROPEAN PHYSICAL JOURNAL PLUS, 2023, 138 (05):
  • [9] Information theoretic distance measures in phylogenomics
    Hanus, Pavol
    Dingel, Janis
    Zech, Juergen
    Hagenauer, Joachim
    Mueller, Jakob C.
    2007 INFORMATION THEORY AND APPLICATIONS WORKSHOP, 2007, : 421 - +
  • [10] Information theoretic measures for power analysis
    Marculescu, D
    Marculescu, R
    Pedram, M
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1996, 15 (06) : 599 - 610