Habitat modeling with single- and multi-target trees and ensembles

被引:7
|
作者
Kocev, Dragi [1 ]
Dzeroski, Saso [1 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Ljubljana 1000, Slovenia
关键词
Habitat modeling; Predictive clustering trees; Ensemble models; Multi-target prediction; RIVER WATER-QUALITY; CHEMICAL-PARAMETERS; CLASSIFICATION; COMMUNITIES; PREDICTION;
D O I
10.1016/j.ecoinf.2013.06.003
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Habitat modeling studies the influence of abiotic factors on the abundance of a given taxonomic group of organisms. In this work, we investigate the effect of environmental conditions on communities of organisms in three different ecosystems. Namely, we consider the diatom community in Lake Prespa, Macedonia, the Collembola community in the soils of Denmark and 14 organisms living in Slovenian rivers. The data for these case studies consist of physical and chemical properties of the environment as well as the relative abundances or presence of the organisms under investigation. The multi-species data are analyzed by constructing habitat models for each species separately (single-target decision trees) or by constructing a single habitat model for all the species (multi-target predictive clustering trees). Typically, habitat models are constructed for each species individually and thus do not exploit the interactions between/among species. While approaches for building a single habitat model of a group of organisms exist, they typically construct models that are not readily interpretable and, thus,, are seldom used by the research community. In this work, we explore in detail the construction of interpretable models of both types. Furthermore, we construct ensembles of decision trees and ensembles of predictive clustering trees to increase the predictive performance of the models. The key outcomes of the interpretation and discussion of the obtained models for each case study are as follows. First, we show that multi-target predictive clustering trees are a very useful method for the analysis of multi-species data and that they are more efficient and produce more concise models than single-target decision trees. The obtained multi-target habitat models are readily interpretable and identify the environmental conditions that influence the composition and structure of a given community of organisms. Second, we conclude that the temperature and magnesium are the most important factors influencing the complete diatom community in Lake Prespa, while the nitrates and the temperature influence more the most abundant species. Third, the biological oxygen demand is the most influential factor for the abundance of river dwelling species, while the river community structure is mostly influenced by the NO2 concentration. Finally, the structure of the community of soil microarthropods is mostly influenced by the soil type and the crop history. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:79 / 92
页数:14
相关论文
共 50 条
  • [1] Using single- and multi-target regression trees and ensembles to model a compound index of vegetation condition
    Kocev, Dragi
    Dzeroski, Saso
    White, Matt D.
    Newell, Graeme R.
    Griffioen, Peter
    ECOLOGICAL MODELLING, 2009, 220 (08) : 1159 - 1168
  • [2] Ensembles of Extremely Randomized Trees for Multi-target Regression
    Kocev, Dragi
    Ceci, Michelangelo
    DISCOVERY SCIENCE, DS 2015, 2015, 9356 : 86 - 100
  • [3] Multi-Target Regression with Rule Ensembles
    Aho, Timo
    Zenko, Bernard
    Dzeroski, Saso
    Elomaa, Tapio
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 2367 - 2407
  • [4] Rule Ensembles for Multi-Target Regression
    Aho, Timo
    Zenko, Bernard
    Dzeroski, Saso
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 21 - +
  • [5] Epigenetic-based therapy: From single- to multi-target approaches
    Benedetti, Rosaria
    Conte, Mariarosaria
    Iside, Concetta
    Altucci, Lucia
    INTERNATIONAL JOURNAL OF BIOCHEMISTRY & CELL BIOLOGY, 2015, 69 : 121 - 131
  • [6] Using Single- and Multi-target Regression to Estimate Biophysical Parameters of Crops
    Sahin, Z. Meltem
    Erten, Esra
    Musaoglu, Nebiye
    Canbay, D. Ekin
    Kaya, Gulsen
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [7] Ensembles for multi-target regression with random output selections
    Martin Breskvar
    Dragi Kocev
    Sašo Džeroski
    Machine Learning, 2018, 107 : 1673 - 1709
  • [8] Ensembles for multi-target regression with random output selections
    Breskvar, Martin
    Kocev, Dragi
    Dzeroski, Saso
    MACHINE LEARNING, 2018, 107 (11) : 1673 - 1709
  • [9] Stepwise induction of multi-target model trees
    Appice, Annalisa
    Dzeroski, Saso
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 502 - +
  • [10] From single- to multi-target drugs in cancer therapy: When aspecificity becomes an advantage
    Petrelli, A.
    Giordano, S.
    CURRENT MEDICINAL CHEMISTRY, 2008, 15 (05) : 422 - 432