The changing landscape of text mining: a review of approaches for ecology and evolution

被引:1
|
作者
Farrell, Maxwell J. [1 ,2 ,3 ]
Le Guillarme, Nicolas [4 ]
Brierley, Liam [3 ,5 ]
Hunter, Bronwen [6 ]
Scheepens, Daan [7 ]
Willoughby, Anna [8 ]
Yates, Andrew [9 ]
Mideo, Nicole [1 ]
机构
[1] Univ Toronto, Dept Ecol & Evolutionary Biol, Toronto, ON, Canada
[2] Univ Glasgow, Sch Biodivers One Hlth & Vet Med, Glasgow, Scotland
[3] Univ Glasgow, Ctr Virus Res, MRC, Glasgow, Scotland
[4] Univ Grenoble Alpes, CNRS, LECA, Lab Ecol Alpine, Grenoble, France
[5] Univ Liverpool, Dept Hlth Data Sci, Liverpool, England
[6] Univ Sussex, Sch Life Sci, Brighton, England
[7] UCL, Div Biosci, London, England
[8] Univ Georgia, Odum Sch Ecol, Athens, GA USA
[9] Univ Amsterdam, Informat Inst, Amsterdam, Netherlands
基金
加拿大自然科学与工程研究理事会;
关键词
Natural Language Processing; large language models; deep learning; literature synthesis; Information Extraction; database construction; ONTOLOGIES; KNOWLEDGE;
D O I
10.1098/rspb.2024.0423
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In ecology and evolutionary biology, the synthesis and modelling of data from published literature are commonly used to generate insights and test theories across systems. However, the tasks of searching, screening, and extracting data from literature are often arduous. Researchers may manually process hundreds to thousands of articles for systematic reviews, meta-analyses, and compiling synthetic datasets. As relevant articles expand to tens or hundreds of thousands, computer-based approaches can increase the efficiency, transparency and reproducibility of literature-based research. Methods available for text mining are rapidly changing owing to developments in machine learning-based language models. We review the growing landscape of approaches, mapping them onto three broad paradigms (frequency-based approaches, traditional Natural Language Processing and deep learning-based language models). This serves as an entry point to learn foundational and cutting-edge concepts, vocabularies, and methods to foster integration of these tools into ecological and evolutionary research. We cover approaches for modelling ecological texts, generating training data, developing custom models and interacting with large language models and discuss challenges and possible solutions to implementing these methods in ecology and evolution.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Past and future uses of text mining in ecology and evolution
    Farrell, Maxwell J.
    Brierley, Liam
    Willoughby, Anna
    Yates, Andrew
    Mideo, Nicole
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2022, 289 (1975)
  • [2] Behavioural approaches to landscape ecology
    Sutherland, WJ
    [J]. AVIAN LANDSCAPE ECOLOGY: PURE AND APPLIED ISSUES IN THE LARGE-SCALE ECOLOGY OF BIRDS, 2002, : 112 - 117
  • [3] Ecology of Phragmites populations in the changing landscape
    Cizková, H
    Brix, H
    Herben, T
    [J]. FOLIA GEOBOTANICA, 2000, 35 (04) : 351 - 351
  • [4] Ecology ofPhragmites populations in the changing landscape
    Hana Čížková
    Hans Brix
    Tomáš Herben
    [J]. Folia Geobotanica, 2000, 35 : 351 - 351
  • [5] Ecology -: Giant pandas in a changing landscape
    Loucks, CJ
    Lü, Z
    Dinerstein, E
    Wang, H
    Olson, DM
    Zhu, CQ
    Wang, DJ
    [J]. SCIENCE, 2001, 294 (5546) : 1465 - 1465
  • [6] A Review on Text Mining
    Zhang, Yu
    Chen, Mengdong
    Liu, Lianzhong
    [J]. PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 681 - 685
  • [7] Editorial: Landscape ecology in a changing globalized environment
    Francesco di Castri
    [J]. Landscape Ecology, 1997, 12 : 3 - 5
  • [8] Editorial: Landscape ecology in a changing globalized environment
    diCastri, F
    [J]. LANDSCAPE ECOLOGY, 1997, 12 (01) : 3 - 5
  • [9] Molecular Approaches to Ecology and Evolution
    Jeremy B Searle
    [J]. Heredity, 1999, 82 (5) : 585 - 585
  • [10] Selection criteria for text mining approaches
    Hashimi, Hussein
    Hafez, Alaaeldin
    Mathkour, Hassan
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2015, 51 : 729 - 733