An architecture for language processing for scientific texts

被引:0
|
作者
Copestake, Ann [1 ]
Corbett, Peter [1 ]
Murray-Rust, Peter [1 ]
Rupp, C. J. [1 ]
Siddharthan, Advaith [1 ]
Teufel, Simone [1 ]
Waldron, Ben [1 ]
机构
[1] Univ Cambridge, Comp Lab, Cambridge CB2 1TN, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe the architecture for language processing adopted on the eScience project 'Extracting the Science from Scientific Publications' (nicknamed SciBorg). In this approach, papers from different sources are first processed to give a common XML format (SciXML). Language processing modules operate on the SciXML in an architecture that allows for (partially) parallel deep and shallow processing and for a flexible combination of domain-independent and domain-dependent techniques. Robust Minimal Recursion Semantics (RMRS) acts both as a language for representing the output of processing and as an integration language for combining different modules. Language processing produces RMRS markup represented as standoff annotation on the original SciXML. Information extraction (IE) of various types is defined as operating on RMRss. Rhetorical analysis of the texts also partially depends on IE-like patterns and supports novel methods of information access.
引用
收藏
页码:614 / 621
页数:8
相关论文
共 50 条
  • [1] NOVICE STRATEGIES FOR PROCESSING SCIENTIFIC TEXTS
    DEELUCAS, D
    LARKIN, JH
    [J]. DISCOURSE PROCESSES, 1986, 9 (03) : 329 - 354
  • [2] Requirements for a System Architecture for the Analysis of Scientific Texts
    Baumgart, Matthias
    Roschke, Christian
    Vodel, Matthias
    Ritter, Marc
    [J]. PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 4, 2023, 465 : 551 - 567
  • [3] Natural Language Processing for Historical Texts
    Rosmorduc, Serge
    [J]. TRAITEMENT AUTOMATIQUE DES LANGUES, 2012, 53 (03): : 155 - 157
  • [4] Natural Language Processing for Historical Texts
    Romary, Laurent
    [J]. COMPUTATIONAL LINGUISTICS, 2014, 40 (01) : 231 - 233
  • [5] Natural language processing of mathematical texts in mArachna
    Blanke, Marie
    Jeschke, Sabina
    Natho, Nicole
    Seiler, Ruedi
    Wilke, Marc
    [J]. ADVANCES AND INNOVATIONS IN SYSTEMS, COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2007, : 301 - 305
  • [6] Transferring scientific Texts into Plain Language Feasibility Considerations
    Rauh, Bernhard
    Kuppel, Sebastian
    [J]. ZEITSCHRIFT FUR PADAGOGIK, 2023, 69 (03): : 333 - 349
  • [7] Importance of information processing skills on the comprehension of scientific texts
    Sanjose, Vicente
    Fernandez, Juan-Jose
    Vidal-Abarca, Eduardo
    [J]. INFANCIA Y APRENDIZAJE, 2010, 33 (04): : 529 - 541
  • [8] Recollections + The language and style of Cinquecento texts on art and architecture
    Nencioni, G
    [J]. LINGUA E STILE, 1997, 32 (01) : 5 - 9
  • [9] Compositionality in a Parallel Architecture for Language Processing
    Baggio, Giosue
    [J]. COGNITIVE SCIENCE, 2021, 45 (05)
  • [10] A FUNCTIONAL LANGUAGE AND MODULAR ARCHITECTURE FOR SCIENTIFIC COMPUTING
    YOUNG, MF
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1985, 201 : 305 - 318