A taxonomy generation tool for semantic visual analysis of large corpus of documents

被引:5
|
作者
Carrion, Belen [1 ]
Onorati, Teresa [1 ]
Diaz, Paloma [1 ]
Triga, Vasiliki [2 ]
机构
[1] Univ Carlos III Madrid, Dept Comp Sci, Leganes, Spain
[2] Cyprus Univ Technol, Dept Commun & Internet Studies, Limassol, Cyprus
基金
欧盟地平线“2020”;
关键词
Knowledge modelling; Semantic visualization; Taxonomy development process; Big data; SOCIAL NETWORKS; CONTEXT;
D O I
10.1007/s11042-019-07880-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Taxonomies are semantic resources that help to categorize and add meaning to data. In a hyperconnected world where information is generated at a rate that exceeds human capacities to process and make sense of it, such semantic resources can help to access relevant information more efficiently by extracting knowledge from large and unstructured data sets. Taxonomies are related to specific domains of knowledge in which they identify relevant topics. However, they have to be validated by experts to guarantee that its terms and relations are meaningful. In this paper, we introduce a semiautomatic taxonomy generation tool for supporting domain experts in building taxonomies that are then used to automatically create semantic visualizations of data. Our proposal combines automatic techniques to extract, sort and categorize terms, and empowers domain experts to take part at any stage of the process by providing a visual edition tool. We tested the tool's usability in two use cases from different domains and languages. Results show that all the functionalities are easy to use and interact with. Lessons learned from this experience will guide the design of a utility evaluation involving domain experts interested in data analysis and knowledge modeling.
引用
收藏
页码:32919 / 32937
页数:19
相关论文
共 50 条
  • [1] A taxonomy generation tool for semantic visual analysis of large corpus of documents
    Belen Carrion
    Teresa Onorati
    Paloma Díaz
    Vasiliki Triga
    Multimedia Tools and Applications, 2019, 78 : 32919 - 32937
  • [2] An annotation tool for semantic documents
    Eriksson, Henrik
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4519 : 759 - 768
  • [3] Semantic analysis of web documents for the generation of optimal content
    Mavridis, Themistoklis
    Symeonidis, Andreas L.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 35 : 114 - 130
  • [4] Summary generation approaches based on semantic analysis for news documents
    Kogilavani, S. V.
    Kanimozhiselvi, C. S.
    Malliga, S.
    JOURNAL OF INFORMATION SCIENCE, 2016, 42 (04) : 465 - 476
  • [5] A Taxonomy based Semantic Similarity of Documents using the Cosine Measure
    Madylova, Ainura
    Oguducu, Sule Guenduez
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 129 - 134
  • [6] Towards a Corpus of Requirements Documents Enriched with Semantic Frame Annotations
    Alhoshan, Waad
    Batista-Navarro, Riza
    Zhao, Liping
    2018 IEEE 26TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2018), 2018, : 428 - 431
  • [7] VarifocalReader - In-Depth Visual Analysis of Large Text Documents
    Koch, Steffen
    John, Markus
    Woerner, Michael
    Mueller, Andreas
    Ertl, Thomas
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (12) : 1723 - 1732
  • [8] Analysis of the Mutual Relevance of Topical Corpus Documents in the Problem of Assessing the Proximity of Text to the Semantic Standard
    D. V. Mikhaylov
    G. M. Emelyanov
    Pattern Recognition and Image Analysis, 2021, 31 : 588 - 594
  • [9] Analysis of the Mutual Relevance of Topical Corpus Documents in the Problem of Assessing the Proximity of Text to the Semantic Standard
    Mikhaylov, D., V
    Emelyanov, G. M.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (03) : 588 - 594
  • [10] Dynamic Generation of Semantic Documents for Web Resources
    Ashish, Saxena
    Gore, M. M.
    COMPUTER NETWORKS AND INTELLIGENT COMPUTING, 2011, 157 : 132 - 140