Semantic Fingerprinting: A Novel Method for Entity-Level Content Classification

被引:1
|
作者
Govind [1 ]
Alec, Celine [1 ]
Spaniol, Marc [1 ]
机构
[1] Univ Caen Normandie, Dept Comp Sci, Campus Cote de Nacre, F-14032 Caen, France
来源
WEB ENGINEERING, ICWE 2018 | 2018年 / 10845卷
关键词
Entity-level web analytics; Semantically-enriched web content classification; Web semantics; WORDNET;
D O I
10.1007/978-3-319-91662-0_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the constantly growing Web, there is a need for automatically analyzing, interpreting and organizing contents. A particular need is given by the management ofWeb contents with respect to classification systems, e.g. based on ontologies in the LOD (Linked Open Data) cloud. Research in deep learning recently has shown great progress in classifying data based on large volumes of training data. However, "targeted" and fine-grained information systems require classification methods based on a relatively small number of "representative" samples. For that purpose, we present an approach that allows a semantic exploitation of Web contents and - at the same time - computationally efficient processing based on "Semantic Fingerprinting". To this end, we raise Web contents to the entity-level and exploit entity-related information that allows "distillation" and fine-grained classification of the Web content by its "semantic fingerprint". In experimental results on Web contents classified in Wikipedia, we show the superiority of our approach against state-of-the-art methods.
引用
收藏
页码:277 / 285
页数:9
相关论文
共 50 条
  • [41] Entity-level Cross-modal Learning Improves Multi-modal Machine Translation
    Huang, Xin
    Zhang, Jiajun
    Zong, Chengqing
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1067 - 1080
  • [42] Using Character-Level and Entity-Level Representations to Enhance Bidirectional Encoder Representation From Transformers-Based Clinical Semantic Textual Similarity Model: ClinicalSTS Modeling Study
    Xiong, Ying
    Chen, Shuai
    Chen, Qingcai
    Yan, Jun
    Tang, Buzhou
    [J]. JMIR MEDICAL INFORMATICS, 2020, 8 (12)
  • [43] An End-to-end Model for Entity-level Relation Extraction using Multi-instance Learning
    Eberts, Markus
    Ulges, Adrian
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 3650 - 3660
  • [44] KNSC: A NOVEL LOCAL CLASSIFICATION METHOD FOR MULTIMEDIA SEMANTIC ANALYSIS
    Tao, Kun
    Lin, Shouxun
    Zhang, Yongdong
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 402 - 405
  • [45] Understanding the determinants of the magnitude of entity-level risk and account-level risk key audit matters: The case of the United Kingdom
    Sierra-Garcia, Laura
    Gambetta, Nicolas
    Garcia-Benau, Maria A.
    Orta-Perez, Manuel
    [J]. BRITISH ACCOUNTING REVIEW, 2019, 51 (03): : 227 - 240
  • [46] NOVEL METHOD FOR DEOXYNUCLEOTIDE FINGERPRINTING
    KOROBKO, VG
    GRACHEV, SA
    PETROV, NA
    [J]. BIOORGANICHESKAYA KHIMIYA, 1977, 3 (10): : 1423 - 1426
  • [47] Semantic Video Entity Linking based on Visual Content and Metadata
    Li, Yuncheng
    Yang, Xitong
    Luo, Jiebo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4615 - 4623
  • [48] Multi-level text classification method based on latent semantic analysis
    Shi, Hongxia
    Wei, Guiyi
    Pan, Yun
    [J]. ICEIS 2007: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS: SOFTWARE AGENTS AND INTERNET COMPUTING, 2007, : 320 - +
  • [49] Implementing multiresolution models and families of models: from entity-level simulation to desktop stochastic models and "repro" models
    McEver, J
    Davis, PK
    Bigelow, J
    [J]. ENABLING TECHNOLOGY FOR SIMULATION SCIENCE IV, 2000, 4026 : 16 - 25
  • [50] METHOD OF SEMANTIC CLASSIFICATION DESIGN
    BAKOV, AA
    BUKHALEVA, EI
    ZDOROV, IP
    [J]. NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 2-INFORMATSIONNYE PROTSESSY I SISTEMY, 1977, (10): : 11 - 14