A hybrid approach to finding relevant social media content for complex domain specific information needs

被引:4
|
作者
Cameron, Delroy [1 ]
Sheth, Amit P. [1 ]
Jaykumar, Nishita [1 ]
Thirunarayan, Krishnaprasad [1 ]
Anand, Gaurish [1 ]
Smith, Gary A. [1 ]
机构
[1] Wright State Univ, Ohio Ctr Excellence Knowledge Enabled Comp Knoesi, Dayton, OH 45435 USA
来源
JOURNAL OF WEB SEMANTICS | 2014年 / 29卷
关键词
Semantic search; Domain specific information retrieval; Complex information needs; Ontology; Background knowledge; Context-free grammar; Knowledge-aware search; RETRIEVAL; SYSTEM; WEB;
D O I
10.1016/j.websem.2014.11.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While contemporary semantic search systems offer to improve classical keyword-based search, they are not always adequate for complex domain specific information needs. The domain of prescription drug abuse, for example, requires knowledge of both ontological concepts and "intelligible constructs'' not typically modeled in ontologies. These intelligible constructs convey essential information that include notions of intensity, frequency, interval, dosage, and sentiments, which could be important to the holistic needs of the information seeker. In this paper, we present a hybrid approach to domain specific information retrieval (or knowledge-aware search) that integrates ontology-driven query interpretation with synonym-based query expansion, and domain specific rules, to facilitate search. Our framework is based on a context-free grammar (CFG) that defines the query language of constructs interpretable by the search system. The grammar provides two levels of semantic interpretation: (1) a top-level CFG that facilitates retrieval of diverse textual patterns, which belong to broad templates and (2) a low-level CFG that enables interpretation of specific expressions that belong to such patterns. These low-level expressions occur as concepts from four different categories of data: (1) ontological concepts, (2) concepts in lexicons (such as emotions and sentiments), (3) concepts in lexicons with only partial ontology representation, called lexico-ontology concepts (such as side effects and routes of administration (ROA)), and (4) domain specific expressions (such as date, time, interval, frequency, and dosage) derived solely through rules. Our approach is embodied in a novel Semantic Web platform called PREDOSE, which provides search support for complex domain specific information needs in prescription drug abuse epidemiology. When applied to a corpus of over 1 million drug abuse-related web forum posts, our search framework proved effective in retrieving relevant documents when compared with three existing search systems. Published by Elsevier B.V.
引用
收藏
页码:39 / 52
页数:14
相关论文
共 50 条
  • [41] Social Media and Internet Use Patterns by Adolescents With Complex Communication Needs
    Bosse, Ingo
    Renner, Gregor
    Wilkens, Leevke
    LANGUAGE SPEECH AND HEARING SERVICES IN SCHOOLS, 2020, 51 (04) : 1024 - 1036
  • [42] Pharmaceutical Companies and Their Drugs on Social Media: A Content Analysis of Drug Information on Popular Social Media Sites
    Tyrawski, Jennifer
    DeAndrea, David C.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2015, 17 (06) : e130
  • [43] Health information seeking on social media: the diversification approach
    Rosenberg, Dennis
    Mano, Rita
    Mesch, Gustavo S.
    EQUALITY DIVERSITY AND INCLUSION, 2023, 42 (03): : 364 - 381
  • [44] Information quality in healthcare social media – an architectural approach
    Lopez D.M.
    Blobel B.
    Gonzalez C.
    Health and Technology, 2016, 6 (1) : 17 - 25
  • [45] Information Diffusion in Halal Food Social Media: A Social Network Approach
    Mostafa, Mohamed M.
    JOURNAL OF INTERNATIONAL CONSUMER MARKETING, 2021, 33 (04) : 471 - 491
  • [46] Social Convos: A New Approach to Modeling Information Diffusion in Social Media
    Katsios, Gregorios
    Sa, Ning
    Strzalkowski, Tomek
    ADVANCES IN ARTIFICIAL INTELLIGENCE, SOFTWARE AND SYSTEMS ENGINEERING, 2020, 965 : 25 - 36
  • [47] Social media elements, media content, and well-being: a communication approach
    Hall, Jeffrey A.
    COMMUNICATION THEORY, 2024, 35 (01) : 1 - 13
  • [48] A Hybrid Approach to the Maximum Clique Problem in the Domain of Information Management
    Alexander, Demidovskij
    Eduard, Babkin
    Babkina, Tatiana
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON FUZZY AND NEURO COMPUTING (FANCCO - 2015), 2015, 415 : 323 - 336
  • [49] On the Information Content of Coarse Data with Respect to the Particle Size Distribution of Complex Granular Media: Rationale Approach and Testing
    Garcia-Gutierrez, Carlos
    Angel Martin, Miguel
    Pachepsky, Yakov
    ENTROPY, 2019, 21 (06)
  • [50] Using Content Analysis and Machine Learning to Identify COVID-19 Information Relevant to Low-income Households on Social Media
    Khanal, Sarthak
    Refati, Rus
    Glandt, Kyle
    Caragea, Doina
    Xu, Sifan
    Chen, Chien-fei
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1522 - 1531