Unsupervised domain-agnostic identification of product names in social media posts

被引:0
|
作者
Pogrebnyakov, Nicolai [1 ]
机构
[1] Copenhagen Business Sch, Frederiksberg, Denmark
关键词
named entity recognition; social media; product names; Facebook; ENTITY EXTRACTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Product name recognition is a significant practical problem, spurred by the greater availability of platforms for discussing products such as social media and product review functionalities of online marketplaces. Customers, product manufacturers and online marketplaces may want to identify product names in unstructured text to extract important insights, such as sentiment, surrounding a product. Much extant research on product name identification has been domain-specific (e.g., identifying mobile phone models) and used supervised or semi-supervised methods. With massive numbers of new products released to the market every year such methods may require retraining on updated labeled data to stay relevant, and may transfer poorly across domains. This research addresses this challenge and develops a domain-agnostic, unsupervised algorithm for identifying product names based on Facebook posts. The algorithm consists of two general steps: (a) candidate product name identification using an off-the-shelf pretrained conditional random fields (CRF) model, part-of-speech tagging and a set of simple patterns; and (b) filtering of candidate names to remove spurious entries using clustering and word embeddings generated from the data.
引用
收藏
页码:3711 / 3716
页数:6
相关论文
共 28 条
  • [1] Fully Unsupervised Domain-Agnostic Image Retrieval
    Zheng, Ziqiang
    Ren, Hao
    Wu, Yang
    Zhang, Weichuan
    Lu, Hong
    Yang, Yang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5077 - 5090
  • [2] Domain-Agnostic Priors for Semantic Segmentation Under Unsupervised Domain Adaptation and Domain Generalization
    Huo, Xinyue
    Xie, Lingxi
    Hu, Hengtong
    Zhou, Wengang
    Li, Houqiang
    Tian, Qi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 3954 - 3976
  • [3] Domain Identification for Intention Posts on Online Social Media
    Thai-Le Luong
    Quoc-Tuan Truong
    Hai-Trieu Dang
    Xuan-Hieu Phan
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 52 - 57
  • [4] Interpretable domain-informed and domain-agnostic features for supervised and unsupervised learning on building energy demand data
    Canaydin, Ada
    Fu, Chun
    Balint, Attila
    Khalil, Mohamad
    Miller, Clayton
    Kazmi, Hussain
    APPLIED ENERGY, 2024, 360
  • [5] Unsupervised Domain-Agnostic Fake News Detection Using Multi-Modal Weak Signals
    Silva A.
    Luo L.
    Karunasekera S.
    Leckie C.
    IEEE Transactions on Knowledge and Data Engineering, 2024, 36 (11) : 1 - 12
  • [6] Learning a generalizable re-identification model from unlabelled data with domain-agnostic expert
    Fangyi Liu
    Mang Ye
    Bo Du
    Visual Intelligence, 2 (1):
  • [7] Beyond the Product: Discovering Image Posts for Brands in Social Media
    Gelli, Francesco
    Uricchio, Tiberio
    He, Xiangnan
    Del Bimbo, Alberto
    Chua, Tat-Seng
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 465 - 473
  • [8] Fuzzy rule based unsupervised sentiment analysis from social media posts
    Vashishtha, Srishti
    Susan, Seba
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 138
  • [9] Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts
    Drinkall, Felix
    Zohren, Stefan
    Pierrehumbert, Janet B.
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1471 - 1484
  • [10] Ensemble-based domain adaptation on social media posts for irony detection
    Saroj, Anita
    Pal, Sukomal
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23249 - 23268