Unsupervised domain-agnostic identification of product names in social media posts

被引:0
|
作者
Pogrebnyakov, Nicolai [1 ]
机构
[1] Copenhagen Business Sch, Frederiksberg, Denmark
关键词
named entity recognition; social media; product names; Facebook; ENTITY EXTRACTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Product name recognition is a significant practical problem, spurred by the greater availability of platforms for discussing products such as social media and product review functionalities of online marketplaces. Customers, product manufacturers and online marketplaces may want to identify product names in unstructured text to extract important insights, such as sentiment, surrounding a product. Much extant research on product name identification has been domain-specific (e.g., identifying mobile phone models) and used supervised or semi-supervised methods. With massive numbers of new products released to the market every year such methods may require retraining on updated labeled data to stay relevant, and may transfer poorly across domains. This research addresses this challenge and develops a domain-agnostic, unsupervised algorithm for identifying product names based on Facebook posts. The algorithm consists of two general steps: (a) candidate product name identification using an off-the-shelf pretrained conditional random fields (CRF) model, part-of-speech tagging and a set of simple patterns; and (b) filtering of candidate names to remove spurious entries using clustering and word embeddings generated from the data.
引用
收藏
页码:3711 / 3716
页数:6
相关论文
共 28 条
  • [21] Examining the Public Messaging on 'Loneliness' over Social Media: An Unsupervised Machine Learning Analysis of Twitter Posts over the Past Decade
    Ng, Qin Xiang
    Lee, Dawn Yi Xin
    Yau, Chun En
    Lim, Yu Liang
    Ng, Clara Xinyi
    Liew, Tau Ming
    HEALTHCARE, 2023, 11 (10)
  • [22] Identification of time-evolving product opportunities via social media mining
    Choi, Jaewoong
    Oh, Seunghyun
    Yoon, Janghyeok
    Lee, Jae-Min
    Coh, Byoung-Youl
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2020, 156
  • [23] Content and Style of Firm-Generated Posts on Social Media: A Study of User Engagement on Hedonic and Utilitarian Product Pages on Facebook
    Schanke, Scott
    Ray, Gautam
    Adomavicius, Gediminas
    Wagle, Mihir
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2024, 25 (02): : 341 - 361
  • [24] Examining Public Messaging on Influenza Vaccine over Social Media: Unsupervised Deep Learning of 235,261 Twitter Posts from 2017 to 2023
    Ng, Qin Xiang
    Ng, Clara Xinyi
    Ong, Clarence
    Lee, Dawn Yi Xin
    Liew, Tau Ming
    VACCINES, 2023, 11 (10)
  • [25] The Influence of Fact-Checking Is Disputed! The Role of Party Identification in Processing and Sharing Fact-Checked Social Media Posts
    Oeldorf-Hirsch, Anne
    Schmierbach, Mike
    Appelman, Alyssa
    Boyle, Michael P.
    AMERICAN BEHAVIORAL SCIENTIST, 2024, 68 (10) : 1345 - 1365
  • [26] ADAPTATION OF DOMAIN-SPECIFIC TRANSFORMER MODELS WITH TEXT OVERSAMPLING FOR SENTIMENT ANALYSIS OF SOCIAL MEDIA POSTS ON COVID-19 VACCINE
    Bansal, Anmol
    Choudhry, Arjun
    Sharma, Anubhav
    Susan, Seba
    COMPUTER SCIENCE-AGH, 2023, 24 (02): : 167 - 186
  • [27] Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
    Md Abul Bashar
    Richi Nayak
    Khanh Luong
    Thirunavukarasu Balasubramaniam
    Social Network Analysis and Mining, 2021, 11
  • [28] Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
    Abul Bashar, Md
    Nayak, Richi
    Luong, Khanh
    Balasubramaniam, Thirunavukarasu
    SOCIAL NETWORK ANALYSIS AND MINING, 2021, 11 (01)