Unsupervised Extraction of Popular Product Attributes from E-Commerce Web Sites by Considering Customer Reviews

被引:30
|
作者
Bing, Lidong [1 ]
Wong, Tak-Lam [2 ]
Lam, Wai [3 ,4 ]
机构
[1] Carnegie Mellon Univ, Machine Learning Dept, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[2] Hong Kong Inst Educ, Dept Math & Informat Technol, 10 Lo Ping Rd, Tai Po, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Lab High Confidence Software Technol, Minist Educ, CUHK Sub Lab, Hong Kong, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Shatin, Hong Kong, Peoples R China
关键词
Information extraction; conditional random fields; product attribute; customer reviews;
D O I
10.1145/2857054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop an unsupervised learning framework for extracting popular product attributes from product description pages originated from different E-commerce Web sites. Unlike existing information extraction methods that do not consider the popularity of product attributes, our proposed framework is able to not only detect popular product features from a collection of customer reviews but also map these popular features to the related product attributes. One novelty of our framework is that it can bridge the vocabulary gap between the text in product description pages and the text in customer reviews. Technically, we develop a discriminative graphical model based on hidden Conditional Random Fields. As an unsupervised model, our framework can be easily applied to a variety of new domains and Web sites without the need of labeling training samples. Extensive experiments have been conducted to demonstrate the effectiveness and robustness of our framework.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Automatic Extraction of Product Information from Multiple e-Commerce Web Sites
    Nasti, Samiah Jan
    Asger, M.
    Butt, Muheet Ahmad
    [J]. PROCEEDINGS OF RECENT INNOVATIONS IN COMPUTING, ICRIC 2019, 2020, 597 : 739 - 747
  • [2] Promote Product Reviews of High Quality on E-commerce Sites
    Huang, Shen
    Shen, Dan
    Feng, Wei
    Baudin, Catherine
    Zhang, Yongzheng
    [J]. PACIFIC ASIA JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2010, 2 (03): : 51 - 71
  • [3] E-commerce Product's Trust Prediction Based on Customer Reviews
    Kargirwar, Hrutuja
    Bhagavatula, Praveen
    Konde, Shrutika
    Chaudhari, Paresh
    Dhamde, Vipul
    Sakarkar, Gopal
    Correa, Juan C.
    [J]. THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 375 - 383
  • [4] Towards Product Attributes Extraction in Indonesian e-Commerce Platform
    Rif'at, Muhammad
    Mahendra, Rahmad
    Budi, Indra
    Wibowo, Haryo Akbarianto
    [J]. COMPUTACION Y SISTEMAS, 2018, 22 (04): : 1367 - 1375
  • [5] Customer-centered rules for design of e-commerce Web sites
    Fang, XW
    Salvendy, G
    [J]. COMMUNICATIONS OF THE ACM, 2003, 46 (12) : 332 - 336
  • [6] The effects of usability and web design attributes on user preference for e-commerce web sites
    Lee, Sangwon
    Koubek, Richard J.
    [J]. COMPUTERS IN INDUSTRY, 2010, 61 (04) : 329 - 341
  • [7] The impact of logistics factors on customer reviews in E-commerce
    Zhang, Qingtian
    Huang, Youfang
    Yan, Wei
    Wang, Yu
    [J]. International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (07): : 201 - 212
  • [8] Leveraging Customer Reviews for E-commerce Query Generation
    Lien, Yen-Chieh
    Zhang, Rongting
    Harper, F. Maxwell
    Murdock, Vanessa
    Lee, Chia-Jung
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PT II, 2022, 13186 : 190 - 198
  • [9] Sentiment Analysis in Customer Reviews for Product Recommendation in E-commerce Using Machine Learning
    Panduro-Ramirez, Jeidy
    [J]. 2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [10] Product Phrase Extraction from e-Commerce Pages
    Vovk, Artem
    Tochilkin, Dmitrii
    Narayana, Pradyumna
    Sone, Kazoo
    Basu, Sugato
    [J]. COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, : 393 - 397