Discovering latent commercial networks from online financial news articles

被引:19
|
作者
Xia, Yunqing [1 ]
Su, Weifeng [2 ]
Lau, Raymond Y. K. [3 ]
Liu, Yi [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] United Int Coll, Div Sci & Technol, Zhuhai 519085, Peoples R China
[3] City Univ Hong Kong, Dept Informat Syst, Hong Kong, Hong Kong, Peoples R China
关键词
commercial entity recognition; commercial relation identification; commercial networks; text mining; natural language processing; FEATURE SPACE THEORY; BUSINESS;
D O I
10.1080/17517575.2011.621093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unlike most online social networks where explicit links among individual users are defined, the relations among commercial entities (e.g. firms) may not be explicitly declared in commercial Web sites. One main contribution of this article is the development of a novel computational model for the discovery of the latent relations among commercial entities from online financial news. More specifically, a CRF model which can exploit both structural and contextual features is applied to commercial entity recognition. In addition, a point-wise mutual information (PMI)-based unsupervised learning method is developed for commercial relation identification. To evaluate the effectiveness of the proposed computational methods, a prototype system called CoNet has been developed. Based on the financial news articles crawled from Google finance, the CoNet system achieves average F-scores of 0.681 and 0.754 in commercial entity recognition and commercial relation identification, respectively. Our experimental results confirm that the proposed shallow natural language processing methods are effective for the discovery of latent commercial networks from online financial news.
引用
收藏
页码:303 / 331
页数:29
相关论文
共 50 条
  • [21] Discovering Homophily in Online Social Networks
    De Salve, Andrea
    Guidi, Barbara
    Ricci, Laura
    Mori, Paolo
    MOBILE NETWORKS & APPLICATIONS, 2018, 23 (06): : 1715 - 1726
  • [22] Discovering Homophily in Online Social Networks
    Andrea De Salve
    Barbara Guidi
    Laura Ricci
    Paolo Mori
    Mobile Networks and Applications, 2018, 23 : 1715 - 1726
  • [23] Automated Classification of Criminal and Violent Activities in Thailand from Online News Articles
    Thaipisutikul, Tipajin
    Tuarob, Suppawong
    Pongpaichet, Siripen
    Amornvatcharapong, Amornsri
    Shih, Timothy K.
    2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 170 - 175
  • [24] On detecting business event from the headlines and leads of massive online news articles
    Qian, Yu
    Deng, Xiongwen
    Ye, Qiongwei
    Ma, Baojun
    Yuan, Hua
    INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (06)
  • [25] Classification of Indonesian News Articles based on Latent Dirichlet Allocation
    Kusumaningrum, Retno
    Adhy, Satriyo
    Wiedjayanto, M. Ihsan Aji
    Suryono
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2016,
  • [26] ContextWalk: Embedding Networks with Context Information Extracted from News Articles
    Chen, Chaoran
    Schoenfeld, Mirco
    Pfeffer, Juergen
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT II, 2021, 12924 : 100 - 114
  • [27] An Approach for Predicting the Popularity of Online Security News Articles
    Kong, Junli
    Wang, Baocang
    Liu, Caiyun
    Wu, Gaofei
    2018 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2018,
  • [28] Online Near-Duplicate Detection of News Articles
    Rodier, Simon
    Carter, Dave
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1242 - 1249
  • [29] An approach for predicting the popularity of online security news articles
    Kong, Junli
    Wang, Baocang
    Liu, Caiyun
    Wu, Gaofei
    2018 IEEE Conference on Communications and Network Security, CNS 2018, 2018,
  • [30] Understanding Reader Backtracking Behavior in Online News Articles
    Smadja, Uzi
    Grusky, Max
    Artzi, Yoav
    Naaman, Mor
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3237 - 3243