Discovering latent commercial networks from online financial news articles

被引:19
|
作者
Xia, Yunqing [1 ]
Su, Weifeng [2 ]
Lau, Raymond Y. K. [3 ]
Liu, Yi [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] United Int Coll, Div Sci & Technol, Zhuhai 519085, Peoples R China
[3] City Univ Hong Kong, Dept Informat Syst, Hong Kong, Hong Kong, Peoples R China
关键词
commercial entity recognition; commercial relation identification; commercial networks; text mining; natural language processing; FEATURE SPACE THEORY; BUSINESS;
D O I
10.1080/17517575.2011.621093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unlike most online social networks where explicit links among individual users are defined, the relations among commercial entities (e.g. firms) may not be explicitly declared in commercial Web sites. One main contribution of this article is the development of a novel computational model for the discovery of the latent relations among commercial entities from online financial news. More specifically, a CRF model which can exploit both structural and contextual features is applied to commercial entity recognition. In addition, a point-wise mutual information (PMI)-based unsupervised learning method is developed for commercial relation identification. To evaluate the effectiveness of the proposed computational methods, a prototype system called CoNet has been developed. Based on the financial news articles crawled from Google finance, the CoNet system achieves average F-scores of 0.681 and 0.754 in commercial entity recognition and commercial relation identification, respectively. Our experimental results confirm that the proposed shallow natural language processing methods are effective for the discovery of latent commercial networks from online financial news.
引用
收藏
页码:303 / 331
页数:29
相关论文
共 50 条
  • [1] A Framework for Discovering Bursty Events and Their Relationships from Online News Articles
    Madani, Pooria
    ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 352 - 355
  • [2] Discovering Themes from Online News Articles on the 2018 Mayon Eruption
    Austero, Lea D.
    Sy, Christian Y.
    Canon, Mary Joy P.
    2018 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2018), 2018, : 242 - 245
  • [3] Clustering sentences for discovering events in news articles
    Naughton, Martina
    Kushmerick, Nicholas
    Carthy, Joe
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 535 - 538
  • [4] Unsupervised latent event representation learning and storyline extraction from news articles based on neural networks
    Si, Jiasheng
    Guo, Linsen
    Zhou, Deyu
    INTELLIGENT DATA ANALYSIS, 2021, 25 (03) : 589 - 603
  • [5] Evaluating sentiment in financial news articles
    Schumaker, Robert P.
    Zhang, Yulei
    Huang, Chun-Neng
    Chen, Hsinchun
    DECISION SUPPORT SYSTEMS, 2012, 53 (03) : 458 - 464
  • [6] Temporal Summarization for Online News Articles
    Kumar, Kishore M.
    Kumar, Sanampudi Suresh
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1376 - 1379
  • [7] Financial activity mining from online multilingual news
    Cheung, PS
    Huang, RZ
    Lam, W
    ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 1, PROCEEDINGS, 2004, : 267 - 271
  • [8] Discovering event episodes from sequences of online news articles: A time-adjoining frequent itemset-based clustering method
    Lee, Yen-Hsien
    Hu, Paul Jen-Hwa
    Zhu, Hongquan
    Chen, Hsin-Wei
    INFORMATION & MANAGEMENT, 2020, 57 (07)
  • [9] Deep learning for stock market prediction from financial news articles
    Vargas, Manuel R.
    de Lima, Beatriz S. L. P.
    Evsukoff, Alexandre G.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA), 2017, : 60 - 65
  • [10] Semantic Data Mining of Financial News Articles
    Vavpetic, Anze
    Novak, Petra Kralj
    Grcar, Miha
    Mozetic, Igor
    Lavrac, Nada
    DISCOVERY SCIENCE, 2013, 8140 : 294 - 307