Explaining Financial Uncertainty through Specialized Word Embeddings

被引:5
|
作者
Theil, Christoph Kilian [1 ]
Štajner, Sanja [1 ]
Stuckenschmidt, Heiner [2 ]
机构
[1] University of Mannheim, Germany
[2] Symanto Research Germany, Nürnberg, Germany
来源
关键词
Embeddings;
D O I
10.1145/3343039
中图分类号
学科分类号
摘要
The detection of vague, speculative, or otherwise uncertain language has been performed in the encyclopedic, political, and scientific domains yet left relatively untouched in finance. However, the latter benefits from public sources of big financial data that can be linked with extracted measures of linguistic uncertainty as a mean of extrinsic model validation. Doing so further helps in understanding how the linguistic uncertainty of financial disclosures might induce financial uncertainty to the market. To explore this field, we use term weighting methods to detect linguistic uncertainty in a large dataset of financial disclosures. As a baseline, we use an existing dictionary of financial uncertainty triggers; furthermore, we retrieve related terms in specialized word embedding models to automatically expand this dictionary. Apart from an industry-agnostic expansion, we create expansions incorporating industry-specific jargon. In a set of cross-sectional event study regressions, we show that the such enriched dictionary explains a significantly larger share of future volatility, a common financial uncertainty measure, than before. Furthermore, we show that - different to the plain dictionary - our embedding models are well suited to explain future analyst forecast uncertainty. Notably, our results indicate that enriching the dictionary with industry-specific vocabulary explains a significantly larger share of financial uncertainty than an industry-agnostic expansion. © 2020 ACM.
引用
收藏
相关论文
共 50 条
  • [1] Word Embeddings-Based Uncertainty Detection in Financial Disclosures
    Theil, Christoph Kilian
    Stajner, Sanja
    Stuckenschmidt, Heiner
    [J]. ECONOMICS AND NATURAL LANGUAGE PROCESSING (ECONLP 2018), 2018, : 32 - 37
  • [2] Explaining Topical Distances Using Word Embeddings
    Witt, Nils
    Seifert, Christin
    Granitzer, Michael
    [J]. 2016 27TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2016, : 212 - 217
  • [3] Explaining Word Embeddings via Disentangled Representation
    Liao, Keng-Te
    Lee, Cheng-Syuan
    Huang, Zhong-Yu
    Lin, Shou-de
    [J]. 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 720 - 725
  • [4] The Impact of Specialized Corpora for Word Embeddings in Natural Langage Understanding
    Neuraz, Antoine
    Rance, Bastien
    Garcelon, Nicolas
    Llanos, Leonardo Campillos
    Burgun, Anita
    Rosset, Sophie
    [J]. DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 432 - 436
  • [5] Towards Understanding Word Embeddings: Automatically Explaining Similarity of Terms
    Zhang, Yating
    Jatowt, Adam
    Tanaka, Katsumi
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 823 - 832
  • [6] Comparing Word Embeddings through Visualisation
    Santos, Pedro
    Datia, Nuno
    Pato, Matilde
    Sobral, Jose
    [J]. 2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 91 - 97
  • [7] Multilingual Financial Word Embeddings for Arabic, English and French
    Zmandar, Nadhem
    El-Haj, Mahmoud
    Rayson, Paul
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4584 - 4589
  • [8] MoRTy: Unsupervised Learning of Task-specialized Word Embeddings by Autoencoding
    Rethmeier, Nils
    Plank, Barbara
    [J]. 4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 49 - 54
  • [9] Comparing general and specialized word embeddings for biomedical named entity recognition
    Ramos-Vargas, Rigo E.
    Roman-Godinez, Israel
    Torres-Ramos, Sulema
    [J]. PEERJ COMPUTER SCIENCE, 2021, 7 : 1 - 22
  • [10] Measuring associational thinking through word embeddings
    Carlos Periñán-Pascual
    [J]. Artificial Intelligence Review, 2022, 55 : 2065 - 2102