Fad or future? Automated analysis of financial text and its implications for corporate reporting

被引:46
|
作者
Lewis, Craig [1 ]
Young, Steven [2 ]
机构
[1] Vanderbilt Univ, Owen Grad Sch Management, Ave South, Nashville, TN 37203 USA
[2] Univ Lancaster, Management Sch, Lancaster, England
基金
英国经济与社会研究理事会;
关键词
textual analysis; natural language processing; automated analysis; corporate reporting;
D O I
10.1080/00014788.2019.1611730
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
This paper describes the current state of natural language processing (NLP) as it applies to corporate reporting. We document dramatic increases in the quantity of verbal content that is an integral part of company reporting packages, as well as the evolution of text analytic approaches being employed to analyse this content. We provide intuitive descriptions of the leading analytic approaches applied in the academic accounting and finance literatures. This discussion includes key word searches and counts, attribute dictionaries, naive Bayesian classification, cosine similarity, and latent Dirichlet allocation. We also discuss how increasing interest in NLP processing of the corporate reporting package could and should influence financial reporting regulation and note that textual analysis is currently more of an afterthought, if it is even considered. Opportunities for improving the usefulness of NLP processing are discussed, as well as possible impediments.
引用
收藏
页码:587 / 615
页数:29
相关论文
共 50 条