A Structured Analysis of Unstructured Big Data by Leveraging Cloud Computing

被引:107
|
作者
Liu, Xiao [1 ]
Singh, Param Vir [2 ]
Srinivasan, Kannan [3 ]
机构
[1] NYU, Stern Sch Business, 550 1St Ave, New York, NY 10012 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA 15213 USA
关键词
big data; cloud computing; text mining; user generated content; Twitter; Google Trends; WORD-OF-MOUTH; USER-GENERATED CONTENT; BOX-OFFICE; PANEL-DATA; GMM ESTIMATION; DYNAMICS; REVIEWS; CHATTER; IMPACT; MATTER;
D O I
10.1287/mksc.2015.0972
中图分类号
F [经济];
学科分类号
02 ;
摘要
Accurate forecasting of sales/consumption is particularly important for marketing because this information can be used to adjust marketing budget allocations and overall marketing strategies. Recently, online social platforms have produced an unparalleled amount of data on consumer behavior. However, two challenges have limited the use of these data in obtaining meaningful business marketing insights. First, the data are typically in an unstructured format, such as texts, images, audio, and video. Second, the sheer volume of the data makes standard analysis procedures computationally unworkable. In this study, we combine methods from cloud computing, machine learning, and text mining to illustrate how online platform content, such as Twitter, can be effectively used for forecasting. We conduct our analysis on a significant volume of nearly two billion Tweets and 400 billion Wikipedia pages. Our main findings emphasize that, by contrast to basic surface-level measures such as the volume of or sentiments in Tweets, the information content of Tweets and their timeliness significantly improve forecasting accuracy. Our method endogenously summarizes the information in Tweets. The advantage of our method is that the classification of the Tweets is based on what is in the Tweets rather than preconceived topics that may not be relevant. We also find that, by contrast to Twitter, other online data (e.g., Google Trends, Wikipedia views, IMDB reviews, and Huffington Post news) are very weak predictors of TV show demand because users tweet about TV shows before, during, and after a TV show, whereas Google searches, Wikipedia views, IMDB reviews, and news posts typically lag behind the show.
引用
收藏
页码:363 / 388
页数:26
相关论文
共 50 条
  • [1] Application Of Cloud Computing In Biomedicine Big Data Analysis Cloud Computing In Big Data
    Yang, Tianyi
    Zhao, Yang
    [J]. 2017 INTERNATIONAL CONFERENCE ON ALGORITHMS, METHODOLOGY, MODELS AND APPLICATIONS IN EMERGING TECHNOLOGIES (ICAMMAET), 2017,
  • [2] Cloud Computing for Big Data Analysis
    Marozzo, Fabrizio
    Belcastro, Loris
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [3] Structured and Unstructured Big Data Analytics
    Misluu, Suyash
    Misra, Anuranjan
    [J]. 2017 INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER, ELECTRICAL, ELECTRONICS AND COMMUNICATION (CTCEEC), 2017, : 740 - 746
  • [4] Associated Index for Big Structured and Unstructured Data
    Zhu, Chunying
    Li, Qingzhong
    Kong, Lanju
    Wang, Xiangwei
    Hong, Xiaoguang
    [J]. WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 : 567 - 570
  • [5] Cloud Computing and Big Data
    Hsu, Ching-Hsien
    Tang, Chunming
    Esteves, Rui M.
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (06): : 995 - 997
  • [6] Big data and cloud computing
    Shrestha, Rasu B.
    [J]. APPLIED RADIOLOGY, 2014, 43 (03) : 32 - 34
  • [7] Application of Big Data Analysis and Cloud Computing Technology
    Guo, Dajun
    [J]. INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2024, 16 (01)
  • [8] Actionable Knowledge As A Service (AKAAS): Leveraging big data analytics in cloud computing environments
    Depeige A.
    Doyencourt D.
    [J]. Journal of Big Data, 2015, 2 (01)
  • [9] Transformative computing in security, big data analysis, and cloud computing applications
    Ogiela, Lidia
    Leu, Fang-Yie
    Fiore, Ugo
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (23):
  • [10] Mining Local Specialties for Travelers by Leveraging Structured and Unstructured Data
    Jiang, Kai
    Liu, Like
    Xiao, Rong
    Yu, Nenghai
    [J]. ADVANCES IN MULTIMEDIA, 2012, 2012