Investigating the Statistical Properties of User-Generated Documents

被引:0
|
作者
Inches, Giacomo [1 ]
Carman, Mark James [2 ]
Crestani, Fabio [1 ]
机构
[1] Univ Lugano, Fac Informat, Lugano, Switzerland
[2] Monash Univ, Fac Informat Technol, Melbourne, FL USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The importance of the Internet as a communication medium is reflected in the large amount of documents being generated every day by users of the different services that take place online. In this work we aim at analyzing the properties of these online user-generated documents for some of the established services over the Internet (Kongregate, Twitter, Myspace and Slashdot) and comparing them with a consolidated collection of standard information retrieval documents (from the Wall Street Journal, Associated Press and Financial Times, as part of the TREC ad-hoc collection). We investigate features such as document similarity; term burstiness, emoticons and Part-Of-Speech analysis, highlighting the applicability and limits of traditional content analysis and indexing techniques used in information retrieval to the new online user-generated documents.
引用
收藏
页码:198 / +
页数:3
相关论文
共 50 条
  • [1] Statistics of Online User-Generated Short Documents
    Inches, Giacomo
    Carman, Mark J.
    Crestani, Fabio
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2010, 5993 : 649 - 652
  • [2] Arabian Photos: Investigating User-Generated Content
    Syed-Ahmad, Sharifah Fatimah
    Pengiran-Kahar, Dayangku Ida Nurul-Fitri
    Lahadzir, Azlinda
    Murphy, Jamie
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 2010, 2010, : 591 - +
  • [3] The Semantics of Clustering: Analysis of User-Generated Spatializations of Text Documents
    Endert, Alex
    Fox, Seth
    Maiti, Dipayan
    Leman, Scotland
    North, Chris
    [J]. PROCEEDINGS OF THE INTERNATIONAL WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES, 2012, : 555 - 562
  • [4] Is It Better to Be Suspicious? Investigating the Case of Online User-Generated Content
    Zhang, Xiao
    Ko, Myung
    [J]. AMCIS 2014 PROCEEDINGS, 2014,
  • [5] User-Generated Evidence
    Hamilton, Rebecca J.
    [J]. COLUMBIA JOURNAL OF TRANSNATIONAL LAW, 2019, 57 (01): : 1 - 61
  • [6] User-generated content
    Wofford, Jennifer
    [J]. NEW MEDIA & SOCIETY, 2012, 14 (07) : 1236 - 1239
  • [7] User-generated content
    Greenfield, David
    [J]. CONTROL ENGINEERING, 2009, 56 (10) : 2 - 2
  • [8] TRANSMISSION: USER-GENERATED LITURGY
    Everett, Isaac
    [J]. LITURGY, 2011, 26 (02) : 20 - 29
  • [9] User-Generated Content Introduction
    Krumm, John
    Davies, Nigel
    Narayanaswami, Chandra
    [J]. IEEE PERVASIVE COMPUTING, 2008, 7 (04) : 10 - 11
  • [10] Differentiation with User-Generated Content
    Zhang, Kaifu
    Sarvary, Miklos
    [J]. MANAGEMENT SCIENCE, 2015, 61 (04) : 898 - 914