Searching, Translating and Classifying Information in Cyberspace

被引:0
|
作者
Savoy, Jacques [1 ]
Dolamic, Ljiljana [1 ]
Zubaryeva, Olena [1 ]
机构
[1] Univ Neuchatel, Dept Comp Sci, CH-2000 Neuchatel, Switzerland
关键词
Search technology; web; machine translation; automatic text classification; machine learning; natural language processing (NLP); WEB;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we describe current search technologies available on the web, explain underlying difficulties and show their limits, related to either current technologies or to the intrinsic properties of all natural languages. We then analyze the effectiveness of freely available machine translation services and demonstrate that under certain conditions these translation systems can operate at the same performance levels as manual translators. Searching for factual information with commercial search engines also allows the retrieval of facts, user comments and opinions on target items. In the third part we explain how the principle machine learning strategies are able to classify short passages of text extracted from the blogosphere as factual or opinionated and then classify their polarity (positive, negative or mixed).
引用
收藏
页码:62 / 75
页数:14
相关论文
共 50 条
  • [31] Auditory browsing for acquisition of information in Cyberspace
    Oki, N
    Teramoto, K
    Okada, K
    Matsushita, Y
    [J]. PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, : 510 - 515
  • [32] Urban planning, information technology, and cyberspace
    Shiode, N
    [J]. JOURNAL OF URBAN TECHNOLOGY, 2000, 7 (02) : 105 - 126
  • [33] INFORMATION INSECURITY: AN ASSESSMENT OF THE ROMANIAN CYBERSPACE
    Sechel, Sergiu
    [J]. INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, IE 2016: EDUCATION, RESEARCH & BUSINESS TECHNOLOGIES, 2016, : 315 - 320
  • [34] Model and application of cyberspace information system
    Wang, Jilong
    Zhuang, Shuying
    Miao, Congcong
    An, Changqing
    [J]. Tongxin Xuebao/Journal on Communications, 2020, 41 (02): : 74 - 83
  • [35] Consumers and cyberspace: Inequitable distribution of information
    Simon, A
    [J]. CONSUMER INTERESTS ANNUAL, VOL 42, 1996, 42 : 265 - 266
  • [36] The Mechanism of Tendentious Information Dissemination in Cyberspace
    School of Information Communication, National University of Defense Technology, Wuhan
    430000, China
    不详
    450000, China
    不详
    450000, China
    [J]. Appl. Sci., 2024, 20
  • [37] Information, place, and cyberspace: Issues in accessibility
    Mayer, HJ
    [J]. JOURNAL OF URBAN TECHNOLOGY, 2002, 9 (01) : 128 - 130
  • [38] Searching by Similarity and Classifying Images on a Very Large Scale
    Amato, Giuseppe
    Savino, Pasquale
    [J]. SISAP 2009: 2009 SECOND INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2009, : 149 - 150
  • [39] Identifying, Classifying and Searching Graphic Symbols in the NOTAE System
    Boccuzzi, Maria
    Catarci, Tiziana
    Deodati, Luca
    Fantoli, Andrea
    Ghignoli, Antonella
    Leotta, Francesco
    Mecella, Massimo
    Monte, Anna
    Sietis, Nina
    [J]. DIGITAL LIBRARIES: THE ERA OF BIG DATA AND DATA SCIENCE, IRCDL 2020, 2020, 1177 : 111 - 122
  • [40] Using a hierarchical Thesaurus for classifying and searching software libraries
    Liao, HC
    Chen, MF
    Wang, FJ
    Dai, JC
    [J]. COMPSAC 97 : TWENTY-FIRST ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE, 1997, : 210 - 216