The Unreasonable Effectiveness of Data

被引:810
|
作者
Halevy, Alon
Norvig, Peter
Pereira, Fernando
机构
关键词
D O I
10.1109/MIS.2009.36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural language processing problems are solved by the use of unreasonable effectiveness of data. The biggest successes in natural-language-related machine learning is statistical speech recognition and statistical machine translation. The first lesson of Web-scale learning is to use available large-scale data rather than hoping for annotated data that is not available. The statistical language models used in speech recognition and machine translation consist of a huge database of probabilities of short sequences of consecutive words. Natural language processing require choosing a representation language, encoding a model in that language, and performing inference on the model. Semantic interpretation deals with imprecise, ambiguous natural languages, and service interoperability deals with making data precise enough so that the programs operating on the data functions effectively.
引用
收藏
页码:8 / 12
页数:5
相关论文
共 50 条
  • [1] Untidy Data: The Unreasonable Effectiveness of Tables
    Bartram, Lyn
    Correll, Michael
    Tory, Melanie
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (01) : 686 - 696
  • [2] The Unreasonable Effectiveness, and Difficulty, of Data in Healthcare
    Lee, Peter
    [J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 3 - 3
  • [3] Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
    Sun, Chen
    Shrivastava, Abhinav
    Singh, Saurabh
    Gupta, Abhinav
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 843 - 852
  • [4] The unreasonable effectiveness of ...
    Kadanoff, LP
    [J]. PHYSICS TODAY, 2000, 53 (11) : 11 - 12
  • [5] UNREASONABLE EFFECTIVENESS OF MATHEMATICS
    HAMMING, RW
    [J]. AMERICAN MATHEMATICAL MONTHLY, 1980, 87 (02): : 81 - 90
  • [6] UNREASONABLE EFFECTIVENESS OF SIMULATION
    不详
    [J]. SIMULATION, 1979, 33 (01) : R7 - R9
  • [7] The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition
    Krause, Jonathan
    Sapp, Benjamin
    Howard, Andrew
    Zhou, Howard
    Toshev, Alexander
    Duerig, Tom
    Philbin, James
    Li Fei-Fei
    [J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 301 - 320
  • [8] The Unreasonable Effectiveness of Martingales
    Peres, Yuval
    [J]. PROCEEDINGS OF THE TWENTIETH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2009, : 997 - 1000
  • [9] ON THE UNREASONABLE EFFECTIVENESS OF DIAGRAMS
    Damour, Thibault
    [J]. REVUE DE SYNTHESE, 2017, 138 (1-4): : 231 - 260
  • [10] THE UNREASONABLE EFFECTIVENESS OF COMPUTER PHYSICS
    DREITLEIN, J
    [J]. FOUNDATIONS OF PHYSICS, 1993, 23 (06) : 923 - 930