Tradeoffs in Accuracy and Efficiency in Supervised Learning Methods

被引：38

作者：

Collingwood, Loren ^{[1
]}

Wilkerson, John ^{[1
]}

机构：

[1] Univ Washington, Dept Polit Sci, Box 353530,101 Gowen Hall, Seattle, WA 98195 USA

来源：

JOURNAL OF INFORMATION TECHNOLOGY & POLITICS | 2012年 / 9卷 / 03期

关键词：

Machine learning; supervised learning; text classification;

D O I：

10.1080/19331681.2012.669191

中图分类号：

G2 [信息与知识传播];

学科分类号：

05 ; 0503 ;

摘要：

Words are an increasingly important source of data for social science research. Automated classification methodologies hold the promise of substantially lowering the costs of analyzing large amounts of text. In this article, we consider a number of questions of interest to prospective users of supervised learning methods, which are used to automatically classify events based on a pre-existing classification system. Although information scientists devote considerable attention to assessing the performance of different supervised learning algorithms and feature representations, the questions asked are often less directly relevant to the more practical concerns of social scientists. The first question prospective social science users are likely to ask is, How well do such methods work? The second is, How much human labeling effort is required? The third is, How do we assess whether virgin cases have been automatically classified with sufficient accuracy? We address these questions in the context of a particular dataset-the Congressional Bills Project-which includes more than 400,000 bill titles that humans have classified into 20 policy topics. This corpus offers an unusual opportunity to assess the performance of different algorithms, the impact of sample size, and the benefits of ensemble learning as a means for estimating classification accuracy.

引用

页码：298 / 318

页数：21

共 50 条

[31] A Survey of Semi-Supervised Learning Methods
Pise, Nitin N.
Kulkarni, Parag
2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, VOLS 1 AND 2, PROCEEDINGS, 2008, : 593 - +
[32] Almonds classification using supervised learning methods
Halac, Delila
Sokic, Emir
Turajlic, Emir
2017 XXVI INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND AUTOMATION TECHNOLOGIES (ICAT), 2017,
[33] Supervised Learning Methods Application to Sentiment Analysis
Altares Lopez, Sergio
Cuadrado-Gallego, Juan J.
IDEAS '19: PROCEEDINGS OF THE 23RD INTERNATIONAL DATABASE APPLICATIONS & ENGINEERING SYMPOSIUM (IDEAS 2019), 2019, : 145 - 150
[34] The impact of learning on perceptual decisions and its implication for speed-accuracy tradeoffs
Mendonca, Andre G.
Drugowitsch, Jan
Vicente, M. Ines
DeWitt, Eric E. J.
Pouget, Alexandre
Mainen, Zachary F.
NATURE COMMUNICATIONS, 2020, 11 (01)
[35] The impact of learning on perceptual decisions and its implication for speed-accuracy tradeoffs
André G. Mendonça
Jan Drugowitsch
M. Inês Vicente
Eric E. J. DeWitt
Alexandre Pouget
Zachary F. Mainen
Nature Communications, 11
[36] Comparison of the Accuracy of Ground Reaction Force Component Estimation between Supervised Machine Learning and Deep Learning Methods Using Pressure Insoles
Kammoun, Amal
Ravier, Philippe
Buttelli, Olivier
SENSORS, 2024, 24 (16)
[37] Self-Supervised Learning Improves Accuracy and Data Efficiency for IMU-Based Ground Reaction Force Estimation
Tan, Tian
Shull, Peter B.
Hicks, Jenifer L.
Uhlrich, Scott D.
Chaudhari, Akshay S.
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2024, 71 (07) : 2095 - 2104
[38] Accuracy and efficiency improvements in synthetic eddy methods
Skillen, A.
Revell, A.
Craft, T.
INTERNATIONAL JOURNAL OF HEAT AND FLUID FLOW, 2016, 62 : 386 - 394
[39] Efficiency and Accuracy of Wildland Weed Mapping Methods
Christensen, Stephanie D.
Ransom, Corey V.
Edvarchuk, Kimberly A.
Rasmussen, V. Philip
INVASIVE PLANT SCIENCE AND MANAGEMENT, 2011, 4 (04) : 458 - 466
[40] ON THE EFFICIENCY AND ACCURACY OF INTERPOLATION METHODS FOR SPECTRAL CODES
van Hinsberg, M. A. T.
Boonkkamp, J. H. M. Ten Thije
Toschi, F.
Clercx, H. J. H.
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2012, 34 (04): : B479 - B498

← 1 2 3 4 5 →