Identifying Useful Features to Detect Off-Topic Essays in Automated ScoringWithout Using Topic-Specific Training Essays

被引:0
|
作者
Chen, Jing [1 ]
Zhang, Mo [1 ]
机构
[1] Educ Testing Serv, Princeton, NJ 08541 USA
来源
关键词
Automated essay scoring; Feature selection; Off-topic essay detection;
D O I
10.1007/978-3-319-38759-8_24
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
E-rater (R) is the automated scoring engine used at ETS to score the writing quality of essays. A pre-screening filtering system is embedded in e-rater to detect and exclude essays that are not suitable to be scored by e-rater. The pre-screening filtering system is composed of a set of advisory flags, each of which marks some unusualness of the essay (e.g. repetition of words and sentences, restatement of the prompt). This study examined the effectiveness of an advisory flag in the filtering system that detected off-topic essays. The detection of off-topic essays usually requires topic specific training essays to train the engine in order to identify essays that are very different from the other essays of the same topic. The advisory flag used here is designed to detect off-topic essays without using topic-specific training essays because topic-specific training essays may not available in real test settings. To enhance the capability of this off-topic advisory flag, we identified a set of essay features that are potentially useful in distinguishing off-topic essays that do not require topic specific training essays. These features include essay length, the number of word types (exclude non-content-bearing words), the number of word tokens, the similarity of an essay to training essays, essay organization, and the variety of sentences in an essay.
引用
收藏
页码:315 / 326
页数:12
相关论文
共 6 条
  • [1] Identifying off-topic student essays without topic-specific training data
    Higgins, D.
    Burstein, J.
    Attali, Y.
    [J]. Natural Language Engineering, 2006, 12 (02) : 145 - 159
  • [2] Advanced Capabilities for Evaluating Student Writing: Detecting Off-Topic Essays Without Topic-Specific Training
    Burstein, Jill
    Higgins, Derrick
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: SUPPORTING LEARNING THROUGH INTELLIGENT AND SOCIALLY INFORMED TECHNOLOGY, 2005, 125 : 112 - 119
  • [3] Identifying the Topic-Specific Influential Users using SLM
    Shalaby, May
    Rafea, Ahmed
    [J]. 2015 FIRST INTERNATIONAL CONFERENCE ON ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2015): ADVANCES IN ARABIC COMPUTATIONAL LINGUISTICS, 2015, : 118 - 123
  • [4] On Using User Query Sequence to Detect Off-Topic Search
    Platt, Alana
    Goharian, Nazli
    Mengle, Saket S. R.
    [J]. APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 882 - 883
  • [5] Prominent Users Detection during Specific Events by Learning On-and Off-topic Features of User Activities
    Bizid, Imen
    Nayef, Nibal
    Boursier, Patrice
    Faiz, Sami
    Morcos, Jacques
    [J]. PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 500 - 503
  • [6] Course-specific Search Engines: Semi-automated Methods for Identifying High Quality Topic-specific Corpora
    Guha, Neel
    Wytock, Matt
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 1247 - 1252