To classify large-scale text corpora, one common approach is using hierarchical text classification and classifying text documents in a top-down manner. Classification methods using top-down approach can scale well and cope with changes to the category trees. However, all these methods suffer from a common problem: a high level of misclassification document has unrecoverable. We define an virtual subclass for each non-leaf category to help the rejected documents go back to ancestor category, thus improving the overall performance. Our experiments using Support Vector Machine (SVM) classifiers on the 20newsgroup collection have shown that they all could reduce blocking and improve the classification accuracy. Our experiments have also shown that the virtual category method delivered the best performance. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of ICAE2011.
机构:
Boston Univ, Div Mat Sci & Engn, Boston, MA 02215 USA
Boston Univ, Dept Phys, Boston, MA 02215 USA
Boston Univ, Dept Elect & Comp Engn, Boston, MA 02215 USABoston Univ, Div Mat Sci & Engn, Boston, MA 02215 USA
Imboden, Matthias
Bishop, David
论文数: 0引用数: 0
h-index: 0
机构:
Boston Univ, Div Mat Sci & Engn, Boston, MA 02215 USA
Boston Univ, Dept Phys, Boston, MA 02215 USA
Boston Univ, Dept Elect & Comp Engn, Boston, MA 02215 USABoston Univ, Div Mat Sci & Engn, Boston, MA 02215 USA
机构:
Univ Wisconsin, Dept Chem, Madison, WI 53706 USAUniv Wisconsin, Dept Chem, Madison, WI 53706 USA
Melby, Jake A.
Roberts, David S.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Chem, Madison, WI 53706 USAUniv Wisconsin, Dept Chem, Madison, WI 53706 USA
Roberts, David S.
Larson, Eli J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Chem, Madison, WI 53706 USAUniv Wisconsin, Dept Chem, Madison, WI 53706 USA
Larson, Eli J.
Brown, Kyle A.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Chem, Madison, WI 53706 USA
Univ Wisconsin, Dept Surg, Madison, WI 53705 USAUniv Wisconsin, Dept Chem, Madison, WI 53706 USA
Brown, Kyle A.
Bayne, Elizabeth F.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Chem, Madison, WI 53706 USAUniv Wisconsin, Dept Chem, Madison, WI 53706 USA
Bayne, Elizabeth F.
Jin, Song
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Chem, Madison, WI 53706 USAUniv Wisconsin, Dept Chem, Madison, WI 53706 USA
Jin, Song
Ge, Ying
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Chem, Madison, WI 53706 USA
Univ Wisconsin, Dept Cell & Regenerat Biol, Madison, WI 53705 USA
Univ Wisconsin, Human Prote Program, Madison, WI 53705 USAUniv Wisconsin, Dept Chem, Madison, WI 53706 USA
机构:
MRC Cognit & Brain Sci Unit, Cambridge CB2 7EF, EnglandMRC Cognit & Brain Sci Unit, Cambridge CB2 7EF, England
Sohoglu, Ediz
Peelle, Jonathan E.
论文数: 0引用数: 0
h-index: 0
机构:
MRC Cognit & Brain Sci Unit, Cambridge CB2 7EF, England
Washington Univ, Dept Otolaryngol, St Louis, MO 63130 USAMRC Cognit & Brain Sci Unit, Cambridge CB2 7EF, England
Peelle, Jonathan E.
Carlyon, Robert P.
论文数: 0引用数: 0
h-index: 0
机构:
MRC Cognit & Brain Sci Unit, Cambridge CB2 7EF, EnglandMRC Cognit & Brain Sci Unit, Cambridge CB2 7EF, England
Carlyon, Robert P.
Davis, Matthew H.
论文数: 0引用数: 0
h-index: 0
机构:
MRC Cognit & Brain Sci Unit, Cambridge CB2 7EF, EnglandMRC Cognit & Brain Sci Unit, Cambridge CB2 7EF, England