본문 바로가기
  • Home

Improving the Performance of SVM Text Categorization with Inter-document Similarities

  • Journal of the Korean Society for Information Management
  • Abbr : JKOSIM
  • 2005, 22(3), pp.261~287
  • DOI : 10.3743/KOSIM.2005.22.3.261
  • Publisher : 한국정보관리학회
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : August 20, 2005
  • Accepted : September 6, 2005
  • Published : September 30, 2005

Lee, Jae Yun 1

1경기대학교

Accredited

ABSTRACT

The purpose of this paper is to explore the ways to improve the performance of SVM(Support Vector Machines) text classifier using inter-document similarit ies. SVMs are powerful machine technique for automatic document classification. In this paper text categorization via SVMs aproach based on feature representation with document vectors is suggested. In this appr oach, document vectors instead stead of term weights are used as feature values. Experiments show that SVM clasifier with do cument vector features can improve the document classification performance. For the sake o f run-time efficiency, two methods are developed: One is to select document vector feature s, and the other is to use category centroid vector features instead. Experiments on these two methods show that we the performance of conventional methods with index term features.

Citation status

* References for papers published after 2023 are currently being built.