본문 바로가기
  • Home

A Study on Improving the Performance of Document Classification Using the Context of Terms

  • Journal of the Korean Society for Information Management
  • Abbr : JKOSIM
  • 2012, 29(2), pp.205~224
  • DOI : 10.3743/KOSIM.2012.29.2.205
  • Publisher : 한국정보관리학회
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : May 30, 2012
  • Accepted : June 26, 2012
  • Published : June 30, 2012

Song, Sung Jeon 1 Young-Mee Chung 1

1연세대학교

Accredited

ABSTRACT

One of the limitations of BOW method is that each term is recognized only by its form, failing to represent the term’s meaning or thematic background. To overcome the limitation, different profiles for each term were defined by thematic categories depending on contextual characteristics. In this study, a specific term was used as a classification feature based on its meaning or thematic background through the process of comparing the context in those profiles with the occurrences in an actual document. The experiment was conducted in three phases; term weighting, ensemble classifier implementation, and feature selection. The classification performance was enhanced in all the phases with the ensemble classifier showing the highest performance score. Also, the outcome showed that the proposed method was effective in reducing the performance bias caused by the total number of learning documents.

Citation status

* References for papers published after 2023 are currently being built.