본문 바로가기
  • Home

A Semantic-Based Feature Expansion Approach for Improving the Effectiveness of Text Categorization by Using WordNet

  • Journal of the Korean Society for Information Management
  • Abbr : JKOSIM
  • 2009, 26(3), pp.261~278
  • DOI : 10.3743/KOSIM.2009.26.3.261
  • Publisher : 한국정보관리학회
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : August 16, 2009
  • Accepted : August 28, 2009
  • Published : September 30, 2009

Eunkyung Chung 1

1이화여자대학교

Accredited

ABSTRACT

Identifying optimal feature sets in Text Categorization(TC) is crucial in terms of improving the effectiveness. In this study, experiments on feature expansion were conducted using author provided keyword sets and article titles from typical scientific journal articles. The tool used for expanding feature sets is WordNet, a lexical database for English words. Given a data set and a lexical tool, this study presented that feature expansion with synonymous relationship was significantly effective on improving the results of TC. The experiment results pointed out that when expanding feature sets with synonyms using on classifier names, the effectiveness of TC was considerably improved regardless of word sense disambiguation.

Citation status

* References for papers published after 2023 are currently being built.