본문 바로가기
  • Home

Hierarchic Document Clustering in OPAC

  • Journal of the Korean Society for Information Management
  • Abbr : JKOSIM
  • 2004, 21(1), pp.93~118
  • DOI : 10.3743/KOSIM.2004.21.1.093
  • Publisher : 한국정보관리학회
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : February 16, 2004
  • Accepted : March 10, 2004
  • Published : March 30, 2004

Jung Soon Ro 1

1한남대학교

Accredited

ABSTRACT

This study is to develop a hiararchic clustering model for document classification and browsing in OPAC systems. Two automatic indexing techniques (with and without controlled terms), two term weighting methods (based on term frequency and binary weight), five similarity coefficients (Dice, Jaccard, Pearson, Cosine, and Squared Euclidean), and three hierarchic clustering algorithms (Between Average Linkage, Within Average Linkage, and Complete Linkage method) were tested on the document collection of 175 books and theses on library and information science. The best document clusters resulted from the Between Average Linkage or Complete Linkage method with Jaccard or Dice coefficient on the automatic indexing with controlled terms in binary vector. The clusters from Between Average Linkage with Jaccard has more likely decimal classification structure.

Citation status

* References for papers published after 2023 are currently being built.