본문 바로가기
  • Home

An Experimental Study on Topic Distillation Using Web Site Structure

  • Journal of the Korean Society for Information Management
  • Abbr : JKOSIM
  • 2007, 24(3), pp.201~218
  • DOI : 10.3743/KOSIM.2007.24.3.201
  • Publisher : 한국정보관리학회
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : August 15, 2007
  • Accepted : September 10, 2007
  • Published : September 30, 2007

Jee-Suk Lee 1 Young-Mee Chung 2

1NHN㈜
2연세대학교

Accredited

ABSTRACT

This study proposes a topic distillation algorithm that ranks the relevant sites selected from retrieved web pages, and evaluates the performance of the algorithm. The algorithm calculates the topic score of a site using its hierarchical structure. The TREC .GOV test collection and a set of TREC-2004 queries for topic distillation task are used for the experiment. The experimental results showed the algorithm returned at least 2 relevant sites in top ten retrieval results. We performed an in-depth analysis of the relevant sites list provided by TREC-2004 to find out that the definition of topic distillation was not strictly applied in selecting relevant sites. When we re-evaluated the retrieved sites/sub-sites using the revised list of relevant sites, the performance of the proposed algorithm was improved significantly.

Citation status

* References for papers published after 2023 are currently being built.