본문 바로가기
  • Home

Application of Machine Learning Techniques for Resolving Korean Author Names

  • Journal of the Korean Society for Information Management
  • Abbr : JKOSIM
  • 2008, 25(3), pp.27~39
  • DOI : 10.3743/KOSIM.2008.25.3.027
  • Publisher : 한국정보관리학회
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : July 2, 2008
  • Accepted : July 23, 2008
  • Published : September 30, 2008

In-Su Kang 1

1경성대학교

Accredited

ABSTRACT

In bibliographic data, the use of personal names to indicate authors makes it difficult to specify a particular author since there are numerous authors whose personal names are the same. Resolving same-name author instances into different individuals is called author resolution, which consists of two steps: calculating author similarities and then clustering same-name author instances into different person groups. Author similarities are computed from similarities of author-related bibliographic features such as coauthors, titles of papers, publication information, using supervised or unsupervised methods. Supervised approaches employ machine learning techniques to automatically learn the author similarity function from author-resolved training samples. So far, however, a few machine learning methods have been investigated for author resolution. This paper provides a comparative evaluation of a variety of recent high-performing machine learning techniques on author disambiguation, and compares several methods of processing author disambiguation features such as coauthors and titles of papers.

Citation status

* References for papers published after 2023 are currently being built.