본문 바로가기
  • Home

KONG-DB: Korean Novel Geo-name DB & Search and Visualization System Using Dictionary from the Web

  • Journal of the Korean Society for Information Management
  • Abbr : JKOSIM
  • 2016, 33(3), pp.321~343
  • DOI : 10.3743/KOSIM.2016.33.3.321
  • Publisher : 한국정보관리학회
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : August 23, 2016
  • Accepted : September 13, 2016
  • Published : September 30, 2016

Park, Sung Hee 1

1한남대학교

Accredited

ABSTRACT

This study aimed to design a semi-automatic web-based pilot system 1) to build a Korean novel geo-name, 2) to update the database using automatic geo-name extraction for a scalable database, and 3) to retrieve/visualize the usage of an old geo-name on the map. In particular, the problem of extracting novel geo-names, which are currently obsolete, is difficult to solve because obtaining a corpus used for training dataset is burden. To build a corpus for training data, an admin tool, HTML crawler and parser in Python, crawled geo-names and usages from a vocabulary dictionary for Korean New Novel enough to train a named entity tagger for extracting even novel geo-names not shown up in a training corpus. By means of a training corpus and an automatic extraction tool, the geo-name database was made scalable. In addition, the system can visualize the geo-name on the map. The work of study also designed, implemented the prototype and empirically verified the validity of the pilot system. Lastly, items to be improved have also been addressed.

Citation status

* References for papers published after 2023 are currently being built.

This paper was written with support from the National Research Foundation of Korea.