본문 바로가기
  • Home

Research on Minimizing Access to RDF Triple Store for Efficiency in Constructing Massive Bibliographic Linked Data

  • Journal of Korean Library and Information Science Society
  • Abbr : JKLISS
  • 2017, 48(3), pp.233-257
  • DOI : 10.16981/kliss.48.3.201709.233
  • Publisher : Korean Library And Information Science Society
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : August 20, 2017
  • Accepted : September 19, 2017

Lee Moon-Ho 1 Sung-Pil Choi 1

1경기대학교

Accredited

ABSTRACT

In this paper, we propose an effective method to convert and construct the MEDLINE, the world's largest biomedical bibliographic database, into linked data. To do this, we first derive the appropriate RDF schema by analyzing the MEDLINE record structure in detail, and convert each record into a valid RDF file in the derived schema. We apply the dual batch registration method to streamline the subject URI duplication checking procedure when merging all RDF files in the converted record unit and storing it in a single RDF triple storage. By applying this method, the number of RDF triple storage accesses for the subject URI duplication is reduced from 26,597,850 to 2,400, compared with the sequential configuration of linked data in units of RDF files. Therefore, it is expected that the result of this study will provide an important opportunity to eliminate the inefficiency in converting large volume bibliographic record sets into linked data, and to secure promptness and timeliness.

Citation status

* References for papers published after 2023 are currently being built.