본문 바로가기
  • Home

Applicability Evaluation and Feature Optimization of the OpenAlex Global Author Disambiguation Model for Korean Scholarly Data

  • Journal of the Korean Biblia Society for Library and Information Science
  • 2026, 37(1), pp.387~410
  • DOI : 10.14699//kbiblia.2026.37.1.387
  • Publisher : Journal Of The Korean Biblia Society For Library And Information Science
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : February 20, 2026
  • Accepted : March 2, 2026
  • Published : March 30, 2026

Hyeong-Sang Jeong 1 Kwak, Seung-Jin 2

1충남대학교 문헌정보학과
2충남대학교

Accredited

ABSTRACT

Author Name Disambiguation(AND) is a critical task in scholarly information systems; however, the applicability of the English-centric OpenAlex model to the Korean academic ecosystem has yet to be fully validated. This study evaluates OpenAlex’s performance using 54,049 papers (2023-2024) from KISTI’s OCEAN database and optimizes seven features tailored to Korean linguistic characteristics. Stepwise experiments demonstrate that the F1-score improved from 0.852 (v1-1) to 0.860 (v2-2), ultimately achieving an accuracy of 0.930 and an F1-score of 0.931 after ground-truth refinement. Cross-validation with ORCID yielded an F1-score of 0.892, confirming the model’s reliability. Specifically, we propose an optimization process that combines incremental processing with manual verification to manage large-scale data efficiently. Finally, the study validates a pipeline that successfully clusters 183,105 author records into 109,205 unique identifiers, verifying its practical feasibility and scalability for Korean scholarly metadata.

Citation status

* References for papers published after 2024 are currently being built.