본문 바로가기
  • Home

The analysis of the nested structure of named entities in Korea

  • Korean Semantics
  • 2022, 76(), pp.66-101
  • Publisher : The Society Of Korean Semantics
  • Research Area : Humanities > Korean Language and Literature
  • Received : May 11, 2022
  • Accepted : June 13, 2022
  • Published : June 30, 2022

youngsook song 1 Hyun Jo You 2 Cheong, Yu-Nam 3

1경희대학교
2서울대학교
3중앙대학교 인문콘텐츠연구소

Accredited

ABSTRACT

This paper analyzes the hierarchical structures of named entities in the NIKL Named Entity Corpus, which is annotated with 553,830 flat named entity tags. This study will be a base for developing a method to build a Korean nested named entity corpus. The flat version of named entity recognition identifies mentions as linear spans. The nested named entity approach analyzes the hierarchical internal structure of named entities which may consist of smaller component named entities. We extracted candidate mentions for the nested named entity analysis from the NIKL Named Entity Corpus and classified them into three categories: serial named entities, complex named entities, and phrases with a named entity head. These candidates were reviewed manually to be selected as the target of nested named entity analysis. Finally, we discussed the span and the internal structure of named entities and proposed principles and guidelines for the construction of the Korean nested named entity corpus

Citation status

* References for papers published after 2023 are currently being built.