This paper analyzes the hierarchical structures of named entities in the NIKL Named Entity Corpus, which is annotated with 553,830 flat named entity tags. This study will be a base for developing a method to build a Korean nested named entity corpus. The flat version of named entity recognition identifies mentions as linear spans.
The nested named entity approach analyzes the hierarchical internal structure of named entities which may consist of smaller component named entities. We extracted candidate mentions for the nested named entity analysis from the NIKL Named Entity Corpus and classified them into three categories: serial named entities, complex named entities, and phrases with a named entity head. These candidates were reviewed manually to be selected as the target of nested named entity analysis. Finally, we discussed the span and the internal structure of named entities and proposed principles and guidelines for the construction of the Korean nested named entity corpus
@article{ART002853584}, author={youngsook song and You Hyun-Jo and Cheong, Yunam}, title={The analysis of the nested structure of named entities in Korea}, journal={Korean Semantics}, issn={1226-7198}, year={2022}, volume={76}, pages={66-101}
TY - JOUR AU - youngsook song AU - You Hyun-Jo AU - Cheong, Yunam TI - The analysis of the nested structure of named entities in Korea JO - Korean Semantics PY - 2022 VL - 76 IS - null PB - The Society Of Korean Semantics SP - 66 EP - 101 SN - 1226-7198 AB - This paper analyzes the hierarchical structures of named entities in the NIKL Named Entity Corpus, which is annotated with 553,830 flat named entity tags. This study will be a base for developing a method to build a Korean nested named entity corpus. The flat version of named entity recognition identifies mentions as linear spans.
The nested named entity approach analyzes the hierarchical internal structure of named entities which may consist of smaller component named entities. We extracted candidate mentions for the nested named entity analysis from the NIKL Named Entity Corpus and classified them into three categories: serial named entities, complex named entities, and phrases with a named entity head. These candidates were reviewed manually to be selected as the target of nested named entity analysis. Finally, we discussed the span and the internal structure of named entities and proposed principles and guidelines for the construction of the Korean nested named entity corpus KW - named entity;named entity recognition;nested named entity;complex named entity;information extraction;named entity annotation;named entity boundary detection;longest named entity;shortest named entity;natural language process DO - UR - ER -
youngsook song, You Hyun-Jo and Cheong, Yunam. (2022). The analysis of the nested structure of named entities in Korea. Korean Semantics, 76, 66-101.
youngsook song, You Hyun-Jo and Cheong, Yunam. 2022, "The analysis of the nested structure of named entities in Korea", Korean Semantics, vol.76, pp.66-101.
youngsook song, You Hyun-Jo, Cheong, Yunam "The analysis of the nested structure of named entities in Korea" Korean Semantics 76 pp.66-101 (2022) : 66.
youngsook song, You Hyun-Jo, Cheong, Yunam. The analysis of the nested structure of named entities in Korea. 2022; 76 66-101.
youngsook song, You Hyun-Jo and Cheong, Yunam. "The analysis of the nested structure of named entities in Korea" Korean Semantics 76(2022) : 66-101.
youngsook song; You Hyun-Jo; Cheong, Yunam. The analysis of the nested structure of named entities in Korea. Korean Semantics, 76, 66-101.
youngsook song; You Hyun-Jo; Cheong, Yunam. The analysis of the nested structure of named entities in Korea. Korean Semantics. 2022; 76 66-101.
youngsook song, You Hyun-Jo, Cheong, Yunam. The analysis of the nested structure of named entities in Korea. 2022; 76 66-101.
youngsook song, You Hyun-Jo and Cheong, Yunam. "The analysis of the nested structure of named entities in Korea" Korean Semantics 76(2022) : 66-101.