@article{ART002580168},
author={In-Su Kang},
title={Evaluation of English Term Extraction based on Inner/Outer Term Statistics},
journal={Journal of The Korea Society of Computer and Information},
issn={1598-849X},
year={2020},
volume={25},
number={4},
pages={141-148},
doi={10.9708/jksci.2020.25.04.141}
TY - JOUR
AU - In-Su Kang
TI - Evaluation of English Term Extraction based on Inner/Outer Term Statistics
JO - Journal of The Korea Society of Computer and Information
PY - 2020
VL - 25
IS - 4
PB - The Korean Society Of Computer And Information
SP - 141
EP - 148
SN - 1598-849X
AB - Automatic term extraction is to recognize domain-specific terms given a collection of domain-specific text. Previous term extraction methods operate effectively in unsupervised manners which include extracting candidate terms, and assigning importance scores to candidate terms. Regarding the calculation of term importance scores, the study focuses on utilizing sets of inner and outer terms of a candidate term. For a candidate term, its inner terms are shorter terms which belong to the candidate term as components, and its outer terms are longer terms which include the candidate term as their component.
This work presents various functions that compute, for a candidate term, term strength from either set of its inner or outer terms. In addition, a scoring method of a term importance is devised based on C-value score and the term strength values obtained from the sets of inner and outer terms.
Experimental evaluations using GENIA and ACL RD-TEC 2.0 datasets compare and analyze the effectiveness of the proposed term extraction methods for English. The proposed method performed better than the baseline method by up to 1% and 3% respectively for GENIA and ACL datasets.
KW - Term extraction;Inner term set;Outer term set;Term importance score;Domain term
DO - 10.9708/jksci.2020.25.04.141
ER -
In-Su Kang. (2020). Evaluation of English Term Extraction based on Inner/Outer Term Statistics. Journal of The Korea Society of Computer and Information, 25(4), 141-148.
In-Su Kang. 2020, "Evaluation of English Term Extraction based on Inner/Outer Term Statistics", Journal of The Korea Society of Computer and Information, vol.25, no.4 pp.141-148. Available from: doi:10.9708/jksci.2020.25.04.141
In-Su Kang "Evaluation of English Term Extraction based on Inner/Outer Term Statistics" Journal of The Korea Society of Computer and Information 25.4 pp.141-148 (2020) : 141.
In-Su Kang. Evaluation of English Term Extraction based on Inner/Outer Term Statistics. 2020; 25(4), 141-148. Available from: doi:10.9708/jksci.2020.25.04.141
In-Su Kang. "Evaluation of English Term Extraction based on Inner/Outer Term Statistics" Journal of The Korea Society of Computer and Information 25, no.4 (2020) : 141-148.doi: 10.9708/jksci.2020.25.04.141
In-Su Kang. Evaluation of English Term Extraction based on Inner/Outer Term Statistics. Journal of The Korea Society of Computer and Information, 25(4), 141-148. doi: 10.9708/jksci.2020.25.04.141
In-Su Kang. Evaluation of English Term Extraction based on Inner/Outer Term Statistics. Journal of The Korea Society of Computer and Information. 2020; 25(4) 141-148. doi: 10.9708/jksci.2020.25.04.141
In-Su Kang. Evaluation of English Term Extraction based on Inner/Outer Term Statistics. 2020; 25(4), 141-148. Available from: doi:10.9708/jksci.2020.25.04.141
In-Su Kang. "Evaluation of English Term Extraction based on Inner/Outer Term Statistics" Journal of The Korea Society of Computer and Information 25, no.4 (2020) : 141-148.doi: 10.9708/jksci.2020.25.04.141