@article{ART002116784},
author={Kim, Pan Jun},
title={An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning},
journal={Journal of the Korean Society for Information Management},
issn={1013-0799},
year={2016},
volume={33},
number={2},
pages={33-59},
doi={10.3743/KOSIM.2016.33.2.033}
TY - JOUR
AU - Kim, Pan Jun
TI - An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning
JO - Journal of the Korean Society for Information Management
PY - 2016
VL - 33
IS - 2
PB - 한국정보관리학회
SP - 33
EP - 59
SN - 1013-0799
AB - This study examined the factors affecting the performance of automatic classification for the domestic conference papers based on machine learning techniques. In particular, In view of the classification performance that assigning automatically the class labels to the papers in Proceedings of the Conference of Korean Society for Information Management using Rocchio algorithm, I investigated the characteristics of the key factors (classifier formation methods, training set size, weighting schemes, label assigning methods) through the diversified experiments. Consequently, It is more effective that apply proper parameters (β, λ) and training set size (more than 5 years) according to the classification environments and properties of the document set. and If the performance is equivalent, I discovered that the use of the more simple methods (single weighting schemes) is very efficient. Also, because the classification of domestic papers is corresponding with multi-label classification which assigning more than one label to an article, it is necessary to develop the optimum classification model based on the characteristics of the key factors in consideration of this environment.
KW - automatic classification;text categorization;performance factors;conference paper;rocchio algorithm;multi-label classification;machine learning
DO - 10.3743/KOSIM.2016.33.2.033
ER -
Kim, Pan Jun. (2016). An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning. Journal of the Korean Society for Information Management, 33(2), 33-59.
Kim, Pan Jun. 2016, "An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning", Journal of the Korean Society for Information Management, vol.33, no.2 pp.33-59. Available from: doi:10.3743/KOSIM.2016.33.2.033
Kim, Pan Jun "An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning" Journal of the Korean Society for Information Management 33.2 pp.33-59 (2016) : 33.
Kim, Pan Jun. An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning. 2016; 33(2), 33-59. Available from: doi:10.3743/KOSIM.2016.33.2.033
Kim, Pan Jun. "An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning" Journal of the Korean Society for Information Management 33, no.2 (2016) : 33-59.doi: 10.3743/KOSIM.2016.33.2.033
Kim, Pan Jun. An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning. Journal of the Korean Society for Information Management, 33(2), 33-59. doi: 10.3743/KOSIM.2016.33.2.033
Kim, Pan Jun. An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning. Journal of the Korean Society for Information Management. 2016; 33(2) 33-59. doi: 10.3743/KOSIM.2016.33.2.033
Kim, Pan Jun. An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning. 2016; 33(2), 33-59. Available from: doi:10.3743/KOSIM.2016.33.2.033
Kim, Pan Jun. "An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning" Journal of the Korean Society for Information Management 33, no.2 (2016) : 33-59.doi: 10.3743/KOSIM.2016.33.2.033