@article{ART002952537},
author={Byung-Il Yun and Dahye Kim and Young-Jin Kim and Medard Edmund Mswahili and Young-Seob Jeong},
title={Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study},
journal={Journal of The Korea Society of Computer and Information},
issn={1598-849X},
year={2023},
volume={28},
number={4},
pages={21-29},
doi={10.9708/jksci.2023.28.04.021}
TY - JOUR
AU - Byung-Il Yun
AU - Dahye Kim
AU - Young-Jin Kim
AU - Medard Edmund Mswahili
AU - Young-Seob Jeong
TI - Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study
JO - Journal of The Korea Society of Computer and Information
PY - 2023
VL - 28
IS - 4
PB - The Korean Society Of Computer And Information
SP - 21
EP - 29
SN - 1598-849X
AB - In this paper, we propose an AI-based system for automatically classifying industry and occupation codes in the population census. The accurate classification of industry and occupation codes is crucial for informing policy decisions, allocating resources, and conducting research. However, this task has traditionally been performed by human coders, which is time-consuming, resource-intensive, and prone to errors. Our system represents a significant improvement over the existing rule-based system used by the statistics agency, which relies on user-entered data for code classification. In this paper, we trained and evaluated several models, and developed an ensemble model that achieved an 86.76% match accuracy in industry and 81.84% in occupation, outperforming the best individual model. Additionally, we propose process improvement work based on the classification probability results of the model. Our proposed method utilizes an ensemble model that combines transfer learning techniques with pre-trained models. In this paper, we demonstrate the potential for AI-based systems to improve the accuracy and efficiency of population census data classification. By automating this process with AI, we can achieve more accurate and consistent results while reducing the workload on agency staff.
KW - Natural language process;Classification;Population survey;Ensemble model;Industrial and Occupation code
DO - 10.9708/jksci.2023.28.04.021
ER -
Byung-Il Yun, Dahye Kim, Young-Jin Kim, Medard Edmund Mswahili and Young-Seob Jeong. (2023). Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study. Journal of The Korea Society of Computer and Information, 28(4), 21-29.
Byung-Il Yun, Dahye Kim, Young-Jin Kim, Medard Edmund Mswahili and Young-Seob Jeong. 2023, "Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study", Journal of The Korea Society of Computer and Information, vol.28, no.4 pp.21-29. Available from: doi:10.9708/jksci.2023.28.04.021
Byung-Il Yun, Dahye Kim, Young-Jin Kim, Medard Edmund Mswahili, Young-Seob Jeong "Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study" Journal of The Korea Society of Computer and Information 28.4 pp.21-29 (2023) : 21.
Byung-Il Yun, Dahye Kim, Young-Jin Kim, Medard Edmund Mswahili, Young-Seob Jeong. Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study. 2023; 28(4), 21-29. Available from: doi:10.9708/jksci.2023.28.04.021
Byung-Il Yun, Dahye Kim, Young-Jin Kim, Medard Edmund Mswahili and Young-Seob Jeong. "Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study" Journal of The Korea Society of Computer and Information 28, no.4 (2023) : 21-29.doi: 10.9708/jksci.2023.28.04.021
Byung-Il Yun; Dahye Kim; Young-Jin Kim; Medard Edmund Mswahili; Young-Seob Jeong. Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study. Journal of The Korea Society of Computer and Information, 28(4), 21-29. doi: 10.9708/jksci.2023.28.04.021
Byung-Il Yun; Dahye Kim; Young-Jin Kim; Medard Edmund Mswahili; Young-Seob Jeong. Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study. Journal of The Korea Society of Computer and Information. 2023; 28(4) 21-29. doi: 10.9708/jksci.2023.28.04.021
Byung-Il Yun, Dahye Kim, Young-Jin Kim, Medard Edmund Mswahili, Young-Seob Jeong. Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study. 2023; 28(4), 21-29. Available from: doi:10.9708/jksci.2023.28.04.021
Byung-Il Yun, Dahye Kim, Young-Jin Kim, Medard Edmund Mswahili and Young-Seob Jeong. "Improving the Classification of Population and Housing Census with AI: An Industry and Job Code Study" Journal of The Korea Society of Computer and Information 28, no.4 (2023) : 21-29.doi: 10.9708/jksci.2023.28.04.021