@article{ART002359115},
author={JeeHee Yuk and Min Song},
title={A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning},
journal={Journal of the Korean Society for Information Management},
issn={1013-0799},
year={2018},
volume={35},
number={2},
pages={63-88},
doi={10.3743/KOSIM.2018.35.2.063}
TY - JOUR
AU - JeeHee Yuk
AU - Min Song
TI - A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning
JO - Journal of the Korean Society for Information Management
PY - 2018
VL - 35
IS - 2
PB - 한국정보관리학회
SP - 63
EP - 88
SN - 1013-0799
AB - This research evaluated differences of classification performance for feature selection methods using LDA topic model and Doc2Vec which is based on word embedding using deep learning, feature corpus sizes and classification algorithms. In addition to find the feature corpus with high performance of classification, an experiment was conducted using feature corpus was composed differently according to the location of the document and by adjusting the size of the feature corpus. Conclusionally, in the experiments using deep learning evaluate training frequency and specifically considered information for context inference. This study constructed biomedical document dataset, Disease-35083 which consisted biomedical scholarly documents provided by PMC and categorized by the disease category. Throughout the study this research verifies which type and size of feature corpus produces the highest performance and, also suggests some feature corpus which carry an extensibility to specific feature by displaying efficiency during the training time. Additionally, this research compares the differences between deep learning and existing method and suggests an appropriate method by classification environment.
KW - document classification;feature selection;text categorization;topic model;deep learning;LDA;Doc2Vec;text mining
DO - 10.3743/KOSIM.2018.35.2.063
ER -
JeeHee Yuk and Min Song. (2018). A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning. Journal of the Korean Society for Information Management, 35(2), 63-88.
JeeHee Yuk and Min Song. 2018, "A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning", Journal of the Korean Society for Information Management, vol.35, no.2 pp.63-88. Available from: doi:10.3743/KOSIM.2018.35.2.063
JeeHee Yuk, Min Song "A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning" Journal of the Korean Society for Information Management 35.2 pp.63-88 (2018) : 63.
JeeHee Yuk, Min Song. A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning. 2018; 35(2), 63-88. Available from: doi:10.3743/KOSIM.2018.35.2.063
JeeHee Yuk and Min Song. "A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning" Journal of the Korean Society for Information Management 35, no.2 (2018) : 63-88.doi: 10.3743/KOSIM.2018.35.2.063
JeeHee Yuk; Min Song. A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning. Journal of the Korean Society for Information Management, 35(2), 63-88. doi: 10.3743/KOSIM.2018.35.2.063
JeeHee Yuk; Min Song. A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning. Journal of the Korean Society for Information Management. 2018; 35(2) 63-88. doi: 10.3743/KOSIM.2018.35.2.063
JeeHee Yuk, Min Song. A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning. 2018; 35(2), 63-88. Available from: doi:10.3743/KOSIM.2018.35.2.063
JeeHee Yuk and Min Song. "A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning" Journal of the Korean Society for Information Management 35, no.2 (2018) : 63-88.doi: 10.3743/KOSIM.2018.35.2.063