@article{ART002615858},
author={PARK HO YEON and Kyoung-jae Kim},
title={Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques},
journal={Journal of The Korea Society of Computer and Information},
issn={1598-849X},
year={2020},
volume={25},
number={8},
pages={181-188},
doi={10.9708/jksci.2020.25.08.181}
TY - JOUR
AU - PARK HO YEON
AU - Kyoung-jae Kim
TI - Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques
JO - Journal of The Korea Society of Computer and Information
PY - 2020
VL - 25
IS - 8
PB - The Korean Society Of Computer And Information
SP - 181
EP - 188
SN - 1598-849X
AB - In this study, we propose a comparative study to confirm the impact of various word embedding techniques on the performance of sentiment analysis. Sentiment analysis is one of opinion mining techniques to identify and extract subjective information from text using natural language processing and can be used to classify the sentiment of product reviews or comments. Since sentiment can be classified as either positive or negative, it can be considered one of the general classification problems. For sentiment analysis, the text must be converted into a language that can be recognized by a computer. Therefore, text such as a word or document is transformed into a vector in natural language processing called word embedding. Various techniques, such as Bag of Words, TF-IDF, and Word2Vec are used as word embedding techniques. Until now, there have not been many studies on word embedding techniques suitable for emotional analysis. In this study, among various word embedding techniques, Bag of Words, TF-IDF, and Word2Vec are used to compare and analyze the performance of movie review sentiment analysis. The research data set for this study is the IMDB data set, which is widely used in text mining. As a result, it was found that the performance of TF-IDF and Bag of Words was superior to that of Word2Vec and TF-IDF performed better than Bag of Words, but the difference was not very significant.
KW - sentiment analysis;Bag of words;TF-IDF;Word2Vec;machine learning
DO - 10.9708/jksci.2020.25.08.181
ER -
PARK HO YEON and Kyoung-jae Kim. (2020). Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques. Journal of The Korea Society of Computer and Information, 25(8), 181-188.
PARK HO YEON and Kyoung-jae Kim. 2020, "Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques", Journal of The Korea Society of Computer and Information, vol.25, no.8 pp.181-188. Available from: doi:10.9708/jksci.2020.25.08.181
PARK HO YEON, Kyoung-jae Kim "Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques" Journal of The Korea Society of Computer and Information 25.8 pp.181-188 (2020) : 181.
PARK HO YEON, Kyoung-jae Kim. Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques. 2020; 25(8), 181-188. Available from: doi:10.9708/jksci.2020.25.08.181
PARK HO YEON and Kyoung-jae Kim. "Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques" Journal of The Korea Society of Computer and Information 25, no.8 (2020) : 181-188.doi: 10.9708/jksci.2020.25.08.181
PARK HO YEON; Kyoung-jae Kim. Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques. Journal of The Korea Society of Computer and Information, 25(8), 181-188. doi: 10.9708/jksci.2020.25.08.181
PARK HO YEON; Kyoung-jae Kim. Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques. Journal of The Korea Society of Computer and Information. 2020; 25(8) 181-188. doi: 10.9708/jksci.2020.25.08.181
PARK HO YEON, Kyoung-jae Kim. Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques. 2020; 25(8), 181-188. Available from: doi:10.9708/jksci.2020.25.08.181
PARK HO YEON and Kyoung-jae Kim. "Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques" Journal of The Korea Society of Computer and Information 25, no.8 (2020) : 181-188.doi: 10.9708/jksci.2020.25.08.181