@article{ART001641512},
author={Jiyoung Woo},
title={The Spam Detection Model for Web Forums using Text Mining Techniques},
journal={Journal of Knowledge Information Technology and Systems},
issn={1975-7700},
year={2012},
volume={7},
number={1},
pages={159-166}
TY - JOUR
AU - Jiyoung Woo
TI - The Spam Detection Model for Web Forums using Text Mining Techniques
JO - Journal of Knowledge Information Technology and Systems
PY - 2012
VL - 7
IS - 1
PB - Korea Knowledge Information Technology Society
SP - 159
EP - 166
SN - 1975-7700
AB - The spam in the discussion web forum causes user inconvenience and lowers the value of the web forum as the open source of user opinion. The importance of postings is evaluated in terms of the number of involved authors, so the spam distorts the analysis result by adding the unnecessary data in the opinion analysis. We propose the automatic detection model of spam postings in the web forum. We extract text features of posting contents using text mining techniques from the perspective of linguistics and then perform supervised learning to recognize spam from normal postings. Significant features are derived through the learning process and the automatic detection model is built based on those features. To build the automatic detection model of normal postings and spam, four evaluators are asked to recognize the spam posting in prior. We adopted the Naive Bayesian, Support Vector Machine (SVM), decision tree, which are known to perform well in data and text mining tasks. We can extract the text features to recognize the spam and detect automatically the newly posted spam. We apply the proposed model to the YahooFinace-Walmart forum, which is the world largest Walmart-related web forum.
KW - Web forum;Social media;Spam;Posting quality;Text mining
DO -
UR -
ER -
Jiyoung Woo. (2012). The Spam Detection Model for Web Forums using Text Mining Techniques. Journal of Knowledge Information Technology and Systems, 7(1), 159-166.
Jiyoung Woo. 2012, "The Spam Detection Model for Web Forums using Text Mining Techniques", Journal of Knowledge Information Technology and Systems, vol.7, no.1 pp.159-166.
Jiyoung Woo "The Spam Detection Model for Web Forums using Text Mining Techniques" Journal of Knowledge Information Technology and Systems 7.1 pp.159-166 (2012) : 159.
Jiyoung Woo. The Spam Detection Model for Web Forums using Text Mining Techniques. 2012; 7(1), 159-166.
Jiyoung Woo. "The Spam Detection Model for Web Forums using Text Mining Techniques" Journal of Knowledge Information Technology and Systems 7, no.1 (2012) : 159-166.
Jiyoung Woo. The Spam Detection Model for Web Forums using Text Mining Techniques. Journal of Knowledge Information Technology and Systems, 7(1), 159-166.
Jiyoung Woo. The Spam Detection Model for Web Forums using Text Mining Techniques. Journal of Knowledge Information Technology and Systems. 2012; 7(1) 159-166.
Jiyoung Woo. The Spam Detection Model for Web Forums using Text Mining Techniques. 2012; 7(1), 159-166.
Jiyoung Woo. "The Spam Detection Model for Web Forums using Text Mining Techniques" Journal of Knowledge Information Technology and Systems 7, no.1 (2012) : 159-166.