@article{ART001963402},
author={YOU Eun-Soon and 최건희 and Seung-Hoon Kim},
title={Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels},
journal={Journal of The Korea Society of Computer and Information},
issn={1598-849X},
year={2015},
volume={20},
number={2},
pages={121-129}
TY - JOUR
AU - YOU Eun-Soon
AU - 최건희
AU - Seung-Hoon Kim
TI - Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels
JO - Journal of The Korea Society of Computer and Information
PY - 2015
VL - 20
IS - 2
PB - The Korean Society Of Computer And Information
SP - 121
EP - 129
SN - 1598-849X
AB - With the explosive growth of information about books, there is a growing number of customers who findit difficult to pick a book. Against the backdrop, the importance of a book recommendation systembecomes greater, through which appropriate information about books could be offered then to encouragecustomers to buy a book in the end. However, existing recommendation systems based on thebibliographical information or user data reveal the reliability issue found in their recommendation results. This is why it is necessary to reflect semantic information extracted from the texts of a book’s main bodyin a recommendation system. Accordingly, this paper suggests a method for extracting keywords from themain body of novels, as a preceding research, by using TF-IDF method as well as the text structure. Tothis end, the texts of 100 novels have been collected then to divide them into four structural elements ofpreface, dialogue, non-dialogue and closing. Then, the TF-IDF weight of each keyword has beencalculated. The calculation results show that the extraction accuracy of keywords improves by 42.1% inperformance when more weight is given to dialogue while including preface and closing instead of usingjust the main body.
KW - Keyword;TFIDF;Novel Structrue;Book Recommendation System;Dialog Weight
DO -
UR -
ER -
YOU Eun-Soon, 최건희 and Seung-Hoon Kim. (2015). Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels. Journal of The Korea Society of Computer and Information, 20(2), 121-129.
YOU Eun-Soon, 최건희 and Seung-Hoon Kim. 2015, "Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels", Journal of The Korea Society of Computer and Information, vol.20, no.2 pp.121-129.
YOU Eun-Soon, 최건희, Seung-Hoon Kim "Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels" Journal of The Korea Society of Computer and Information 20.2 pp.121-129 (2015) : 121.
YOU Eun-Soon, 최건희, Seung-Hoon Kim. Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels. 2015; 20(2), 121-129.
YOU Eun-Soon, 최건희 and Seung-Hoon Kim. "Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels" Journal of The Korea Society of Computer and Information 20, no.2 (2015) : 121-129.
YOU Eun-Soon; 최건희; Seung-Hoon Kim. Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels. Journal of The Korea Society of Computer and Information, 20(2), 121-129.
YOU Eun-Soon; 최건희; Seung-Hoon Kim. Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels. Journal of The Korea Society of Computer and Information. 2015; 20(2) 121-129.
YOU Eun-Soon, 최건희, Seung-Hoon Kim. Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels. 2015; 20(2), 121-129.
YOU Eun-Soon, 최건희 and Seung-Hoon Kim. "Study on Extraction of Keywords Using TF-IDF and Text Structure of Novels" Journal of The Korea Society of Computer and Information 20, no.2 (2015) : 121-129.