A K-Nearest Neighbor Algorithm for Categorical Sequence Data (범주형 시퀀스 데이터의 K-Nearest Neighbor알고리즘)

Seung-Joon Oh (오승준)

A K-Nearest Neighbor Algorithm for Categorical Sequence Data

Journal of The Korea Society of Computer and Information
Abbr : JKSCI
2005, 10(2), pp.215~222
Publisher : The Korean Society Of Computer And Information
Research Area : Engineering > Computer Science

Seung-Joon Oh ¹

¹경기과학기술대학교

Candidate

ABSTRACT

TRecently, there has been enormous growth in the amount of commercial and scientific data, such as protein sequences, retail transactions, and web-logs. Such datasets consist of sequence data that have an inherent sequential nature. In this paper, we study how to classify these sequence datasets. There are several kinds techniques for data classification such as decision tree induction, Bayesian classification and K-NN etc. In our approach, we use a K-NN algorithm for classifying sequences. In addition, we propose a new similairty measure to compute the similarity between two sequences and an efficient method for measuring similarity.

KEYWORDS

Data Mining, Classification, Sequences

Citation status

* References for papers published after 2025 are currently being built.

[journal] / pakdd20022002. / “Evalutation of Techniques for Classifying Biological Sequences”

[journal] / 1997. / Algorithm on Strings Press Syndicate of the University of Cambridge

[journal] / 1997. / Pattern Matching Algorithms, / Oxford University Press

[journal] / 2000. / and Event Type Similarity Notions for Data Mining Univ. of Helsinki Dept. of Com. Sci.

[journal] / 2000. / Proc. 2000 Int. Conf. Math. and Eng. Tech. in Medicine and Biological Sci. : 239~245

[journal] / 2001 / Concepts and Techniques / Morgan Kaufmann Publishers : 2001~

[journal] / pakdd20012001. / Text Categorization Using Weight Adjusted k-Nearest Neighbor Classification

[journal] / 2002. / “Use of K-Nearest Neighbor Classifier for Intrusion Detection 21 : 439~448

[journal] / 2000 / for Fast and Accurate String Classification” spain : 2000~

[journal] / 2002. / Improved k-Nearest Neighbor Classification, Pattern Recognition 35 : 2311~2318

[journal] / pakdd20022002. / “K-Nearest Neighbor Classification on Spatial Data Streams Using P-Trees”

[journal] / 1998. / UCI Repository of Machine Learning Databases

[journal] / 2002. / “대규모 네트워크를 위한 침입탐지결정 모듈 설계 7(2)

[journal] / 2003. / 보안정책 기반 침입탐지 시스템 모델 설계 8(4)

KJCKorea
Journal Central

Journal of The Korea Society of Computer and Information 2025 KCI Impact Factor : 1.01