본문 바로가기
  • Home

A Implementation of Optimal Multiple Classification System using Data Mining for Genome Analysis

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2018, 23(12), pp.43-48
  • DOI : 10.9708/jksci.2018.23.12.043
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science
  • Received : September 7, 2018
  • Accepted : December 12, 2018
  • Published : December 31, 2018

Jeong, Yu-jeong 1 Choi Gwang Mi 1

1조선대학교

Accredited

ABSTRACT

In this paper, more efficient classification result could be obtained by applying the combination of the Hidden Markov Model and SVM Model to HMSV algorithm gene expression data which simulated the stochastic flow of gene data and clustering it. In this paper, we verified the HMSV algorithm that combines independently learned algorithms. To prove that this paper is superior to other papers, we tested the sensitivity and specificity of the most commonly used classification criteria. As a result, the K-means is 71% and the SOM is 68%. The proposed HMSV algorithm is 85%. These results are stable and high. It can be seen that this is better classified than using a general classification algorithm. The algorithm proposed in this paper is a stochastic modeling of the generation process of the characteristics included in the signal, and a good recognition rate can be obtained with a small amount of calculation, so it will be useful to study the relationship with diseases by showing fast and effective performance improvement with an algorithm that clusters nodes by simulating the stochastic flow of Gene Data through data mining of BigData.

Citation status

* References for papers published after 2022 are currently being built.