본문 바로가기
  • Home

A Performance Analysis Based on Hadoop Application's Characteristics in Cloud Computing

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2010, 15(5), pp.49-56
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science

금태훈 1 Won Joo Lee 2 Chang Ho Jeon 3

1한양대 컴퓨터공학과
2인하공업전문대학
3한양대학교

Accredited

ABSTRACT

In this paper, we implement a Hadoop based cluster for cloud computing and evaluate the performance of this cluster based on application characteristics by executing RandomTextWriter, WordCount, and PI applications. A RandomTextWriter creates given amount of random words and stores them in the HDFS(Hadoop Distributed File System). A WordCount reads an input file and determines the frequency of a given word per block unit. PI application induces PI value using the Monte Carlo law. During simulation, we investigate the effect of data block size and the number of replications on the execution time of applications. Through simulation, we have confirmed that the execution time of RandomTextWriter was proportional to the number of replications. However, the execution time of WordCount and PI were not affected by the number of replications. Moreover, the execution time of WordCount was optimum when the block size was 64~256MB. Therefore, these results show that the performance of cloud computing system can be enhanced by using a scheduling scheme that considers application's characteristics.

Citation status

* References for papers published after 2023 are currently being built.