본문 바로가기
  • Home

A Novel Method of Improving Cache Hit-rate in Hadoop MapReduce using SSD Cache

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2015, 20(8), pp.1-6
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science

김종찬 1 안재훈 1 김영환 1 Kiman Jeon 1

1전자부품연구원

Accredited

ABSTRACT

The MapReduce Program of Hadoop Distributed File System operates on any unspecified nodes due to distributed-parallel process and block replicate for data stability. Since it is difficult to guarantee the cache locality when a Solid State Drive is used as a cache in hadoop, cache hit-rate is decreased. In this paper, we suggest a method to improve cache hit rate by pre-loading the input data of the MapReduce onto the SSD cache. To perform this method, we estimated the blocks that are used on each node by using capacity scheduler and block metadata. Eventually we could increase the performance of SSD cache by loading the blocks onto SSD cache before the Map Task run.

Citation status

* References for papers published after 2023 are currently being built.