본문 바로가기
  • Home

A Block Relocation Algorithm for Reducing Network Consumption in Hadoop Cluster

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2014, 19(11), pp.9-15
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science

김준상 1 김창현 2 Won Joo Lee 3 Chang Ho Jeon 1

1한양대학교
2한양대 컴퓨터공학과
3인하공업전문대학

Accredited

ABSTRACT

In this paper, We propose a block reallocation algorithm for reducing network traffic in Hadoop cluster. The scheduler of Hadoop cluster receives a job from users. And the job is divided into multiple tasksassigned to nodes. At this time, the scheduler allocates the task to the node that satisfied data locality. Ifa task is assigned to the node that does not have the data(block) to be processed, the task is processedafter the data transmission from another node. There is difference of workload among nodes becauseblocks in cluster have different access frequency. Therefore, the proposed algorithm relocates blocks according to the task allocation pattern of Hadoop scheduler. Eventually, workload of nodes are leveled,and the case of the task processing in a node that does not have the block to be processing is reduced. Thus, the network traffic of the cluster is also reduced. We evaluate the proposed block reallocationalgorithm by a simulation. The simulation result shows maximum 23.3% reduction of network consumptionthan default delay scheduling for jobs processing.

Citation status

* References for papers published after 2023 are currently being built.