본문 바로가기
  • Home

An Optimization of Hashing Mechanism for the DHP Association Rules Mining Algorithm

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2010, 15(8), pp.13-21
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science

Hyung-Bong Lee 1 Kwon Ki Hyeon 2

1강릉원주대학교
2강원대학교

Accredited

ABSTRACT

One of the most distinguished features of the DHP association rules mining algorithm is that it counts the support of hash key combinations composed of k items at phase k-1, and uses the counted support for pruning candidate large itemsets to improve performance. At this time, it is desirable for each hash key combination to have a separate count variable, where it is impossible to allocate the variables owing to memory shortage. So, the algorithm uses a direct hashing mechanism in which several hash key combinations conflict and are counted in a same hash bucket. But the direct hashing mechanism is not efficient because the distribution of hash key combinations is unvalanced by the characteristics sourced from the mining process. This paper proposes a mapped perfect hashing function which maps the region of hash key combinations into a continuous integer space for phase 3 and maximizes the efficiency of direct hashing mechanism. The results of a performance test experimented on 42 test data sets shows that the average performance improvement of the proposed hashing mechanism is 7.3% compared to the existing method, and the highest performance improvement is 16.9%. Also, it shows that the proposed method is more efficient in case the length of transactions or large itemsets are long or the number of total items is large.

Citation status

* References for papers published after 2023 are currently being built.