본문 바로가기
  • Home

Automatic Generation of Code-clone Reference Corpus

  • Journal of Software Assessment and Valuation
  • Abbr : JSAV
  • 2011, 7(1), pp.29-39
  • Publisher : Korea Software Assessment and Valuation Society
  • Research Area : Engineering > Computer Science
  • Received : April 3, 2011
  • Accepted : June 20, 2011
  • Published : June 30, 2011

Hyo-Sub Lee 1 Doh, Kyung-Goo 1

1한양대학교

ABSTRACT

To evaluate the quality of clone detection tools, we should know how many clones the tool misses. Hence we need to have the standard code-clone reference corpus for a carefully chosen set of sample source codes. The reference corpus available so far has been built by manually collecting clones from the results of various existing tools. This paper presents a tree-pattern-based clone detection tool that can be used for automatic generation of reference corpus. Our tool is compared with CloneDR for precision and Bellon's reference corpus for recall. Our tool finds no false positives and 2 to 3 times more clones than CloneDR. Compared to Bellon's reference corpus, our tools shows the 93%-to-100% recall rate and detects far more clones.

KEYWORDS

Citation status

* References for papers published after 2023 are currently being built.