본문 바로가기
  • Home

An Effective Classification Scheme for Microsoft Windows Executables using Frequency Information of Strings

  • Journal of Software Assessment and Valuation
  • Abbr : JSAV
  • 2016, 12(1), pp.27-36
  • Publisher : Korea Software Assessment and Valuation Society
  • Research Area : Engineering > Computer Science
  • Received : May 18, 2016
  • Accepted : June 19, 2016
  • Published : June 30, 2016

Cho Daehee 1 SEONG JE CHO 1

1단국대학교

ABSTRACT

In order to filter and block illegal software efficiently, software classification is employed. In this paper, we propose a new software classification scheme that effectively categorizes Microsoft Windows executable files using frequency information of strings. The proposed scheme first extracts strings from the .text, .data, .rdata, and .rsrc sections of each Windows executable, and then computes and maintains a score table of the strings by combining Local Document Frequency (LDF), Inverse Document Frequency (IDF), and Inverse Category Frequency (ICF). If a suspicious program (Windows executable) got useful string information, we calculate scores of the strings in the suspicious program based on the score table of strings of each category, sums up the scores, and obtains a total score per each category. Finally, we classify the suspicious program into a specific category which represents the highest total score. For verifying the effectiveness of the proposed scheme, we perform experiments with nine categories and 55 programs each category. The experimental results show that about 75% of the 495 executables are correctly classified.

Citation status

* References for papers published after 2023 are currently being built.