본문 바로가기
  • Home

Machine Learning-based Application-Layer Network Fingerprinting for Dark Web Criminal Service Classification

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2025, 30(12), pp.195~204
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science
  • Received : November 14, 2025
  • Accepted : December 12, 2025
  • Published : December 31, 2025

Jinwoo Shin 1 Dong-Won Kang 1 Jiyeon Kim 1

1대구대학교

Accredited

ABSTRACT

As drug and hacking crimes exploiting the anonymity of the Dark Web have rapidly increased, automated technologies capable of detecting them quickly are needed. Previous approaches such as HTML-based static content analysis are vulnerable to content concealment and dynamic changes, and transport layer (L3/4) traffic analysis is limited to network transmission characteristics such as packet size and transmission rate, failing to reflect the structural and content characteristics of websites. To overcome these limitations, this paper collects not only L3/4 metrics but also application layer (L7) metrics, and analyzes effective metrics for classifying service types of Dark Web crime sites through machine learning-based learning. To this end, approximately 12,847 Dark Web sites related to drugs and hacking were collected, and a dataset was constructed by extracting 46 network metrics of L3/4 and L7 generated during site access. Furthermore, machine learning algorithms such as XGBoost, Random Forest, Logistic Regression, and SVM were developed as single models learning metrics of each layer and hybrid models integrating metrics of both layers. As a result, the XGBoost model trained only on L7 metrics recorded the highest performance with F1-scores of 0.996 for drug crime site type classification and 0.933 for hacking crime site type classification.

Citation status

* References for papers published after 2024 are currently being built.