본문 바로가기
  • Home

An Automated Method for Detecting and Collecting Evidence of Illegal OTT Content Distribution Using Large Language Model

  • Journal of Software Assessment and Valuation
  • Abbr : JSAV
  • 2025, 21(2), pp.33~42
  • Publisher : Korea Software Assessment and Valuation Society
  • Research Area : Engineering > Computer Science
  • Received : May 7, 2025
  • Accepted : June 20, 2025
  • Published : June 30, 2025

Byuong-Chan Park 1 Lee, Jae Chung 2 Seok-Yoon Kim 1 Youngmo Kim 1

1숭실대학교
2(주)비욘드테크

Accredited

ABSTRACT

This paper proposes an automated method for detecting and collecting evidence of illegal OTT content distribution using a Large Language Model (LLM) and a Large Action Model (LAM). Traditional manual web crawling methods are limited in their ability to adapt to structural changes in websites and incur high maintenance costs. To address these issues, we design a fully automated system architecture encompassing web structure analysis, code generation, similarity-based judgment, and evidence preservation. In the proposed method, the LLM analyzes the HTML structure of a website and generates code for information extraction, while the LAM executes the code to collect data. Additionally, the system incorporates Retrieval-Augmented Generation (RAG) to enhance semantic similarity comparison and includes a feedback loop that iteratively improves the results when confidence is low. This approach allows the system to flexibly adapt to various web structures and enhances the reliability and efficiency of evidence collection.

Citation status

* References for papers published after 2023 are currently being built.