본문 바로가기
  • Home

A Data-centric Drift Management Methodology based on Data Quality Assessment in Machine Learning System

  • Journal of Software Forensics
  • Abbr : JSAV
  • 2026, 22(1), pp.97~110
  • Publisher : Korea Software Assessment and Valuation Society
  • Research Area : Engineering > Computer Science
  • Received : March 9, 2026
  • Accepted : March 20, 2026
  • Published : March 31, 2026

Okjoo Choi 1 Won Sun Shin 2

1한국과학기술원
2주식회사 비전21테크

Accredited

ABSTRACT

Machine learning (ML) systems often suffer from degradation in prediction accuracy and reliability due to data drift that occurs over time. Conventional approaches primarily focus on restoring performance through model retraining; however, it is essential to recognize that the root cause of drift lies in the data itself. In this study, we propose the Q-DcDM framework, which integrates data requirement–driven quality assessment with drift analysis to proactively manage drift before the model training and inference stages. The proposed framework enables early detection of potential drift in the initial phase of the ML lifecycle, thereby preventing performance degradation. Experimental results on the UCI Wine Quality and CIFAR-10 datasets demonstrate that the proposed approach improves average model performance by 14.8% and 12.3%, respectively, while also significantly reducing the frequency of model retraining.

Citation status

* References for papers published after 2024 are currently being built.