본문 바로가기
  • Home

A Lightweight CNN Model for Alcohol-intoxicated Detection Using Mel-Spectrogram Voice Features

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2025, 30(10), pp.53~60
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science
  • Received : August 21, 2025
  • Accepted : September 24, 2025
  • Published : October 31, 2025

Younguk Yun 1

1연세대학교

Accredited

ABSTRACT

In this paper, we propose TinyAlcoCNN, a lightweight deep learning model designed to non-invasively detect alcohol consumption based on voice data. The proposed model adopts a 2D-CNN architecture that takes Mel-spectrograms as input and is trained on approximately 40,000 Korean voice samples. To support real-time applications, the dataset was preprocessed using the Whisper API for automatic segmentation. Experimental results demonstrate that TinyAlcoCNN achieves a training accuracy of 0.9982 and an inference accuracy of 1.000, while maintaining efficiency with approximately one million parameters and 13.9 million FLOPs. These results confirm both the effectiveness and computational efficiency of the model. This study highlights the feasibility of voice-based alcohol detection and suggests potential for broader applications, including personalized services, through multilingual expansion and integration with mobile systems.

Citation status

* References for papers published after 2024 are currently being built.