본문 바로가기
  • Home

MoTUNet: A MobileNetV2-Transformer U-Net for Water Body Segmentation

  • Journal of Internet of Things and Convergence
  • Abbr : JKIOTS
  • 2025, 11(2), pp.63~75
  • Publisher : The Korea Internet of Things Society
  • Research Area : Engineering > Computer Science > Internet Information Processing
  • Received : March 6, 2025
  • Accepted : April 8, 2025
  • Published : April 30, 2025

Kimsay Pov 1 Tara Kit 1 TAEKYUNG KIM 2 Youngsun Han 3

1국립부경대학교 인공지능융합학과
2충북대학교
3국립부경대학교

Accredited

ABSTRACT

Efficient real-time water body segmentation is crucial for applications such as flood detection, but balancing accuracy and inference efficiency remains challenging. In this paper, we propose MoTUNet (MobileNetV2-Transformer U-Net), designed to optimize both accuracy and inference speed for water body segmentation. Its performance is evaluated against several popular segmentation models such as U-Net, DeepLabV3+, PSPNet, PAN, and LinkNet. All models use MobileNetV2 as an encoder to reduce computational complexity while preserving feature extraction, and the Kaggle RIWA dataset is used for training and evaluation. The key metrics include Intersection over Union (IoU), precision, recall, F1-score, frames per second (FPS), and the average inference latency. Our results show that U-Net and DeepLabV3+ achieve the highest accuracy, while PSPNet is the most efficient in terms of FPS. MoTUNet provides an optimal balance by being 97.20% and 64.49% faster than U-Net and DeepLabV3+ at a 512×512 input size, and 81.71% and 58.18% faster at a 256×256 input size, while maintaining competitive segmentation accuracy

Citation status

* References for papers published after 2023 are currently being built.