본문 바로가기
  • Home

Analysis of Portability Performance of Large-Scale Grid Processing Based on GPU Memory Architecture Differences

  • Journal of Software Forensics
  • Abbr : JSF
  • 2026, 22(2), pp.143~154
  • DOI : 10.29056/jsf.2026.06.13
  • Publisher : Korea Software Assessment and Valuation Society
  • Research Area : Engineering > Computer Science
  • Received : June 1, 2026
  • Accepted : June 20, 2026
  • Published : June 30, 2026

Kim Min-Soo 1 Kim Jae Woong 2 Jang Mi-young 1

1공주대학교
2국립공주대학교

Accredited

ABSTRACT

This paper analyses the performance characteristics observed when porting large-scale grid-resampling CUDA code from a PC discrete-GPU environment to a Jetson AGX Orin-based UMA GPU environment. In a PC environment, CPU memory and dedicated GPU memory are physically separated, whereas the Jetson shares physical memory between the CPU and GPU. Therefore, the same memory method does not necessarily produce equivalent performance across the two platforms. Explicit-copy memory (Device Memory), Unified Memory, and Zero-copy were evaluated under linear, local-offset, and scatter access patterns. The results showed that Device Memory provided the most stable performance on the PC. On the Jetson, Unified Memory and Zero-copy showed potential advantages depending on data size and memory-management conditions. Scatter access degraded performance on both platforms, indicating that both memory architecture and data-access patterns should be considered when porting CUDA code.

Citation status

* References for papers published after 2024 are currently being built.