CTA-Aware Prefetching and Scheduling for GPU

Koo, G; Jeon, H; Liu, ZH; Kim, NS; Annavaram, M

Koo, G (reprint author), Univ Southern Calif, Los Angeles, CA 90089 USA.

2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018; (): 137

Abstract

Albeit GPUs are supposed to be tolerant to long latency of data fetch operation, we observe that L1 cache misses occur in a bursty manner for many mem......

Full Text Link