TF-MVP: Novel Sparsity-Aware Transformer Accelerator with Mixed-Length Vector Pruning

Yoo, E; Park, G; Min, JG; Kwon, SJ; Park, B; Lee, D; Lee, Y

Lee, Y (通讯作者),Pohang Univ Sci & Technol, Pohang, South Korea.

2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023; ():

Abstract

We present the energy-efficient TF-MVP architecture, a sparsity-aware transformer accelerator, by introducing novel algorithm-hardware co-optimization......

Full Text Link