FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks, ASPLOS’23
Coming soon
ASPLOS’23, slides, poster, video