-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Open
Labels
Description
Which component requires the feature?
CuTe DSL
Feature Request
I'd love to be able to control L2 cache eviction when doing TMA load and TMA store (e.g. evict_first, evict_last)
Additional context
This is important for some attention kernels, as we used it in FA3, e.g. here:
https://github.com/Dao-AILab/flash-attention/blob/413d07e9deef1e3c793c7de59d7146b43ae4d558/hopper/mainloop_fwd_sm90_tma_gmma_ws.hpp#L753
Chillee and thakkarV