Closed
Description
Our CUDA DSS implementation uses lots of DataLayout internals. Making any material changes to DataLayouts results in needing to nearly rewrite all of ext/cuda/topologies_dss.jl
. I think that we need to refactor this code and avoid use of internals.
This is the biggest piece that blocks #2045.