I notice that there exist an assertion ``` DG_HOST_ASSERT(fused_kv_cache.stride(1) == head_dim_with_sf); ``` When we sliced paged kvcache by layer_idx, the sliced paged kvcache may not be contiguous.