Add a_1_128_w_128_128 (DeepSeek style) float8 scaling for inference #11624
Annotations
1 warning
          | 
                      
                          build_docs (3.11)
                        
                      
                       WARNING conda.cli.main_config:_set_key(451): Key auto_activate_base is an alias of auto_activate; setting value with latter
 | 
Artifacts
Produced during runtime
          | Name | Size | Digest | |
|---|---|---|---|
| 
                        
                          Doc-Build
                        
                       | 6.06 MB | sha256:df5827d7a118b5aa5fbb5ff9f6492737b1cc11bdf2ae4d274a739257a0b8c98a |  |