You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enable CPU benchmark for VLLM perf dashboard (#44)
* first draft to enable CPU benchmark
* Update .github/workflows/vllm-benchmark.yml
Co-authored-by: Huy Do <huydhn@gmail.com>
* fix for ROCm changes
* change to use public cpu vllm postmerge registry
* target on 4 NUMA node EMR machine
* Update vllm-benchmark.yml
* Update vllm-benchmark.yml
* Fix CPU suffix
* Rename benchmark files to include the device name
Signed-off-by: Huy Do <huydhn@gmail.com>
* Fix model selection for CPU devices
Signed-off-by: Huy Do <huydhn@gmail.com>
* Update the workflow
Signed-off-by: Huy Do <huydhn@gmail.com>
* Another try
Signed-off-by: Huy Do <huydhn@gmail.com>
* Use python3
Signed-off-by: Huy Do <huydhn@gmail.com>
* Testing 1 2 3
Signed-off-by: Huy Do <huydhn@gmail.com>
* Does this work?
Signed-off-by: Huy Do <huydhn@gmail.com>
* Debug
Signed-off-by: Huy Do <huydhn@gmail.com>
* Typo
Signed-off-by: Huy Do <huydhn@gmail.com>
* Fix Docker usage
Signed-off-by: Huy Do <huydhn@gmail.com>
* Missing ON_CPU?
Signed-off-by: Huy Do <huydhn@gmail.com>
* Testing 1 2 3
Signed-off-by: Huy Do <huydhn@gmail.com>
* Fix the upload script
Signed-off-by: Huy Do <huydhn@gmail.com>
* Update .github/workflows/vllm-benchmark.yml
Co-authored-by: Louie Tsai <louie.tsai@intel.com>
* Typo
Signed-off-by: Huy Do <huydhn@gmail.com>
* Sanitize the device type
Signed-off-by: Huy Do <huydhn@gmail.com>
* Wrong variable
Signed-off-by: Huy Do <huydhn@gmail.com>
* c7i.metal-24xl has only 1 NUMA node
Co-authored-by: Louie Tsai <louie.tsai@intel.com>
---------
Signed-off-by: Huy Do <huydhn@gmail.com>
Co-authored-by: Tsai, Louie <louie.tsai@intel.com>
0 commit comments