Skip to content

Actions: HabanaAI/vllm-fork

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,580 workflow runs
2,580 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Enable embedding accuracy test with hpu
pre-commit #2639: Pull request #1424 synchronize by akarnows
June 23, 2025 07:56 6m 1s dev/akarnowski/embed_acc_test
June 23, 2025 07:56 6m 1s
Fix multimodal warmup
pre-commit #2638: Pull request #1465 synchronize by adobrzyn
June 23, 2025 07:51 6m 2s adobrzyn/mm_bucketing_fix
June 23, 2025 07:51 6m 2s
Fix multimodal warmup
pre-commit #2637: Pull request #1465 opened by adobrzyn
June 23, 2025 07:40 6m 3s adobrzyn/mm_bucketing_fix
June 23, 2025 07:40 6m 3s
Integrate DP with PD
pre-commit #2636: Pull request #1461 synchronize by xinyu-intel
June 23, 2025 06:56 4m 42s dev/xinyu/pd_dp_integration
June 23, 2025 06:56 4m 42s
Enabled BnB NF4 inference on Gaudi
pre-commit #2635: Pull request #1457 synchronize by rsshaik1
June 23, 2025 06:33 4m 49s tests_bnb
June 23, 2025 06:33 4m 49s
DP: Optimizations for Data Parallel Attention
pre-commit #2633: Pull request #1463 synchronize by xinyu-intel
June 23, 2025 06:20 4m 52s dev/xinyu/dp_opt
June 23, 2025 06:20 4m 52s
DP: Fix init_device for DP
pre-commit #2630: Pull request #1464 opened by xinyu-intel
June 23, 2025 05:49 6m 6s dev/xinyu/fix_init_dev_dp
June 23, 2025 05:49 6m 6s
Enabled BnB NF4 inference on Gaudi
pre-commit #2629: Pull request #1457 synchronize by rsshaik1
June 23, 2025 05:19 6m 9s tests_bnb
June 23, 2025 05:19 6m 9s
DP: Optimizations for Data Parallel Attention
pre-commit #2628: Pull request #1463 synchronize by xinyu-intel
June 23, 2025 04:44 5m 20s dev/xinyu/dp_opt
June 23, 2025 04:44 5m 20s
Enabled BnB NF4 inference on Gaudi
pre-commit #2627: Pull request #1457 synchronize by rsshaik1
June 23, 2025 04:28 6m 5s tests_bnb
June 23, 2025 04:28 6m 5s
DP: Optimizations for Data Parallel Attention
pre-commit #2625: Pull request #1463 synchronize by xinyu-intel
June 23, 2025 03:54 6m 13s dev/xinyu/dp_opt
June 23, 2025 03:54 6m 13s
Integrate DP with PD
pre-commit #2624: Pull request #1461 synchronize by xinyu-intel
June 23, 2025 03:49 5m 56s dev/xinyu/pd_dp_integration
June 23, 2025 03:49 5m 56s
DP: Optimizations for Data Parallel Attention
pre-commit #2623: Pull request #1463 opened by xinyu-intel
June 23, 2025 03:42 6m 20s dev/xinyu/dp_opt
June 23, 2025 03:42 6m 20s
[PD] reduce a D2D copy.
pre-commit #2622: Pull request #1462 opened by jikunshang
June 23, 2025 03:14 3m 47s jikunshang:ds_r1_0620
June 23, 2025 03:14 3m 47s
Integrate DP with PD
pre-commit #2621: Pull request #1461 opened by xinyu-intel
June 23, 2025 02:46 6m 4s dev/xinyu/pd_dp_integration
June 23, 2025 02:46 6m 4s
[P/D] add acc test script of hpu pd disagg
pre-commit #2620: Pull request #1394 synchronize by zhenwei-intel
June 23, 2025 01:51 4m 51s lzw/add_test_script
June 23, 2025 01:51 4m 51s
Revise the README
pre-commit #2619: Pull request #1459 opened by taotod
June 22, 2025 12:39 6m 5s taotod:aice/v1.21.0
June 22, 2025 12:39 6m 5s
Fix the script file typo in README file
pre-commit #2618: Pull request #1458 opened by taotod
June 22, 2025 09:00 3m 51s taotod:deepseek_r1
June 22, 2025 09:00 3m 51s
Bucketing refactoring
pre-commit #2616: Pull request #1414 synchronize by adobrzyn
June 20, 2025 11:04 5m 34s adobrzyn/bucketing_refactor
June 20, 2025 11:04 5m 34s