Skip to content

Actions: stanford-crfm/helm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
4,039 workflow run results
4,039 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Scenario tests
Scenario tests #120: Scheduled
November 17, 2024 15:35 8m 50s main
November 17, 2024 15:35 8m 50s
November 16, 2024 16:44 1m 0s
Scenario tests
Scenario tests #119: Scheduled
November 16, 2024 15:34 13m 16s main
November 16, 2024 15:34 13m 16s
Add CASEHold scenario
Test #7618: Pull request #3164 opened by yifanmai
November 15, 2024 19:39 13m 39s yifanmai/fix-casehold
November 15, 2024 19:39 13m 39s
Release AIR-Bench v1.3.0 leaderboard (#3163)
Build Frontend #149: Commit 145cbe1 pushed by yifanmai
November 15, 2024 19:06 48s main
November 15, 2024 19:06 48s
Release AIR-Bench v1.3.0 leaderboard (#3163)
Frontend #664: Commit 145cbe1 pushed by yifanmai
November 15, 2024 19:06 1m 2s main
November 15, 2024 19:06 1m 2s
Release AIR-Bench v1.3.0 leaderboard
Frontend #663: Pull request #3163 opened by yifanmai
November 15, 2024 18:28 1m 7s yifanmai/fix-air-bench-v1.3.0
November 15, 2024 18:28 1m 7s
Scenario tests
Scenario tests #118: Scheduled
November 15, 2024 15:35 11m 28s main
November 15, 2024 15:35 11m 28s
Added COT Metric and Adapter to MMLU Pro
Test #7617: Pull request #3162 opened by siyagoel
November 15, 2024 11:57 13m 3s siyagoel/mmlupro_with_cot
November 15, 2024 11:57 13m 3s
Add Casual Conversation V2 audio scenario (#3158)
Test #7615: Commit dd8a58a pushed by teetone
November 15, 2024 07:13 12m 57s main
November 15, 2024 07:13 12m 57s
Add Casual Conversation V2 audio scenario
Test #7614: Pull request #3158 synchronize by ImKeTT
November 14, 2024 20:20 13m 44s ImKeTT:fairness_audio_scenarios
November 14, 2024 20:20 13m 44s
Adding WildBench
Scenario tests #117: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:14 11m 42s jialiang/wildbench
November 14, 2024 20:14 11m 42s
Adding WildBench
Test #7613: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:14 12m 33s jialiang/wildbench
November 14, 2024 20:14 12m 33s
Add Casual Conversation V2 audio scenario
Test #7612: Pull request #3158 synchronize by ImKeTT
November 14, 2024 20:13 13m 15s ImKeTT:fairness_audio_scenarios
November 14, 2024 20:13 13m 15s
Add Casual Conversation V2 audio scenario
Test #7611: Pull request #3158 synchronize by ImKeTT
November 14, 2024 20:11 5m 16s ImKeTT:fairness_audio_scenarios
November 14, 2024 20:11 5m 16s
Adding WildBench
Scenario tests #116: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:10 12m 15s jialiang/wildbench
November 14, 2024 20:10 12m 15s
Adding WildBench
Test #7610: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:10 13m 12s jialiang/wildbench
November 14, 2024 20:10 13m 12s
Adding WildBench
Scenario tests #115: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:03 5m 10s jialiang/wildbench
November 14, 2024 20:03 5m 10s
Adding WildBench
Test #7609: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:03 5m 25s jialiang/wildbench
November 14, 2024 20:03 5m 25s
Scenario tests
Scenario tests #114: Scheduled
November 14, 2024 15:35 12m 52s main
November 14, 2024 15:35 12m 52s
Adding WildBench
Scenario tests #113: Pull request #3150 synchronize by liamjxu
November 14, 2024 08:06 11m 13s jialiang/wildbench
November 14, 2024 08:06 11m 13s
Adding WildBench
Test #7608: Pull request #3150 synchronize by liamjxu
November 14, 2024 08:06 12m 21s jialiang/wildbench
November 14, 2024 08:06 12m 21s
Added Metric for COT
Test #7607: Pull request #3159 synchronize by siyagoel
November 14, 2024 08:00 12m 47s siyagoel/cotmetric
November 14, 2024 08:00 12m 47s
Adding WildBench
Test #7606: Pull request #3150 synchronize by liamjxu
November 14, 2024 07:50 7m 41s jialiang/wildbench
November 14, 2024 07:50 7m 41s