Skip to content

Actions: stanford-crfm/helm

Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,868 workflow run results
2,868 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add CASEHold scenario
Test #7618: Pull request #3164 opened by yifanmai
November 15, 2024 19:39 13m 39s yifanmai/fix-casehold
November 15, 2024 19:39 13m 39s
Added COT Metric and Adapter to MMLU Pro
Test #7617: Pull request #3162 opened by siyagoel
November 15, 2024 11:57 13m 3s siyagoel/mmlupro_with_cot
November 15, 2024 11:57 13m 3s
Add Casual Conversation V2 audio scenario (#3158)
Test #7615: Commit dd8a58a pushed by teetone
November 15, 2024 07:13 12m 57s main
November 15, 2024 07:13 12m 57s
Add Casual Conversation V2 audio scenario
Test #7614: Pull request #3158 synchronize by ImKeTT
November 14, 2024 20:20 13m 44s ImKeTT:fairness_audio_scenarios
November 14, 2024 20:20 13m 44s
Adding WildBench
Test #7613: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:14 12m 33s jialiang/wildbench
November 14, 2024 20:14 12m 33s
Add Casual Conversation V2 audio scenario
Test #7612: Pull request #3158 synchronize by ImKeTT
November 14, 2024 20:13 13m 15s ImKeTT:fairness_audio_scenarios
November 14, 2024 20:13 13m 15s
Add Casual Conversation V2 audio scenario
Test #7611: Pull request #3158 synchronize by ImKeTT
November 14, 2024 20:11 5m 16s ImKeTT:fairness_audio_scenarios
November 14, 2024 20:11 5m 16s
Adding WildBench
Test #7610: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:10 13m 12s jialiang/wildbench
November 14, 2024 20:10 13m 12s
Adding WildBench
Test #7609: Pull request #3150 synchronize by liamjxu
November 14, 2024 20:03 5m 25s jialiang/wildbench
November 14, 2024 20:03 5m 25s
Adding WildBench
Test #7608: Pull request #3150 synchronize by liamjxu
November 14, 2024 08:06 12m 21s jialiang/wildbench
November 14, 2024 08:06 12m 21s
Added Metric for COT
Test #7607: Pull request #3159 synchronize by siyagoel
November 14, 2024 08:00 12m 47s siyagoel/cotmetric
November 14, 2024 08:00 12m 47s
Adding WildBench
Test #7606: Pull request #3150 synchronize by liamjxu
November 14, 2024 07:50 7m 41s jialiang/wildbench
November 14, 2024 07:50 7m 41s
Adding WildBench
Test #7605: Pull request #3150 synchronize by liamjxu
November 14, 2024 07:40 7m 31s jialiang/wildbench
November 14, 2024 07:40 7m 31s
Added Metric for COT
Test #7604: Pull request #3159 synchronize by siyagoel
November 14, 2024 07:35 7m 46s siyagoel/cotmetric
November 14, 2024 07:35 7m 46s
Adding WildBench
Test #7603: Pull request #3150 synchronize by liamjxu
November 14, 2024 06:52 7m 46s jialiang/wildbench
November 14, 2024 06:52 7m 46s
Adding WildBench
Test #7602: Pull request #3150 synchronize by liamjxu
November 14, 2024 06:37 8m 10s jialiang/wildbench
November 14, 2024 06:37 8m 10s
Adding WildBench
Test #7601: Pull request #3150 synchronize by liamjxu
November 14, 2024 06:21 7m 41s jialiang/wildbench
November 14, 2024 06:21 7m 41s
Adding WildBench
Test #7600: Pull request #3150 synchronize by liamjxu
November 14, 2024 05:41 7m 21s jialiang/wildbench
November 14, 2024 05:41 7m 21s
Adding WildBench
Test #7599: Pull request #3150 synchronize by liamjxu
November 14, 2024 05:29 7m 49s jialiang/wildbench
November 14, 2024 05:29 7m 49s
Add Casual Conversation V2 audio scenario
Test #7598: Pull request #3158 opened by ImKeTT
November 14, 2024 00:09 12m 53s ImKeTT:fairness_audio_scenarios
November 14, 2024 00:09 12m 53s
Adding WildBench
Test #7597: Pull request #3150 synchronize by liamjxu
November 13, 2024 23:18 13m 30s jialiang/wildbench
November 13, 2024 23:18 13m 30s
Adding WildBench
Test #7596: Pull request #3150 synchronize by liamjxu
November 13, 2024 23:17 5m 7s jialiang/wildbench
November 13, 2024 23:17 5m 7s
Adding WildBench
Test #7595: Pull request #3150 synchronize by liamjxu
November 13, 2024 21:55 13m 36s jialiang/wildbench
November 13, 2024 21:55 13m 36s
Fix speech schema (#3157)
Test #7594: Commit b6b2971 pushed by ImKeTT
November 13, 2024 17:37 12m 50s main
November 13, 2024 17:37 12m 50s