Skip to content

Actions: stanford-crfm/helm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
4,433 workflow runs
4,433 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Scenario tests
Scenario tests #213: Scheduled
December 25, 2024 15:34 9m 59s main
December 25, 2024 15:34 9m 59s
Scenario tests
Scenario tests #212: Scheduled
December 24, 2024 15:34 9m 9s main
December 24, 2024 15:34 9m 9s
MedHelm: Add VQA-RAD scenario and specs
Test #7813: Pull request #3246 opened by sashimono-san
December 24, 2024 14:06 Action required sashimono-san:feat/vqa_rad_scenario
December 24, 2024 14:06 Action required
pip in /scripts/data_overlap for jinja2, jinja2 - Update #937519456
Dependabot Updates #41: by dependabot bot
December 24, 2024 00:36 50s main
December 24, 2024 00:36 50s
pip in /scripts/data_overlap for jinja2 - Update #937519429
Dependabot Updates #40: by dependabot bot
December 24, 2024 00:36 53s main
December 24, 2024 00:36 53s
Fix LiveQA import (#3244)
Test #7811: Commit e2e7270 pushed by yifanmai
December 23, 2024 22:52 9m 59s main
December 23, 2024 22:52 9m 59s
Scenario tests
Scenario tests #211: Scheduled
December 23, 2024 15:34 8m 18s main
December 23, 2024 15:34 8m 18s
Scenario tests
Scenario tests #210: Scheduled
December 22, 2024 15:34 11m 25s main
December 22, 2024 15:34 11m 25s
Scenario tests
Scenario tests #209: Scheduled
December 21, 2024 15:34 10m 0s main
December 21, 2024 15:34 10m 0s
Fix LiveQA import
Test #7810: Pull request #3244 opened by farzaank
December 21, 2024 10:22 10m 54s farzaan/scoreutil_bugfix
December 21, 2024 10:22 10m 54s
New safety scenario: HarmBench GCG-T (#3035)
Test #7809: Commit b1fcafd pushed by farzaank
December 21, 2024 09:38 11m 5s main
December 21, 2024 09:38 11m 5s
Clean up Markdown formatting in "Enterprise benchmark" documentation …
Test #7808: Commit 1224774 pushed by yifanmai
December 20, 2024 23:59 11m 35s main
December 20, 2024 23:59 11m 35s
Make encrypt_scenario_states script idempotent (#3242)
Test #7806: Commit 099a6c0 pushed by yifanmai
December 20, 2024 23:55 11m 7s main
December 20, 2024 23:55 11m 7s
Build frontend (#3236)
Test #7805: Commit 5bd2fbb pushed by yifanmai
December 20, 2024 23:48 11m 3s main
December 20, 2024 23:48 11m 3s
Make encrypt_scenario_states script idempotent
Test #7804: Pull request #3242 opened by yifanmai
December 20, 2024 23:39 11m 35s yifanmai/fix-encrypt-script
December 20, 2024 23:39 11m 35s
New safety scenario: HarmBench GCG-T
Test #7803: Pull request #3035 synchronize by yifanmai
December 20, 2024 22:45 11m 39s farzaan/hb-gcg
December 20, 2024 22:45 11m 39s
Add MedAlign scenario (#3038)
Test #7802: Commit 5e9bf74 pushed by yifanmai
December 20, 2024 22:43 11m 7s main
December 20, 2024 22:43 11m 7s
Add landing page for HELM Capabilities leaderboard (#3241)
Frontend #693: Commit 66eba9f pushed by yifanmai
December 20, 2024 22:41 1m 0s main
December 20, 2024 22:41 1m 0s
Add landing page for HELM Capabilities leaderboard (#3241)
Build Frontend #159: Commit 66eba9f pushed by yifanmai
December 20, 2024 22:41 47s main
December 20, 2024 22:41 47s
Fix incorrect file path in CzechBankQAAnnotator (#3240)
Test #7801: Commit d74392d pushed by yifanmai
December 20, 2024 21:55 11m 45s main
December 20, 2024 21:55 11m 45s
Bump jinja2 version (#3239)
Test #7800: Commit f368e67 pushed by yifanmai
December 20, 2024 21:41 12m 13s main
December 20, 2024 21:41 12m 13s