diff --git a/CHANGELOG.md b/CHANGELOG.md index f8afdfd9a3..a80f6e19d7 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -2,6 +2,42 @@ ## [Upcoming] +## [v0.5.1] - 2024-05-06 + +### Scenarios + +- Updated VLM scenarios for VHELM (#1592) +- Added `trust_remote_code=True` for `math` and `legalbench` scenarios (#2597) + +### Models + +- Added Google Gemini 1.5 Pro as a VLM (#2607) +- Add Mixtral 8x22b Instruct, Llama 3 Chat, and Yi Chat (#2610) +- Updated VLM models for VHELM (#1592) +- Improved handling of Gemini content blocking (#2603) +- Added default instructions prefix for multiple-choice joint adaptation for Claude 3 +- Renamed some model names for consistency (#2596) +- Added Snowflake Arctic Instruct (#2599, #2591) + +### Frontend + +- Added global landing page (#2593) + +### Evaluation Results + +- [VHELM v1.0.0](https://crfm.stanford.edu/helm/mmlu/v1.0.0/) + - Initial release with Gemini 1.5 Pro, Claude 3 Sonnet, Claude 3 Opus, Gemini 1.0 Pro Vision, IDEFICS 2 and GPT-4V + +### Contributors + +Thank you to the following contributors for your work on this HELM release! + +- @farzaank +- @neelguha +- @percyliang +- @teetone +- @yifanmai + ## [v0.5.0] - 2024-04-23 ### Breaking changes @@ -425,7 +461,8 @@ Thank you to the following contributors for your contributions to this HELM rele - Initial release -[upcoming]: https://github.com/stanford-crfm/helm/compare/v0.5.0...HEAD +[upcoming]: https://github.com/stanford-crfm/helm/compare/v0.5.1...HEAD +[v0.5.1]: https://github.com/stanford-crfm/helm/releases/tag/v0.5.1 [v0.5.0]: https://github.com/stanford-crfm/helm/releases/tag/v0.5.0 [v0.4.0]: https://github.com/stanford-crfm/helm/releases/tag/v0.4.0 [v0.3.0]: https://github.com/stanford-crfm/helm/releases/tag/v0.3.0 diff --git a/setup.cfg b/setup.cfg index d11686b6c9..19674d65c5 100644 --- a/setup.cfg +++ b/setup.cfg @@ -1,6 +1,6 @@ [metadata] name = crfm-helm -version = 0.5.0 +version = 0.5.1 author = Stanford CRFM author_email = contact-crfm@stanford.edu description = Benchmark for language models