Skip to content

Commit

Permalink
Release v0.5.1 (#2606)
Browse files Browse the repository at this point in the history
  • Loading branch information
yifanmai authored May 7, 2024
1 parent e708b20 commit ccf1df5
Show file tree
Hide file tree
Showing 2 changed files with 39 additions and 2 deletions.
39 changes: 38 additions & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,42 @@

## [Upcoming]

## [v0.5.1] - 2024-05-06

### Scenarios

- Updated VLM scenarios for VHELM (#1592)
- Added `trust_remote_code=True` for `math` and `legalbench` scenarios (#2597)

### Models

- Added Google Gemini 1.5 Pro as a VLM (#2607)
- Add Mixtral 8x22b Instruct, Llama 3 Chat, and Yi Chat (#2610)
- Updated VLM models for VHELM (#1592)
- Improved handling of Gemini content blocking (#2603)
- Added default instructions prefix for multiple-choice joint adaptation for Claude 3
- Renamed some model names for consistency (#2596)
- Added Snowflake Arctic Instruct (#2599, #2591)

### Frontend

- Added global landing page (#2593)

### Evaluation Results

- [VHELM v1.0.0](https://crfm.stanford.edu/helm/mmlu/v1.0.0/)
- Initial release with Gemini 1.5 Pro, Claude 3 Sonnet, Claude 3 Opus, Gemini 1.0 Pro Vision, IDEFICS 2 and GPT-4V

### Contributors

Thank you to the following contributors for your work on this HELM release!

- @farzaank
- @neelguha
- @percyliang
- @teetone
- @yifanmai

## [v0.5.0] - 2024-04-23

### Breaking changes
Expand Down Expand Up @@ -425,7 +461,8 @@ Thank you to the following contributors for your contributions to this HELM rele

- Initial release

[upcoming]: https://github.com/stanford-crfm/helm/compare/v0.5.0...HEAD
[upcoming]: https://github.com/stanford-crfm/helm/compare/v0.5.1...HEAD
[v0.5.1]: https://github.com/stanford-crfm/helm/releases/tag/v0.5.1
[v0.5.0]: https://github.com/stanford-crfm/helm/releases/tag/v0.5.0
[v0.4.0]: https://github.com/stanford-crfm/helm/releases/tag/v0.4.0
[v0.3.0]: https://github.com/stanford-crfm/helm/releases/tag/v0.3.0
Expand Down
2 changes: 1 addition & 1 deletion setup.cfg
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[metadata]
name = crfm-helm
version = 0.5.0
version = 0.5.1
author = Stanford CRFM
author_email = [email protected]
description = Benchmark for language models
Expand Down

0 comments on commit ccf1df5

Please sign in to comment.