-
Notifications
You must be signed in to change notification settings - Fork 265
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes for MMLU PRO with COT #3200
Conversation
when: "?" | ||
language: English | ||
|
||
- name: ifeval |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't delete ifeval
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This seems identical to chain_of_thought_metric.py
- delete this file.
Note: use git rm
to delete files in your commit (doc).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Delete this file (it is a copy of mmlu_scenario_pro.py
)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Delete this file. (this is a copy of mmlu_pro_scenario.py
input_prefix="What is the correct answer to this question: ", | ||
input_suffix="\nChoices:\n", | ||
output_prefix="", | ||
reference_prefix="(A) ", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete this line with reference_prefix
(just let AdapterSpec
use the default value).
what: "?" | ||
who: "?" | ||
when: "?" | ||
language: English |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
put these back for ifeval
main_name: ifeval_strict_accuracy | ||
main_name: chain_of_thought_correct # non-CoT |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should be just main_name: ifeval_strict_accuracy
, delete the other line
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
Final changes to MMLU Pro with COT