Changes for MMLU PRO with COT #3200

siyagoel · 2024-12-06T00:37:46Z

Final changes to MMLU Pro with COT

yifanmai · 2024-12-06T22:01:52Z

src/helm/benchmark/static/schema_lite_v2.yaml

-      when: "?"
-      language: English
-
-  - name: ifeval


Don't delete ifeval

yifanmai · 2024-12-06T22:03:17Z

src/helm/benchmark/metrics/chain_of_thought_metric_correctness.py

This seems identical to chain_of_thought_metric.py - delete this file.

Note: use git rm to delete files in your commit (doc).

yifanmai · 2024-12-06T22:04:41Z

src/helm/benchmark/scenarios/mmlu_pro.py

Delete this file (it is a copy of mmlu_scenario_pro.py)

yifanmai · 2024-12-06T22:05:45Z

src/helm/benchmark/scenarios/mmlu_scenario_pro.py

Delete this file. (this is a copy of mmlu_pro_scenario.py

yifanmai · 2024-12-06T22:09:05Z

src/helm/benchmark/run_specs/lite_run_specs.py

+            input_prefix="What is the correct answer to this question: ",
+            input_suffix="\nChoices:\n",
+            output_prefix="",
+            reference_prefix="(A) ",


delete this line with reference_prefix (just let AdapterSpec use the default value).

yifanmai · 2024-12-06T22:39:01Z

src/helm/benchmark/static/schema_lite_v2.yaml

-      what: "?"
-      who: "?"
-      when: "?"
-      language: English


put these back for ifeval

yifanmai · 2024-12-06T22:39:20Z

src/helm/benchmark/static/schema_lite_v2.yaml

+      main_name: ifeval_strict_accuracy
+      main_name: chain_of_thought_correct  # non-CoT


should be just main_name: ifeval_strict_accuracy, delete the other line

yifanmai

Thanks!

siyagoel and others added 27 commits November 11, 2024 15:17

Committing changes for COT metric

fad62fd

Changes for COT metrix

89460ec

Changes to COT metric

a366e24

Changes to COT Metric

d676183

Changes made to file.

de6b9b1

Changes made

6c09cbc

Committing changes

2e02fb7

Changes committed

d039a9d

orrect changes to metric

af03185

format changes

d675da0

changes

16afbbe

Merge branch 'main' into siyagoel/cotmetric

4a8e167

changes to file

d367578

changed format

23968c2

changes to file by deleting

90ac194

reformat file

7cfbb1c

changes in files for schema_lite_z2.yaml

c876828

Changes to address comments

97a9aff

changes added based on comments

6d5eb55

MMLU Pro With Metric

1398ab2

format changes to files

243057e

Changes to most recent comments

0ce9bc9

changes for format

bd7edc1

changed the correctness metric

c9b2082

Adding new file changes

616b30e

Adding a new file

d71d92d

Changes to formatting

33a65de

yifanmai requested changes Dec 6, 2024

View reviewed changes

Please enter the commit message for your changes. Lines starting

4388c8e

yifanmai requested changes Dec 6, 2024

View reviewed changes

siyagoel added 2 commits December 6, 2024 14:41

Changed schema file

ed10470

Changed test scenario to match changes

b09dd91

yifanmai approved these changes Dec 6, 2024

View reviewed changes

yifanmai merged commit ff9c7c9 into main Dec 6, 2024
12 checks passed

yifanmai deleted the mmlupro_with_cot branch December 6, 2024 23:25

yifanmai mentioned this pull request Dec 19, 2024

Fixes to HELM capabilities #3225

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes for MMLU PRO with COT #3200

Changes for MMLU PRO with COT #3200

siyagoel commented Dec 6, 2024

yifanmai Dec 6, 2024

yifanmai Dec 6, 2024

yifanmai Dec 6, 2024

yifanmai Dec 6, 2024

yifanmai Dec 6, 2024

yifanmai Dec 6, 2024

yifanmai Dec 6, 2024

yifanmai left a comment

		main_name: ifeval_strict_accuracy
		main_name: chain_of_thought_correct # non-CoT

Changes for MMLU PRO with COT #3200

Changes for MMLU PRO with COT #3200

Conversation

siyagoel commented Dec 6, 2024

yifanmai Dec 6, 2024

Choose a reason for hiding this comment

yifanmai Dec 6, 2024

Choose a reason for hiding this comment

yifanmai Dec 6, 2024

Choose a reason for hiding this comment

yifanmai Dec 6, 2024

Choose a reason for hiding this comment

yifanmai Dec 6, 2024

Choose a reason for hiding this comment

yifanmai Dec 6, 2024

Choose a reason for hiding this comment

yifanmai Dec 6, 2024

Choose a reason for hiding this comment

yifanmai left a comment

Choose a reason for hiding this comment