Skip to content

Commit 09d15c3

Browse files
authored
Fix MMMU-Pro evaluation (EvolvingLMMs-Lab#296)
1 parent d674d31 commit 09d15c3

File tree

4 files changed

+6
-6
lines changed

4 files changed

+6
-6
lines changed

lmms_eval/tasks/mmmu_pro/mmmu_pro.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,4 +2,4 @@ group: mmmu_pro
22
task:
33
- mmmu_pro_vision
44
# - mmmu_pro_composite # removing composite task in formal MMMU-Pro evaluation
5-
- mmmu_pro_original
5+
- mmmu_pro_standard

lmms_eval/tasks/mmmu_pro/mmmu_pro_original.yaml renamed to lmms_eval/tasks/mmmu_pro/mmmu_pro_standard.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
1-
task: "mmmu_pro_original"
1+
task: "mmmu_pro_standard"
22
dataset_path: MMMU/MMMU_Pro
3-
dataset_name: original
3+
dataset_name: standard
44
test_split: test
55
output_type: generate_until
66
doc_to_visual: !function utils.mmmu_pro_doc_to_visual

lmms_eval/tasks/mmmu_pro/mmmu_pro_original_cot.yaml renamed to lmms_eval/tasks/mmmu_pro/mmmu_pro_standard_cot.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
1-
task: "mmmu_pro_original_cot"
1+
task: "mmmu_pro_standard_cot"
22
dataset_path: MMMU/MMMU_Pro
3-
dataset_name: original
3+
dataset_name: standard
44
test_split: test
55
output_type: generate_until
66
doc_to_visual: !function utils.mmmu_pro_doc_to_visual

lmms_eval/tasks/mmmu_pro/mmmu_pro_vision.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ dataset_name: vision
44
test_split: test
55
output_type: generate_until
66
doc_to_visual: !function utils.mmmu_pro_doc_to_visual
7-
doc_to_text: "Answer with the option letter from the given choices directly. The last line of your response should be of the following format: 'Answer: $LETTER' (without quotes) where LETTER is one of options."
7+
doc_to_text: "Answer with the option letter from the given choices directly."
88
doc_to_target: "{{answer}}"
99
# The return value of process_results will be used by metrics
1010
process_results: !function utils.mmmu_pro_process_results

0 commit comments

Comments
 (0)