Multimodal prototyping #2243

lintangsutawika · 2024-08-22T22:23:36Z

No description provided.

…_image still WIP)

accesslint

There are accessibility issues in these changes.

lm_eval/tasks/mmmu/utils.py

accesslint

There are accessibility issues in these changes.

lm_eval/models/hf_vlms.py

accesslint

There are accessibility issues in these changes.

lm_eval/models/hf_vlms.py

accesslint

There are accessibility issues in these changes.

lm_eval/models/hf_vlms.py

lm_eval/evaluator.py

…I/lm-evaluation-harness into multimodal-prototyping

haileyschoelkopf

@baberabb and I are merging this, though we'll continue iterating on the model/task design from here!

mmmu_val scores on a few models we specifically used for testing during development can be found in the MMMU-specific readme. Scores tend to match or slightly exceed the lmms-eval implementation although they don't always match the model authors' reported scores (which don't have code published).

…I/lm-evaluation-harness into multimodal-prototyping

* add WIP hf vlm class * add doc_to_image * add mmmu tasks * fix merge conflicts * add lintang's changes to hf_vlms.py * fix doc_to_image * added yaml_path for config-loading * revert * add line to process str type v * update * modeling cleanup * add aggregation for mmmu * rewrite MMMU processing code based on only MMMU authors' repo (doc_to_image still WIP) * implemented doc_to_image * update doc_to_image to accept list of features * update functions * readd image processed * update args process * bugfix for repeated images fed to model * push WIP loglikelihood code * commit most recent code (generative ; qwen2-vl testing) * preliminary image_token_id handling * small mmmu update: some qs have >4 mcqa options * push updated modeling code * use processor.apply_chat_template * add mathvista draft * nit * nit * ensure no footguns in text<>multimodal LM<>task incompatibility * add notification to readme regarding launch of prototype! * fix compatibility check * reorganize mmmu configs * chat_template=None * add interleave chat_template * add condition * add max_images; interleave=true * nit * testmini_mcq * nit * pass image string; convert img * add vllm * add init * vlm add multi attr * fixup * pass max images to vllm model init * nit * encoding to device * fix HFMultimodalLM.chat_template ? * add mmmu readme * remove erroneous prints * use HFMultimodalLM.chat_template ; restore tasks/__init__.py * add docstring for replace_placeholders in utils * fix `replace_placeholders`; set image_string=None * fix typo * cleanup + fix merge conflicts * update MMMU readme * del mathvista * add some sample scores * Update README.md * add log msg for image_string value --------- Co-authored-by: haileyschoelkopf <[email protected]> Co-authored-by: Baber Abbasi <[email protected]> Co-authored-by: Baber <[email protected]> Co-authored-by: Hailey Schoelkopf <[email protected]>

haileyschoelkopf and others added 21 commits July 2, 2024 12:30

add WIP hf vlm class

5b62a52

add doc_to_image

34a079e

add mmmu tasks

8bce8cf

Merge branch 'hailey-multimodal-prototyping' into multimodal-prototyping

6cc6e9c

Merge branch 'main' into multimodal-prototyping

e4db76c

fix merge conflicts

1c94a54

add lintang's changes to hf_vlms.py

9692aa0

fix doc_to_image

90ba03a

added yaml_path for config-loading

aa6c50e

revert

7c76574

add line to process str type v

1b9deaa

update

8db0a47

modeling cleanup

9b9ca7b

merge with lintang-multimodal-prototyping

df7fee6

add aggregation for mmmu

8d92a68

rewrite MMMU processing code based on only MMMU authors' repo (doc_to…

f410d35

…_image still WIP)

implemented doc_to_image

ebf54d8

update doc_to_image to accept list of features

941b502

update functions

8e4c1d6

readd image processed

63bcbc5

update args process

15dda35

accesslint bot reviewed Aug 22, 2024

View reviewed changes

haileyschoelkopf added 2 commits August 23, 2024 17:25

bugfix for repeated images fed to model

d811a3a

push WIP loglikelihood code

2242ed3

accesslint bot reviewed Sep 3, 2024

View reviewed changes

lm_eval/models/hf_vlms.py Outdated Show resolved Hide resolved

commit most recent code (generative ; qwen2-vl testing)

be14ac1

accesslint bot reviewed Sep 9, 2024

View reviewed changes

lm_eval/models/hf_vlms.py Outdated Show resolved Hide resolved

preliminary image_token_id handling

7516b88

accesslint bot reviewed Sep 9, 2024

View reviewed changes

lm_eval/models/hf_vlms.py Show resolved Hide resolved

lm_eval/models/hf_vlms.py Show resolved Hide resolved

lm_eval/models/hf_vlms.py Outdated Show resolved Hide resolved

small mmmu update: some qs have >4 mcqa options

5a65d10

EleutherAI deleted a comment from accesslint bot Sep 13, 2024

haileyschoelkopf reviewed Sep 13, 2024

View reviewed changes

lm_eval/evaluator.py Outdated Show resolved Hide resolved

Merge branch 'main' into multimodal-prototyping

805a115

This comment was marked as outdated.

Sign in to view

haileyschoelkopf added 2 commits September 13, 2024 13:16

remove erroneous prints

05f0dd6

Merge branch 'multimodal-prototyping' of https://github.com/EleutherA…

4623768

…I/lm-evaluation-harness into multimodal-prototyping