Add ability to pass options to VLLM #621

robinhad · 2025-04-07T22:15:26Z

This pull request enables passing options to VLLM directly. This is useful for the following scenario:

gemma-3-12b-it model, which can fit in 2x24G GPU, but has a long-context of 128k tokens. User can then specify max_model_len directly like:
VLLM_USE_V1=0 VLLM_WORKER_MULTIPROC_METHOD=spawn lmms-eval --tasks mmmu --model vllm --model_args model_version=google/gemma-3-12b-it,tensor_parallel_size=2,gpu_memory_utilization=0.95,max_images=1,max_videos=0,max_audios=0,max_model_len=4096 --batch_size 100 --log_samples --output_path lmms-results

It enables support for all VLLM engine args.

* Add ability to pass options to VLLM * Add link to engine args

robinhad added 2 commits April 8, 2025 01:10

Add ability to pass options to VLLM

fc9a95c

Add link to engine args

adf6eb5

Luodian approved these changes Apr 12, 2025

View reviewed changes

Luodian merged commit 6ba0b5e into EvolvingLMMs-Lab:main Apr 12, 2025
1 check passed

dadwadw233 pushed a commit to dadwadw233/lmms-eval that referenced this pull request Apr 28, 2025

Add ability to pass options to VLLM (EvolvingLMMs-Lab#621)

1a29045

* Add ability to pass options to VLLM * Add link to engine args

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add ability to pass options to VLLM #621

Add ability to pass options to VLLM #621

Uh oh!

robinhad commented Apr 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add ability to pass options to VLLM #621

Add ability to pass options to VLLM #621

Uh oh!

Conversation

robinhad commented Apr 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants