[Fix] Aria and LLama Vision and OpenAI compatible models #641

Luodian · 2025-04-18T14:19:29Z

Replaced AutoModelForCausalLM and AutoProcessor with AriaForConditionalGeneration and AriaProcessor in the Aria model.
Updated the pretrained model string in LlamaVision to "meta-llama/Llama-3.2-11B-Vision-Instruct".
Enhanced OpenAICompatible to support AzureOpenAI and modified API key handling for better flexibility.
Adjusted timeout parameter and refined token handling in OpenAICompatible for improved functionality.

Before you open a pull-request, please check if a similar issue already exists or has been closed before.

When you open a pull-request, please be sure to include the following

A descriptive title: [xxx] XXXX
A detailed description

If you meet the lint warnings, you can use following scripts to reformat code.

pip install pre-commit
pre-commit install
pre-commit run --all-files

Thank you for your contributions!

…dling - Added `system_prompt`, `interleave_visuals`, and `max_length` parameters to Qwen2_VL class. - Simplified device assignment logic for single process scenarios. - Improved visual processing by refining how visuals are handled and ensuring proper mapping to contexts. - Enhanced message construction to support interleaving of visuals and text based on placeholders. - Set default generation parameters and refined handling of generated outputs to ensure proper trimming and formatting.

… consistency - Updated string formatting to use double quotes for consistency. - Enhanced the `VisualPuzzles_doc_to_text` and `parse_response` functions for better clarity and structure. - Simplified conditional checks and improved whitespace handling in response parsing. - Ensured consistent handling of options and answers throughout the utility functions.

…Compatible - Replaced AutoModelForCausalLM and AutoProcessor with AriaForConditionalGeneration and AriaProcessor in the Aria model. - Updated the pretrained model string in LlamaVision to "meta-llama/Llama-3.2-11B-Vision-Instruct". - Enhanced OpenAICompatible to support AzureOpenAI and modified API key handling for better flexibility. - Adjusted timeout parameter and refined token handling in OpenAICompatible for improved functionality.

Luodian · 2025-04-18T14:24:02Z

We should close the last PR then could merge this since it's developed based on that.

#639

kcz358

Most of the codes look fine to me as this is the only way to hardcode under current logic. I will merge this PR.

A small notice for others:

We plan to rfc the doc to text these things in the future with messages format so these hardcoded models will gradually being deprecated. Please stay tuned for our update

…s-Lab#641) * Enhance Qwen model with additional parameters and improved visual handling - Added `system_prompt`, `interleave_visuals`, and `max_length` parameters to Qwen2_VL class. - Simplified device assignment logic for single process scenarios. - Improved visual processing by refining how visuals are handled and ensuring proper mapping to contexts. - Enhanced message construction to support interleaving of visuals and text based on placeholders. - Set default generation parameters and refined handling of generated outputs to ensure proper trimming and formatting. * Refactor VisualPuzzles utility functions for improved readability and consistency - Updated string formatting to use double quotes for consistency. - Enhanced the `VisualPuzzles_doc_to_text` and `parse_response` functions for better clarity and structure. - Simplified conditional checks and improved whitespace handling in response parsing. - Ensured consistent handling of options and answers throughout the utility functions. * Update model imports and parameters for Aria, LlamaVision, and OpenAICompatible - Replaced AutoModelForCausalLM and AutoProcessor with AriaForConditionalGeneration and AriaProcessor in the Aria model. - Updated the pretrained model string in LlamaVision to "meta-llama/Llama-3.2-11B-Vision-Instruct". - Enhanced OpenAICompatible to support AzureOpenAI and modified API key handling for better flexibility. - Adjusted timeout parameter and refined token handling in OpenAICompatible for improved functionality.

Luodian added 3 commits April 17, 2025 18:03

Luodian changed the title ~~Fix/models~~ [Fix] Aria and LLama Vision and OpenAI compatible models Apr 18, 2025

Luodian requested a review from kcz358 April 18, 2025 14:23

kcz358 approved these changes Apr 19, 2025

View reviewed changes

kcz358 merged commit b68fa0b into main Apr 19, 2025
1 of 2 checks passed

kcz358 deleted the fix/models branch April 19, 2025 05:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Fix] Aria and LLama Vision and OpenAI compatible models #641

[Fix] Aria and LLama Vision and OpenAI compatible models #641

Uh oh!

Luodian commented Apr 18, 2025

Uh oh!

Luodian commented Apr 18, 2025 •

edited

Loading

Uh oh!

kcz358 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Fix] Aria and LLama Vision and OpenAI compatible models #641

[Fix] Aria and LLama Vision and OpenAI compatible models #641

Uh oh!

Conversation

Luodian commented Apr 18, 2025

When you open a pull-request, please be sure to include the following

Uh oh!

Luodian commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kcz358 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Luodian commented Apr 18, 2025 •

edited

Loading