AutoModel cant load Qwen/Qwen2.5-0mni-7B #37794

liwenju0 · 2025-04-25T16:27:11Z

System Info

from transformers import AutoModel
model = AutoModel.from pretrained("Qwen/Qwen2.5-0mni-7B",torch dtype="auto"trust remote code=True)

will raise error：

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

from transformers import AutoModel
model = AutoModel.from pretrained("Qwen/Qwen2.5-0mni-7B",torch dtype="auto"trust remote code=True)

Expected behavior

load sucessfully

The text was updated successfully, but these errors were encountered:

jiangyukunok · 2025-04-26T03:52:45Z

@liwenju0
My understanding is AutoModel is designed for simple single-modality models, while Qwen is multi-modal and could make it hard for AutoModal to figure out how to assemble encoders/decoders.

You can workaround using the customized APIs: https://github.com/huggingface/transformers/blob/main/docs/source/en/model_doc/qwen2_5_omni.md

zucchini-nlp · 2025-04-27T18:07:17Z

@liwenju0 for multimodal models we didn't have a base model in most cases because the models are usually a composition of LM and encoders. Thus the straightforward way was to compose a generative model only. I am currently adding a base model for multimodals in #37033 but it might not be usable with official checkopints

If you wanted to obtain last hidden state from Qwen-Omni, you can still load the ConditionalGeneration and get model outputs with out.hidden_states[-1]

liwenju0 added the bug label Apr 25, 2025

liwenju0 mentioned this issue Apr 25, 2025

fix qwen2.5-omini cant be loaded from AutoModel #37795

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AutoModel cant load Qwen/Qwen2.5-0mni-7B #37794

AutoModel cant load Qwen/Qwen2.5-0mni-7B #37794

liwenju0 commented Apr 25, 2025

jiangyukunok commented Apr 26, 2025

zucchini-nlp commented Apr 27, 2025

AutoModel cant load Qwen/Qwen2.5-0mni-7B #37794

AutoModel cant load Qwen/Qwen2.5-0mni-7B #37794

Comments

liwenju0 commented Apr 25, 2025

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

jiangyukunok commented Apr 26, 2025

zucchini-nlp commented Apr 27, 2025