You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@liwenju0
My understanding is AutoModel is designed for simple single-modality models, while Qwen is multi-modal and could make it hard for AutoModal to figure out how to assemble encoders/decoders.
@liwenju0 for multimodal models we didn't have a base model in most cases because the models are usually a composition of LM and encoders. Thus the straightforward way was to compose a generative model only. I am currently adding a base model for multimodals in #37033 but it might not be usable with official checkopints
If you wanted to obtain last hidden state from Qwen-Omni, you can still load the ConditionalGeneration and get model outputs with out.hidden_states[-1]
System Info
will raise error:
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
load sucessfully
The text was updated successfully, but these errors were encountered: