LLama 3.2-vision without second stage OCR #95
Replies: 2 comments 1 reply
-
Hi! The service is not performing the second stage of OCR. We have two phases (or stages as you referred to):
In the near future, we will create documentation to provide an architectural visualization of the entire process. :) But please clarify what do you mean by "Python wrapper currently does not allow for this." |
Beta Was this translation helpful? Give feedback.
-
I think you mean you can't setup the dynamic prompt for Ollama based models, if you'd be able to you could potentially do the extraction and remodeling within single step right? |
Beta Was this translation helpful? Give feedback.
-
Hi,
Thank you for providing this service. I see that LLaMA 3.2-vision is already quite capable and might not need a second stage OCR. To my knowledge, the Python wrapper currently does not allow for this.
Am I doing something wrong?
Beta Was this translation helpful? Give feedback.
All reactions