Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning
-
Updated
Jun 10, 2025 - Python
Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
[CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".
Sample project of multimodal decision and image generation with DeepSeek Janus Pro 7B with Real-ESRGAN upscaling
Add a description, image, and links to the multimodal-understanding topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-understanding topic, visit your repo's landing page and select "manage topics."