Qwen3-VL / Qwen2.5-VL Demo
Space for Qwen2.5-VL-3B and 7B image + text demo.
Generate AI voice response from audio input
Generate audio from text
Transcribe audio or YouTube videos into text
Generate text responses to your queries