Vidu Q3 text-to-video and image-to-video with high visual fidelity, multiple styles, 540p/720p/1080p output, 1-16s duration, and optional audio plus background music.
Added Jan 31, 2026
Approx. Price
$0.350 per video
Model Type
both
Preview Examples
3
Generation controls available for this model.
Resolution/Aspect Options
3
Default Duration
5
16 duration options
Tunable Settings
7
Aspect Ratio (T2V only)
Default
4:3
Options (5)
Landscape (16:9), Standard (4:3), Square (1:1), Portrait (3:4) +1 more
Applies to text-to-video
Background Music
Default
Yes
Add background music
Duration
Default
5
Options (16)
1 second, 2 seconds, 3 seconds, 4 seconds +12 more
Video length in seconds (1-16)
Generate Audio
Default
Yes
Generate synchronized audio
Motion
Default
auto
Options (4)
Auto, Small, Medium, Large
Movement intensity
Resolution
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis APIA retro taxi driving slowly on a rainy New York street at night, neon lights reflecting on the wet pavement, pedestrians with umbrellas walking hurriedly, camera tracking the taxi, cinematic film grain, photorealistic, 8k, moody atmosphere.
Text-to-Video | General style
Gentle 2D slice of life anime. Students laughing and walking down a school hallway filled with sunlight during sunset. Warm color palette, soft watercolor background textures, relaxed atmosphere, expressive character designs.
Text-to-Video | Anime style
Vibrant 3D anime idol concert. A group of female idols dancing and singing on a glittering stage with large LED screens playing graphics. cheering crowd with lightsticks. Flashy visual effects, smooth motion capture animation, bright and colorful stage lighting.
Text-to-Video | Anime style
Default
720p
Options (3)
540p, 720p, 1080p
Output resolution
Style (T2V only)
Default
general
Options (2)
General, Anime
Visual style for text-to-video