Text-to-video and image-to-video with optional end frame control. Native audio generation, cinematic realism, and consistent subjects. Supports 4/6/8 seconds at 720p or 1080p.
Added Oct 11, 2025
Approx. Price
$0.800 per video
Model Type
both
Preview Examples
2
Generation controls available for this model.
Resolution/Aspect Options
2
Default Duration
8
3 duration options
Tunable Settings
4
Aspect Ratio (T2V)
Default
16:9
Options (2)
Landscape (16:9), Portrait (9:16)
Applies to text-to-video
Duration
Default
8
Options (3)
4 seconds, 6 seconds, 8 seconds
4/6/8 seconds
Generate Audio
Default
Yes
Enable native audio generation
Resolution
Default
720p
Options (2)
720p, 1080p
Output resolution
Human preference benchmarks sourced from Artificial Analysis.
Text to Video
#21 / 83
ELO
1207.0
Appearances
5,589
95% CI
-9/9
Image to Video
#29 / 76
ELO
1245.0
Appearances
5,617
95% CI
-10/10
Release Date 2026-01 · Matched as Veo 3.1
Artificial Analysis APITime-lapse of the city transitioning through the day. Shadows shift across buildings, traffic flows in streaks of light, clouds race across the sky. The scene shifts from bright midday to warm golden hour. Smooth hyperlapse feel.
A ballerina performing an elegant pirouette on a moonlit outdoor stage. She wears a flowing white tutu that catches the silver light. Rose petals swirl around her in slow motion. The background is a dark forest with fireflies. Camera orbits slowly around her as she spins. Ethereal, dreamlike, cinematic shallow depth of field.