Pixverse v5.6 supports text-to-video and image-to-video in one model. Upload an image to animate it, or leave it blank to generate from text. 360p-1080p resolution, 5/8/10s durations, optional audio generation, prompt optimization, and negative prompts.
Added Jan 27, 2026
Approx. Price
$0.350 per video
Model Type
both
Preview Examples
2
Generation controls available for this model.
Resolution/Aspect Options
4
Default Duration
5
3 duration options
Tunable Settings
5
Aspect Ratio (T2V)
Default
16:9
Options (5)
Landscape (16:9), Standard (4:3), Square (1:1), Portrait (3:4) +1 more
Aspect ratio for text-to-video
Duration
Default
5
Options (3)
5 seconds, 8 seconds, 10 seconds
Video length (10s unavailable at 1080p)
Generate Audio
Default
No
Enable audio generation (priced add-on)
Prompt Optimization
Default
auto
Options (3)
Auto, Enabled, Disabled
System-level prompt optimization
Resolution
Default
720p
Options (4)
360p, 540p, 720p, 1080p
Video quality level
Human preference benchmarks sourced from Artificial Analysis.
Text to Video
#13 / 83
ELO
1220.0
Appearances
5,972
95% CI
-9/9
Image to Video
#11 / 76
ELO
1281.0
Appearances
5,314
95% CI
-10/10
Release Date 2026-02 · Matched as PixVerse V5.6
Artificial Analysis APIVertical video. A barista pours latte art in a cozy cafe. Hook (first second): extreme close-up of glossy espresso crema swirling as milk hits the surface. Camera: quick push-in to macro detail, then slow stabilized top-down shot as the rosette pattern forms, smooth motion for 5 seconds. Lighting: warm morning sunlight through window, soft shadows, gentle steam. Style: cinematic food videography, realistic liquids, crisp detail. Constraints: no scene cuts, no text.
Text-to-video
Use the uploaded anime battle mage illustration as the first frame. Keep the character's face, pose, outfit and background consistent. Start with a medium shot, then slowly dolly the camera in toward his outstretched hand as glowing blue energy builds up, swirling around his arm. In the last third of the clip, let the energy burst outward in a shockwave of light and particles, with slight camera shake and motion blur. Style: high-energy anime action, strong contrast, neon blue and purple highlights. Sound: rising synth and orchestral music, electric crackling, a powerful impact sound at the peak of the blast, no dialogue.
Image-to-video