Native 4K Kling O3 model for text-to-video, image-to-video, and reference-to-video generation. Supports start/end frame control, up to 7 reference images, and 3-15 second durations.
Added Apr 23, 2026
Approx. Price
$2.10 per video
Model Type
both
Preview Examples
3
Generation controls available for this model.
Resolution/Aspect Options
3
Default Duration
5
13 duration options
Tunable Settings
3
Aspect Ratio
Default
16:9
Options (3)
Landscape (16:9), Portrait (9:16), Square (1:1)
Output aspect ratio
Duration
Default
5
Options (13)
3 seconds, 4 seconds, 5 seconds, 6 seconds +9 more
Video duration in seconds
Generate Audio
Default
No
Enable native audio generation
Human preference benchmarks sourced from Artificial Analysis.
Text to Video
#8 / 83
ELO
1224.0
Appearances
5,265
95% CI
-9/9
Image to Video
#21 / 76
ELO
1263.0
Appearances
4,935
95% CI
-10/10
Release Date 2026-02 · Matched as Kling 3.0 Omni 720p (Standard)
Artificial Analysis APIExtreme close-up of rocky texture that reveals a dragon eye opening, then a rapid cinematic zoom-out to reveal epic scale.
text-to-video | https://fal.ai/models/fal-ai/kling-video/o3/4k/text-to-video
Animate a stylized cyberpunk hero portrait into a short action beat with rain, neon reflections, and fast camera panning.
image-to-video | https://fal.ai/models/fal-ai/kling-video/o3/4k/image-to-video
Use reference style cues to generate a high-speed POV descent with intense motion, snow spray, and dynamic lens artifacts.
reference-to-video | https://fal.ai/models/fal-ai/kling-video/o3/4k/reference-to-video