Text-to-video lipsync. Upload a 2–10 second focal video and provide a script. Kling synthesizes a matching voiceover and animates lips/micro-expressions to the dialogue.
Approx. Price
$0.600 per video
Model Type
video-to-video
Preview Examples
2
Generation controls available for this model.
Resolution/Aspect Options
N/A
Default Duration
N/A
Tunable Settings
3
Voice
Default
genshin_klee2
Options (25)
Genshin Klee 2, Genshin Vindi 2, Zhinen Xuesheng, AOT +21 more
Voice used for autogenerated narration
Voice Language
Default
en
Options (2)
English (en), Chinese (zh)
Match the synthesized narration language to your script
Voice Speed
Default
1
Options (5)
0.8x, 1.0x, 1.2x, 1.5x +1 more
Playback rate for autogenerated narration (0.8 – 2.0)
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis APIText-to-video with auto-generated voice