Audio-to-video lipsync. Upload a 2–10 second focal video and a clean vocal track (≤5 MB). Kling aligns mouth shapes and facial muscles to the audio while preserving the original footage.
Approx. Price
$0.450 per video
Model Type
video-to-video
Preview Examples
2
Generation controls available for this model.
Resolution/Aspect Options
N/A
Default Duration
N/A
Tunable Settings
0
No configurable settings are exposed for this model yet.
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis API