Turns a single portrait and one audio track into a realistic talking avatar with accurate lip sync, expressive facial motion, and consistent identity. Optional prompt can guide mood or energy.
Added Dec 7, 2025
Approx. Price
$0.280 per video
Model Type
image-to-video
Preview Examples
3
Generation controls available for this model.
Resolution/Aspect Options
N/A
Default Duration
N/A
Tunable Settings
0
No configurable settings are exposed for this model yet.
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis APIGallery example from Kling V2 Avatar Standard.
Gallery example from Kling V2 Avatar Standard.
Gallery example from Kling V2 Avatar Standard.