WAN 2.7 image model for text-to-image and multi-image editing. Supports prompt-driven generation and up to 4 reference images in NanoGPT.
Added Apr 1, 2026
Approx. Price
$0.030 per image
Model Type
both
Preview Examples
6
Generation controls available for this model.
Images Per Run
Up to 4
Resolution Options
6
1024*1024 (Square (1:1)), 1280*720 (Landscape (16:9)), 720*1280 (Portrait (9:16)) +3 more
Tunable Settings
5
Negative Prompt
Default
N/A
Details you don't want in the image.
Number of Images
Default
1
Prompt Expansion
Default
No
Let Alibaba expand the prompt before generation.
Resolution
Default
1024*1024
Options (6)
1024*1024 (Square (1:1)), 1280*720 (Landscape (16:9)), 720*1280 (Portrait (9:16)), 1536*1024 (Landscape (3:2)) +2 more
Watermark
Default
No
Add an AI generation watermark to the output image.
Human preference benchmarks sourced from LMArena.
Text to Image
#32 / 57
Arena Score
1099.4
Votes
28,446
Confidence Interval
1094.3 - 1104.4
Image Edit
#12 / 45
Arena Score
1303.2
Votes
40,080
Confidence Interval
1299.3 - 1307.2
Published 2026-04-23 · Matched as wan2.7-image
LMArena Dataset
a group of animals standing in line to buy coffee, side view, anthropomorphic animals, a dog, a cat, a raccoon and a rabbit waiting in a queue, holding coffee cups, modern coffee shop counter, barista in background, casual daily scene, natural behavior, soft morning light, realistic environment, cinematic composition, 35mm photography, shallow depth of field, warm tones, high detail, ultra realistic
Text-to-image • Example

Close-up portrait of a model whose face is partially covered in flowing liquid metal or an iridescent, second-skin-like substance. She has otherworldly, light purple eyes and stares directly into the camera. The background is completely blurred out, leaving only a soft halo of light. The lighting is even and ethereal, as if from a bioluminescent source. Inspired by the style of Nick Knight, the image emphasizes surreal textures and subtle color gradients, exceptionally sharp, with breathtaking detail, 16K.
Text-to-image • Example

Restore the old car into a clean cinematic night scene with a teal vintage SUV, palm-lined boulevard, bright moonlight, realistic reflections, and a crisp wide-angle street-photography look.
Image-edit • Example
Close-up portrait of a woman pressing her palm against a rain-soaked window at night, shot from outside. Heavy rain streaks distort her face through the glass, neon signs from the street below cast fragmented magenta and cyan reflections across her skin. Camera slowly pushes in at 0.3x speed. Shallow depth of field, bokeh rain drops, dual-exposure ghosting of her reflection overlapping her face. Cinematic 4K, anamorphic lens flare, 24fps.
Text-to-video • Example
Extreme close-up portrait, woman's face in profile against golden hour sun, shot with 85mm f/1.2. Sun positioned directly behind her head creating a blazing rim light halo. Individual strands of hair catch the light and glow translucent amber. A gentle breeze causes hair to drift across her cheek in slow motion. Skin subsurface scattering visible on ear and nose tip. Dust particles float through the backlit air. Hyper-realistic, 120fps slow motion playback at 24fps, anamorphic 2.39:1.
Text-to-video • Example
Portrait of a woman aging from 20 to 80 and back to 20, seamless morphing over 10 seconds. Shot in style of 1960s 16mm film with authentic gate weave, film grain, occasional splice marks and light leaks. The background remains constant - a sun-drenched window in a Paris apartment - while only her face transforms. Color grade shifts from saturated Kodachrome warmth in youth to desaturated, slightly faded tones in age, then back. No CGI aesthetic - must feel like archival film footage.
Text-to-video • Example