Pose · Hand · Face Presets — Hands Strong, Faces Medium, Body Soft (CloudDock)

The core idea (why most ControlNet stacks look stiff)

When people first discover ControlNet, they crank everything to max: high weight, full steps, multiple strong controls. The result is “perfect structure” — but also:

stiff limbs
dead mannequin posture
robotic hands
faces that look pasted on

For anime-style work, the best results usually come from a hierarchy:

Hands strong, faces medium, body soft.
Let pose guide the body gently, let face guide identity moderately, and let hands lock down anatomy where the model tends to fail.

When to use these presets

You have a pose reference but want style freedom
You want better hands without over-controlling the whole image
You want faces consistent while still allowing expression
You’re on smaller VRAM and need a predictable stack

Preset stack overview

The most common and useful trio:

Pose (OpenPose full body) — gentle global guide
Hands (OpenPose hands / hand-focused) — strong correction
Face (OpenPose face / face detail) — medium identity anchor

Diagram showing ControlNet stack: pose soft, face medium, hand strong

A good stack is not “everything strong.” It’s priorities.

Preset: Standard anime stack

This is the default we recommend for anime-style outputs. It keeps motion and mood while preventing the most common anatomy failures.

Pose (body soft)

Weight: 0.35–0.55
Start / End: 0.00 → 0.70
Goal: hint the silhouette, don’t freeze the character

Face (medium)

Weight: 0.55–0.75
Start / End: 0.10 → 0.85
Goal: keep identity & proportions, allow expression

Hands (strong)

Weight: 0.80–1.10
Start / End: 0.00 → 0.95
Goal: stop finger meltdown and “glove hands”

ControlNet panel showing three slots configured for pose, face, and hands with different weights

Three-slot stack: body soft, face medium, hands strong.

Why Start/End steps matter (the secret anti-mannequin tool)

If ControlNet controls the entire sampling process (start 0 → end 1), you get stiffness. The model never gets a chance to “draw naturally.”

Ending Pose earlier (end ~ 0.70) is the simplest way to avoid mannequin-fu:

early steps: pose guides structure
late steps: the model adds natural detail and style

Rule of thumb: Pose ends earlier. Hands end later. Face sits in the middle.

VRAM-friendly notes (A-group)

ControlNet costs VRAM. On smaller GPUs, use a “lean stack”:

Keep ControlNet slots to 2–3
Prefer moderate resolutions (don’t start at 2K)
Batch size 1 while using ControlNet
If you also use hires, keep it at 1.3× and low denoise

Common A-group OOM combo: high-res base + multiple ControlNets + hires + batch > 1. If you OOM, reduce base size first.

Quick fixes

“It looks stiff / mannequin-like.”

Lower Pose weight to 0.35–0.45
End Pose earlier (end 0.60–0.70)
Keep hands strong, not pose strong

“Face changes too much.”

Increase Face weight slightly (0.70–0.85)
Start Face a bit later (start 0.10–0.20)
Don’t overload the prompt with new identity words

“Hands still broken.”

Increase Hands weight (0.95–1.15)
Make sure the hand reference is clean (clear fingers, not tiny blur)
Use inpaint targeted repair on hands after generation

A starter prompt that plays nicely with ControlNet

Keep the prompt clean. ControlNet already gives structure.

anime illustration, 1girl, dynamic pose, clean lineart, detailed eyes,
soft lighting, high contrast, sharp focus, masterpiece

Negative starter:

extra fingers, missing fingers, deformed hands, bad anatomy, mannequin, stiff pose,
blurry, lowres, worst quality

What’s next?

Two-stage hires ladder: 1.3× first, low-noise repair second.
Go to “Two-stage Hires →”
ControlNet & Hires Fix: the full non-mushy workflow.
Go to “ControlNet & Hires Fix →”
img2img + Inpaint repair: targeted fixes for face and hands.
Go to “img2img Basics →”

Control the hands. Let the body breathe.

← Back to SD Home