Pick a design to see how the ReDesign agent parsed it.
When the design file is lost, only pixels remain.
You can't move an element, recolor a shape, fix a typo, or reflow the layout.
A vision-language controller rebuilds the layer hierarchy as a tree, coarse to fine,
verifying every step until each element is atomic.
Text layers with real fonts, vector shapes with fills, grouped elements with z-order,
recovered from a single image, ready to re-edit.
ReDesign recovers a fully editable design hierarchy from a single image.
A vision-language controller grows a layer tree breadth-first and a verifier checks every step,
producing semantic text layers with real fonts, vector shapes with fills, grouped elements with z-order, and isolated elements that survive real edits without disturbing the rest of the design.
Starting from the whole image as the root, a VLM controller expands an editable layer hierarchy
by selecting and composing specialized tool actions across modalities,
checking the validity of each editable element, and repeating until every element is atomic and editable.
Figma-909 benchmark; 909 real-world designs. ReDesign wins on every metric.
| Method | L1 ↓ | PSNR ↑ | LPIPS ↓ | PQ ↑ | F1 ↑ |
|---|---|---|---|---|---|
| VTracer | 0.0977 | 20.487 | 0.1917 | 24.64 | 0.309 |
| LayerD | 0.0704 | 16.141 | 0.3381 | 30.09 | 0.350 |
| Qwen-Image-Layered | 0.0493 | 26.192 | 0.1073 | 35.37 | 0.429 |
| Tool Agent | 0.0493 | 13.923 | 0.3869 | 45.33 | 0.527 |
| Ours | 0.0431 | 26.286 | 0.0883 | 45.37 | 0.535 |
Six atomic edits on predicted designs; measured against ground truth. ReDesign dominates on all axes.
ReDesign applies each edit cleanly to the correct element. Baselines move wrong regions or corrupt the layout.
Generative editors (Nano Banana 2) edit pixel-space: instructions leak, layouts blur, precision fails. ReDesign exposes discrete layers with explicit position and rotation; edits execute exactly.
All 909 frames redistributed under CC BY 4.0 with full attribution. Download on HuggingFace.