stepfun-ai/Step1X-Edit
Python
Captured source
source ↗stepfun-ai/Step1X-Edit
Description: A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
Language: Python
License: Apache-2.0
Stars: 2224
Forks: 102
Open issues: 32
Created: 2025-04-23T09:53:08Z
Pushed: 2026-04-29T14:31:44Z
Default branch: main
Fork: no
Archived: no
README:
🔥🔥🔥 News!!
- Apr 29, 2026: 🎉 Step Image Edit 2 is now live — a lightweight model designed for ultra-fast response and high-quality output, delivering a real-time interactive creation experience. It can complete image generation and editing tasks within 2 seconds. Feel free to try it out and share your feedback ✨✨✨
Try it here (StepFun Open Platform): https://platform.stepfun.com/docs/zh/guides/models/step-image-edit-2
API documentation: https://platform.stepfun.com/docs/zh/step-plan/integrations/image-api
- Dec 29, 2025: 🎉 RegionE delivers a 2.5× speedup for Step1X-Edit inference with no accuracy degradation, achieved with just five lines of code.
- Nov 26, 2025: 👋 We release Step1X-Edit-v1p2 (referred to as ReasonEdit-S in the paper), a native reasoning edit model with better performance on KRIS-Bench and GEdit-Bench. Technical report can be found here.
Models
G_SC⬆️ G_PQ⬆️ G_O⬆️ FK⬆️ CK⬆️ PK⬆️ Overall⬆️
Flux-Kontext-dev 7.16 7.37 6.51 53.28 50.36 42.53 49.54
Qwen-Image-Edit-2509 8.00 7.86 7.56 61.47 56.79 47.07 56.15
Step1X-Edit v1.1 7.66 7.35 6.97 53.05 54.34 44.66 51.59
Step1x-edit-v1p2-preview 8.14 7.55 7.42 60.49 58.81 41.77 52.51
Step1x-edit-v1p2 (base) 7.77 7.65 7.24 58.23 60.55 46.21 56.33
Step1x-edit-v1p2 (thinking) 8.02 7.64 7.36 59.79 62.76 49.78 58.64
Step1x-edit-v1p2 (thinking + reflection) 8.18 7.85 7.58 62.44 65.72 50.42 60.93
- Sep 08, 2025: 👋 We release step1x-edit-v1p2-preview, a new version of Step1X-Edit with reasoning edit ability and better performance (report to be released soon), featuring:
- Native Reasoning Edit Model: Combines instruction reasoning with reflective correction to handle complex edits more accurately. Performance on KRIS-Bench:
| Models | Factual Knowledge ⬆️ | Conceptual Knowledge ⬆️ | Procedural Knowledge ⬆️ | Overall ⬆️ | |:------------:|:------------:|:------------:| :------------:|:------------:| | Step1X-Edit v1.1 | 53.05 | 54.34 | 44.66 | 51.59 | | Step1x-edit-v1p2-preview | 60.49 | 58.81 | 41.77 | 52.51 | | Step1x-edit-v1p2-preview (thinking) | 62.24 | 62.25 | 44.43 | 55.21| | Step1x-edit-v1p2-preview (thinking + reflection) | 62.94 | 61.82 | 44.08 | 55.64 |
- Improved image editing quality and better instruction-following performance. Performance on GEdit-Bench:
| Models | G_SC ⬆️ | G_PQ ⬆️ | G_O ⬆️ | Q_SC ⬆️ | Q_PQ ⬆️ | Q_O ⬆️ | |:------------:|:------------:|:------------:| :------------:|:------------:| :------------:|:------------:| | Step1X-Edit (v1.0) | 7.13 | 7.00 | 6.44 | 7.39 | 7.28 | 7.07 | | Step1X-Edit (v1.1) | 7.66 | 7.35 | 6.97 | 7.65 | 7.41 | 7.35 | | Step1x-edit-v1p2-preview | 8.14 | 7.55 | 7.42 | 7.90 | 7.34 | 7.40 |
- Jul 09, 2025: 👋 We’ve updated the step1x-edit model and released it as step1x-edit-v1p1 (diffusers version see here), featuring:
- Added support for text-to-image (T2I) generation tasks
- Improved image editing quality and better instruction-following performance.
Quantitative evaluation on GEdit-Bench-EN (Full set). G_SC, G_PQ, and G_O refer to the metrics evaluated by GPT-4.1, while Q_SC, Q_PQ, and Q_O refer to the metrics evaluated by Qwen2.5-VL-72B. To facilitate reproducibility, we have released the intermediate results of our model evaluations. | Models | G_SC ⬆️ | G_PQ ⬆️ | G_O ⬆️ | Q_SC ⬆️ | Q_PQ ⬆️ | Q_O ⬆️ | |:------------:|:------------:|:------------:| :------------:|:------------:| :------------:|:------------:| | Step1X-Edit (v1.0) | 7.13 | 7.00 | 6.44 | 7.39 | 7.28 | 7.07 | | Step1X-Edit (v1.1) | 7.66 | 7.35 | 6.97 | 7.65 | 7.41 | 7.35 |
- Jun 17, 2025: 👋 Support for Teacache and parallel inference has been added.
- May 22, 2025: 👋 Step1X-Edit now supports Lora finetuning on a single 24GB GPU now! A hand-fixing Lora for anime characters has also been released. Download Lora
- Apr 30, 2025: 🎉 Step1X-Edit ComfyUI Plugin is available now, thanks for the community contribution! quank123wip/ComfyUI-Step1X-Edit & raykindle/ComfyUI_Step1X-Edit.
- Apr 27, 2025: 🎉 With community support, we update the inference code and model weights of Step1X-Edit-FP8. meimeilook/Step1X-Edit-FP8 & rkfg/Step1X-Edit-FP8.
- Apr 26, 2025: 🎉 Step1X-Edit is now live — you can try editing images directly in the online demo! Online Demo
- Apr 25, 2025: 👋 We release the evaluation code and benchmark data of Step1X-Edit. Download GEdit-Bench
- Apr 25, 2025: 👋 We release the inference code and model weights of Step1X-Edit. ModelScope & HuggingFace models.
- Apr 25, 2025: 👋 We have made our technical report available as open source. Read
📖 Introduction
We introduce a state-of-the-art image editing model, Step1X-Edit, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini2 Flash. More specifically, we adopt the Multimodal LLM to process the reference image and user's editing instruction. A latent embedding has been extracted and integrated with a diffusion image decoder to obtain the target image. To train the model, we build a data generation pipeline to produce a high-quality dataset. For evaluation, we develop the GEdit-Bench, a…
Excerpt shown — open the source for the full document.
Notability
notability 5.0/10New repo with solid traction.