Ji Xie's picture

Ji Xie PRO

sanaka87

·

https://horizonwind2004.github.io/

AI & ML interests

Image Generation

Recent Activity

upvoted a paper 10 days ago

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

upvoted a paper 10 days ago

GIR-Bench: Versatile Benchmark for Generating Images with Reasoning

updated a Space 20 days ago

sanaka87/undefined

View all activity

Organizations

None yet

Posts 4

Post

3088

Excited to share our Unified Multimodal Models new work Reconstruction Alignment (RecA)! 🚀 Just 6 × 80GB A100s × 4.5 hours to boost BAGEL performance across all tasks! Outperforms FLUX-Kontext in image editing capabilities!

📄 Paper: https://alphaxiv.org/abs/2509.07295
💻 Code: https://github.com/HorizonWind2004/reconstruction-alignment
🤗 HF Models: sanaka87/reca-68ad2176380355a3dcedc068
✍️ DEMO: sanaka87/BAGEL-RecA
🌐 Project Page: https://reconstruction-alignment.github.io
🔥 X: https://x.com/XDWang101/status/1965908302581420204
📰 Zhihu: https://zhuanlan.zhihu.com/p/1947584568187159814
🤗 HF Daily Paper: Reconstruction Alignment Improves Unified Multimodal Models (2509.07295)

⚡ <10k images & 27 GPU hours (no-arch-changes) → SOTA, surpassing much larger open-source & private models:

📊 GenEval: 0.73 → 0.90 | 📊 DPGBench: 80.93 → 88.15
🖼️ ImgEdit: 3.38 → 3.75 | 🖌️ GEdit: 6.94 → 7.25

✅ RecA trains UMMs to reconstruct images from their own visual understanding encoder embeddings → big gains in image generation 🎨 & editing ✂️.

Post

3086

Our ICEdit's video is below~
🔥 🔥🔥Huggingface DEMO: RiverZ/ICEdit
🌐 Project Website: https://river-zhang.github.io/ICEdit-gh-pages/
🏠 GitHub Repository: https://github.com/River-Zhang/ICEdit/blob/main/scripts/gradio_demo.py
🤗 Huggingface: sanaka87/ICEdit-MoE-LoRA
📄 arxiv Paper: In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer (2504.20690)

Collections 1

Papers 4

arxiv:2509.07295

arxiv:2504.20690

arxiv:2501.05131

arxiv:2410.12669

spaces 3

undefined

Explore Ji Xie's AI research portfolio

Running on Zero

BAGEL

Demo for BAGEL

reca-page

models 9

sanaka87/BAGEL-RecA

Any-to-Any • Updated Sep 14 • 45 • 25

sanaka87/OpenUni-RecA

Any-to-Any • Updated Sep 11 • 7 • 1

sanaka87/Show-o-512x512-RecA

Any-to-Any • Updated Sep 11 • 5 • 2

sanaka87/Harmon-1.5B-RecA

Any-to-Any • Updated Sep 11 • 7 • 2

sanaka87/Harmon-1.5B-RecA-plus

Text-to-Image • Updated Sep 11 • 18 • 3

sanaka87/Show-o-RecA

Text-to-Image • Updated Sep 11 • 3 • 3

sanaka87/Harmon-0.5B-RecA

Text-to-Image • Updated Sep 11 • 15 • 4

sanaka87/ICEdit-MoE-LoRA

Image-to-Image • Updated May 2 • 374 • 118

sanaka87/3DIS

Text-to-Image • Updated Feb 11 • 7

datasets 1

sanaka87/LLaVA-Instruct-150K

Preview • Updated Sep 9 • 82