Research Snapshot

Visual Frontiers & Generative Intelligence

My research lies at the intersection of Generative Models and Reward Modeling, Autonomous Agents, and Visual Perception. I build generalized frameworks like PosterCraft and PosterOmni to redefine artistic creation, design reward models for human-aligned generation PosterReward, and empower agents with reasoning capabilities for open-world understanding.

Generative model Reward Modeling Vision Agents
11
CVPR / ICCV / ECCV
2
NeurIPS / ICLR / ICML
10
AAAI / IJCAI / ACM MM

Highlight Research

PosterOmni teaser CVPR 2026

PosterOmni — Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback

Sixiang Chen*, Jianyu Lai*, Jialin Gao*, Hengyu Shi*, Zhongying Liu*, Tian Ye, Junfeng Luo, Xiaoming Wei, Lei Zhu✉️

One model for poster creation—unifying local edits and global design for generalized multi-task image/poster-to-poster generation. ✨ Your intelligent assistant for high-quality aesthetic poster creation!

PosterReward CVPR 2026

PosterReward: Unlocking Accurate Evaluation for High-Quality Graphic Design Generation

Jianyu Lai*, Sixiang Chen*, Jialin Gao*, Hengyu Shi, Zhongying Liu, Fuxiang Zhai, Junfeng Luo, Xiaoming Wei, Lujia Wang, Lei Zhu✉️

A comprehensive reward model for design aesthetics and typography, trained on a automated preference dataset to unlock accurate evaluation for high-quality graphic design generation.

PosterCraft ICLR 2026

PosterCraft — Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Sixiang Chen*, Jianyu Lai*, Jialin Gao*, Tian Ye, Haoyu Chen, Hengyu Shi, Shitong Shao, Yunlong Lin, Song Fei, Zhaohu Xing, Yeying Jin, Junfeng Luo, Xiaoming Wei, Lei Zhu✉️

A new framework for "Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework". ✨ From your prompts to high-quality aesthetic posters!

JarvisIR CVPR 2025

JarvisIR — Elevating Autonomous Driving Perception with Intelligent Image Restoration

Yunlong Lin*, Zixu Lin*, Haoyu Chen*, Panwang Pan*, Chenxin Li, Sixiang Chen, Kairun Wen, Yeying Jin, Wenbo Li, Xinghao Ding✉️

A multimodal agent reasons about adverse scenes, calling specialized restoration experts to stabilize perception stacks for autonomous vehicles.

Selected Publications

Feb 2026

[CVPR 2026] PosterOmni — Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback

Sixiang Chen*, Jianyu Lai*, Jialin Gao*, Hengyu Shi*, Zhongying Liu*, Tian Ye, Junfeng Luo, Xiaoming Wei, Lei Zhu

One model for poster creation—unifying local edits and global design for generalized multi-task image/poster-to-poster generation. ✨ Your intelligent assistant for high-quality aesthetic poster creation!

Feb 2026

[CVPR 2026] PosterReward: Unlocking Accurate Evaluation for High-Quality Graphic Design Generation

Jianyu Lai*, Sixiang Chen*, Jialin Gao*, Hengyu Shi, Zhongying Liu, Fuxiang Zhai, Junfeng Luo, Xiaoming Wei, Lujia Wang, Lei Zhu

A comprehensive reward model for design aesthetics and typography, trained on automated preference dataset to unlock accurate evaluation for high-quality graphic design generation.

Jun 2025

[ICLR 2026] PosterCraft — Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Sixiang Chen*, Jianyu Lai*, Jialin Gao*, Tian Ye, Haoyu Chen, Hengyu Shi, Shitong Shao, Yunlong Lin, Song Fei, Zhaohu Xing, Yeying Jin, Junfeng Luo, Xiaoming Wei, Lei Zhu

A new framework for "Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework". ✨ From your prompts to high-quality aesthetic posters!

Jun 2025

[ICCV 2025] GenHaze — One-step Controllable Haze Generation for Real-World Dehazing

Sixiang Chen, Tian Ye, Yunlong Lin, Yeying Jin, Yijun Yang, Haoyu Chen, Jianyu Lai, Song Fei, Zhaohu Xing, Fugee Tsung, Lei Zhu

Proposes a one-step, reference-controllable haze generator that better matches real-world haze complexity than classic physics pipelines, creating high-quality paired data to boost real-world dehazing performance.

Apr 2025

[Tech Report 2025] An Empirical Study of GPT-4o Image Generation Capabilities

Sixiang Chen*, Jinbin Bai*, Zhuoran Zhao*, Tian Ye*, Qingyu Shi, Donghao Zhou, Wenhao Chai, Xin Lin, Jianzong Wu, Chao Tang, Shilin Xu, Tao Zhang, Haobo Yuan, Yikang Zhou, Wei Chow, Linfeng Li, Xiangtai Li, Lei Zhu, Lu Qi

Provides a systematic benchmark study of GPT-4o image generation across 20+ tasks (text-to-image, image-to-image, image-to-3D, image-to-X), summarizing strengths/limitations and what they imply for unified multimodal generation.

Feb 2025

[CVPR 2025] JarvisIR — Elevating Autonomous Driving Perception with Intelligent Image Restoration

Yunlong Lin*, Zixu Lin*, Haoyu Chen*, Panwang Pan*, Chenxin Li, Sixiang Chen, Kairun Wen, Yeying Jin, Wenbo Li, Xinghao Ding

A multimodal agent reasons about adverse scenes, calling specialized restoration experts to stabilize perception stacks for autonomous vehicles.

Feb 2025

[CVPR 2025] SnowMaster — Comprehensive Real-world Image Desnowing via MLLM with Multi-Model Feedback Optimization

Jianyu Lai*, Sixiang Chen*, Yunlong Lin, Tian Ye, Yun Liu, Song Fei, Zhaohu Xing, Hongtao Wu, Wei Wang, Lei Zhu

Uses an MMPO/DPO-enhanced MLLM as an evaluator to rank/filter pseudo-labels, enabling semi-supervised training that improves real-world desnowing without relying on dense paired GT.

Feb 2025

[CVPR 2025] Detect Any Mirrors — Boosting Learning Reliability with an Iterative Data Engine

Zhaohu Xing, Lihao Liu, Yijun Yang, Hongqiu Wang, Tian Ye, Sixiang Chen, Wenxue Li, Guang Liu, Lei Zhu

Builds an iterative data engine to harvest large-scale unlabeled data and progressively select high-reliability pseudo labels, improving mirror detection robustness and generalization in diverse scenes.

Jan 2025

[AAAI 2025] PromptHaze — Prompting Real-world Dehazing via Depth Anything

Tian Ye, Sixiang Chen, Haoyu Chen, Wenhao Chai, Jingjing Ren, Zhaohu Xing, Wenxue Li, Lei Zhu

Introduces a depth-prompting paradigm: leverages Depth Anything’s stable depth priors as prompts to guide dehazing, aiming for plug-and-play real-world dehazing under complex haze.

Jan 2025

[AAAI 2025] AGLLDiff — Guiding Diffusion Models Towards Unsupervised Training-Free Real-World Low-Light Image Enhancement

Yunlong Lin*, Tian Ye*, Sixiang Chen*, Zhenqi Fu, Yingying Wang, Wenhao Chai, Zhaohu Xing, Lei Zhu, Xinghao Ding

Proposes a training-free, unsupervised diffusion guidance framework that steers a pretrained diffusion model using attribute-based guidance for effective real-world low-light enhancement.

Jul 2024

[NeurIPS 2024] RestoreAgent — Autonomous Image Restoration Agent via Multimodal Large Language Models

Haoyu Chen, Wenbo Li, Jinjin Gu, Jingjing Ren, Sixiang Chen, Tian Ye, Renjing Pei, Kaiwen Zhou, Fenglong Song, Lei Zhu

Proposes an MLLM-driven agent that diagnoses degradations and plans a task sequence + selects expert models from a tool/model pool to restore images.

Jun 2024

[ECCV 2024] Teaching Tailored to Talent — Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint

Sixiang Chen, Tian Ye, Kai Zhang, Zhaohu Xing, Yunlong Lin, Lei Zhu

Using a prompt pool to compose weather-specific prompts on the fly, plus Depth-Anything–constrained scene prompts to stabilize background reconstruction under unseen adverse-weather combinations.

Jun 2024

[CVPR 2024 Highlight] Learning Diffusion Texture Priors for Image Restoration

Tian Ye, Sixiang Chen, Wenhao Chai, Zhaohu Xing, Jing Qin, Ge Lin, Lei Zhu

Presents DTPM (Diffusion Texture Prior Model) that explicitly models high-quality texture priors to preserve fine details and structure when applying diffusion to restoration.

Oct 2023

[ICCV 2023] Adverse Weather Removal with Codebook Priors

Tian Ye*, Sixiang Chen*, Jinbin Bai*, Shi Jun, Chenghao Xue, Jingjia Jiang, Junjie Yin, Erkang Chen, Yun Liu

Formulates adverse weather removal as matching and fusing degraded features with high-quality priors stored in a learned codebook.

Oct 2023

[ICCV 2023] Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks

Sixiang Chen*, Tian Ye*, Jinbin Bai, Jun Shi, Erkang Chen, Lei Zhu

A transformer that uses sparse sampling attention to model global rain degradation relations and an uncertainty-driven ranking strategy to focus on hard-to-restore regions for unified deraining.

Oct 2023

[ACM MM 2023] Uncertainty-Driven Dynamic Degradation Perceiving and Background Modeling for Efficient Single Image Desnowing

Sixiang Chen, Tian Ye, Chenghao Xue, Haoyu Chen, Yun Liu, Erkang Chen, Lei Zhu

Proposes an efficient single image desnowing framework by modeling uncertainty in degradation perception and background reconstruction.

Oct 2022

[ECCV 2022 Oral] Perceiving and Modeling Density for Image Dehazing

Tian Ye*, Yunchen Zhang*, Mingchao Jiang*, Liang Chen, Yun Liu, Sixiang Chen, Erkang Chen

Argues that dehazing hinges on haze density perception, introducing Separable Hybrid Attention and density modeling to handle uneven haze distribution.