RenderFormer

Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

SIGGRAPH 2025

Chong Zeng^1,2 Yue Dong² Pieter Peers³ Hongzhi Wu¹ Xin Tong²

¹State Key Lab of CAD and CG, Zhejiang University

²Microsoft Research Asia

³College of William & Mary

arXiv Paper Code Model

Talk Poster Slides

Introduction

We present RenderFormer, a neural rendering pipeline that directly renders an image from a triangle-based representation of a scene with full global illumination effects and that does not require per-scene training or fine-tuning.

Mesh to Image, End to End

Instead of taking a physics-centric approach to rendering, we formulate rendering as a sequence-to-sequence transformation where a sequence of tokens representing triangles with reflectance properties is converted to a sequence of output tokens representing small patches of pixels.

Simple Transformer Architecture with Minimal Prior Constraints

RenderFormer follows a two stage pipeline: a view-independent stage that models triangle-to-triangle light transport, and a view-dependent stage that transforms a token representing a bundle of rays to the corresponding pixel values guided by the triangle-sequence from the the view-independent stage. Both stages are based on the transformer architecture and are learned with minimal prior constraints. No rasterization, no ray tracing.

Rendering Gallery

Examples of scenes rendered with RenderFormer demonstrating various lighting conditions, materials, and geometric complexity, without any per-scene training or fine-tuning. Check out the reference images for more details.

Cornell Box

Cornell University Program of Computer Graphics

Stanford Bunny in Cornell Box

Stanford University Computer Graphics Laboratory

Lucy Statue

Stanford University Computer Graphics Laboratory

Utah Teapot

University of Utah, Utah Model Repository

Composed Scene

Fausto Javier Da Rosa and Keenan Crane

Constant Width Bodies

Crystals

Fox in the Wild

Horse and Heart

RenderFormer Logo

Interior Room

Shader Ball

Tree

Veach MIS

Videos

Check out extra video results including uncompressed videos and some reference videos.

Teaser Scenes

Dynamic demonstrations of RenderFormer's capabilities, showing object rotations, lighting changes, and material adjustments.

Cornell Box Roughness Adjustment

Bunny Roughness Adjustment

Tree Light Change

Tree Object Rotation

Fancy Scene Rotation

Composed Scene View Change

Animations

RenderFormer can render animations of scenes.

Cascade Cube Animation

Tycho Magnetic Anomaly

Animated Crab

Bohdan Lvov

Gyroscope Motion

reddification

Animated Character

mortaleiros

Marching Cubes Animation

Tycho Magnetic Anomaly

Robot Animation

Gouhadouken

Physical-Based Simulations

RenderFormer can render physically simulated scenes with complex dynamics and interactions.

Bowling Ball Physics Simulation

SINOFWRATH

Rotating Box Dynamics

Constant Width Body Simulation

BibTeX

@inproceedings {zeng2025renderformer,
    title      = {RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination},
    author     = {Chong Zeng and Yue Dong and Pieter Peers and Hongzhi Wu and Xin Tong},
    booktitle  = {ACM SIGGRAPH 2025 Conference Papers},
    year       = {2025}
}