NVIDIA's GPC Teaches Game Characters to Move Like GPT Thinks

NVIDIA AI

4H AGO

2 min read

4 hrs ago

2 min read

Every time a game studio or robotics team needs a character to do something new -- crouch-walk, recover from a shove, sprint up stairs -- they typically train a brand-new controller from scratch. It is expensive, slow, and does not scale. NVIDIA's new paper, presented at SIGGRAPH 2026, asks a deceptively simple question: what if motor control worked more like language modeling?

The problem with one-controller-one-task

Developing controllers capable of completing a wide range of tasks in a natural and life-like manner is a key challenge in enabling practical applications of physics-based character animation. The field has historically solved this by training task-specific controllers -- one for locomotion, one for acrobatics, one for combat -- each requiring its own data pipeline and reinforcement learning run. The result is a fragmented ecosystem of siloed skills that cannot be easily composed or reused.

The deeper issue is that physics-based control is notoriously hard to generalize. Physics constraints are often considered as a black box and non-differentiable, which means the tricks that work for kinematic motion synthesis (generating plausible-looking poses) do not transfer cleanly to controllers that must actually push joints and maintain balance inside a physics simulator.

GPT, but for bodies

GPC introduces Generative Pretrained Controllers, which leverage tokenization and next-token modeling to create general-purpose, reusable generative controllers from large-scale motion datasets. The analogy to GPT is not just marketing. The architecture follows the same pretrain-then-finetune recipe that transformed NLP: learn a rich prior over a huge corpus, then adapt cheaply to downstream tasks.

The pipeline has two main stages:

Build a motion vocabulary. The framework uses end-to-end reinforcement learning to jointly optimize a "motion vocabulary", modeled via Finite Scalar Quantization (FSQ), along with a corresponding control policy that can map the discrete codes to physics-based controls. FSQ is a quantization technique that compresses continuous motion states into a finite set of discrete tokens -- think of it as the motion equivalent of a word-piece tokenizer.

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

Full access to in-depth AI research breakdowns
Be the first to know what's trending before it hits mainstream
Daily curated papers, repos, and industry moves

NVIDIA's GPC Teaches Game Characters to Move Like GPT Thinks

Takeaways

The problem with one-controller-one-task

GPT, but for bodies

Don't miss what's next in AI