The way AI creative tools have worked until now is simple: you prompt, the model generates, you take it or leave it. OpenArt Worlds is trying to break that loop entirely. Powered by the World Labs API, it lets creators upload a single image and walk into a persistent, navigable 3D environment they can return to, reframe, and build within over time -- more like a virtual film set than a one-shot generation.

From prompt roulette to spatial direction

The core problem OpenArt Worlds is solving is one every AI creator knows: you describe what you want, the model gives you something close, and then you prompt again, and again, hoping to land on the right shot. There is no way to hold the scene still and just move the camera. OpenArt Worlds changes that.

For the first time in AI content creation, creators are no longer at the mercy of what the model decides to generate. OpenArt Worlds lets you build a fully navigable 3D world from a text prompt or image, walk through it freely, and take the shot from exactly the angle and composition you want. Wide establishing shots, top-down views, over-the-shoulder framing -- all from the same environment, without regenerating anything.

The workflow breaks down into three steps:

  1. Build: Upload any image -- a photo, an AI-generated piece, or a concept painting -- and generate a navigable 3D world.
  2. Direct: Move freely through the environment, set camera angles, and frame the exact shot you want.
  3. Capture and Compose: At the moment of capture, creators can add characters, objects, or any creative element via prompt, outputting a production-ready 2D image.

OpenArt Worlds is a persistent AI creation space that converts images or text prompts into explorable 3D scenes. Unlike standard AI image generators that produce one-off outputs, Worlds saves every environment permanently inside your library. Come back next week, add a new character, capture a different angle -- the world is still there.

The technology underneath

The spatial engine behind this is Marble, World Labs' large world model (LWM -- a class of AI model designed to reason about and generate three-dimensional space, as opposed to flat images or text). Marble uses 3D Gaussian Splatting, a technique that balances photorealism and computational efficiency. Traditional 3D generation is either low quality or so heavy it is unusable in real-time. Gaussian Splatting represents a scene as millions of small, overlapping ellipsoids rather than polygons, which makes rendering fast and editable.

Alpha Signal

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

  • Full access to in-depth AI research breakdowns
  • Be the first to know what's trending before it hits mainstream
  • Daily curated papers, repos, and industry moves