MemCast
MemCast / episode / insight
Marble ingests multimodal inputs and produces editable 3‑D worlds
  • Users can feed a single text prompt, a single image, or a set of images and receive a coherent 3‑D scene that matches the inputs.
  • The system supports interactive edits such as recoloring objects, moving items, or adding new geometry.
  • This multimodal flexibility bridges the gap between creative designers and technical engineers.
  • By exposing a simple API, Marble enables downstream applications ranging from game asset creation to rapid prototyping.
Fei-Fei LiLatent Space00:00:28

Supporting quotes

Marble is a generative model of 3‑D worlds… you can input things like text or image or multiple images and it will generate for you a 3‑D world that kind of matches those inputs. Fei-Fei Li
Marble is the first glimpse into our model… it is a model of spatial intelligence that also intentionally designed to be useful today. Fei-Fei Li

From this concept

Marble: A Generative 3-D World Model and Product

Marble is World Labs' flagship system that turns text, images, or multiple images into editable 3-D scenes. It is deliberately built as a usable product today while also serving as a research stepping-stone toward full-scale spatial intelligence. Real-time demos prove the feasibility of streaming 3-D generation over the internet.

View full episode →

Similar insights