The conversation distinguishes spatial reasoning--understanding, moving, and interacting in 3-D space--from linguistic processing. Humans are born with high-bandwidth visual perception, while language is a comparatively low-bandwidth, symbolic channel. Building AI that matches human spatial acuity requires models that go beyond token-by-token prediction.
View full episode →“Spatial intelligence complements linguistic intelligence for 3‑D reasoning”
“Moving AI out of the data‑center into the world requires spatial models”
“World Labs aims to build AI that understands and manipulates 3D space, a capability beyond language”