MemCast
MemCast / episode / insight
Human brains instantly infer 3‑D geometry, relationships, and future events from a single glance
  • In a live demo, Li asks the audience to raise their hands after looking at a cup for one second.
  • She explains that within that second the brain extracts the cup’s shape, its position in space, and its relation to the table, cat, and surrounding objects.
  • The brain also predicts how the cup will behave if interacted with, illustrating forward modeling.
  • This rapid, holistic processing is what she calls “spatial intelligence.”
  • The example highlights the richness of human perception compared with current AI that often processes only 2‑D pixels without context.
Fei‑Fei LiTED00:06:04

Supporting quotes

在刚才的一秒钟里,你的大脑观察了这个杯子的几何形状、它在三维空间中的位置、它与桌子、猫以及其他一切的关系。 Fei‑Fei Li
而且你可以预测接下来会发生什么。 Fei‑Fei Li
采取行动的冲动是所有拥有空间智能的生物与生俱来的,空间智能将感知与行动联系起来。 Fei‑Fei Li

From this concept

Spatial Intelligence: Bridging Perception and Action

Li defines spatial intelligence as the innate loop that ties 3-D perception to the impulse to act. She demonstrates how a single glance yields geometry, relationships, and predictions, and argues that current AI lacks this perception-action coupling. Training agents in simulated 3-D worlds can close the gap.

View full episode →

Similar insights