Li defines spatial intelligence as the innate loop that ties 3-D perception to the impulse to act. She demonstrates how a single glance yields geometry, relationships, and predictions, and argues that current AI lacks this perception-action coupling. Training agents in simulated 3-D worlds can close the gap.