Abstract

Visual scene understanding requires much more than a list of the objects present in the scene and their locations. To understanding a scene, plan action on it, and predict what will happen next we must extract the relationships between objects (e.g., support and attachment), their physical properties (e.g., mass and material), and the forces acting upon them. One view is that we do this with the use of a "mental physics engine" that represents this information and runs forward simulations to predict what will happen next. Over the last several years we have been testing this idea with Josh Tenenbaum using fMRI. I will review evidence that certain brain regions in the parietal and frontal lobes (but not the ventral visual pathway) behave as expected if they implement a mental physics engine: they respond more strongly when deciding about physical than visual properties and when viewing physical versus social stimuli (Fischer et al, 2016), and they contain scenario-invariant information about object mass inferred from motion trajectories (Schwettmann et al, 2019), the stability of a configuration of objects (Pramod et al, 2022), and whether two objects are in contact with each other (Pramod et al 2025). Most tellingly, we can decode predicted collision events from perceived collision events, as expected if these brain regions run forward simulations of what will happen next. I will discuss the scope of engagement of this system by not just rigid "Things" but fluid "Stuff" (Paulun et al 2025), and (at least under some circumstances) by language. I will argue that these findings (as well as the poor performance of deep net models on many intuitive physical tasks) provide preliminary evidence for a physics engine in the human parietal and frontal cortex.

Speaker(s)

Nancy Kanwisher

MIT

Intuitive Physical Reasoning in the Human Brain

Part 1: Seeing and Decoding the Mind

Abstract

Speaker(s)

Nancy Kanwisher

Events

Introduction

Intuitive Physical Reasoning in the Human Brain

Representing the Visual World

Conscious Perception: The Propagation of Selection Signals through the Glo…

The Control of Sequence Working Memory in the Prefrontal Cortex

Exploring Consciousness at the Edge: Global Neuronal Workspace Framework &…

In Search of the Neural Code of Language

The Neuronal Basis of Numerical Cognition in Humans and Nonhuman Primates…

Exploring the Functional Architecture of the Brain In Millimeter Scale…

Educability

Seeing Syntax Everywhere: Syntactic Theory, Language Impairments, and the …

Illuminating Fractions Learning: Neuronal Recycling of Non-Symbolic Ratios…

The Relationship Between The Approximate Number System (ANS) And Math C…

Pattern Codes for Numerical Quantity during Perception and Internal Comput…

Spatiotemporal Dynamics of Arithmetic Computation in the Human Brain…

Discovering Combinatorial Affordances of Elements to Form Gestalts: Learni…

Developmental Origins of Human Curiosity

Why Is Conceptual Learning so Hard?

The state of the State of the Arts of the Language of thought

The Global Neuronal Workspace from the Molecular to the Cognitive Level: C…

Building a Theory of Consciousness, One Collaboration at a Time

Neural Mechanisms of Conscious Visual Perception in Humans

The Global Workspace Model of Consciousness: Then and Now

Network Coding in Grid Cells and Place Cells: From Space to Memory

Space as the Fabric of Thought

How Humans Compress Information in Memory: The Language of Thought Hypothe…

The Perception and Understanding of Patterns and Graphics

Do Monkeys See the Way We Do?

Single-Neuron Correlates of Perception and Memory in the Human Medial Temp…

Uniquely Human Prediction?

The What?, How? And Why?…

Dissecting the Language of Thought Hypothesis across Marr's Levels

Scaling Intelligence the Human Way

Concluding Remarks

See also