Spike-Constrained Stochastic Control

Spike-Constrained Stochastic Control

2023, Aug 15    
  • We develop a version of stochastic control that accounts for computational costs in the brain.

  • Combining concepts of efficient coding with control theory, we analyze Linear Quadratic Gaussian (LQG) control. LQG is a well-understood mathematical example of optimal control that combines a Kalman filter to keep track of a hidden state with a linear regulator that minimizes integrated quadratic state and action costs. When the state dynamics are linear, and all noise sources are Gaussian, this combination is your best bet!

  • We assume the brain encodes uncertainty using a Probabilistic Population Code (PPC), which is a distributed neural representation that supports evidence integration through linear operations and marginalization through nonlinear operations

  • We implement the Kalman filter using a spiking neural network:

    • Spike trains emitted by a sensory layer of neurons with Poisson-like response variability encode observations about the Gaussian world state.

    • This neural response feeds a recurrent layer whose firing activity encodes the belief the agent would obtain by implementing recursive Bayesian estimation based on its last observation and action executed.

    • The agent decodes the hidden world state using different projections of the neural activity emitted by the recurrent layer.

    • The agent minimizes the expected total integrated state, action, and neural cost by adjusting its spike rates and control gain.

Take away

  • In this simplest version, the precision of the observation and inferences is directly proportional to the number of spikes.

  • We reveal a trade-off: an agent can obtain more utility overall by relinquishing some task performance if doing so saves enough spikes.

  • The optimal spike rate varies with the properties of the system to be controlled (for example, stability and signal-to-noise ratio).

Why it matters?

  • Previous efforts to identify metabolically efficient ways of coding sensory evidence are restricted to feedforward settings. Here, we consider the consequences of closed-loop control.

  • This work provides a foundation for a new type of bounded rational behavior that could be used to explain suboptimal computations in the brain.

  • Understanding the mechanisms that give biological organisms an advantage over machines in perception and mobility can help engineer more robust, general-purpose, and energy-efficient robots.




Code available here