Information bottleneck-based Hebbian learning rule naturally ties working memory and synaptic updates

Daruwalla, Kyle, Lipasti, Mikko (May 2024) Information bottleneck-based Hebbian learning rule naturally ties working memory and synaptic updates. Frontiers in Computational Neuroscience, 18. p. 1240348. ISSN 1662-5188

[thumbnail of 10.3389.fncom.2024.1240348.pdf] PDF
10.3389.fncom.2024.1240348.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB)

Abstract

Deep neural feedforward networks are effective models for a wide array of problems, but training and deploying such networks presents a significant energy cost. Spiking neural networks (SNNs), which are modeled after biologically realistic neurons, offer a potential solution when deployed correctly on neuromorphic computing hardware. Still, many applications train SNNs offline, and running network training directly on neuromorphic hardware is an ongoing research problem. The primary hurdle is that back-propagation, which makes training such artificial deep networks possible, is biologically implausible. Neuroscientists are uncertain about how the brain would propagate a precise error signal backward through a network of neurons. Recent progress addresses part of this question, e.g., the weight transport problem, but a complete solution remains intangible. In contrast, novel learning rules based on the information bottleneck (IB) train each layer of a network independently, circumventing the need to propagate errors across layers. Instead, propagation is implicit due the layers' feedforward connectivity. These rules take the form of a three-factor Hebbian update a global error signal modulates local synaptic updates within each layer. Unfortunately, the global signal for a given layer requires processing multiple samples concurrently, and the brain only sees a single sample at a time. We propose a new three-factor update rule where the global signal correctly captures information across samples via an auxiliary memory network. The auxiliary network can be trained a priori independently of the dataset being used with the primary network. We demonstrate comparable performance to baselines on image classification tasks. Interestingly, unlike back-propagation-like schemes where there is no link between learning and memory, our rule presents a direct connection between working memory and synaptic updates. To the best of our knowledge, this is the first rule to make this link explicit. We explore these implications in initial experiments examining the effect of memory capacity on learning performance. Moving forward, this work suggests an alternate view of learning where each layer balances memory-informed compression against task performance. This view naturally encompasses several key aspects of neural computation, including memory, efficiency, and locality.

Item Type: Paper
Subjects: bioinformatics
bioinformatics > quantitative biology
organs, tissues, organelles, cell types and functions > tissues types and functions > neural networks
organs, tissues, organelles, cell types and functions
organs, tissues, organelles, cell types and functions > tissues types and functions
CSHL Authors:
SWORD Depositor: CSHL Elements
Depositing User: CSHL Elements
Date: 16 May 2024
Date Deposited: 31 May 2024 18:08
Last Modified: 31 May 2024 18:08
PMCID: PMC11137249
Related URLs:
URI: https://repository.cshl.edu/id/eprint/41574

Actions (login required)

Administrator's edit/view item Administrator's edit/view item