%A Scellier,Benjamin
%A Bengio,Yoshua
%D 2017
%J Frontiers in Computational Neuroscience
%C
%F
%G English
%K artificial neural network,backpropagation algorithm,biologically plausible learning rule,biologically plausible neural networks,contrastive hebbian learning,deep learning,fixed point,Hopfield Networks,Recurrent network,spike-timing dependent plasticity
%Q
%R 10.3389/fncom.2017.00024
%W
%L
%N 24
%M
%P
%7
%8 2017-May-04
%9 Original Research
%+ Benjamin Scellier,Département d'Informatique et de Recherche Opérationnelle, Montreal Institute for Learning Algorithms, Université de Montréal,Montreal, QC, Canada,benjamin.scellier@polytechnique.edu
%#
%! Equilibrium Propagation: Bridging the Gap Between Energy-Based Models and Backpropagation
%*
%<
%T Equilibrium Propagation: Bridging the Gap between Energy-Based Models and Backpropagation
%U https://www.frontiersin.org/article/10.3389/fncom.2017.00024
%V 11
%0 JOURNAL ARTICLE
%@ 1662-5188
%X We introduce Equilibrium Propagation, a learning framework for energy-based models. It involves only one kind of neural computation, performed in both the first phase (when the prediction is made) and the second phase of training (after the target or prediction error is revealed). Although this algorithm computes the gradient of an objective function just like Backpropagation, it does not need a special computation or circuit for the second phase, where errors are implicitly propagated. Equilibrium Propagation shares similarities with Contrastive Hebbian Learning and Contrastive Divergence while solving the theoretical issues of both algorithms: our algorithm computes the gradient of a well defined objective function. Because the objective function is defined in terms of local perturbations, the second phase of Equilibrium Propagation corresponds to only nudging the prediction (fixed point, or stationary distribution) towards a configuration that reduces prediction error. In the case of a recurrent multi-layer supervised network, the output units are slightly nudged towards their target in the second phase, and the perturbation introduced at the output layer propagates backward in the hidden layers. We show that the signal 'back-propagated' during this second phase corresponds to the propagation of error derivatives and encodes the gradient of the objective function, when the synaptic update corresponds to a standard form of spike-timing dependent plasticity. This work makes it more plausible that a mechanism similar to Backpropagation could be implemented by brains, since leaky integrator neural computation performs both inference and error back-propagation in our model. The only local difference between the two phases is whether synaptic changes are allowed or not. We also show experimentally that multi-layer recurrently connected networks with 1, 2 and 3 hidden layers can be trained by Equilibrium Propagation on the permutation-invariant MNIST task.