Front. Comput. Neurosci., 11 June 2013

Resolution enhancement in neural networks with dynamical synapses

  • 1Department of Physics, The Hong Kong University of Science and Technology, Hong Kong, China
  • 2State Key Laboratory of Cognitive Neuroscience and Learning, School of Cognitive Neuroscience, Lab of Neural Information Processing, Beijing Normal University, Beijing, China

Conventionally, information is represented by spike rates in the neural system. Here, we consider the ability of temporally modulated activities in neuronal networks to carry information extra to spike rates. These temporal modulations, commonly known as population spikes, are due to the presence of synaptic depression in a neuronal network model. We discuss its relevance to an experiment on transparent motions in macaque monkeys by Treue et al. in 2000. They found that if the moving directions of objects are too close, the firing rate profile will be very similar to that with one direction. As the difference in the moving directions of objects is large enough, the neuronal system would respond in such a way that the network enhances the resolution in the moving directions of the objects. In this paper, we propose that this behavior can be reproduced by neural networks with dynamical synapses when there are multiple external inputs. We will demonstrate how resolution enhancement can be achieved, and discuss the conditions under which temporally modulated activities are able to enhance information processing performances in general.

1. Introduction

An important issue in computational neuroscience is how information is represented in the neural system. It was widely accepted that spike rates of neurons carry information. This notion was further illustrated in population codes, in which the a group of neurons encode information and even represent uncertainties therein through their collective activities (Zemel and Dayan, 1999; Pouget et al., 2000). Consequently, population coding has been successfully applied to describe the encoding of spatial and directional information, such as orientation (Ben-Yishai et al., 1995), head direction (Zhang, 1996), and spatial location (Samsonovich and McNaughton, 1997). They are also used to explain information processing in the recently discovered grid cells (Fuhs and Touretzky, 2006).

An interesting question arises, namely, whether information can be encoded in other aspects of population coding besides spike rates. For example, can extra information be carried by the coding if the spikes are modulated in time, so that different spike trains modulated differently may convey different messages even though their spike rates appear to be the same. Given this possibility, the information content of population coding can be much richer than its superficial appearance as spike rates.

In this paper, we will explore the ability of population spikes to carry information extra to spike rates. Population spikes are temporal modulations of the population neuronal activity, and are also known as ensemble synchronizations, representing extensively coordinated rises and falls in the discharge of many neurons (Loebel and Tsodyks, 2002; Holcman and Tsodyks, 2006). The population spikes are due to the presence of short-term depression (STD) of the synapses, referring to the reduction of synaptic efficacy of a neuron after firing due to the depletion of neurotransmitters (Stevens and Wang, 1995; Markram and Tsodyks, 1996; Dayan and Abbott, 2001). This adds to a recently expanding list of the roles played by STD in neural information processing. For example, STD was recently suggested to be useful in expanding the dynamic range of the system (Abbott et al., 1997; Tsodyks and Markram, 1997), estimating the information of the pre-synaptic membrane potential (Pfister et al., 2010), and stabilizing the self-organized critical behavior for optimal computational capabilities (Levine et al., 2007). STD was also found to be useful in enhancing the mobility of the network state in tracking moving stimuli (Fung et al., 2012a), and hence was recently proposed to be a foundation of a potential anticipation mechanism (Fung et al., 2012b).

Previously, population spikes were found to be global synchronizations of neuronal activities. However, in order for them to encode spatial information, the population spikes that will be considered in this paper are localized ones. We will use the case of transparent motion as an example. This example illustrates the possibility that the modulation by population spikes enables the neural system to refine the resolution of direction for multiple stimuli. The prediction by the proposed mechanism has an excellent agreement with experimental results (Treue et al., 2000).

Transparent motion is one of the most well-known experiments in the psychophysical community. In the experiment, the stimulus usually contains moving dots with different directions. So, there are multiple moving directions transparently superimposed on one another. In the nervous system, the middle temporal (MT) area was found to be responsible for detecting moving directions of objects (Maunsell and Essen Van, 1983). Here, it was recently found that the neurons are heterogeneous, with some neurons responding to the pattern of moving stimuli, while others responding to the components of composite moving patterns (Rust et al., 2006). In 2000, Treue et al. found that if the directions of two groups of moving dots differ by an angle larger than the tuning width of the neurons, the observed neuronal response profile begins to split (Treue et al., 2000). However, subjects can still distinguish the two directions if their difference is as small as about 10° (Mather and Moulden, 1980), while the average direction tuning width of neurons is about 96°.

To resolve this paradox, Treue et al. proposed that when the resultant neuronal response is too board for a single direction, the perception can identify the two directions by considering the resultant neuronal response as a superposition of two individual neuronal responses of each direction. However, when the two directions differ by an angle less than the tuning width, it becomes difficult to resolve the peaks of the two superposed responses, if the curvature of the average neural activity profile is not taken into account This difficulty was also observed in simulations with distributional population codes (Zemel and Dayan, 1999). The mechanism of enhanced resolution remained unknown, and coding by firing rates may not reveal the complete picture.

In a recently proposed model on motion transparency, the enhanced resolution was achieved (Raudies et al., 2011). Two mechanisms held the key to this advance. First, as in standard neural field models, there is a local center-surround competition in the space of motion directions. Although this is not sufficient to explain the enhanced resolution, there is the second mechanism, namely, the modulatory feedback signals from higher stages of processing in the area medial superior temporal (MST) area. Motion attraction (that is, under-estimation of the directional difference) at small angular difference, and motion repulsion (that is, over-estimation) at larger angles were successfully explained. Perception repulsion can also be found in a Bayesian inference explanation on identification of audiovisual stimulus (Sato and Toyoizumi, 2007).

Here, we propose a novel mechanism for resolution enhancement based on the temporal modulation inherent in population coding. To focus on the generic issue of whether information carried in the temporal modulation of population coding can be usefully applied in a processing task, we consider a simplified model of transparent motion. We assume that inputs from different locations of the receptive field have been integrated, the directional information has been filtered, and the processing of input information can proceed without the assistance of feedback modulations. Thus our working model reduces to a single network. The working principle is a continuous attractor neural network (CANN) with dynamical synapses. Continuous attractor neural networks, also known as neural field models, are models used for describing phenomena and features observed in some brain regions where localized attractor neuronal responses are used to represent continuous information. Due to short-range excitatory interactions and long-range/global inhibitory interactions, bump-shaped neuronal response profiles are attractors of CANNs. Since the response profiles are easy to shift their positions in the space of continuous information, they are useful in tracking moving stimuli (Amari, 1977; Ben-Yishai et al., 1995; Wu et al., 2008; Fung et al., 2010) and their drifting behaviors have been studied (Itskov et al., 2011). In contrast to these studies of tracking, we will focus on stationary stimuli and their time-dependent neuronal responses.

Dynamical synapses are found to enrich the dynamical behaviors of CANNs (York and van Rossum, 2009; Fung et al., 2012a). Short-term synaptic depression (STD) can degrade the synaptic efficacies between neurons, depending temporally on the activity history of the presynaptic neuron (Tsodyks et al., 1998). In the presence of an external stimulus, the bumps can remain temporally stable if STD were absent. However, with STD, the population activity may drop after it reaches a maximum, since neurotransmitters have been consumed. After the drop, neurotransmitters are recovered and the neuronal population is ready to respond to the external stimulus again. This results in periodic bursts of local neuronal responses, referred to as population spikes. As we shall see, the temporal modulation induced by STD, together with input fluctuations, enable the system to reduce the angle of resolution in transparent motion down to one-fourth to one-third of the tuning width of the neuron.

In the rest of this paper, we will begin with an introduction of the CANN model and its basic properties. After that, we will discuss simulation results showing that our model is able to represent acute difference in transparent stimuli. At the end, there is a discussion section concluding our proposed mechanism.

2. Model and Method

In the continuous attractor neural network model, we specify the dynamics and the state of the system by the neuronal current. For neurons with preferred stimulus x in the range −L/2 ≤ xL/2, its neuronal current is denoted by u(x, t). The dynamics of u(x, t) is given by Fung et al. (2012a)


τs is the timescale of u(x, t). It is usually of the order of the magnitude of 1 ms. ρ is the density of neurons over the space spanned by {x}. J(xx′) is a translational invariant excitatory coupling given by


where a is the range of excitatory connection and J0 is the average strength of the coupling. r(x, t) is the neural activity related to u(x, t) by


Here, Θ is a step function centered at 0. The denominator, B(t) ≡ 1 + kρ ∫ dxu(x′, t)2, in this formula is the global inhibition, controlled by the inhibition parameter k. This type of global inhibition can be achieved by shunting inhibition (Heeger, 1992; Hao et al., 2009). Iext(x, t) is the external input to the system, which will be defined in the latter part of this section.

In the integral of Equation (1), p(x, t) is the available fraction of neurotransmitters of the presynaptic neurons. Neurotransmitters are consumed when a neuron sends chemical signals to its postsynaptic neurons. However, the recovery time of the neurotransmitters is considerably longer than τs. This process can be modeled by Tsodyks et al. (1998) and Fung et al. (2012a)


τd is the timescale of recovery process of neurotransmitters. The recovery process usually takes 25–100 ms. Here, we choose τd = 50τs. These two differential equations, Equations (1) and (4), are found to be consistent with the model proposed by Tsodyks et al. (1998).

The stimulus fed to the system consists of n components, each with a Gaussian profile and a time-dependent fluctuation in strength. It is given by


Here, zi's are the peak positions of the components, and aI is the width of the Gaussian profiles. If not specified, it was assumed to be the same as the synaptic interaction range a used in Equation (2). A0 is the average relative magnitude of one input component, while δAi(t) is a random fluctuation with standard deviation σA in amplitude of input components.

Note that when the Gaussian profiles have strong overlaps, the components cannot be resolved, as illustrated in Figures 1AC. We consider the amplitude fluctuations of each component to be independent of each other, i.e., 〈δAiδAj〉 = 0, where the average is over time. These fluctuations provide a cue for the system to distinguish different components (Figure 1D). This is consistent with the psychophysical experiment which showed that spatial and temporal randomness is important for perception of motion transparency (Qian et al., 1994). Since the fluctuations vanish when averaged over time, a system responding only to time-averaged inputs is unable be able to detect the components. Here, the role of STD is to modulate the network state, so that it responds to one input component once a time.


Figure 1. (A–C) The profile of two superposed Gaussian functions with the same height. f(x) ≡ {exp[(x − Δz/2)2/(2a2)] + exp[(x + Δz/2)2/(2a2)]}/2. Red solid line: y = f(x) with different Δz. Dashed line: y = f(x) with Δz = 0 as a reference. (A) Δz = 0. (B) Δz = tuning width = 2a. (C) Δz = 110% tuning width = 2.2a. (D) The profile of two superposed Gaussian functions with different heights to illustrate how the amplitude fluctuations provide a cue to distinguish the components. g(x) ≡ {A0 exp[(x − Δz/2)2/(2a2)] + A1 exp[(x + Δz/2)2/(2a2)]}. Dashed line: y = f(x) with Δz = 0 as a reference. Red solid line: y = g(x) with Δz = tuning width, A0 = 0.4 and A1 = 0.6. Blue solid line: y = g(x) with Δz = tuning width, A0 = 0.6 and A1 = 0.4.

To model the situation that the maximum strength of the input profile is invariant, we consider the input in Equation (1) to be


where A is the fixed maximum magnitude of the external input. As the external input profile is set to have a constant maximum, only the ratio σA/A0, rather than the magnitudes of A0 and σA, is relevant in our studies.

It is convenient to rescale the dynamical variables as follows. We first consider the case without STD when β = 0, and the synaptic interaction range a « L. In this case, p(x, t) = 1 in Equation (1). For kkcρJ02/(82πa), the network holds a continuous family of Gaussian-shaped stationary states when Iext(x, t) = 0. These stationary states are

u˜(x)=u˜0exp(|xz|24a2) ,(7)


r˜(x)=r˜0exp(|xz|22a2) .(8)

where u˜(x) is the rescaled variable ρJ0u(x), and u˜0 is the rescaled bump height. The parameter z, i.e., the center of the bump, is a free parameter, implying that the stationary state of the network can be located anywhere in the space x. In this paper, we assume that the variable is represented solely by the peak position of the neural activity profile. This assumption is one of the most direct ways to interpret the population code. However, there are other ways to interpret population codes. For example, Treue et al. (2000) proposed that the curvature of the average of the neural activity carries information represented by the neural population code, although the mechanism achieving this objective is not clear (Treue et al., 2000). On the phenomenological level, distributional population coding and double distributional population coding were proposed to represent information in population coding with more sophistication (Zemel and Dayan, 2000; Sahani and Dayan, 2003).

The tuning width of a neuron, defined as the standard deviation of the firing rate profile multiplied by 2, is therefore 2a. In the present work, we rescale the neuronal current as u˜(x,t)ρJ0u(x,t), together with the corresponding rescaling of other variables given by A˜ρJ0A, k˜k/kc, β˜τdβ/(ρ2J02). By using these rescaling rules, the dynamics of the system should only depend on k˜, β˜, τds, σA/A0, zi's and A˜. Below, only these parameters will be specified.

In each simulation, the variables u(x, t) are modeled to be located at N discrete positions uniformly distributed in the space of preferred stimuli {x}. To do massive simulations, all simulation results are generated by using N = 80. We have verified that the dynamics of the system is independent to N, and the number of neurons should not affect the conclusion. The boundary condition of the space is periodic. The range of the network is 360° and the tuning width of the neurons is 96°, following the experimental estimates in Treue et al. (2000). To solve differential equations in Equations (1) and (4), we used the Runge-Kutta Prince-Dormand (8,9) method provided by the GNU Scientific Library. Initial conditions of u(x, t)'s is zero, while p(x, t)'s are initially 1. The local error of each evolution step is less than 10−6. The random number generator used to generate the Gaussian random number is the generator proposed by Lüscher (1994). The Gaussian fluctuation is updated every 50τs.

3. Results

3.1. Population Spikes

We first consider the response of the network when the input consists of one component. We explore the network behavior by varying the parameters k˜, β˜, and A˜. We found a rich spectrum of behaviors including population spikes, static bumps, and moving bumps. The full picture will be reported elsewhere. For the purpose of the present paper, we fix k˜ and β˜ at a typical value and consider the behavior when A˜ increases. As shown in the top panel of Figure 2, the network cannot be triggered to have significant activities when the input is weak. In the bottom panel, the input is so strong that the network response is stabilized to a static bump with time-independent amplitude. An interesting case arises in the middle panel for moderately strong input, where population spikes can be observed. Population spikes are the consequence of the presence of STD. They are caused by a rapid rise of neuronal activity due to the external stimulus. Then in a time of the order of τd, the neurotransmitters are consumed, leading to a rapid drop in neuronal activity. When the neurotransmitters recover, the neurons become ready for the next population spike, resulting in the interesting periodic behavior. Population spikes have been found before as synchronization of neuronal activities, and their potential role in processing information was appreciated, but no specific context of such applications was identified (Loebel and Tsodyks, 2002), Here, we will present an example that spatially localized population spikes endow the neural system a capacity of reading-out input components.


Figure 2. Firing rates r˜(x, t) (A, D, and G), available fraction of neurotransmitters p(x, t) (B, E, and H) and corresponding input (C, F, and I) for various magnitudes of single-peaked external inputs. (A–C) A˜=0.4, (D–F) A˜=0.8, and (G–I) A˜=2.0. Other parameters: k˜=0.5, β˜=0.24, a = 48π/180, and τd = 50τs.

3.2. Network Activities for Two Stimuli

Next, we consider inputs with two components separated by Δz > 0 and study the network behavior when Δz gradually increases. Without loss of generality, we choose z1 = Δz/2 and z2 = −Δz/2. The relative fluctuation is σA/A0 = 0.3.

When the separation is small, the positions of the population spikes fluctuate around the mid-position of the two stimuli, as illustrated in Figure 3A. The two components cannot be resolved.


Figure 3. Raster plot of firing rates r˜ for (A) Δz = 0.5, (B) Δz = 1.0, and (C) Δz = 2.0. White dashed lines: positions of stimuli. Parameters: other parameters: k˜=0.5, β˜=0.24, a = 48π/180, A˜=0.8, σδAi/A0 = 0.3, and τd = 50τs.

When the separation increases to the extent that the two components remain barely resolved, an interesting change in the spiking pattern occurs as shown in Figure 3B. The positions of the population spike peaks begin to center around the two input components, although the shoulders of the population spikes remain overlapping considerably. Note that in this regime, the profile of the neuronal activities remain unresolved when they are averaged over time. However, due to the presence of STD, it is likely that a population spike is produced at the position of the component which happens to be higher due to height fluctuations. Hence in this regime, the population spike peaks are no longer aligned at the center. Rather, they are arranged in two rows, each around the two components. Furthermore, the two rows of population spikes tend to fire alternately. This implies that although it is hard to resolve the two components by considering the time-averaged signals, the temporal modulation by the alternating population spikes may be utilized for resolution enhancement.

When the separation increases further, the population spikes form two groups clearly, as shown in Figure 3C. The two components are clearly resolved.

To compare our model with experimental results, we measure the time average of neuronal activities as a function of preferred stimuli of neurons and the separation of the two stimuli, shown in Figure 4A. We found that this result is very similar to the experimental results reported by Treue et al. [Figure 2C in Treue et al. (2000)]. The peak of the average profile of neuronal activities splits near Δz ~ 1.0× tuning width. However, the time-averaged data cannot explain why subjects can resolve separations much less than the tuning width.


Figure 4. (A) Time average of firing rates r˜ as a function of the preferred stimuli of neurons, x, and the separation between the two stimuli, Δz. Contour lines: r˜t=1 (dotted-dashed line), r˜t=2 (dashed line), r˜t=3 (solid line), r˜t=4 (dotted line). Parameters: same as Figure 3. (B) The average neural activity recorded by Treue et al. (2000) (with license number 3125800919243 for the reuse purpose). (C) Contours of the distribution of peak positions higher than 6.2 as a function of preferred stimuli, x, and the separation between the two stimuli, Δz. White dashed line: positions of the two stimuli. L1, one-third of the tuning width. L2, tuning width. Parameters, same as Figure 3.

3.3. Extraction of Modulated Information

To demonstrate that the neuronal activities carry the information about two stimuli, we collect statistics on the peak positions of the population spikes. Here the peak position is calculated by maxxr˜(x). In Figure 4C, we present the contour plot of the distribution of peak positions in the space of the preferred stimuli of neurons and separation between the two stimuli in units of the tuning width. To focus on peaks with significant information only, we counted only population spikes with maximum amplitudes above an appropriately chosen threshold. Each column in Figure 4C is a normalized histogram with 80 bins. In order to obtain a relatively smooth distribution, the sampling process lasted for 100,000 τs. The mean of the separation between peak positions is plotted in Figure 5 as a function of Δz. We found that in this setting, the system can detect the input separation down to one-fourth of the tuning width. We note that in Figure 4C, when the difference between the components is too small, Δz ≲ 1/4 tuning width, population spikes occur at the middle of the net external input profile with a relatively small variance. However, when the network starts to resolve the two components, there are notable variances on positions of the population spikes in each component. The standard deviation of the positions of the population spikes in each component is roughly of the order of 0.1 times the tuning width, which is roughly 20°, as shown in Figure 5.


Figure 5. The mean separation of peak positions of r(x, t) shown in Figure 4C. Symbols, simulation. Dashed line, diagonal line representing perfect distinguishability.

To investigate whether the statistics with long sampling period is applicable to sampling periods in actual experiments, we have also collected statistics for 500τs. (In the experiment done by Treue et al., subjects took 500 ms to perform the discriminational task.) The result is shown in Figure A2A in Appendix. Although the distribution is rougher because of the relatively small sampling size, enhanced resolution down to 0.3 tuning width is still visible.

Furthermore, when the separation between the two stimuli lies between one-third and three-halves of the tuning width, the system slightly overestimated the separation of the two profiles. If we take the tuning width to be 96° (Treue et al., 2000), this range will be approximately from 30° to 140°. This is consistent with the experimental results of Braddick et al. (2002), in which subjects overestimated some moving direction difference in transparent motion experiments. However, it was reported in Figure 4 in Treue et al. (2000) that the perceived separation of movement direction starts to underestimate the truth when the stimulus separation increases above 40°. Since the range corresponding to “motion repulsion” reported by Braddick et al. (2002). is different from that reported by Treue et al., it seems that the range of differences between stimuli corresponding to “motion repulsion” is different for different experimental settings.

We have also tested the effects of choosing the widths of external input components to be different from the tuning width of the neuronal response. We found that the results for different stimulus strengths in Figure A1 in Appendix are qualitatively the same as that in Figure 4C.

The result shown in Figure 4C is not particular for the chosen set of parameters. In Figure 6, there is a phase diagram along with some selected parameters. In Figure 6A, the colored region is the region for population spikes with one stimulus. If A˜ and β˜ are chosen from this region, as far as we have observed, similar results can be obtained by choosing appropriate thresholds. If A˜ and β˜ are outside the colored region, no matter what the threshold was, the result shown in Figure 4C cannot be reproduced. This result suggests that population spikes are important to resolution enhancement.


Figure 6. (A) The phase diagram of population spikes over the parameter space spanned by (A˜,β˜) with the parameter k˜=0.5 and τds = 50. (B–H) are distributions of the occurence of peak positions as function of Δz. The numbers at the top of (B–H) are thresholds used to sample peak positions. Parameters: (B) A˜=0.4 and β˜=0.35. (C) A˜=0.65 and β˜=0.35. (D) A˜=0.9 and β˜=0.35. (E) A˜=0.4 and β˜=0.2. (F) A˜=0.6 and β˜=0.2. (G) A˜=0.7 and β˜=0.1. (H) A˜=0.9 and β˜=0.2.

3.4. Network Response with Multiple Stimuli

We further test the response of our model to more than two stimuli. Figure 7 shows the case for three stimuli of equal amplitude, whose peak positions are labeled by the white dashed lines. However, the contours of the distribution of population spikes are double-peaked, similar to those in Figure 5. This result suggests that, if there are three stimuli overlapped together, the network response should give only two groups of neuronal responses. Also, it predicts that peaks of population spikes should occur at positions that underestimate the separation between the outermost stimuli. A similar result for shorter sampling periods comparable to actual experiments can be found in Figure A2B in Appendix.


Figure 7. Contours of the distribution of peak positions higher than 6.2 as a function of preferred stimuli, x, and the separation between the two outermost stimuli, Δz, in the case of three equally strong stimuli. White dashed line: positions of three stimuli. Horizontal dotted line: the case comparable to the three-stimulus experiment reported by Treue et al., 2000. Vertical dashed lines: perception (±40°) reported by subjects in the experiment in units of the tuning width (96°). Parameters: same as Figure 5.

We found that the experimental result of multiple stimuli reported by Treue et al. is consistent with this prediction. In their paper, it was reported that, when there were three groups of moving dots moving at directions ±50° and 0°, the subjects would report that there were only two moving directions at ±40°. This consistency is shown in Figure 7, where the vertical dotted line L labels the position that the outermost stimuli are directed at ±50° when the tuning width is 96°, and the pair of horizontal dashed lines labels ±40° correspondingly.

4. Conditions for Resolution Enhancement

We have demonstrated the phenomenon of resolution enhancement due to modulations of population spikes. To see whether this picture can be generalized to other cases and what alternative models are to be excluded, we summarize the general conditions of its occurrence. To appreciate the significance of each condition, we will consider the alternative scenarios in the presence and absence of the various conditions.

4.1. Short-Term Synaptic Depression

Without the STD, the steady state of the neuronal activity profile becomes centered at either one of the two input stimuli. In Figure 8A, when the difference between the input profiles is large, Δz/a = 3.7 for instance, the neuronal activity is trapped by the input profile near x = 1.55. This case is not consistent with experiments, because when the separation between the input profiles is large enough, the neuronal activity should be able to identify both stimuli. This shows that STD plays the following roles in this phenomenon.


Figure 8. (A) Raster plot of firing rate r˜ of the network with two stimuli and without STD. Parameters: k˜=0.5, β˜=0, A˜=0.8, a = 48π/180, σA/A0 = 0.3, and Δz = 3.1. (B) Rastor plot of firing rate r˜ of the network with two stimuli with weak net input profile. Parameters: k˜=0.5, β˜=0.24, A˜=0.4, a = 48π/180, σA/A0 = 0.3, and Δz = 2.5. (C) Rastor plot of firing rate r˜ of the network with two stimuli without height fluctuations in the external input profile. Parameters: k˜=0.5, β˜=0.24, A˜=0.8, a = 48π/180, σA/A0 = 0, and Δz = 1.67. (D) Contours of the distribution of peak positions for all peak heights. White dashed line: positions of the two stimulus components. Parameters: k˜=0.5, β˜=0.24, A˜=0.8, a = 48π/180, σA/A0 = 0.3, and Δz = 1.0.

First, STD gives rise to the temporal modulation characterized by the population spikes, in which rapid rises in population activities alternate periodically with drops due to the consumption of neurotransmitters. Spiking activities enable the activity profile to jump from one stimulus position to another easily.

Second, the presence of STD enhances the mobility of the activity profiles. Due to the consumption of neurotransmitters in the active region, the profile tends to relocate itself to less active regions. This is the cause of the increased mobility when the activity profile tracks the movement of external stimuli, as well as their anticipatory tracking as a possible mechanism for delay compensation (Fung et al., 2012a,b). In the parameter regime where the stationary profile becomes unstable in its position, and population spikes become the attractor state, the network tends to establish a population spike in new locations, preventing itself from being trapped by one stimulus. This results in population spikes centered at alternating stimuli and hence the temporal modulation.

For example, if the two stimuli are strongly overlapped, the average neuronal response concentrates at the in-between region of the two stimuli, as shown in Figure 3B. In this case, the time-average profile of the dynamical variable p(x, t) has a dip centered at the midpoint between two stimuli, as shown in Figure 9. Since, in our model, there are fluctuations of the magnitude of each component of the external input, population spikes occur near the positions of the stimuli, labeled by the blue lines in Figure 9. Since the synaptic efficacies of the presynaptic neurons are stronger in the side region further away from the other stimulus, population spikes are more likely to happen in the outer region rather than the inner region. So, the separation between the two groups of population spikes can be larger than the separation between the two stimuli. This is also the reason why only two groups of population spikes can be observed in the case with three stimuli (Figure 7). STD also explains the slight over-estimation of the perceived positions when the separation of the stimuli is around the tuning width.


Figure 9. The time-averaged dynamical variable p(x, t). Symbols and red line: measurement from the simulation. Blue lines: positions of two stimuli. Parameters: k˜=0.5, β˜=0.24, a = 48π/180, τds = 50, A˜=0.8, and Δz = tuning width of attractor states.

Third, when STD is not sufficiently strong, we observe that sloshers rather than population spikes are formed (Folias, 2011). These sloshers are bumps that oscillate back and forth around the external stimuli, as shown in Figure 10. The height of the bumps is highest when they slosh to the extreme positions, but due to the weaker STD, the height variation in a cycle is not as extreme as those in the population spikes. The positional extent of their oscillations is mainly determined by the restoring attraction from the external input, and is effectively insensitive to the stimulus profile. Hence in the task of resolving the stimulus directions, the performance is degraded by the very flat part of the curve of the perceived separation when the stimuli have strong overlaps, as shown in Figure 6G.


Figure 10. Raster plot of firing rates r˜ at Δz = 0.1, showing a slosher. White dashed lines: positions of the stimuli. Other parameters: k˜=0.5, β˜=0.1, a = 48π/180, A˜=0.8, σA/A0 = 0.2, and τd = 50τs.

There are also other variants of the model that demonstrate the significance of STD in similar ways. For example, in recurrent networks with local inhibition, we may replace B(t) in Equation (3) by B′(x, t) given by

B(x,t)=1+ρkdxexp (|xx|22b2)u(x,t)2.(9)

To stabilize the neural activity, the range of the local inhibition, b, has to be larger than the range of excitatory connection, a. However, if a is as large as 48°, this local inhibition can be fairly replaced by B(t) with appropriate k˜. In the presence of STD, the discrimination performance is comparable to that in Figure 5, but the resolution is poor otherwise.

4.2. Suitably Strong Input Profiles

Suitably strong input magnitude is needed to produce the temporally modulated patterns, as illustrated in Figure 6. First, when the magnitude of the external input is too small, no significant system-driven neuronal activity can be observed. Fluctuations of external input components cannot stimulate the population spike, as the activation by input profiles was not strong enough. Second, even when the magnitude of the external input is larger, population spikes can be produced but the stimulus is too weak to pin them at the position of the stimuli. Since the mobility of the population spikes is enhanced by STD, moving population spikes are formed, as illustrated in Figure 8B. Since the population spikes move away from the stimulus positions after their formation, they cannot be used to encode the stimulus positions and also become part of the noisy background affecting the recognition of the stimulus positions. When the stimulus is too strong, population spikes cannot be generated and the resolution degrades.

4.3. Fluctuations in Input Profiles

Fluctuations on external input components is important to the behavior in Figure 3. If there were no fluctuations in the input profiles, the net input profile will have only one peak for Δz < 2a. As a result, there is effectively one bell-shaped input profile if the difference between two stimuli is too small, and the network response will also be single-peaked, as shown in Figure 8C. Hence fluctuations in the external input play the role of rendering the components distinguishable. As shown in Figure 11, recognition of input location always follows a strong input on the same side at the current step, and a strong input on the other side in the previous step, suggesting that a sudden shift in input bias provides condition for reliable recognition. In fact, the noise fluctuations act as the signals themselves, without which the single-peaked input provides little information about the components. Results in Figure 11 also illustrate that, statistically, the system is able to give valid responses to stimulus changes in a single step. This explains why the network yields discrimination performance equally well for short and long sampling periods, as demonstrated in a comparison between Figures 4C, A2A.


Figure 11. Population spikes' positions conditional on input fluctuations. One step refers to 50τs, which is the temporal interval between every update in the Gaussian fluctuation δAi(t). Input bias is defined as (δA1(t) − δA2(t))/max(A0 + δA1(t), A0 + δA2(t)). The color code indicates the average position of population spikes above threshold within one step in unit of the tuning width (TW). Gray color means the average position is within the true position of either input ±0.01 TW. True positions of inputs: z1 = Δz/2, z2 = − Δz/2. (A) Δz = 0.33 TW. (B) Δz = 0.40 TW. Parameters: same as Figure 3.

The fluctuations may come from randomness in the inputs. Psychophysical experiments show that spatial and temporal randomness is important for perceptions of motion transparency. For example, regularly spaced lines moving in opposite directions do not give the perception of transparent motion, whereas randomly spaced lines are able to do so (Qian et al., 1994). The input signals come from different locations of the visual field, and fluctuations arise when the perceived objects move from one location to another. Fluctuations may also arise when feedback signals from advanced stages of processing guide the system to shift its attention from one specific component to another.

Functionally, fluctuations facilitate the resolution of the directional inputs in the following two aspects. Spatially, it breaks the symmetry of the input profile. Temporally, it provides the time-dependent signals that induce the population spikes centered at the component that happens to be strengthened by fluctuations. This enables the system to recognize the temporally modulated inputs. On the other hand, for systems processing only time-averaged inputs, the height fluctuations vanish when averaged over time, so that the components cannot be detected.

4.4. Thresholding

Even after temporal modulation, resolution based on the network response can still carry large errors. As shown in Figure 3C, there are obviously two groups centering around the positions of the two components, but in between the two components, there is a region with moderate neuronal activities. If the network includes neuronal activities of all magnitudes, the errors in estimating the component positions will be large, especially when Δz is small. Indeed, Figure 8D shows that without imposing any thresholds on the neuronal activities, the network cannot resolve the two components until the separation exceeds the tuning width.

In order to solve this problem, we introduce a threshold on the maximum firing rates. We collect statistics of the peak positions of the firing rate profile when their height exceeds the threshold. The result is shown in Figure 5, indicating a significant improvement of resolution compared with Figure 8D. The effects of the threshold value on the resolution performance are shown in Figure 12. When the threshold is low, the components are not resolved even at a separation of 0.5 times the tuning width. On the other hand, when the threshold is too high, the statistics of peak positions becomes too sparse to be reliable. In an intermediate range of thresholds that is not too narrow, the resolution of the components can be achieved down to separations of 0.3–0.4 times the tuning width.


Figure 12. Effect of thresholding on the statistics of peak positions. Distributions of peak positions higher than different thresholds (shown on the left) when two stimuli are separated by Δz (shown at the bottom) are plotted in patches. The scales of all the patches are the same, shown on the lower-left patch. Distributions are normalized to the maximum value. Red bars mark the positions of two stimuli. Other parameters: k˜=0.5, β˜=0.1, a = 48π/180, A˜=0.8, σA/A0 = 0.2, and τd = 50τs.

4.5. Recurrent Connections

Finally, we would like to stress the importance of recurrent connections in achieving resolution enhancement. With no recurrence, population spikes cannot be generated and the amplification of the difference between nearly overlapping inputs cannot be achieved. Let us consider a purely feedforward network, with weaker but spatially broader inhibition than excitation,

τsdudt(x,t)=u(x,t)+ρdx[JEexp(|x   −x|22a2)                  JIexp (|xx|22b2)]p(x,t)Iext(x,t)(10)

where JE > JI and Iext(x, t) is the same as that in recurrent network in Equation (6). Although in this feedforward network STD can still modulate the synaptic efficacy so that neuronal activities prefer the side region to the midpoint between two stimuli, temporal modulation, which is essential to population spikes, cannot be realized without feedback. As mentioned above, population spikes make it easier for the activity profile to switch off on one side and grow up on the other. As shown in Figure 13, the resolution enhancement in the purely feedforward network is poor. In fact, the behavior is very similar to those in the non-spiking region even when the architecture is recurrent, as shown in Figures 6A,E.


Figure 13. (A) Raster plot of firing rate r for purely feedforward network. Δz = 0.5 TW. White dashed line: positions of the two stimuli. (B) Contours of the distribution of peak positions higher than 0.45 as a function of preferred stimuli,x, and the separation between the two stimuli, Δz. White dashed line: positions of the two stimuli. Parameters: JI = 0.3JE, b = 3a, τdβ/ρJE = 0.2, ρJEA = 0.8, σδAi/A0 = 0.3, and τd = 50τs.

5. Discussion

In this paper, we have demonstrated how STD plays the role of generating population spikes that can carry information extra to spike rates. We have used the example of resolving transparent motion with two components in a continuous attractor neural network, and have shown that the temporal modulation of the firing rates enables the network to enhance the resolution of motion transparency, thereby providing a possible explanation to the longstanding mystery of resolving separations narrower than the tuning width of the neurons, and resulting in input-output relations that can have excellent agreement with experimental results (Treue et al., 2000). The role played by STD was further clarified by comparison with alternate scenarios under 4 general conditions.

First, the strength of STD should be sufficiently strong. Weaker STD may result in the network response being pinned by one of the two components, or slosher modes that span a range of positions effectively independent of the component separations. On the other hand, sufficiently strong STD can give rise to population spikes, endowing them the freedom to alternate between the two components. Equally important is the provision of temporal modulation by the population spikes, so that the firing patterns indeed contain information of the stimuli, even though the time-averaged firing rate can only resolve separations larger than the tuning width of neurons, as shown in Figure 4 and found experimentally by Treue et al. (2000). The role played by temporally modulated signals in transparent motions can be tested in future experiments.

Second, the strength of the input should be sufficiently strong. Otherwise, no population spikes can be produced. Even for moderately strong input, the population spikes become moving ones, and fail to represent the stimulus positions.

Third, fluctuations in the input profiles are also important. They provide the temporally sensitive signals when the two components cannot be resolved in the time-averaged input. They correspond to the “unbalanced motion signals” in the detection of transparent motion with opposite moving directions (Qian et al., 1994).

Fourth, thresholds are needed to extract the information of the stimuli contained in the firing patterns, since they are able to truncate background activities that interfere the signals from the two components.

Our proposed model is not the first model or mechanism to explain the behavior of the discriminational task in transparent motion experiments. It was suggested that the curvature of the average neural activity may provide information of multiple stimuli, but the neural activity is wider than expected (Treue et al., 2000). Other proposals require more complex structures to achieve the task. For example, a population to encode uncertainty is needed to differentiate between multiplicity and uncertainty (Sahani and Dayan, 2003), and additional internal structures are needed to provide feedback information (Raudies et al., 2011). While admittedly involving additional structures and layers can augment the functionality of the brain, our work shows that it is possible to achieve with little additional structure the performance consistent with experiments in Treue et al. (2000) and Braddick et al. (2002). An interesting future direction is to consider whether firing rates multiplexed with temporal modulations can be an instrument to achieve the differentiation between multiplicity and uncertainty posed in Sahani and Dayan (2003).

The ability of STD to generate temporally modulated response is also applicable to other brain tasks, such as switching between percepts in competitive neural networks (Kilpatrick, 2012). Compared with other conventional neural network models processing time-averaged or static neuronal response profiles, the temporal component provides an extra dimension to encode acute stimuli, so that information processing performance can be significantly enhanced.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.


This work is supported by the Research Grants Council of Hong Kong (grant numbers 605010, 604512, and N_HKUST606/12) and the National Foundation of Natural Science of China (No. 31221003, No. 31261160495, No. 91132702).


Abbott, L. F., Varela, J. A., Sen., K., and Nelson, S. B. (1997). Synaptic depression and cortical gain control. Science 275, 220–224. doi: 10.1126/science.275.5297.221

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Amari, S. I. (1977). Dynamics of pattern formation in lateral-inhibition type neural fields. Biol. Cybern. 27, 77–87.

Pubmed Abstract | Pubmed Full Text

Ben-Yishai, B., Bar-Or, R. L., and Sompolinsky, H. (1995). Theory of orientation tuning in visual cortex. Proc. Natl. Acad. Sci. U.S.A. 92, 3844–3848. doi: 10.1073/pnas.92.9.3844

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Braddick, O. J., Wishart, K. A., and Curran, W. (2002). Directional performance in motion transparency. Vis. Res. 42, 1237–1248. doi: 10.1016/S0042-6989(02)00018-4

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Dayan, P., and Abbott, L. F. (2001). Theoretical Neuroscience. Cambridge; London: MIT Press.

Folias, S. E. (2011). Nonlinear analysis of breathing pulses in a synoptically coupled neural network. SIAM J. Appl. Dyn. Syst. 10, 744–787. doi: 10.1137/100815852

CrossRef Full Text

Fuhs, M. C., and Touretzky, D. S. (2006). A spin glass model of path integration in rat medial entorhinal cortex. J. Neurosci. 26, 4266–4276. doi: 10.1523/JNEUROSCI.4353-05.2006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Fung, C. C. A., Wong, K. Y. M., Wang, H., and Wu, S. (2012a). Dynamical synapses enhance neural information processing: gracefulness, accuracy and mobility. Neural Compute. 24, 1147–1185. doi: 10.1162/NECO_a_00269

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Fung, C. C. A., Wong, K. Y. M., and Wu, S. (2012b). “Delay compensation with dynamical synapses,” in Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012 (New York, NY: Curran Associates, Inc.).

Fung, C. C. A., Wong, K. Y. M., and Wu, S. (2010). A moving bump in a continuous manifold: a comprehensive study of the tracking dynamics of continuous attractor neural networks. Neural Compute. 22, 752–792. doi: 10.1162/neco.2009.07-08-824

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hao, J., Wang, X., Dan, Y., Poor, M., and Zhang, X. (2009). An arithmetic rule for spatial summation of excitatory and inhibitory inputs in pyramidal neurons. Proc. Natl. Acad. Sci. U.S.A. 106, 21906–21911. doi: 10.1073/pnas.0912022106

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Heeger, D. (1992). Normalization of cell responses in cat striate cortex. Vis. Neurosci. 9, 181–197.

Pubmed Abstract | Pubmed Full Text

Holcman, D., and Tsodyks, M. (2006). The emergence of up and down states in cortical networks. PLoS Comput. Biol. 2, 174–181. doi: 10.1371/journal.pcbi.0020023

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Itskov, V., Hansel, D., and Tsodyks, M. (2011). Short-term facilitation may stabilize parametric working memory trace. Front. Comput. Neurosci. 5, 1–19. doi: 10.3389/fncom.2011.00040

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kilpatrick, Z. P. (2012). Short term synaptic depression improves information transfer in perceptual multistability. Available online at: http://arxiv.org/abs/1212.0076

Levine, A., Herrmann, J. M., and Giesel, T. (2007). Dynamical synapses causing self-organized criticality in neural networks. Nat. Phys. 3, 857–860. doi: 10.1038/nphys758

Loebel, A., and Tsodyks, M. (2002). Computation by ensemble synchronization in recurrent networks with synaptic depression. J. Compute. Neurosci. 13, 111–124.

Pubmed Abstract | Pubmed Full Text

Lüscher, M. (1994). A portable high-quality random number generator for lattice field theory simulations. Compute. Phys. Commun. 79, 100–110. doi: 10.1016/0010-4655(94)90232-1

CrossRef Full Text

Markram, H., and Tsodyks, M. (1996). Redistribution of synaptic efficacy between neocortical pyramidal neurons. Nature 382, 807–810. doi: 10.1038/382807a0

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Mather, G., and Moulden, B. (1980). A simultaneous shift in apparent direction: further evidence for a “distribution-shift” model of direction coding. Q. J. Exp. Psychol. 32, 325–333. doi: 10.1080/14640748008401168

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Maunsell, J. H., and Essen Van, D. C. (1983). Functional properties of neurons in middle temporal visual area of the macaque monkey. I. Selectivity for stimulus direction, speed, and orientation. J. Neurophysiol. 49, 1127–1147.

Pubmed Abstract | Pubmed Full Text

Pfister, J.-P., Dayan, P., and Lengyel, M. (2010). Synapses with short-term plasticity are optimal estimators of presynaptic membrane potentials. Nat. Neurosci. 13, 1271–1275. doi: 10.1038/nn.2640

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Pouget, A., Dayan, P., and Zemel, R. (2000). Information processing with population codes. Nat. Rev. Neurosci. 1, 125–132. doi: 10.1038/35039062

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Qian, N., Andersen, R. A., and Adelson, E. H. (1994). Transparent motion perception as detection of unbalanced motion signals. I. Psychophysics. J. Neurosci. 14, 7357–7366.

Pubmed Abstract | Pubmed Full Text

Raudies, F., Mingolla, E., and Neumann, H. (2011). A model of motion transparency processing with local center-surround interactions and feedback. Neural Comput. 23, 2868–2914. doi: 10.1162/NECO_a_00193

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Rust, N. C., Mante, V., Simoncelli, E. P., and Movshon, J. A. (2006). How MT cells analyze the motion of visual patterns. Nat. Neurosci. 9, 1421–1431. doi: 10.1038/nn1786

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Sahani, M., and Dayan, P. (2003). Doubly distributional population codes: simultaneous representation of uncertainty and multiplicity. Neural Comput. 15, 2255–2279. doi: 10.1162/089976603322362356

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Samsonovich, A., and McNaughton, B. L. (1997). Path integration and cognitive mapping in a continuous attractor neural network model. J. Neurosci. 17, 5900–5920.

Pubmed Abstract | Pubmed Full Text

Sato, Y., and Toyoizumi, T. (2007). Bayesian inference explains perception of unity and ventriloquism aftereffect: identification of common sources of audiovisual stimuli. Neural Comput. 19, 3335–3355. doi: 10.1162/neco.2007.19.12.3335

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Stevens, C. F., and Wang, Y. (1995). Facilitation and depression at single central synapses. Neuron 14, 795–802. doi: 10.1016/0896-6273(95)90223-6

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Treue, S., Hol, K., and Dauber, H.-J. (2000). Seeing multiple directions of motion—physiology and psychophysics. Nat. Neurosci. 3, 270–276. doi: 10.1038/72985

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Tsodyks, M., and Markram, H. (1997). The neural code between neocortical pyramidal neurons depends on neurotransmitter release probability. Proc. Natl. Acad. Sci. U.S.A. 94, 719–723.

Pubmed Abstract | Pubmed Full Text

Tsodyks, M., Pawelzik, K., and Markram, H. (1998). Neural networks with dynamic synapses. Neural Comput. 10, 821–835. doi: 10.1162/089976698300017502

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wu, S., Hamaguchi, K., and Amari, S. (2008). Dynamics and computation of continuous attractors. Neural Comput. 20, 994–1025. doi: 10.1162/neco.2008.10-06-378

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

York, L. C., and van Rossum, M. C. W. (2009). Recurrent networks with short term synaptic depression. J. Comput. Neurosci. 27, 607–620. doi: 10.1007/s10827-009-0172-4

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Zemel, R. S., and Dayan, P. (1999). Distributional Population Codes and Multiple Motion Models. Advances in NIPS, Vol. 11. Cambridge, MA: MIT Press, 174–180.

Zemel, R. S., and Dayan, P. (2000). Distributional Population Codes and Multiple Motion Models. Advances in Neural Information Processing Systems, Vol. 11. Cambridge, MA: MIT Press.

Zhang, K. (1996). Representation of spatial orientation by the intrinsic dynamics of the head-direction cell ensemble: a theory. J. Neurosci. 16, 2112–2126.

Pubmed Abstract | Pubmed Full Text



Figure A1. Contours of the distribution of peak positions higher than (A) 6.2 and (B) 5.5 as a function of preferred stimuli, x, and the separation between the two stimuli, Δz. White dashed line: positions of two stimuli. Parameters: same as Figure 3, except aI = 0.95a for (A) and aI = 1.05a for (B).


Figure A2. The population spike occurrence counted within 500τs, which is comparable to the timescale in typical experiments. (A) Situations that there are two stimuli. (B) Situations with three stimuli. Other parameters: (A) same as Figure 4, (B) same as Figure 7.

Keywords: continuous attractor neural network, neural field model, short-term synaptic depression, short-term synaptic plasticity, transparent motion

Citation: Fung CCA, Wang H, Lam K, Wong KYM and Wu S (2013) Resolution enhancement in neural networks with dynamical synapses. Front. Comput. Neurosci. 7:73. doi: 10.3389/fncom.2013.00073

Received: 01 January 2013; Accepted: 15 May 2013;
Published online: 11 June 2013.

Edited by:

Misha Tsodyks, Weizmann Institute of Science, Israel

Reviewed by:

Florentin Wörgötter, University Goettingen, Germany
Taro Toyoizumi, RIKEN/BSI, Japan

Copyright © 2013 Fung, Wang, Lam, Wong and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.

*Correspondence: K. Y. Michael Wong, Department of Physics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, China e-mail: phkywong@ust.hk;
Si Wu, State Key Lab of Cognitive Neuroscience and Learning, School of Cognitive Neuroscience, Lab of Neural Information Processing, Beijing Normal University, No.19, XinJieKoWai St., HaiDian District, Beijing 100875, China e-mail: wusi@bnu.edu.cn

Present address: Kin Lam, Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, USA