Introduction
Binocular Rivalry (BR) is the perceptual competition resulting when two different images are presented simultaneously to corresponding retinal locations (Wheatstone, 1838). A “race” between the two monocular images leads to continuous perceptual alternations, with successive periods of dominance and suppression. The spontaneous character of these alternations (Levelt, 1965) is the hallmark of rivalry, and a rigorous debate regarding its origin is whether it is based on competition between the two eyes or between the two stimuli (Blake and Logothetis, 2002). The traditional version of this debate has been challenged by compelling evidence supporting that rivalry involves a cascade of neuronal events spanning through multiple levels of the visual hierarchy, thus, favoring an integrative approach (see Sterzer et al., 2009; Blake and Wilson, 2011). However, the interplay between low-level and high-level visual processing as well as their exact role in the instigation of rivalry remains to be clarified. Several attempts to shed light on this question focus on adaptation as a crucial variable (Alais et al., 2010; Kang and Blake, 2010; Theodoni et al., 2011a).
Adaptation is a ubiquitous property of the visual system (Clifford et al., 2007) and has been used as a powerful tool for dissecting the involvement of high-level, feature-based, and lower-level, eye-based factors in BR (e.g., Blake et al., 2006; van Boxtel et al., 2008; Lin and He, 2009; Bartels and Logothetis, 2010). It has in this way contributed in establishing a multi-level localization of rivalry (Wilson, 2003). Furthermore, adaptation per se could be the driving force behind spontaneous perceptual alternations (e.g., Kang and Blake, 2010). The present review aims to bring together evidence supporting the idea that adaptation is a key constituent process in generating perceptual switches which, together with other important constraints such as noise and voluntary control (Kim et al., 2006; van Ee, 2009; Paffen and Alais, 2011), could lead to a complete and successful explanation of this intriguing perceptual phenomenon.
The potential role of adaptation in triggering perceptual alternations
Local versus global adaptation
An adaptation-based explanation of BR was firstly introduced in early, interocular-competition theories, which assigned a causal role to early, local inhibitory mechanisms (Blake, 1989). Stimulus-competition theories, on the other hand, have treated perceptual alternations as the result of a perceptual inference process, which controls these early mechanisms via feedback from higher visual areas (Logothetis, 1998; Leopold and Logothetis, 1999). Both views thus accept a crucial role for adaptation, making pertinent the need to clarify the stage at which adaptation-based interactions trigger bistability. Could adaptation provide a causal explanation for BR? If so, is it local or global adaptation processes that enjoy a primary role in triggering these perceptual switches? With the term local adaptation, we refer to adaptation occurring at an early, perhaps monocular level that is governed by local, eye-based processes. Global adaptation, on the other hand, refers mostly to feature/pattern-based processing at later processing stages of the visual system, with neurons having more complex receptive field properties (Alais and Blake, 1998). Such a distinction is consistent with the fact that suppression-depth is stronger for complex stimuli (Alais and Melcher, 2007), the former also probably increasing as one ascends the stages of visual hierarchy (Nguyen et al., 2003). Besides, it is well-established that low- and high-level rivalry are differentially affected by attention (Meng and Tong, 2004; van Ee et al., 2005) or noise (van Ee, 2009) and, despite the fact that they might share a common computational mechanism (Klink et al., 2008b), seem to take place at different processing stages (Wilson, 2003).
Given the multi-stage localization of both rivalry and adaptation (Clifford et al., 2007), the crucial question concerns how adaptation mechanisms at different levels interact in order to produce a perceptual change. It has been suggested that perceptual switches arise from adaptation of the currently dominant neuronal representation, the latter being gradually weakened via competing inhibition from the antagonistic representation, eventually succumbing to suppression that leads to a perceptual switch (Kang and Blake, 2010). These interactions are thought to occur at an early processing level (Blake, 1989; Wilson, 2003) even though mutual inhibition may rise from multiple cortical areas as well (Seely and Chow, 2011). Accordingly, initial dominance biases at specific locations of the visual field can be broken down by preceding spatiotopic adaptation, indicating that periods of adaptation can induce local perceptual switches (Carter and Cavanagh, 2007). Furthermore, the longer the duration of adaptation, the later the first breakout from suppression (van Ee, 2011), supporting a model of competing neuronal populations which encode the two alternative stimulus interpretations (Wilson, 2003). The spatial “profile” of this escape from suppression takes place in an inhomogeneous manner, arising from local random differences in adaptation at the monocular processing level, which might determine local perceptual switches (van Ee, 2011). A similar conclusion about the primacy of local adaptation comes from a study by Alais and Melcher (2007), focusing on the coherence of rivalry alternations (vs. piecemeal rivalry) and the suppression-depth of complex (faces) and simple (gratings) stimuli. They have shown that global adaptation processes are unable to overcome the stochastic character of rivalry alternations in local zones, even when a complex stimulus is rivaling with a simple one. These findings suggest that global processes fail to exert a stabilizing influence in rivalry alternations, thus assigning a primary role to local adaptation processes (but see van Boxtel et al., 2008). Along the same lines, Carlson and He (2004) have also shown that, unless there are local incompatibilities between rivaling stimuli, global differences alone fail to produce rivalry. Results like these do reject a role of global, top-down influences in rivalry, but rather suggest that it is local conflict that counts more. Contrary to these findings, other studies suggest that perceptual differences can induce rivalry despite physical similarities: physically identical random-dot displays, which are phenomenally different due to adaptation, can engage in rivalry (Blake et al., 1998), and BR can be induced by identical stimuli within a different chromatic surround (Andrews and Lotto, 2004). Thus, the emerging question concerns the conditions which are sufficient for bistability to be triggered: could perceptual differences account for the perceptual alternations or is the eye-of-origin information the driving force?
By interrupting rivalry at various time points after a period of dominance, Bartels and Logothetis (2010) have found a strong eye-of-origin stabilizing effect, which declines over time and a higher-level image/perceptual contribution which initially exhibits a small stabilizing effect, but later strongly destabilizes perception. This complementary variation over time underlines the need to include both eye-based and stimulus-based processes in a rivalry account, with the latter influencing eye-based processes possibly via feedback signals. Along these lines, recent findings show a weak modulation of early visual cortex activity by perception (Keliris et al., 2010), which becomes stronger in higher areas, as reported previously (Leopold and Logothetis, 1996). In a similar vein, Stuit et al. (2011) have examined the differential contribution of eye- and stimulus-factors as potential grouping cues in synchronizing the dominance of spatially segregated visual targets (Kovacs et al., 1996). They have shown that, even though both kind of cues act independently to promote grouping, the eye-of-origin information has a superior value as a grouping-cue. Taken together, these findings suggest that both levels of processing should be included in an adaptation-based explanation of BR. Nevertheless, it is rather difficult to separate the contribution of eye-based and feature-based processes in BR even at the earliest stage of the competition since, in addition to eye-of-origin signals, single cells in V1 also contain feature-related information (see Keliris et al., 2010) and top-down processing can influence the very early stage of rivalry onset (Klink et al., 2008a).
Locating causality within the cortex
It is obvious that unraveling the contribution of eye- and stimulus-related factors is still of high importance in rivalry literature (see Paffen and Alais, 2011), with serious implications regarding the neural localization of the triggering point of alternations. Accepting the primacy of image-content information, results in treating alternations as the outcome of inferential processes that concern perceptual interpretations (Sterzer et al., 2009). In this context, perceptual changes could be an expression of the frequent re-evaluation of the interpretations of the sensory input (Leopold and Logothetis, 1999). Rivalry starts because perceptual conflict about meaning is maximized (Sterzer et al., 2009), rather than just because fusion fails (Wolfe, 1986; Blake, 1989). The idea is further supported by imaging data, showing prefrontal and parietal activation during perceptual rivalry transitions (Lumer et al., 1998; Zaretskaya et al., 2010). Activation of right inferior frontal cortex is shown to have a temporal precedence, thus suggesting a causal role to these areas in the initiation of perceptual changes (Sterzer and Kleinschmidt, 2007). Whether these findings also indicate that top-down processes initiate the reorganization of activity in early visual cortex (Sterzer et al., 2009), remains speculative. If so, the adaptation of early, local inhibitory processes could play a role in destabilizing the activity underpinning the currently dominant percept, thus driving higher-order evaluative processes to initiate a perceptual reorganization (Sterzer and Kleinschmidt, 2007). Although such a scenario would give primacy to global adaptation mechanisms, this “trigger hypothesis” has been recently challenged by evidence toward the opposite direction (Knapen et al., 2011). By using more realistic rivalry simulations in the rivalry-mimicking condition,1 no difference was found in frontoparietal activation between endogenous (actual rivalry) and exogenous (replay) transitions, with a difference observed in occipital cortex alone (see figure 3 in Knapen et al., 2011). This pattern of results implies that changes observed in frontoparietal activity may be the consequence of alternations rather than their cause, possibly reflecting changes in sensory experience, attentional state or task demand processes (Knapen et al., 2011). The existence of a broad network involved causally in BR transitions has been challenged by Kamphuisen et al. (2008). Using an MEG frequency tagging method in order to track how stimulus-based activity relates to perception, they have located the source of rivalry alternations mainly in early visual areas of the occipital lobe. In accordance, Pearson et al. (2007) have provided evidence for a causal role of early visual cortex activity in BR: they have found that TMS applied over V1/V2 during conventional and swap/stimulus rivalry can induce perceptual changes in the former but not in the latter case. On the other hand, TMS applied in parietal cortex during BR results in disrupting alternations rhythm by lengthening dominance durations, implying that causality could be located in higher areas as well (Zaretskaya et al., 2010; see also Paffen and Alais, 2011 for a review on the controversial role frontoparietal areas in BR).
The role of attention
Although a possible causal role of frontoparietal cortex in BR alternations has not been yet firmly established, the findings mentioned above suggest a tight link between BR and attention, since there seems to be a major topographical overlap between the cortical networks mediating both phenomena (Paffen and Alais, 2011). The susceptibility of the alternation rate to attentional manipulations has been widely used as an indicator of top-down influences in BR, even though its sensitivity as a measure has been doubted (Meng and Tong, 2004). Alternations are enhanced once attention is driven to the stimuli and attenuated when it is withdrawn (Meng and Tong, 2004; van Ee et al., 2005; Paffen et al., 2006), or even eliminated by the total absence of attention (Zhang et al., 2011—but see Roeber et al., 2011). Attentional changes have been long suggested as the cause of alternations, a core-issue on the rivalry debate since its rediscovery by Wheatstone (1838; see also Blake and Logothetis, 2002). Nevertheless, though well-established, the role of voluntary control in BR is clearly limited (Paffen and Alais, 2011), especially when it comes to rivalry between low-level stimuli which, compared to ambiguous figures and rivalry between more complex stimuli, seems to be less subjective to (endogenous) attention (Meng and Tong, 2004; van Ee et al., 2005) and more prone to noisy variations (see van Ee, 2009).
The dependency of BR on bottom-up factors such as stimulus strength on the one hand and attention on the other, might seem contradictory, but could perhaps be explained by evidence for the apparent but quantifiable effects of attention on low-level stimulus characteristics (van Ee et al., 2005; Paffen et al., 2006; Klink et al., 2008b). Attention seems to boost the effective contrast of the attended stimulus (Carrasco et al., 2004; Chong and Blake, 2006), which in turn affects both the alternation rate and the dominance durations, following Levelt's predictions (see Paffen et al., 2006; Klink et al., 2008b). Within this framework, attention may be the counterpart of adaptation, attenuating the build-up of the latter by preventing its effects from developing fully, similar to the way in which stimulus strength's manipulations directly affect the time course of the recovery from adaptation (Wilson, 2003). In order to directly address the influence of adaptation on perceptual switches, Kang and Blake (2010) have created a novel “on-line” adaptation paradigm by physically removing and later reintroducing the suppressed stimulus, so that brief periods of monocular adaptation are interspersed during the actual duration of BR. They have shown that dominance duration decreases with increasing adaptation duration (see also van Ee, 2011). Furthermore, Blake et al. (2003) have found that when the conflicting stimuli are made to move continuously across the retina, alternations slow down significantly—a result also attributed to the removal of local adaptation. The similarity between attentional and adaptational manipulations, suggests that attention is another important variable that has to be incorporated in models explaining rivalry dynamics. Such an account must also take into consideration new experimental evidence showing that such an effect can also be multimodal (e.g., van Ee et al., 2009; Zhou et al., 2010; Chen et al., 2011). It is perhaps worth mentioning here that perceptual stabilization caused by intermittent presentation of rivalrous stimuli (Leopold et al., 2002), an extreme version of lengthening dominance durations (van Ee et al., 2005), has also been explained as the result of the insufficient development of the inhibitory-adaptational interactions eventually leading to a perceptual transition (Chen and He, 2004)2. However, it has been come evident by psychophysical and computational studies that local adaptation (or its absence) cannot account for stabilization caused by intermittent presentation (see Pearson and Clifford, 2005; Brascamp et al., 2007, 2009; Noest et al., 2007; also Pearson and Brascamp, 2008 for a detailed review).
Can adaptation fully explain perceptual switches? The role of noise
The studies reviewed above make an adaptation-based explanation of rivalry highly plausible. Still, there are predictions of the adaptation-cross inhibition model that a solely adaptation-based account cannot explain (Alais et al., 2010; Kang and Blake, 2010). For example, according to this model, visual sensitivity should change over time in a way that reflects the reciprocal changes in the adaptation level of the antagonistic neuronal populations (Noest et al., 2007). Although earlier studies have failed to support this idea (e.g., Fox and Check, 1972), Alais et al. (2010) have used a novel probe-method to show that sensitivity during both rivalry states is not stable: performance during dominance declines over time, while the opposite is true during suppression. This complementary variation in dominance and suppression sensitivity over time corresponds exactly with the gradual reciprocal changes predicted by the adaptation-inhibition model. This finding, for the first time experimentally confirmed by Alais et al., overcomes what was previously thought an inconsistency between theory and observed data, supporting further the idea of adaptation as a crucial mechanism in BR (Alais et al., 2010).
Another intriguing aspect of BR is the irregular and unpredictable character of dominance shifts observed in other multistable perceptual phenomena as well (Leopold and Logothetis, 1999; Gigante et al., 2009). This random nature of perceptual fluctuations, thought to stem from stochastic variation (Lehky, 1995; Kim et al., 2006), contradicts the fairly regular character of alternations that should be observed if transitions were mediated solely by deterministic processes such as adaptation and cross inhibition (Shpiro et al., 2009; van Ee, 2009; Kang and Blake, 2010). Therefore, noise has also been incorporated in computational models of adaptation-inhibition based perceptual decisions, in order to account for the irregular character of perceptual alternations (e.g., Wilson, 2003, 2007; Noest et al., 2007; Shpiro et al., 2009). It has been suggested that it exerts a crucial, perhaps dominating role (Brascamp et al., 2006; Lankheet, 2006). Stochastic resonance is a noisy-driven process with the virtue of allowing for deterministic (adaptational) influences to take place as well, as in the case of rivalry (van Ee, 2009). It seems to be the key-process emerging from the coupling between adaptation, inhibition, and noise, controlling perceptual transitions (Kim et al., 2006). It has been suggested that it is noise in the adaptation of percept-related neurons, not noise in the cross-inhibition neurons, which accounts for transitions (Kim et al., 2006; van Ee, 2009; Theodoni et al., 2011a). This seems to be also true for the serial correlations observed between successive dominance durations, once experimental impurities are controlled for (van Ee, 2009; see also Kang and Blake, 2010). In support of this notion, recent computational studies of the relative contribution of noisy-driven or adaptation-driven models in generating bistability, have shown that noise is a crucial variable that needs to operate in balance with adaptation in generating perceptual alternations (Shpiro et al., 2009; Theodoni et al., 2011a). Other computational models examining the occurrence of alternations during intermittent viewing, go further and incorporate higher-order variables such as perceptual memory of the type Leopold et al. (2002) have suggested (Wilson, 2007; Brascamp et al., 2009), as well as voluntary control (Klink et al., 2008a). A recent computational model simulating perceptual decisions regarding higher-order, ambiguous visual stimuli (face/hand) has suggested that adaptation-driven transitions due to afterhyperpolarization currents, can explain oscillatory perceptual alternations better than noise-driven transitions, which are due to the probabilistic spike times of neurons (Theodoni et al., 2011b). It becomes clear that none of these factors should be excluded from a complete explanation of rivalry alternations. When it comes to causality, however, adaptation and noise seem to offer the most plausible explanation.