Auditory Time-Interval Perception as Causal Inference on Sound Sources

Sawai, Ken-ichi; Sato, Yoshiyuki; Aihara, Kazuyuki

doi:10.3389/fpsyg.2012.00524

ORIGINAL RESEARCH article

Front. Psychol., 28 November 2012

Sec. Perception Science

Volume 3 - 2012 | https://doi.org/10.3389/fpsyg.2012.00524

Auditory Time-Interval Perception as Causal Inference on Sound Sources

KS
Ken-ichi Sawai ¹^*
YS
Yoshiyuki Sato ²
KA
Kazuyuki Aihara ¹

1. Institute of Industrial Science, University of Tokyo Tokyo, Japan
2. Graduate School of Information Systems, University of Electro-Communications Tokyo, Japan

Abstract

Perception of a temporal pattern in a sub-second time scale is fundamental to conversation, music perception, and other kinds of sound communication. However, its mechanism is not fully understood. A simple example is hearing three successive sounds with short time intervals. The following misperception of the latter interval is known: underestimation of the latter interval when the former is a little shorter or much longer than the latter, and overestimation of the latter when the former is a little longer or much shorter than the latter. Although this misperception of auditory time intervals for simple stimuli might be a cue to understanding the mechanism of time-interval perception, there exists no model that comprehensively explains it. Considering a previous experiment demonstrating that illusory perception does not occur for stimulus sounds with different frequencies, it might be plausible to think that the underlying mechanism of time-interval perception involves a causal inference on sound sources: herein, different frequencies provide cues for different causes. We construct a Bayesian observer model of this time-interval perception. We introduce a probabilistic variable representing the causality of sounds in the model. As prior knowledge, the observer assumes that a single sound source produces periodic and short time intervals, which is consistent with several previous works. We conducted numerical simulations and confirmed that our model can reproduce the misperception of auditory time intervals. A similar phenomenon has also been reported in visual and tactile modalities, though the time ranges for these are wider. This suggests the existence of a common mechanism for temporal pattern perception over modalities. This is because these different properties can be interpreted as a difference in time resolutions, given that the time resolutions for vision and touch are lower than those for audition.

1 Introduction

Temporal pattern processing is necessary for all sensory modalities and these patterns contain much essential information for our brain to learn what happens in the external world. Therefore, revealing the temporal perception system is fundamental to understanding the sensory processing system, but it is not fully understood yet.

Hearing three rapid successive sounds is a good situation for investigating the time-perception system. One reason for this is that the temporal accuracy of our auditory system is higher than those for other modalities (Burr et al., 2009; Vroomen and Keetels, 2010; Occelli et al., 2011); that is, auditory experimental results reflect the actual time-perception mechanism better. In addition, a combination of two time intervals is the simplest situation of temporal pattern perception. With regard to hearing three rapid sounds on a hundred-millisecond scale, it is known that our brain sometimes misestimates the second interval depending on the relative length of the two intervals. Concretely speaking, the second interval, T₂, is perceived as shorter than the actual length in the case where T₂ is equal to or a little longer than the first interval, T₁. This perceptual underestimation phenomenon was named “time-shrinking” (Nakajima et al., 1991). This illusion vanishes as the total length T₁ + T₂ increases. In addition, though the degrees of misestimation are not so large as those for the case of the time-shrinking illusion, the following phenomena on the perception of T₂ have also been observed (Miyauchi and Nakajima, 2005; Figure 1A): overestimation of T₂ when T₂ is a little shorter than T₁; underestimation of T₂ when T₂ is much shorter than T₁; and overestimation of T₂ when T₂ is much longer than T₁. The time-shrinking illusion has been examined in other articles as well (Nakajima et al., 1992; ten Hoopen et al., 1993, 2006; Suetomi and Nakajima, 1998; Miyauchi and Nakajima, 2007; Mitsudo et al., 2009). Furthermore, it was reported that this phenomenon occurs in other sensory modalities such as visual (Arao et al., 2000) and tactile (van Erp and Spapé, 2008) senses. This fact suggests that there is a common time-perception system among sensory modalities.

Figure 1

A time-perception model has been proposed to explain the time-shrinking illusion (Nakajima et al., 2004). In this model, it is assumed that the subjective duration of a time-interval is proportional to the sum of the actual length and a constant length. It is also assumed that if the neural system judges the two neighboring intervals as similar, the estimating process for the latter interval is shortened and the latter interval is thus underestimated. By these assumptions, this model can quantitatively mimic the time-shrinking illusion, namely, the underestimation of T₂ caused by a shorter preceding interval T₁. However, the other misestimation phenomena when hearing three successive sounds are out of the scope of this model and cannot be reproduced by the model.

In the present study, we consider that the perceptual phenomena as mentioned above are results of effective information processing in our neural system. Sensory information, which our brain uses to infer what happens in the world, inevitably has uncertainty caused by both internal noise in our nervous system (Faisal et al., 2008) and ubiquitous fluctuation in the external world. Therefore, our brain must manage with those kinds of uncertainty, otherwise we may misunderstand the situation or regard the same experiences as different. One reasonable way for the brain to cope with the uncertainty is exploiting prior knowledge, or the experience and statistics pertaining to the situation. This strategy can be formulated by using Bayesian inference. Bayesian modeling is a powerful method for describing the human perception mechanism and has been applied to visual temporal perception (Miyazaki et al., 2005; Jazayeri and Shadlen, 2010), and more widely to human perception (Vilares and Körding, 2011, for a recent review).

2 Materials and Methods

To consider the perceptual phenomena of hearing three rapid sounds, we assume a Bayesian observer who tries to solve a common source identification problem for each pair of two neighboring sounds. Further, prior to hearing, the observer assumes that sounds from the same source have short and equal intervals. The assumption of prior knowledge of short time intervals for stimuli from the same source is based on some previous works. These studies showed that the closer the two sources, the shorter are the perceived time intervals (Akerboom et al., 1983, for audition; and Goldreich, 2007; Kuroki et al., 2010, for tactile sensation). Further, with respect to the assumption of equal intervals, we can find many examples of signals aligned at almost equal intervals: heart beats, swinging pendulum, etc. This can be because simple dynamical systems tend to generate periodical orbits, which are often observed as periodic signals generated by a limit cycle.

Here, we propose that the perception of sound intervals involves inference of causal relationship among sounds. Although there is little direct evidence for this notion, some auditory perceptual phenomenon could be associated with some form of causal judgments. For example, the time-shrinking illusion vanishes in the case wherein the temporal pattern is marked by sounds with quite different frequencies (Remijn et al., 1999). For this case, we consider that sounds with different frequencies have been judged as from independent sources. Therefore, the perceptual estimation of the latter time-interval is different from that for the case of a sound sequence composed of the same frequency. This view that sound frequency indicates source identity is also supported by an auditory psychological phenomenon (Deutsch, 1975). The perception of a common source is a kind of causal inference and should be important for making an effective inference (Körding et al., 2007; Sato et al., 2007; Shams and Beierholm, 2010). We will discuss this point further in Discussion.

Our Bayesian model assumes that our neural system cannot observe true time instants t₁, t₂, and t₃ of the sounds, but only observed times including noise s₁, s₂, and s₃, respectively (Figure 2A). Each index of the variables indicates the order of emergence in the sound sequence. Then, our brain infers true interval durations T₁ = t₂ − t₁ and T₂ = t₃ − t₂ from the observation. To estimate them, our Bayesian observer composes a conditional probability, called a posterior probability, P(T₁,T₂|s₁,s₂,s₃). Bayesian theorem enables us to represent the posterior probability as

Figure 2

Since the denominator on the right side can be obtained by integrating the numerator over T₁ and T₂, we need to consider only terms P(s₁,s₂,s₃|T₁,T₂) and P(T₁,T₂) in the numerator. The first term of the numerator represents how the observational values are obtained, and is formulated as where we assume that distribution P(t₂) is constant; knowing T₁, T₂, and t₂ is equivalent to knowing t₁, t₂, and t₃ in the second line, and the noise distributions for the timings of the three sounds are assumed to be independent from each other in the third line. We set the distribution of the observation noise as a Gaussian distribution with the width σ_o and the center at a true value given as

Here, we consider standard deviation σ_o to be constant with time. By substituting equation (3) into equation (2) and integrating over t₂, we obtain the following formula (see Appendix for the details of this derivation): where we introduce variables S₁ = s₂ − s₁ and S₂ = s₃ − s₂, which represent the observed interval durations. Note that, given T₁ and T₂, t₁ and t₃ are not independent from t₂ but change with t₂. Therefore, the integral range of t₂ in equation (2) is (−∞, ∞). This term stands for the likelihood of the true intervals. Due to (T₁ − S₁)(T₂ − S₂), this function has a negative correlation between T₁ and T₂, as shown in Figure 2B.

Then, we formulate term P(T₁,T₂) in equation (1). This term does not relate to s_i(i = 1, 2, 3); that is, what our neural system has observed. Thus, this probability function represents knowledge acquired prior to the event. We model the prior knowledge of two neighboring time intervals as follows, assuming that the observer solves a source identification problem. First, our brain infers from the three successive sounds whether each pair of two neighboring sounds comes from the same source. To consider the source identification inference, we introduce variable C that represents which of the three sounds are from the same source. Here, our brain is not considered to make a judgment that the first and third sounds come from the same source while at the same time the second sound comes from another source. Thus, C represents the following four cases:

each sound is from an independent source,
the first and second sounds come from the same source and the third from another source,
the second and third sounds come from the same source and the first from another source,
all three sounds are from the same source.

Then, we assign 1, 2, 3, and 4 as the value of C to the above cases, respectively. Using the variable C, we formulate the prior distribution as

We treat the probabilities of C appearing in equation (5) as model parameters, and denote P(C = j)(j = 1, 2, 3, 4) by p_j.

Next, we formulate prior distributions P(T₁,T₂|C) for C = 1, 2, 3, and 4, by using the assumption of equal and short intervals for sounds from the same source. The assumption is formulated as follows:

For C = 1, there is no bias for the sound intervals. Thus, the prior distribution is a two-dimensional uniform distribution: where L is a parameter defining the integration range.
For C = 2 and C = 3, the two sounds that come from the same source are expected to have a short interval (Figures 2E,F). Each prior distribution is as follows: where standard deviation σ_p is a parameter that controls the bias toward short intervals. P(T₁|C = 2) gives the distribution of an interval wherein the two marker sounds are from the same source, and P(T₂|C = 2) gives the distribution of an interval wherein the two sounds come from different sources.
For C = 4, the three markers are expected to have short and equal intervals. This distribution is expressed as a two-dimensional Gaussian distribution, with the center at the origin and a positive correlation between the two variables T₁ and T₂ (Figure 2G). Thus, this distribution can be expressed as where Z is the normalization term, and σ_q and σ_r are constant parameters. It is necessary for the prior distribution to satisfy the following condition:
Given this condition, the constants Z, σ_q, and σ_r in equation (9) are represented as follows:

New parameters σ_q and σ_r control the shape of the distribution. Since we intend the distribution to have a positive correlation between T₁ and T₂, σ_q should be greater than σ_r.

By substituting equations (6)–(9) into equation (5), we have prior distribution P(T₁,T₂). The obtained prior distribution has a large peak at the origin of the T₁ − T₂ plane, and also has high values along the T₁ and T₂ axes, and along the 45° line from the T₁-axis (Figure 2C).

Then, we obtain the posterior distribution P(T₁,T₂|s₁,s₂,s₃) by multiplying the likelihood function of equation (4) and the prior distribution (Figure 2D).

3 Result

We conducted a numerical simulation to show the validity of our model. The parameter values used in the simulation are shown in Table 1. There are too many parameters in our model to learn the correct values from appropriate experiments. Thus, the parameter values are chosen and adjusted so that the time scales are not strange in terms of their physical implications. For example, because the time resolution of the auditory system changes with measurement methods, a specific time resolution parameter σ_o cannot be decided. Therefore, we set it so that the time scale is similar to existing psychological results (Grondin and Plourde, 2007, for example). The parameter value of L is decided so as to cover the time range in which the stimuli are presented.

Table 1

Parameter	Value	Description
σ_o	25 ms	Time resolution of auditory system. The smaller this value is, the smaller is the perceptual shift
σ_p	50 ms	Strength of the bias toward short intervals. The smaller this value is, the stronger is the bias and the larger is the perceptual shift
σ_q, σ_r)	(97.5, 22.2 ms)	Strength of the bias toward equal intervals. The larger the value of σ_q is, the stronger is the bias and the larger is the perceptual shift for the case that the two intervals are similar
(p₁, p₂, p₃, p₄)	(0.01, 0.01, 0.01, 0.97)	Probability distribution of C. The larger the value of p₁ is, the smaller is the perceptual shift. Increasing p₂ and p₃ results in a larger perceptual shift for the case that the two intervals are dissimilar. Increasing p₄ results in a larger perceptual shift for the case that the two intervals are similar
L	500 ms	Integration range. The shorter this value is, the smaller is the perceptual shift especially for the case that the two intervals are similar

Parameter values in the simulation.

In this simulation, we calculated the expectation value of the marginal distribution of T₂ and regarded the value as a result of the Bayesian observer’s inference. Although there are some other decision-making strategies, such as maximizing the posterior probability, we chose calculating the expectation value because of its low computational cost. However, the simulation result of the maximum a posteriori strategy was not qualitatively different from that of the expectation value. In addition, it is yet to be ascertained which rule should be applied to a Bayesian inference (see Jazayeri and Shadlen, 2010, for this issue).

Using this simulation, our model reproduced the time-shrinking illusion; that is, the large underestimation of T₂ when T₂ is a little longer than T₁, due to the assumption of equal intervals. However, the amount of overestimation when T₂ is a little shorter than T₁ was smaller than the above underestimation. We also observed overestimation and underestimation of T₂ when T₂ is much longer and shorter than T₁, respectively. Moreover, our model simulation showed that the underestimation and overestimation decrease as the total length increases and that there is underestimation of T₂ when T₂ = T₁ (Figure 1B). These properties of our model were also observed in psychological experiments (Figure 1A).

3.1 Explanation of the perception of three rapid sounds

Here, we explain how our model reproduces the behavior of the human auditory system. First, when the two time intervals are similar, the observed time-interval pair stands near the diagonal line on the T₁-T₂ plane. Thus, the perception of three sounds shifts from noisy observation toward prior knowledge when all three sounds originate from the same source. As a result, the two intervals are perceived as more similar to each other than their observation. That is, T₂ is underestimated if T₂ is a little longer than T₁ (Point A₁ in Figure 3), and T₂ is overestimated if T₂ is a little shorter than T₁ (Point A₂ in Figure 3). In addition, the degree of underestimation is larger than that of overestimation because the peak of the prior distribution is at the origin due to the expectation of short intervals. The expectation of short intervals also causes the underestimation of T₂ when T₂ = T₁.

Figure 3

Next, when the intervals are dissimilar, the time-interval pair is located either near the T₁-axis or the T₂-axis on the T₁-T₂ plane. Therefore, perception is biased toward the T₁-axis or the T₂-axis by prior knowledge when the first two or the latter two sounds come from the same source, respectively. In addition, since the likelihood function has a negative correlation between T₁ and T₂, perception shifts along the negative correlation. Thus, T₂ is perceived as longer than the actual duration if T₂ is longer than T₁, and vice versa (Points B₁ and B₂ in Figure 3, respectively).

In addition, the shape of the prior distribution becomes more flat as distance from the origin and the axes on the T₁-T₂ plane increases. Therefore, the prior effect is weak in such areas (Point C in Figure 3).

Discussion

Our model succeeds in replicating the human perception of a simple temporal pattern. This result suggests that our brain judges the causality of sounds and expects short and equal intervals for temporal patterns in the unconscious process.

In our model, we assumed that the observer inferred the causal relationship among sounds. Although there is little evidence for this assumption, we can propose some experiments that could verify it. For example, we propose an experiment in which subjects hear three rapid sounds and report which of the three sounds come from the same source. The rate of each judgment on source identification can be predicted by calculating P(estimated C|T₁,T₂) from the present model. In addition, this experiment would also provide feedback on the parameter values of (p₁, p₂, p₃, p₄), which are rather arbitrary in this study. By extending our model, we can also predict that the temporal pattern of sounds alters the perception of their spatial locations. Although we modeled the perception of time intervals marked by sounds in this article, we can also model the spatial perception of the sounds in almost the same form of causal inference and easily combine it with the current model. From this combined model, we predict that the same spatial patterns of sounds are perceived as spatially different if the patterns are temporally different. This is because the inference on the causal relationship among sounds is made from their temporal and spatial pattern in this model, and thus varies with temporal difference even if the actual spatial patterns are the same.

Our model has several parameters, and there exists some arbitrariness in their setting. For instance, even if we change the value of L from that in Table 1 to another value, we can reproduce a result similar to Figure 1B by adjusting parameters (p₁, …, p₄). In this article, we choose quite a high value for p₄ relative to the other three parameters. Although we assumed that inference was made based on observed time of sounds, in reality, we observe other features of sounds such as direction, pitch, color, volume, and so on, and all of these provide cues for the causal relationship among sounds. In the experiment we reproduced, all of these other features were kept the same for the series of three sounds, which strongly suggests that the sounds had come from the same source. We interpret (p₁, …, p₄) as including the cues from those other features. Thus, it might be natural to assume that p₄, which is the probability of all of the sounds coming from the same source, is considerably higher than the other possibilities. This suggests that time-interval perception depends on other sound features and, if presented with visual stimuli, also depends on visual features such as color, size, or location. In fact, it was confirmed that the result of time-interval perception differs according to the combination of stimulus pitches (Remijn et al., 1999).

Our model could be improved by trying to replicate the experimental facts about the perception of T₁. It was reported that the direction of the perceptual shift of T₁ follows the same pattern as that of T₂; that is, T₁ is underestimated when T₁ is a little longer or much shorter than T₂, and T₁ is overestimated when T₁ is a little shorter or much longer than T₂ (Miyauchi and Nakajima, 2005). This qualitative property of T₁ perception can be predicted by our model. However, in that experiment, the magnitude of each perceptual shift of T₁ was found to be less than that of T₂. Since the present model has symmetry between T₁ and T₂, it is impossible for our model to mimic the difference between the perceptions of T₁ and that of T₂. In the future, we seek to consider how we refine the present model to reproduce experimental results on the perception of T₁.

In auditory science, the issue is discerning a single sound stream in a complex of multiple sounds. This ability of the auditory system is called “auditory scene analysis” or “auditory scene segregation” (Bregman, 1990), and regarded as an important key to reveal the auditory system. Because this sound separating mechanism should involve perceptual source identification, our model may contribute to considering a sound segregation mechanism from the temporal aspect.

Finally, let us consider the time-perception mechanisms for other sensory modalities. From the psychological experiments on the visual (Arao et al., 2000) and tactile (van Erp and Spapé, 2008) time-shrinking illusions, it is known that time ranges for these modalities are broader than those for audition. The underlying reason can be understood by using the present model as follows, given that the visual and tactile time resolutions are lower than the auditory one. The perceptual bias of our model becomes weaker in a longer time scale. However, for a low-temporal-resolution modality, the perceptual bias is still relatively strong, because the observation has much uncertainty. Thus, the illusion occurs in a wider range. Though we can give a possible explanation for the difference among the modalities, the time-perception mechanisms in the sub-second scale for the other sensory modalities have not been well studied. Therefore, more research is needed before concluding that a time-perception system is shared by all sensory modalities.

Statements

Acknowledgments

The authors would like to thank Ryota Miyauchi and Yoshitaka Nakajima for their kindly providing us the high-quality image, which is used for Figure 1A. This research is supported by the Aihara Innovative Mathematical Modelling Project, the Japan Society for the Promotion of Science (JSPS) through the “Funding Program for World-Leading Innovative R&D on Science and Technology (FIRST Program),” initiated by the Council for Science and Technology Policy (CSTP), and by Grant-in-Aid for Young Scientists (B) (23700309) from the Japan Society for the Promotion of Science.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
AkerboomS.ten HoopenG.OlierookP.van der SchaafT. (1983). Auditory spatial alternation transforms auditory time. J. Exp. Psychol. Hum. Percept. Perform.9, 882–897.10.1037/0096-1523.9.6.882
2
AraoH.SuetomiD.NakajimaY. (2000). Does time-shrinking take place in visual temporal patterns?Perception29, 819–830.10.1068/p2907rvw
3
BregmanA. S. (1990). Auditory Scene Analysis: The Perceptual Organization of Sound. Boston: The MIT Press.
- Google Scholar
4
BurrD.SilvaO.CicchiniG. M.BanksM. S.MorroneM. C. (2009). Temporal mechanisms of multimodal binding. Proc. R. Soc. Lond. B Biol. Sci.276, 1761–1769.10.1098/rspb.2008.1899
- CrossRef
- Google Scholar
5
DeutschD. (1975). Two-channel listening to musical scales. J. Acoust. Soc. Am.57, 1156–1160.10.1121/1.380573
6
FaisalA. A.SelenL. P. J.WolpertD. M. (2008). Noise in the nervous system. Nat. Rev. Neurosci.9, 292–303.10.1038/nrn2258
7
GoldreichD. (2007). A Bayesian perceptual model replicates the cutaneous rabbit and other tactile spatiotemporal illusions. PLoS ONE2, e333.10.1371/journal.pone.0000333
- CrossRef
- Google Scholar
8
GrondinS.PlourdeM. (2007). Discrimination of time intervals presented in sequences: spatial effects with multiple auditory sources. Hum. Mov. Sci.26, 702–716.10.1016/j.humov.2007.07.009
9
JazayeriM.ShadlenM. N. (2010). Temporal context calibrates interval timing. Nat. Neurosci.13, 1020–1028.10.1038/nn.2590
10
KördingK. P.BeierholmU.MaW. J.QuartzS.TenenbaumJ. B.ShamsL. (2007). Causal inference in multisensory perception. PLoS ONE2, e943.10.1371/journal.pone.0000943
- CrossRef
- Google Scholar
11
KurokiS.WatanabeJ.KawakamiN.TachiS.NishidaS. (2010). Somatotopic dominance in tactile temporal processing. Exp. Brain Res.203, 51–62.10.1007/s00221-010-2212-8
12
MitsudoT.NakajimaY.RemijnG. B.TakeichiH.GotoY.TobimatsuS. (2009). Electrophysiological evidence of auditory temporal perception related to the assimilation between two neighboring time intervals. Neuroquantology7, 114–127.
- Google Scholar
13
MiyauchiR.NakajimaY. (2005). Bilateral assimilation of two neighboring empty time intervals. Music Percept.22, 411–424.10.1525/mp.2005.22.3.411
- CrossRef
- Google Scholar
14
MiyauchiR.NakajimaY. (2007). The category of 1:1 ratio caused by assimilation of two neighboring empty time intervals. Hum. Mov. Sci.26, 717–727.10.1016/j.humov.2007.07.008
15
MiyazakiM.NozakiD.NakajimaY. (2005). Testing Bayesian models of human coincidence timing. J. Neurophysiol.94, 395–399.10.1152/jn.01168.2004
16
NakajimaY.ten HoopenG.HilkhuysenG.SasakiT. (1992). Time-shrinking: a discontinuity in the perception of auditory temporal patterns. Percept. Psychophys.51, 504–507.10.3758/BF03211646
17
NakajimaY.ten HoopenG.SasakiT.YamamotoK.KadotaM.SimonsM.et al (2004). Time-shrinking: the process of unilateral temporal assimilation. Perception33, 1061–1079.10.1068/p5061
18
NakajimaY.ten HoopenG.van der WilkR. (1991). A new illusion of time perception. Music Percept.8, 431–448.10.2307/40285504
- CrossRef
- Google Scholar
19
OccelliV.SpenceC.ZampiniM. (2011). Audiotactile interactions in temporal perception. Psychon. Bull. Rev.18, 429–454.10.3758/s13423-011-0070-4
20
RemijnG.van der MeulenG.ten HoopenG.NakajimaY.KomoriY.SasakiT. (1999). On the robustness of time-shrinking. J. Acoust. Soc. Jpn. E20, 365–373.10.1250/ast.20.365
- CrossRef
- Google Scholar
21
SatoY.ToyoizumiT.AiharaK. (2007). Bayesian inference explains perception of unity and ventriloquism aftereffect: identification of common sources of audiovisual stimuli. Neural Comput.19, 3335–3355.10.1162/neco.2007.19.12.3335
22
ShamsL.BeierholmU. R. (2010). Causal inference in perception. Trends Cogn. Sci. (Regul. Ed.)14, 425–432.10.1016/j.tics.2010.07.001
23
SuetomiD.NakajimaY. (1998). How stable is time-shrinking?J. Music Percept. Cogn.4, 19–25.
- Google Scholar
24
ten HoopenG.HilkhuysenG.VisG.NakajimaY.YamaguchiF.SasakiT. (1993). A new illusion of time perception – II. Music Percept.11, 15–38.10.2307/40285597
- CrossRef
- Google Scholar
25
ten HoopenG.SasakiT.NakajimaY.RemijnG.MassierB.RhebergenK. S.et al (2006). Time-shrinking and categorical temporal ratio perception: evidence for a 1:1 temporal category. Music Percept.24, 1–22.10.1525/mp.2006.24.1.C1
- CrossRef
- Google Scholar
26
van ErpJ. B. F.SpapéM. M. A. (2008). “Time-shrinking and the design of tactons,” in Proceeding EuroHaptics ’08 Proceedings of the 6th International Conference on Haptics: Perception, Devices and Scenarios, 289–294.
- Google Scholar
27
VilaresI.KördingK. P. (2011). Bayesian models: the structure of the world, uncertainty, behavior, and the brain. Ann. N. Y. Acad. Sci.1224, 22–39.10.1111/j.1749-6632.2011.05965.x
28
VroomenJ.KeetelsM. (2010). Perception of intersensory synchrony: a tutorial review. Atten. Percept. Psychophys.72, 871–884.10.3758/APP.72.4.871

Appendix

In this Appendix, we derive equation (4) from equations (2) and (3). First, by substituting equation (3) into equation (2), the likelihood function is written as

Then, by introducing variables S₁ = s₂ − s₁ and S₂ = s₃ − s₂, and substituting u for t₂ − s₂, we obtain equation (4) as follows:

Summary

Keywords

time-interval perception, Bayesian inference, source identification, causal inference

Citation

Sawai K, Sato Y and Aihara K (2012) Auditory Time-Interval Perception as Causal Inference on Sound Sources. Front. Psychology 3:524. doi: 10.3389/fpsyg.2012.00524

Received

31 July 2012

Accepted

06 November 2012

Published

28 November 2012

Volume

3 - 2012

Edited by

Hirokazu Tanaka, Japan Advanced Institute of Science and Technology, Japan

Reviewed by

Hirokazu Tanaka, Japan Advanced Institute of Science and Technology, Japan; Hiroyuki Kambara, Tokyo Institute of Technology, Japan

This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.

*Correspondence: Ken-ichi Sawai, Institute of Industrial Science, The University of Tokyo, 4–6–1 Komaba, Meguro-ku, Tokyo 153-8505, Japan. e-mail: ken1@sat.t.u-tokyo.ac.jp

This article was submitted to Frontiers in Perception Science, a specialty of Frontiers in Psychology.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Perception Science

ORIGINAL RESEARCH article

Auditory Time-Interval Perception as Causal Inference on Sound Sources

Abstract

1 Introduction

2 Materials and Methods

3 Result

3.1 Explanation of the perception of three rapid sounds

Discussion

Statements

Acknowledgments

Conflict of interest

References

Appendix

Summary

Outline

Figures

Cite article

Article metrics

ORIGINAL RESEARCH article

Auditory Time-Interval Perception as Causal Inference on Sound Sources

Abstract

1 Introduction

2 Materials and Methods

3 Result

3.1 Explanation of the perception of three rapid sounds

Discussion

Statements

Acknowledgments

Conflict of interest

References

Appendix

Summary

Outline

Figures

Cite article

Share article

Article metrics