- 1Graduate Institute of Musicology, National Taiwan University, Taipei, Taiwan
- 2Graduate Institute of Brain and Mind Sciences, National Taiwan University, Taipei, Taiwan
Objective: This systematic review aims to explore how individual and group musical activities influence social bonding and emotion regulation through the oxytocinergic system.
Methods: Following the PRISMA 2020 guidelines, a systematic search of PubMed, Embase, Scopus, Web of Science Core Collection, and PsycInfo was conducted to identify studies up to October 2024, supplemented by a manual search. One reviewer screened studies, extracted data, and assessed the study quality. Framework synthesis and narrative synthesis were conducted to integrate findings.
Results: A total of 1,865 records were identified. After reviewing the full-text papers, 20 studies (seven randomized controlled trials and 13 quasi-experiments) were included, which involved 877 participants across healthy and clinical populations. The reviewed interventions included singing, playing instruments, listening to music, and music therapy. Most studies reported improvements in psychosocial outcomes, such as reduced anxiety and depression or enhanced social cognition, but they do not always align with peripheral oxytocin (OXT) changes. However, certain psychosocial outcomes or contexts revealed relatively consistent patterns in OXT responses, suggesting the presence of context-dependent modulation. Short-term interventions often reported detectable peripheral OXT changes, which only partially reflected the temporary activity of magnocellular OXT neurons in the hypothalamus. No significant changes in baseline peripheral OXT levels were observed after long-term interventions.
Conclusion: Music-induced OXT responses are context-dependent. The bidirectional modulation of OXT supports social bonding and emotion regulation in musical contexts. Clinicians and music therapists should carefully consider therapeutic goals, individual differences, and environmental factors when designing music therapy.
1 Introduction
There is growing interest in music therapy, which has shown promise in promoting mental health and wellbeing (Bernatzky et al., 2011; Bhandarkar et al., 2024). Recent studies have begun to explore the underlying neural mechanisms of music's effects, with a particular focus on oxytocin (OXT), a crucial neuropeptide for social bonding and emotion regulation (Olff et al., 2013).
OXT is synthesized by the paraventricular nucleus (PVN) and supraoptic nucleus (SON) of the hypothalamus and released into the central nervous system (CNS) and endocrine system to exert its effects (Meyer-Lindenberg et al., 2011). In the endocrine system, OXT is secreted by the posterior pituitary into the bloodstream to support reproductive processes like uterine contraction and lactation (Grewen et al., 2010; Maud et al., 2018). In the CNS, OXT is released from dendritic or axonal terminals to specific brain regions to modulate neurotransmitter release, synaptic plasticity, and neural network activity (Chini et al., 2017; Jurek and Neumann, 2018). For example, OXT promotes the release of γ-aminobutyric acid (GABA), which reduces neuronal excitability and produces anxiolytic effects (Sobota et al., 2015; Yuki, 2023). OXT engages in intracellular signaling to regulate gene expression (Jurek and Neumann, 2018; Uvnas-Moberg, 2024) and synaptic plasticity in the amygdala (Gur et al., 2014), hippocampus (Lin and Hsu, 2018), and prefrontal cortex (Ninan, 2011) to support long-term adaptations in emotional responses and social behaviors (Marlin and Froemke, 2017). OXT can suppress the expression of corticotropin-releasing hormone (CRH) and reduce downstream cortisol secretion to moderate stress responses (Bowling, 2023; Harvey, 2020; Jurek and Neumann, 2018).
OXT also plays a role in social cognition (Harvey, 2020). It enhances an individual's ability to recognize, process, and respond to social cues. Notably, these effects are influenced by contextual factors (e.g., safe or threatening environments), and individual factors, including sex, hormonal status, gene variations, attachment style, history of childhood trauma, and the presence of psychiatric disorders, influence the OXTergic system responsiveness (Olff et al., 2013). These findings suggest that the role of OXT is to enhance sensitivity to environmental changes for appropriate behavioral selection (Steinman et al., 2019).
It is unclear how music modulates the OXTergic system to influence emotion regulation and social bonding. Although empirical studies have suggested that music can lead to peripheral OXT changes, the findings are inconsistent. Some studies reported OXT increases after musical activities (Grape et al., 2003; Nilsson, 2009), while others observed no significant change (Palumbo et al., 2022) or even decreases (Eerola et al., 2021; Fancourt et al., 2016; Schladt et al., 2017). The discrepancies may arise from differences in participant characteristics, types of music-based intervention, study designs, OXT measurements, and contextual factors (Engel et al., 2019; MacLean et al., 2019; Tabak et al., 2023). Besides, most studies have been conducted in laboratory or clinical settings with limited ecological validity, which raises questions about whether music-induced OXT changes observed under controlled conditions can be generalized to real-life contexts, especially collective musical rituals.
To address the gaps, this systematic review synthesizes current evidence on how music modulates the OXTergic system in humans. By considering the contextual factors, population characteristics, and intervention features, the review refines the theoretical model of music-induced OXT response to offer guidance to optimize music-based interventions for populations with social impairments. For a rapidly developing field, it will provide a reliable foundation for future research and clinical applications.
The objective of the current review is to explore how individual and group musical activities influence mental health through the OXTergic system in terms of social bonding and emotion regulation. We will address the following questions:
(1) How do OXT levels change in different populations in music-based interventions?
(2) Which characteristics of music-based interventions can be associated with significant changes in OXT levels and psychosocial outcomes?
(3) What are the strengths and limitations of OXT measurements in music-based interventions?
2 Methods
2.1 Search strategy
The systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines. Literature searches were conducted on November 23, 2024, in five databases: PubMed, Embase, Scopus, Web of Science Core Collection, and PsycInfo. Search terms combined keywords related to music, ritual, OXT, emotion regulation, and social bonding. A detailed search strategy is presented in the Supplementary material S1.
2.2 Eligibility criteria
Studies were included if they met the following criteria: (1) original, peer-reviewed, full-text articles published in English up to October 2024; (2) study designs were randomized controlled trials (RCTs), quasi-experimental designs, or observational studies; (3) populations including healthy individuals, clinical populations, or both; (4) interventions involving music therapy or musical activities (e.g., music listening, singing, or playing instruments) delivered individually or in groups; and (5) outcomes reporting OXT measurements along with at least one psychosocial, clinical, or subjective outcome. Exclusion criteria included: (1) non-human studies; (2) reviews or editorials without original data; (3) studies focusing solely on pharmacotherapy (unless compared to music-based interventions); (4) conference abstracts or book chapters; and (5) non-peer-reviewed papers. There were no restrictions regarding participant age, sample size, geographical location, or comparator.
2.3 Study selection
All identified records were imported into reference management software, and duplicates were removed. Titles and abstracts were screened independently by the first author. Full texts of potentially relevant articles were then assessed for eligibility. Uncertainties were resolved through discussion with the second author.
2.4 Quality assessment
The quality of included studies was assessed using the Joanna Briggs Institute (JBI) critical appraisal tools for RCTs and for quasi-experimental studies. The tools evaluate the validity of study design, methodology, statistical conclusions, and risk of bias (Barker et al., 2024, 2023). The quality assessment was conducted by the first author and verified by the second author.
2.5 Data extraction, analysis, and synthesis
From the full texts of selected articles and their supplementary material, one reviewer extracted the data, including study characteristics, participant characteristics, intervention details, outcome measures, results, and statistical analyses.
Quantitative OXT changes were transformed into qualitative categories: increase, decrease, or no significant change. Psychosocial outcomes were grouped into broader domains: anxiety, depression, empathy, social bonding, trust, stress and relaxation, as well as emotion and mood. Contextual descriptions were extracted and categorized into four dimensions: (1) nature of the activity (e.g., passive listening, active music-making, or music therapy); (2) stress signals (e.g., levels of physical or mental stress); (3) social cues (e.g., presence of others); and (4) familiarity (e.g., familiarity with the music, participants, or environments).
The reviewer conducted a framework synthesis and a narrative synthesis. The framework synthesis was based on the model proposed by Bowling (2023), following a five-stage process (Brunton et al., 2020): familiarization, framework selection, indexing, charting, and mapping/interpretation. The narrative synthesis was used to summarize patterns within and across studies, explore relationships, and assess the robustness of findings in terms of methodological quality (Impellizzeri and Bizzini, 2012). Finally, a convergent design (Hong et al., 2017) was used to integrate quantitative and qualitative data.
3 Results
3.1 Study selection and study characteristics
A PRISMA flow diagram is presented in Figure 1, showing study identification and inclusion of eligible studies. A total of 1,865 records were identified, of which 1,860 were from electronic databases, and five were from other sources. After removing duplicates, 902 records were screened based on titles and abstracts. Thirty-one full-text articles were assessed for eligibility, and 11 studies were excluded (five duplicates, two conference abstracts, two with wrong outcomes, and two without OXT data). Finally, 20 studies (seven RCTs and 13 quasi-experiments) were included in the systematic review.
Table 1 summarizes the characteristics of the included studies, which were published between 2003 and 2023 across 11 countries. The majority of the studies were conducted in Europe (n = 12, 60%), followed by North America (n = 5, 25%), and Asia (n = 3, 15%). All studies comprised 877 participants. Sample sizes ranged from 4 to 193 participants (median = 25.5).
The reviewed music-based interventions encompassed singing, playing instruments, listening to music, and music therapy. Most studies used pieces of classical music as materials. Only two studies adopted quasi-ritualistic music-based interventions grounded in culture: one study from Japan used group taiko drumming (Yuhi et al., 2017), and one study from Indonesia examined a school-based intervention program combining Indonesian traditional music (Maramis et al., 2021).
One reviewer used the JBI critical appraisal tools to assess study quality. Among the studies, 17 were rated as low risk of bias, two as moderate risk, and one as high risk. Tables 2, 3 present the detailed results of the quality assessment. No studies were excluded based on quality alone.
3.2 Population characteristics and OXT responses
The included studies encompassed diverse populations, with a primary focus on healthy adults. Other studies examined clinical populations, including cancer patients and their carers Fancourt et al., (2016), pregnant women Wulff et al., (2021), preterm infants Filippa et al., (2021) and their mothers Filippa et al., (2023), individuals with Williams syndrome Dai et al., (2012), post-surgical patients (Nilsson, 2009), post-stroke patients (Palumbo et al., 2022), children with autism spectrum disorder (ASD; Corbett et al., 2011), and maltreated children (Yuhi et al., 2017). Two studies (Grape et al., 2003; Kreutz, 2014) recruited mixed samples that varied in age, sex, health status, and singing experiences, as detailed in Table 1.
Participant ages ranged from 34.8 gestational weeks (preterm infants) to 72.8 years (healthy older adults). Most studies had a predominance of female participants. Baseline peripheral OXT levels showed notable differences between populations. In typical control individuals, the patterns of plasma OXT response to music demonstrated low inter-individual variability (Dai et al., 2012), although they had different baseline OXT levels. By contrast, patients with Williams syndrome exhibited higher baseline plasma OXT levels and greater variability in response to music Dai et al., (2012). In pregnant and postpartum mothers (Filippa et al., 2023; Wulff et al., 2021), baseline peripheral OXT levels were low, with small yet statistically significant changes after musical activities. The patterns of OXT response to music were not influenced by sex (Bowling et al., 2022; Filippa et al., 2021). Other covariates the phase of the menstrual cycle, the use of hormonal contraception, and the musical sophistication index did not change the patterns (Eerola et al., 2021).
Tables 4, 5 summarize the peripheral OXT changes induced by music across populations and contexts (also see Supplementary material S2). These results revealed the context-dependence of OXT responses to music. After group singing, OXT levels decreased in healthy young adults Bowling et al., (2022); Keeler et al., (2015); Schladt et al., (2017) but increased in healthy older adults Good and Russo, (2022). OXT reductions were observed in cancer patients and their carers after choral singing Fancourt et al., (2016). For preterm infants Filippa et al., (2021) and their mothers Filippa et al., (2023), as well as pregnant women Wulff et al., (2021), maternal singing to infants or fetuses increased their OXT levels. Listening to slow-tempo music generally triggered OXT release in several populations, including healthy males Ooishi et al., (2017), pregnant women with trait anxiety Wulff et al., (2021), and post-surgical patients Nilsson, (2009). Interestingly, listening to sad music led to significant OXT decreases in healthy females with high empathy but not in healthy females with low empathy Eerola et al., (2021). However, in children with ASD Corbett et al., (2011) and post-stroke patients Palumbo et al., (2022), no significant changes in peripheral OXT levels were observed after long-term group music therapy.
3.3 Psychosocial outcomes and OXT changes
Across the included studies, associations between psychosocial outcomes and peripheral OXT changes varied by population, context, and measurement time point. Table 6 summarizes the relationships between psychosocial outcomes, OXT changes, and context characteristics.
Long-term intervention programs did not lead to significant changes in peripheral OXT levels, despite the improvements in psychosocial outcomes. For instance, after participating in interactive improvised group music-making therapy for 6 weeks, post-stroke patients showed improvements in depression and motor functions without significant OXT changes Palumbo et al., (2022). Similarly, a 3-month structured musical theater therapy improved the ASD children's skills of theory of mind and memory for faces, but no significant plasma OXT changes were observed Corbett et al., (2011).
Short-term music-based interventions, such as a single session of singing, often resulted in improvements in anxiety, though not always accompanied by changes in peripheral OXT levels. Maternal singing led to anxiety reduction and OXT increases in mothers of preterm infants Filippa et al., (2023), whereas in other contexts (e.g., healthy adults engaging in choir singing), anxiety reductions occurred accompanied by OXT decreases Schladt et al., (2017). Additionally, unpleasant music stimuli under stress conditions both increased anxiety and plasma OXT levels in healthy males Jezova et al., (2013).
Empathy-related outcomes unveiled noteworthy patterns. For high-empathy healthy females, listening to unfamiliar sad instrumental music elicited stronger self-reported emotional responses, accompanied by significant decreases in plasma OXT levels. In contrast, low-empathy healthy females reported smaller emotional improvements but showed modest OXT increases Eerola et al., (2021).
In an RCT, Riedl et al. (2017) examined whether music listening in a trust game influenced the trust behavior and perceived trustworthiness. During the trust game, no significant changes in OXT levels were observed in the participants in the music conditions, but they tended to show greater trust behavior and rated faces and nicknames as more trustworthy compared to those in the no-music conditions, although the trends did not reach statistical significance.
All studies on choral singing reported that participants had experienced social bonding, and OXT levels declined in most cases Bowling et al., (2022); Fancourt et al., (2016); Keeler et al., (2015), except in improvised choral singing, where OXT levels increased Keeler et al., (2015).
Last, participants frequently reported subjective improvements in relaxation and mood scales after musical activities across studies. However, these changes were not always accompanied by increases in peripheral OXT levels Bowling et al., (2022); Eerola et al., (2021); Fancourt et al., (2016); Good and Russo, (2022); Grape et al., (2003); Kreutz, (2014); Maramis et al., (2021); Nilsson, (2009); Wulff et al., (2021).
Overall, improvements in psychosocial outcomes do not correspond with changes in peripheral OXT levels, and consistent OXT response patterns were observed only for specific psychosocial outcomes.
3.4 Effects of contextual factors on patterns of OXT responses and psychosocial outcomes
3.4.1 Patterns of OXT responses
Analyses of OXT responses across studies consistently revealed interaction effects, which indicated that patterns of OXT change were context-dependent (see Table 7). Kreutz (2014) demonstrated a significant time × condition interaction for salivary OXT responses (F(1,21) = 7.988, p < 0.05). The dependence on activity was further supported by several studies: Filippa et al. (2021) reported a significant time × condition interaction (χ2(2) = 6.99, p = 0.03), as did Good and Russo (2022) (z = −2.142, p = 0.032). Schladt et al. (2017) observed a significant time × context interaction (F(4) = 7.27, p < 0.001), and Bowling et al. (2022) found a significant time × vocal mode interaction (χ2(1) = 5.7, p = 0.018). Evidence for OXT's sensitivity to stress signals was provided by Ooishi et al. (2017), who observed a significant time × tempo interaction (F(1,22) = 13.44, p = 0.0014). However, Bowling et al. (2022) found no significant time × social context interaction (p = 0.230) or time × vocal mode × social context interaction (p = 0.190), indicating that OXT responses were not significantly influenced by the explicitly manipulated social setting in their study.
3.4.2 Patterns of psychosocial outcomes
Psychosocial outcomes exhibited varying degrees of context-dependence (see Table 7). Kreutz (2014) found a significant time × condition interaction for positive feelings (F(1,20) = 9.655, p < 0.01). Schladt et al. (2017) observed significant time × context interactions for excitement (F(1) = 6.51, p = 0.015) and happiness (F(1) = 5.27, p = 0.028), but not for worry or sadness. Jezova et al. (2013) examined the effects of pleasant vs. unpleasant music and found a significant time × condition interaction for state anxiety (F(1, 26) = 5.10, p < 0.05). In Bowling et al. (2022), the social bonding measure (IOS scale) showed a significant time × vocal mode interaction (χ2(1) = 7.06, p = 0.008). In contrast, the subjective emotion measure (PANAS) in the same study exhibited a significant three-way interaction among time, vocal mode, and social context (χ2(1) = 10.54, p = 0.001). The findings indicate that changes in psychosocial outcomes, particularly positive affect measures, were significantly influenced by activity and social cues.
3.5 The characteristics of OXT measurements
All the included studies used peripheral OXT measurements as biomarkers, and they varied in collection protocols, sample types, and quantification methods, as summarized in Table 8. Twelve studies provided details on the time of sample collection. Of these, nine studies scheduled collection during the afternoon or evening hours to avoid interference from the cortisol awakening response (CAR). The remaining studies adopted different protocols: one study collected samples across many time points throughout the day (morning, afternoon, and evening), and another between 9:00 and 13:00. Notably, Maramis et al. (2021) was the sole study that conducted sample collection during the CAR period in the early morning.
In short-term interventions, samples were often collected immediately before and after a single session to measure acute OXT changes. Six studies collected samples at three or more time points (Dai et al., 2012; Eerola et al., 2021; Jezova et al., 2013; Nilsson, 2009; Riedl et al., 2017; Schladt et al., 2017), whereas the others relied on pre-test and post-test measurements. Long-term interventions assessed baseline OXT levels before and after the entire program, which lasted from several weeks to months. However, long-term interventions generally showed no significant changes in baseline OXT levels.
Sample types included plasma (n = 5), serum (n = 4), and saliva (n = 11). Generally, plasma and serum samples showed higher OXT concentrations (ranging from tens to hundreds of pg/ml) compared to saliva samples (approximately 0–20 pg/ml). In terms of quantification methods, 12 studies used enzyme immunoassay (EIA), of which four applied sample extraction procedures prior to EIA. Five studies used radioimmunoassay (RIA), one study used electrochemiluminescence assay, and two used fluorescence immunoassay. Studies that omitted sample extraction before EIA tended to report higher OXT concentrations (Dai et al., 2012; Grape et al., 2003; Keeler et al., 2015; Maramis et al., 2021; Palumbo et al., 2022; Yuhi et al., 2017) than those that included extraction steps (Bowling et al., 2022; Eerola et al., 2021; Nilsson, 2009; Ooishi et al., 2017).
Therefore, methodological differences in OXT measurements may contribute to inconsistencies in OXT levels reported across studies.
4 Discussion
4.1 The bidirectional modulation of OXT
4.1.1 Context-dependence of OXT responses
High-quality studies have shown that music can lead to both increases and decreases in peripheral OXT levels. Overall, clinical condition appears to be the most influential factor in modulating OXT release. For instance, individuals with Williams syndrome exhibit exaggerated OXT release in response to their favorite positive music, suggesting dysregulation of the OXTergic system (Dai et al., 2012). In pregnant and postpartum mothers, the OXTergic system is highly active yet peripheral OXT levels tend to be low (Filippa et al., 2023; Wulff et al., 2021), since pregnancy inhibits OXT secretion but enhances OXT receptor (OXTR) expression and enzymatic degradation (Uvnas-Moberg, 2024).
Context-dependence can explain the variability in OXT responses across studies. Under basal conditions, central and peripheral OXT levels show no significant correlation (Kagerbauer et al., 2013; Valstad et al., 2017), since peripheral OXT reflects only the activity of magnocellular OXT neurons projecting to the posterior pituitary but central release involves both magnocellular and parvocellular OXT neurons (Bowling, 2023). However, during acute stress or strong physiological or social stimulation, coordinated OXT release in the CNS and periphery may occur and produce higher correlations between central and peripheral OXT levels (Martin et al., 2018; Valstad et al., 2017). As illustrated in Figure 2, music can induce such release by shaping the context, which is comprised of several elements, including the type of activity, stress signals, social cues, and familiarity with the music or social setting. The auditory signals travel through the classical pathway to the auditory cortex, while contextual inputs, including all sensory inputs, stress signals, and social cues, go through the non-classical pathway to the limbic system and the hypothalamus. The inputs are integrated in the hypothalamus, which may trigger coordinated or independent release of OXT in the CNS and periphery, and activate or deactivate the hypothalamic-pituitary-adrenal (HPA) axis.
Nevertheless, the dimensions that constitute the context differ in their importance. Our contextual analysis (Tables 6, 7) indicates that activity and stress signals have combined effects on OXT responses, with qualitative analysis showing that activity is the primary driver, while stress signals may reverse these effects under certain conditions. Social cues and familiarity showed limited predictive power. Statistical evidence also supports the patterns. Schladt et al. (2017) found a significant time × context interaction, showing that OXT changes over time depend on contexts, consistent with other studies (Filippa et al., 2021; Good and Russo, 2022; Kreutz, 2014). Bowling et al. (2022) distinguished two contextual factors, vocal mode and social context, and identified vocal mode (corresponding to activity type) as the key factor that influences OXT response patterns, with social context showing no significant effect. Ooishi et al. (2017) confirmed OXT's responsiveness to stress signals, as evidenced by a significant time × tempo interaction effect in a music-listening task that manipulated arousal or stress levels via tempo.
As for the weak effect of the social context, we attribute this to potential issues in the current experimental paradigm. Just as HPA axis activation requires high-stress conditions, OXT modulation may need sufficiently demanding social cues. Keeler et al. (2015) observed the influence of social cues through improvised group singing, which involved trust, synchrony, and real-time interaction. Instead, the structured singing tasks in Schladt et al. (2017) and Bowling et al. (2022) required participants to focus on vocal performance with limited spontaneous social interactions and should have been insufficient to reveal the effects of social cues. As a result, the context-dependence of OXT responses to music is specifically attributed to its reliance on two factors, namely the activity type and the stress signals, based on the statistical findings.
4.1.2 Functional adaptation of OXT to musical contexts
OXT exhibits temporally and spatially specific release patterns, and peripheral release can occur independently of or in parallel with central release (Jurek and Neumann, 2018). In the circulation, OXT follows a pulsatile release pattern, and pulse characteristics are strongly linked with baseline socio-emotional functioning. Mean OXT pulse height and pulse mass were negatively correlated with avoidant attachment and positively correlated with perceived social support (Baskaran et al., 2017). It suggested that temporal dynamics of peripheral OXT release carry crucial psychosocial information.
In the CNS, spatial specificity of OXT release allows for nuanced modulation. OXT activates OXTRs, which are coupled with Gq or Gi/o proteins, resulting in diverse downstream effects (Jurek and Neumann, 2018). For example, OXT release directly activates dopaminergic neurons and indirectly inhibits them via local GABAergic interneurons, but the relative magnitudes of the two mechanisms differ in the ventral tegmental area (VTA) and substantia nigra pars compacta (SNc; Xiao et al., 2017). OXT functions as a allosteric modulator of μ opioid receptor (MOR), and enhances the MOR signaling without altering receptor affinity (Meguro et al., 2018). In the striatum, OXTR activation on astrocytes inhibits glutamate release (Amato et al., 2022), and D2R-OXTR heterocomplexes facilitate dopaminergic signaling (Hernandez-Mondragon et al., 2023; Pérez de la Mora et al., 2016). The differential modulation of OXT supports its excitatory and inhibitory effects, which depend on cell type, receptor coupling, and local neurotransmitter interactions.
Thus, OXT increases or decreases play different roles in reward processing, social cognition, and social behaviors:
(1) Reward processing: the ventral striatum is central in reward processing. The nucleus accumbens (NAcc) is responsible for recognizing the salience and valence of stimuli (Cooper and Knutson, 2008). The left NAcc recognizes pleasure, while the right NAcc responds to all noteworthy stimuli whether they are positive or negative. When OXT increases, it may promote dopaminergic signaling (Xiao et al., 2017) and enhance the output of the striatum.
(2) Empathy network: the ventromedial prefrontal cortex (vmPFC), a core area of the empathy network (Taruffi et al., 2021), normally receives inhibitory OXT projections. Increased OXT reduces activity in vmPFC, accelerates the response to social cues, reduces recall accuracy, and blurs the boundaries between self and others (Zhao et al., 2016). Decreased OXT might enhance the local activity of the empathy network and right striatum, strengthen emotional responses, and promote deeper emotional engagement with music, though direct evidence for this pattern in musical contexts remains to be established.
(3) Social behavior: in both approach and avoidance behaviors, increased OXT reduces right striatum activity, potentially suppressing excessive emotional interference and promoting appropriate behavioral selection. Although OXT may weaken overall motivation, it improves the accuracy of approach to positive stimuli (Yao et al., 2018). This effect may reflect a compensatory mechanism in which motivation control shifts toward the left striatum when the right striatal activity is reduced.
The bidirectional modulation of OXT suggests that the OXTergic system adapts flexibly to musical context. Happy music primarily activates the left NAcc (Trost et al., 2012), and OXT release may facilitate dopaminergic signaling (Dai et al., 2012). Sad music selectively activates the right NAcc over the left (Trost et al., 2012), and is often accompanied by peripheral OXT decreases, especially in females with high trait empathy (Eerola et al., 2021).
Furthermore, there is a bidirectional relationship between the OXTergic system and the HPA axis (Bowling, 2023; Harvey, 2020; Jurek and Neumann, 2018). Under acute stress conditions, three temporal patterns may occur:
(1) Concurrent release of OXT and adrenocorticotropic hormone (ACTH)/cortisol (Keeler et al., 2015);
(2) An initial OXT increase that attenuates HPA activation (Jezova et al., 2013; Keeler et al., 2015);
(3) HPA activation that selectively inhibits OXT release from the PVN (Jezova et al., 2013; Keeler et al., 2015).
Engaging in low-exertion social activities under low-stress conditions is conducive to OXT release since OXT is primarily involved in physiological relaxation rather than psychological relaxation (Ooishi et al., 2017). Slow-tempo music is more likely to induce OXT release without inhibition by the HPA axis (Ooishi et al., 2017; Wulff et al., 2021), as it is usually perceived as more pleasant and relaxing (Ooishi et al., 2017). In contrast, fast-tempo music or singing may initially activate the HPA axis due to increased arousal or stress, which can temporarily suppress OXT release, even though the HPA responses may decrease by the end of the intervention (Bowling et al., 2022; Fancourt et al., 2016; Good and Russo, 2022; Grape et al., 2003; Jezova et al., 2013; Keeler et al., 2015; Ooishi et al., 2017; Schladt et al., 2017). Additionally, prolonged singing sessions (e.g., over an hour) may lead to fatigue and limit other social interactions (Fancourt et al., 2016). If the reward from music is insufficient to counteract the earlier inhibition, OXT levels may remain unchanged or even decrease, as observed in studies where the participants sang touching sublime religious songs (Bowling et al., 2022; Schladt et al., 2017).
In summary, the findings challenge the assumption that higher OXT levels are beneficial. Instead, both increases and decreases in OXT levels may support adaptive emotional and social functions. Importantly, the context-dependence of music and OXT emphasizes the need to consider baseline physiological and psychological states, intervention types, and environmental factors.
4.2 Peripheral OXT and subjective outcomes
Discrepancies between peripheral OXT levels and psychosocial outcomes have led to ongoing debate about the relationship between central and peripheral OXT activity. While context-dependence may account for the variability and direction of peripheral OXT responses, subjective outcomes exhibit different context-dependence.
Evidence from Bowling et al. (2022) implied the dissociation. The patterns of OXT responses were mainly determined by vocal mode (time × vocal mode interaction) rather than social cues. IOS scale also showed a significant time × vocal mode interaction, which suggested that social bonding might be associated with the physiological processes behind vocal synchrony. However, PANAS showed a significant three-way interaction (time × vocal mode × social context), indicating that the changes of affective experiences were dependent on both the activity and social cues. In addition, there were no significant correlations between the pre-post change of OXT and self-reported emotion, mood or stress scores (Fancourt et al., 2016; Ooishi et al., 2017). Thus, different outcome measures are sensitive to different contextual factors. OXT responses should be regarded as intrinsic physiological adaptations to musical activities. Since OXT secretory dynamics (e.g., pulse characteristics) are more likely to reflect subjective socio-emotional functioning than a single measurement or pre-post change (Baskaran et al., 2017), pre-post changes of OXT often fail to predict subjective outcomes, which are heavily influenced by placebo effects, expectations, and the infeasibility of blinding in music-based interventions. As a result, perceived benefits may reflect cognitive and cultural interpretations of musical meaning.
Behavioral changes following musical activities are likely mediated by central mechanisms not fully captured by peripheral OXT measurements (Martins et al., 2020). For example, social cognition and reward processing can be mediated by OXT and OXTR, while social communication relies more on arginine-vasopressin (AVP) and its receptor (V1aR; Song and Albers, 2018). Since OXT and AVP share structural similarities and can activate each other's receptors, this phenomenon, “cross-talk” (Song and Albers, 2018), is likely to participate in shaping social behaviors in musical contexts. Therefore, peripheral OXT levels have limited predictive value for subjective outcomes. When interpreting outcomes regarding social cognition, social reward, or social behavior, we recommend considering the OXT-AVP systems that mediate the adaptive responses to social stimuli.
4.3 Clinical implications
The bidirectional modulation of OXT suggests that therapeutic strategies need to move beyond merely increasing OXT levels, as OXT enhances the salience of both positive and negative social cues, encouraging social approach in safe contexts but inducing avoidance in aversive environments (Steinman et al., 2019). From a translational medicine perspective, the context-dependence of OXT responses implies that existing music-based interventions could be refined by considering the activity type and stress signals involved.
For social anxiety disorder (SAD) patients, social cues belong to intense stress signals since they usually interpret social cues negatively and lack motivation for positive social interaction. Based on our contextual analysis, at the initial stage, treatment should begin with a safe setting using neutral, calming, or peaceful music, with limited social interactions. It is advisable that the musical emotions not completely simulate anxiety to avoid overwhelming negative feelings in patients. Social engagement can be gradually introduced only after basic trust is established and the emotional state is stabilized. We recommend structured group singing in a small choir (Keeler et al., 2015) for SAD patients, as it provides appropriately limited social cues which is sufficient for therapeutic effects but not overwhelming.
Some typical characteristics of ASD, such as deficits in imitation, emotional empathy, and attributing intentions to others, reflect the mirror system dysfunction (Cattaneo and Rizzolatti, 2009). For ASD children, rhythmic musical activities could mobilize the mirror systems, improve self-control, and help process acoustic prosody for conversational quality (Wynn et al., 2022). Because rhythmic music involves precise timing prediction and synchronization, patients need to pay more attention to social cues. Patients who are sensitive to OXT will benefit from improvement in social information processing. Thus, group drumming (Yuhi et al., 2017) and musical theater therapy (Corbett et al., 2011) are recommended for individuals with ASD. The therapy design requires setting up lessons of different levels and breaking them down into manageable parts to learn. Rhythm complexity needs to increase gradually, from simple, accessible patterns to more complex synchronization tasks, to avoid frustrations for beginners.
Understanding the mechanisms through which music influences emotion regulation and social bonding benefits the design of music therapy, as it enables a better understanding of which strategies are more effective in mobilizing the neural networks to achieve therapeutic goals.
4.4 Strengths and limitations of the included study
Several included studies demonstrated methodological rigor that strengthens confidence in their findings. The RCTs provided stronger evidence for causal relationships between music-based interventions and OXT changes. Some studies accounted for biological and environmental confounders, used validated OXT assays with extraction procedures (Gan et al., 2023), and employed real-world settings that enhanced ecological validity. An advantage was the predominant use of saliva samples. Unlike plasma OXT, which fluctuates rapidly due to its 7-min half-life (Shafer et al., 2025) and pulsatile release pattern (Baskaran et al., 2017), salivary OXT can provide more stable measurements since it has a longer half-life (Quintana et al., 2018).
However, the studies exhibit several limitations that necessitate discussion. A primary concern is the reliance on pre-post measurements of peripheral OXT, which may not reflect central OXT activity (Kagerbauer et al., 2013; Valstad et al., 2017) or be correlated with attachment style, perceived social support, and emotional awareness (Baskaran et al., 2017). Timing issues were also evident in some studies. Specifically, OXT's dramatic fluctuations within CAR period could potentially mask intervention effects. Comparability across studies was hindered by inconsistencies in sampling methods and quantification techniques (Lefevre et al., 2017), as well as insufficient reporting of musical and contextual details. Moreover, common issues like small sample sizes and the lack of negative controls reduced the internal validity. While fasting and exercise restrictions did not have significant influence on OXT levels (Engel et al., 2019), other behavioral confounders, such as hospital anxiety or sexual activity prior to sampling, still need to be considered (Jurek and Neumann, 2018).
4.5 Strengths and limitations of the current study
This review addressed the research questions through comprehensive search strategies, predefined eligibility criteria, and structured syntheses that integrated heterogeneous findings. Our contextual analysis provides a key insight by identifying four distinct contextual dimensions and their different contribution on OXT modulation.
However, there were several limitations in the review. First, it was mainly conducted by one reviewer, which may introduce selection bias, and restrictions to English publications could have excluded relevant studies. None of the included studies examined OXTR gene polymorphisms, which associate with social behavior variability in clinical populations (Harrison et al., 2015) and may influence individual responses to music. Additionally, no studies directly investigated receptor interactions such as functional complexes of OXTR with dopamine, serotonin, and opioid receptors. Only Fancourt et al. (2016) reported correlated declines of OXT and beta-endorphin after choral singing. Thus, the interpretation scope of the review is limited.
5 Conclusion
The context-dependence of OXT responses to music has significant implications for clinical practice and research. Evidence supports a bidirectional relationship in which music induces subtle emotional responses, which are further modulated by OXT. The effectiveness of music-based interventions may not rely on the direction of OXT change but on the adaptive modulation of OXT. For clinical applications, music-based interventions should be carefully tailored to specific populations and contexts. Larger sample sizes, standardized outcome measures, and well-controlled experimental designs are needed in the future. Given the limitations of peripheral OXT measurements, we suggest that future studies explore central OXT signaling or receptor interactions. Integrating genetic studies and peptide assessments, the use of OXTR agonists or antagonists, combined with neuroimaging techniques such as functional magnetic resonance imaging (fMRI), may provide deeper insights into how music modulates brain activity through the OXTergic system.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
T-HC: Conceptualization, Data curation, Formal analysis, Visualization, Writing – original draft. C-GT: Conceptualization, Supervision, Writing – review & editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This work was supported by a grant project [NSC 112-2410-H-002-071] from Ministry of Science and Technology, Taiwan.
Acknowledgments
The article includes content from T-HC's Master's thesis (Chu, 2025), submitted to National Taiwan University. The thesis has been deposited in the university repository but is not publicly accessible currently. The present manuscript contains updated analyses and revised discussion compared to the thesis. All final decisions and revisions were made by the authors.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was used in the creation of this manuscript. The author(s) used ChatGPT (GPT-4o model, OpenAI) to improve the clarity and readability of the English in the manuscript. The use of this AI tool was limited to language editing and did not affect the scientific content, interpretations, or conclusions.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcogn.2025.1678665/full#supplementary-material
Supplementary material S1 | Systematic review protocol.
Supplementary material S2 | Peripheral OXT changes induced by music over time.
References
Amato, S., Averna, M., Guidolin, D., Pedrazzi, M., Pelassa, S., Capraro, M., et al. (2022). Heterodimer of A2A and oxytocin receptors regulating glutamate release in adult striatal astrocytes. Int. J. Mol. Sci. 23:2326. doi: 10.3390/ijms23042326
Barker, T. H., Habibi, N., Aromataris, E., Stone, J. C., Leonardi-Bee, J., Sears, K., et al. (2024). The revised JBI critical appraisal tool for the assessment of risk of bias for quasi-experimental studies. JBI Evid. Synth. 22, 378–388. doi: 10.11124/JBIES-23-00268
Barker, T. H., Stone, J. C., Sears, K., Klugar, M., Tufanaru, C., Leonardi-Bee, J., et al. (2023). The revised JBI critical appraisal tool for the assessment of risk of bias for randomized controlled trials. JBI Evid. Synth. 21, 494–506. doi: 10.11124/JBIES-22-00430
Baskaran, C., Plessow, F., Silva, L., Asanza, E., Marengi, D., Eddy, K. T., et al. (2017). Oxytocin secretion is pulsatile in men and is related to social-emotional functioning. Psychoneuroendocrinology 85, 28–34. doi: 10.1016/j.psyneuen.2017.07.486
Bernatzky, G., Presch, M., Anderson, M., and Panksepp, J. (2011). Emotional foundations of music as a non-pharmacological pain management tool in modern medicine. Neurosci. Biobehav. Rev. 35, 1989–1999. doi: 10.1016/j.neubiorev.2011.06.005
Bhandarkar, S., Salvi, B. V., and Shende, P. (2024). Current scenario and potential of music therapy in the management of diseases. Behav. Brain Res. 458:114750. doi: 10.1016/j.bbr.2023.114750
Bowling, D. L. (2023). Biological principles for music and mental health. Transl. Psychiatry 13:374. doi: 10.1038/s41398-023-02671-4
Bowling, D. L., Gahr, J., Ancochea, P. G., Hoeschele, M., Canoine, V., Fusani, L., et al. (2022). Endogenous oxytocin, cortisol, and testosterone in response to group singing. Horm. Behav. 139:105105. doi: 10.1016/j.yhbeh.2021.105105
Brunton, G., Oliver, S., and Thomas, J. (2020). Innovations in framework synthesis as a systematic review method. Res. Synth. Methods 11, 316–330. doi: 10.1002/jrsm.1399
Cattaneo, L., and Rizzolatti, G. (2009). The mirror neuron system. Arch. Neurol. 66, 557–560. doi: 10.1001/archneurol.2009.41
Chini, B., Verhage, M., and Grinevich, V. (2017). The action radius of oxytocin release in the mammalian CNS: from single vesicles to behavior. Trends Pharmacol. Sci. 38, 982–991. doi: 10.1016/j.tips.2017.08.005
Chu, T.-H. (2025). The role of music in modulating oxytocin levels to enhance social bonding and emotion regulation: a systematic review (master's thesis). National Taiwan University, Taipei, Taiwan.
Cooper, J. C., and Knutson, B. (2008). Valence and salience contribute to nucleus accumbens activation. NeuroImage 39, 538–547. doi: 10.1016/j.neuroimage.2007.08.009
Corbett, B. A., Gunther, J. R., Comins, D., Price, J., Ryan, N., Simon, D., et al. (2011). Brief report: theatre as therapy for children with autism spectrum disorder. J. Autism Dev. Disord. 41, 505–511. doi: 10.1007/s10803-010-1064-1
Dai, L., Carter, C. S., Ying, J., Bellugi, U., Pournajafi-Nazarloo, H., Korenberg, J. R., et al. (2012). Oxytocin and vasopressin are dysregulated in Williams Syndrome, a genetic disorder affecting social behavior. PLoS ONE 7:e38513. doi: 10.1371/journal.pone.0038513
Eerola, T., Vuoskoski, J. K., Kautiainen, H., Peltola, H. R., Putkinen, V., Schäfer, K., et al. (2021). Being moved by listening to unfamiliar sad music induces reward-related hormonal changes in empathic listeners. Ann. N. Y. Acad. Sci. 1502, 121–131. doi: 10.1111/nyas.14660
Engel, S., Laufer, S., Miller, R., Niemeyer, H., Knaevelsrud, C., Schumacher, S., et al. (2019). Demographic, sampling- and assay-related confounders of endogenous oxytocin concentrations: a systematic review and meta-analysis. Front. Neuroendocrinol. 54:100775. doi: 10.1016/j.yfrne.2019.100775
Fancourt, D., Williamon, A., Carvalho, L. A., Steptoe, A., Dow, R., Lewis, I., et al. (2016). Singing modulates mood, stress, cortisol, cytokine and neuropeptide activity in cancer patients and carers. Ecancermedicalscience 10:631. doi: 10.3332/ecancer.2016.631
Filippa, M., Monaci, M. G., Spagnuolo, C., Di Benedetto, M., Serravalle, P., and Grandjean, D. (2023). Oxytocin levels increase and anxiety decreases in mothers who sing and talk to their premature infants during a painful procedure. Children 10:334. doi: 10.3390/children10020334
Filippa, M., Monaci, M. G., Spagnuolo, C., Serravalle, P., Daniele, R., Grandjean, D., et al. (2021). Maternal speech decreases pain scores and increases oxytocin levels in preterm infants during painful procedures. Sci. Rep. 11:17301. doi: 10.1038/s41598-021-96840-4
Gan, H.-W., Leeson, C., Aitkenhead, H., and Dattani, M. (2023). Inaccuracies in plasma oxytocin extraction and enzyme immunoassay techniques. Compr. Psychoneuroendocrinol. 15:100188. doi: 10.1016/j.cpnec.2023.100188
Good, A., and Russo, F. A. (2022). Changes in mood, oxytocin, and cortisol following group and individual singing: a pilot study. Psychol. Music 50, 1340–1347. doi: 10.1177/03057356211042668
Grape, C., Sandoren, M., Hansson, L. O., Ericson, M., and Theorell, T. (2003). Does singing promote well-being?: an empirical study of professional and amateur singers during a singing lesson. Integr. Physiol. Behav. Sci. 38, 65–74. doi: 10.1007/BF02734261
Grewen, K. M., Davenport, R. E., and Light, K. C. (2010). An investigation of plasma and salivary oxytocin responses in breast- and formula-feeding mothers of infants. Psychophysiology 47, 625–632. doi: 10.1111/j.1469-8986.2009.00968.x
Gur, R., Tendler, A., and Wagner, S. (2014). Long-term social recognition memory is mediated by oxytocin-dependent synaptic plasticity in the medial amygdala. Biol. Psychiatry 76, 377–386. doi: 10.1016/j.biopsych.2014.03.022
Harrison, A. J., Gamsiz, E. D., Berkowitz, I. C., Nagpal, S., and Jerskey, B. A. (2015). Genetic variation in the oxytocin receptor gene is associated with a social phenotype in autism spectrum disorders. Am. J. Med. Genet. B Neuropsychiatr. Genet. 168, 720–729. doi: 10.1002/ajmg.b.32377
Harvey, A. R. (2020). Links between the neurobiology of oxytocin and human musicality. Front. Hum. Neurosci. 14:350. doi: 10.3389/fnhum.2020.00350
Hernandez-Mondragon, J. C., Hernandez-Hernandez, D. A., Crespo-Ramirez, M., Prospero-Garcia, O., Rocha-Arrieta, L., Fuxe, K., et al. (2023). Evidence for the existence of facilitatory interactions between the dopamine D2 receptor and the oxytocin receptor in the amygdala of the rat relevance for anxiolytic actions. Front. Pharmacol. 14:1251922. doi: 10.3389/fphar.2023.1251922
Hong, Q. N., Pluye, P., Bujold, M., and Wassef, M. (2017). Convergent and sequential synthesis designs: implications for conducting and reporting systematic reviews of qualitative and quantitative evidence. Syst. Rev. 6:61. doi: 10.1186/s13643-017-0454-2
Impellizzeri, F. M., and Bizzini, M. (2012). Systematic review and meta-analysis: a primer. Int. J. Sports Phys. Ther. 7, 493–503.
Jezova, D., Hlavacova, N., Makatsori, A., Duncko, R., Loder, I., Hinghofer-Szalkay, H., et al. (2013). Increased anxiety induced by listening to unpleasant music during stress exposure is associated with reduced blood pressure and ACTH responses in healthy men. Neuroendocrinology 98, 144–150. doi: 10.1159/000354202
Jurek, B., and Neumann, I. D. (2018). The oxytocin receptor: from intracellular signaling to behavior. Physiol. Rev. 98, 1805–1908. doi: 10.1152/physrev.00031.2017
Kagerbauer, S. M., Martin, J., Schuster, T., Blobner, M., Kochs, E. F., Landgraf, R., et al. (2013). Plasma oxytocin and vasopressin do not predict neuropeptide concentrations in human cerebrospinal fluid. J. Neuroendocrinol., 25, 668–673. doi: 10.1111/jne.12038
Keeler, J. R., Roth, E. A., Neuser, B. L., Spitsbergen, J. M., Waters, D. J. M., Vianney, J. M., et al. (2015). The neurochemistry and social flow of singing: bonding and oxytocin. Front. Hum. Neurosci. 9:518. doi: 10.3389/fnhum.2015.00518
Kreutz, G. (2014). Does singing facilitate social bonding? Music Med. 6, 51–60. doi: 10.47513/mmd.v6i2.180
Lefevre, A., Mottolese, R., Dirheimer, M., Mottolese, C., Duhamel, J.-R., Sirigu, A., et al. (2017). A comparison of methods to measure central and peripheral oxytocin concentrations in human and non-human primates. Sci. Rep. 7:17222. doi: 10.1038/s41598-017-17674-7
Lin, Y. T., and Hsu, K. S. (2018). Oxytocin receptor signaling in the hippocampus: role in regulating neuronal excitability, network oscillatory activity, synaptic plasticity and social memory. Prog. Neurobiol. 171, 1–14. doi: 10.1016/j.pneurobio.2018.10.003
MacLean, E. L., Wilson, S. R., Martin, W. L., Davis, J. M., Nazarloo, H. P., Carter, C. S., et al. (2019). Challenges for measuring oxytocin: the blind men and the elephant? Psychoneuroendocrinology 107, 225–231. doi: 10.1016/j.psyneuen.2019.05.018
Maramis, M. M., Setiawati, Y., Febriyanti, N., Fitriah, M., Atika Salim, R., et al. (2021). Effects of playing angklung and practicing silence on emotion, cognition and oxytocin levels in children: a preliminary study. Malays. J. Med. Sci. 28, 105–117. doi: 10.21315/mjms2021.28.3.10
Marlin, B. J., and Froemke, R. C. (2017). Oxytocin modulation of neural circuits for social behavior. Dev. Neurobiol. 77, 169–189. doi: 10.1002/dneu.22452
Martin, J., Kagerbauer, S. M., Gempt, J., Podtschaske, A., Hapfelmeier, A., Schneider, G., et al. (2018). Oxytocin levels in saliva correlate better than plasma levels with concentrations in the cerebrospinal fluid of patients in neurocritical care. J. Neuroendocrinol. 30:e12596. doi: 10.1111/jne.12596
Martins, D., Gabay, A. S., Mehta, M., and Paloyelis, Y. (2020). Salivary and plasmatic oxytocin are not reliable trait markers of the physiology of the oxytocin system in humans. eLife 9:e62456. doi: 10.7554/eLife.62456.sa2
Maud, C., Ryan, J., McIntosh, J. E., and Olsson, C. A. (2018). The role of oxytocin receptor gene (OXTR) DNA methylation (DNAm) in human social and emotional functioning: a systematic narrative review. BMC Psychiatry 18:154. doi: 10.1186/s12888-018-1740-9
Meguro, Y., Miyano, K., Hirayama, S., Yoshida, Y., Ishibashi, N., Ogino, T., et al. (2018). Neuropeptide oxytocin enhances μ opioid receptor signaling as a positive allosteric modulator. J. Pharmacol. Sci. 137, 67–75. doi: 10.1016/j.jphs.2018.04.002
Meyer-Lindenberg, A., Domes, G., Kirsch, P., and Heinrichs, M. (2011). Oxytocin and vasopressin in the human brain: social neuropeptides for translational medicine. Nat. Rev. Neurosci. 12, 524–538. doi: 10.1038/nrn3044
Nilsson, U. (2009). Soothing music can increase oxytocin levels during bed rest after open-heart surgery: a randomised control trial. J. Clin. Nurs. 18, 2153–2161. doi: 10.1111/j.1365-2702.2008.02718.x
Ninan, I. (2011). Oxytocin suppresses basal glutamatergic transmission but facilitates activity-dependent synaptic potentiation in the medial prefrontal cortex. J. Neurochem. 119, 324–331. doi: 10.1111/j.1471-4159.2011.07430.x
Olff, M., Frijling, J. L., Kubzansky, L. D., Bradley, B., Ellenbogen, M. A., Cardoso, C., et al. (2013). The role of oxytocin in social bonding, stress regulation and mental health: an update on the moderating effects of context and interindividual differences. Psychoneuroendocrinology 38, 1883–1894. doi: 10.1016/j.psyneuen.2013.06.019
Ooishi, Y., Mukai, H., Watanabe, K., Kawato, S., and Kashino, M. (2017). Increase in salivary oxytocin and decrease in salivary cortisol after listening to relaxing slow-tempo and exciting fast-tempo music. PLoS ONE 12:e0189075. doi: 10.1371/journal.pone.0189075
Palumbo, A., Aluru, V., Battaglia, J., Geller, D., Turry, A., Ross, M., et al. (2022). Music upper limb therapy-integrated provides a feasible enriched environment and reduces post-stroke depression: a pilot randomized controlled trial. Am. J. Phys. Med. Rehabil. 101, 937–946. doi: 10.1097/PHM.0000000000001938
Pérez de la Mora, M., Perez-Carrera, D., Crespo-Ramirez, M., Tarakanov, A., Fuxe, K., and Borroto-Escuela, D. O. (2016). Signaling in dopamine D2 receptor-oxytocin receptor heterocomplexes and its relevance for the anxiolytic effects of dopamine and oxytocin interactions in the amygdala of the rat. Biochim. Biophys. Acta Mol. Basis Dis. 1862, 2075–2085. doi: 10.1016/j.bbadis.2016.07.004
Quintana, D. S., Westlye, L. T., Smerud, K. T., Mahmoud, R. A., Andreassen, O. A., Djupesland, P. G., et al. (2018). Saliva oxytocin measures do not reflect peripheral plasma concentrations after intranasal oxytocin administration in men. Horm. Behav. 102, 85–92. doi: 10.1016/j.yhbeh.2018.05.004
Riedl, R., Javor, A., Gefen, D., Felten, A., and Reuter, M. (2017). Oxytocin, trust, and trustworthiness: the moderating role of music. J. Neurosci. Psychol. Econ. 10, 1–8. doi: 10.1037/npe0000070
Schladt, T. M., Nordmann, G. C., Emilius, R., Kudielka, B. M., de Jong, T. R., Neumann, I. D., et al. (2017). Choir versus solo singing: effects on mood, and salivary oxytocin and cortisol concentrations. Front. Hum. Neurosci. 11:430. doi: 10.3389/fnhum.2017.00430
Shafer, S. L., Ririe, D. G., Miller, S., Curry, R. S., Hsu, D. T., Sullivan, G. M., et al. (2025). Plasma pharmacokinetics of intravenous and intranasal oxytocin in nonpregnant adults. Br. J. Anaesth. 134, 1513–1522. doi: 10.1016/j.bja.2024.12.046
Sobota, R., Mihara, T., Forrest, A., Featherstone, R. E., and Siegel, S. J. (2015). Oxytocin reduces amygdala activity, increases social interactions, and reduces anxiety-like behavior irrespective of NMDAR antagonism. Behav. Neurosci. 129, 389–398. doi: 10.1037/bne0000074
Song, Z., and Albers, H. E. (2018). Cross-talk among oxytocin and arginine-vasopressin receptors: relevance for basic and clinical studies of the brain and periphery. Front. Neuroendocrinol. 51, 14–24. doi: 10.1016/j.yfrne.2017.10.004
Steinman, M. Q., Duque-Wilckens, N., and Trainor, B. C. (2019). Complementary neural circuits for divergent effects of oxytocin: social approach versus social anxiety. Biol. Psychiatry 85, 792–801. doi: 10.1016/j.biopsych.2018.10.008
Tabak, B. A., Leng, G., Szeto, A., Parker, K. J., Verbalis, J. G., Ziegler, T. E., et al. (2023). Advances in human oxytocin measurement: challenges and proposed solutions. Mol. Psychiatry 28, 127–140. doi: 10.1038/s41380-022-01719-z
Taruffi, L., Skouras, S., Pehrs, C., and Koelsch, S. (2021). Trait empathy shapes neural responses toward sad music. Cogn. Affect. Behav. Neurosci. 21, 231–241. doi: 10.3758/s13415-020-00861-x
Trost, W., Ethofer, T., Zentner, M., and Vuilleumier, P. (2012). Mapping aesthetic musical emotions in the brain. Cereb. Cortex 22, 2769–2783. doi: 10.1093/cercor/bhr353
Uvnas-Moberg, K. (2024). The physiology and pharmacology of oxytocin in labor and in the peripartum period. Am. J. Obstet. Gynecol. 230, S740–S758. doi: 10.1016/j.ajog.2023.04.011
Valstad, M., Alvares, G. A., Egknud, M., Matziorinis, A. M., Andreassen, O. A., Westlye, L. T., et al. (2017). The correlation between central and peripheral oxytocin concentrations: a systematic review and meta-analysis. Neurosci. Biobehav. Rev. 78, 117–124. doi: 10.1016/j.neubiorev.2017.04.017
Wulff, V., Hepp, P., Wolf, O. T., Balan, P., Hagenbeck, C., Fehm, T., et al. (2021). The effects of a music and singing intervention during pregnancy on maternal well-being and mother-infant bonding: a randomised, controlled study. Arch. Gynecol. Obstet. 303, 69–83. doi: 10.1007/s00404-020-05727-8
Wynn, C. J., Barrett, T. S., and Borrie, S. A. (2022). Rhythm perception, speaking rate entrainment, and conversational quality: a mediated model. J. Speech Lang. Hear. Res. 65, 2187–2203. doi: 10.1044/2022_JSLHR-21-00293
Xiao, L., Priest, M. F., Nasenbeny, J., Lu, T., and Kozorovitskiy, Y. (2017). Biased oxytocinergic modulation of midbrain dopamine systems. Neuron 95, 368–384.e5. doi: 10.1016/j.neuron.2017.06.003
Yao, S., Zhao, W., Geng, Y., Chen, Y., Zhao, Z., Ma, X., et al. (2018). Oxytocin facilitates approach behavior to positive social stimuli via decreasing anterior insula activity. Int. J. Neuropsychopharmacol. 21, 918–925. doi: 10.1093/ijnp/pyy068
Yuhi, T., Kyuta, H., Mori, H. A., Murakami, C., Furuhara, K., Okuno, M., et al. (2017). Salivary oxytocin concentration changes during a group drumming intervention for maltreated school children. Brain Sci. 7:152. doi: 10.3390/brainsci7110152
Yuki, U. (2023). Oxytocin: an expansive review of its mechanisms, functions, and therapeutic potential. World J. Adv. Res. Rev. 19, 1264–1272. doi: 10.30574/wjarr.2023.19.1.1499
Keywords: music therapy, oxytocin, context-dependence, social cognition, affect regulation, social behavior, ritual music
Citation: Chu T-H and Tsai C-G (2026) Music's context-dependent influence on oxytocin, social bonding, and emotion regulation: a systematic review. Front. Cognit. 4:1678665. doi: 10.3389/fcogn.2025.1678665
Received: 03 August 2025; Revised: 24 November 2025;
Accepted: 28 November 2025; Published: 02 January 2026.
Edited by:
Kumiko Toyoshima, Osaka Shoin Women's University, JapanReviewed by:
Alan Harvey, University of Western Australia, AustraliaMadhavi Rangaswamy, Christ University, India
Copyright © 2026 Chu and Tsai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chen-Gia Tsai, dHNhaWNoZW5naWFAbnR1LmVkdS50dw==