Videos in short video sharing platforms as a source of information on bipolar disorder: a cross-sectional content analysis study

Qi, Xin; Lu, Qi; Li, Bo; Liu, Sha; Zhang, Lei; Xiang, Runwu; Wang, Qilong; Zhao, Dongrong

doi:10.3389/fpubh.2025.1627885

ORIGINAL RESEARCH article

Front. Public Health, 28 October 2025

Sec. Public Mental Health

Volume 13 - 2025 | https://doi.org/10.3389/fpubh.2025.1627885

This article is part of the Research TopicDigital Health Literacy as a Pathway to Better Mental Well-beingView all 10 articles

Videos in short video sharing platforms as a source of information on bipolar disorder: a cross-sectional content analysis study

Xin Qi^1,2^†

Qi Lu³^†^‡

Bo Li⁴^†

Sha Liu^1,2^‡

Lei Zhang^1,2

Runwu Xiang^1,2

Qilong Wang^1,2^*^‡

Dongrong Zhao²^*^‡

¹Gansu University of Chinese Medicine, Lanzhou, Gansu, China
²Department of Psychiatry, Gansu Provincial People's Hospital, Lanzhou, Gansu, China
³Yan'an University, Xi’an, Shaanxi, China
⁴Qiqihar Medical College, Qiqihar, Heilongjiang, China

Background: Bipolar disorder is a prevalent mental health issue characterized by recurrent episodes of mania and depression, significantly impacting patients’ quality of life. With the rise of short video sharing platforms, there is an urgent need to evaluate the quality and reliability of the medical information disseminated regarding this disorder.

Objective: This study aimed to assess the quality and reliability of videos related to bipolar disorder available on popular Chinese short video platforms, including TikTok, Kwai, Bilibili, WeChat, Xiaohongshu, and Baidu.

Methods: A cross-sectional content analysis was conducted in May 2025, using keywords related to bipolar disorder to retrieve relevant videos from selected platforms. The quality of the videos was evaluated using multiple standardized assessment tools, including the JAMA Benchmarking Criteria, GQS, modified DISCERN, PEMAT, and HONCODE.

Results: Significant differences in video quality and audience engagement metrics were observed across platforms. TikTok and Kwai had higher quality scores, while WeChat resulted in more comments. Most videos were created by medical professionals, although independent users also contributed content. Overall, video quality was inconsistent and not necessarily correlated with engagement metrics, highlighting the necessity for improved standards in disseminating health-related information on social media.

Conclusion: On Chinese short video platforms, clinical practitioners are the main creators of bipolar disorder-related content, but their scientific nature, production quality, and information transparency still need to be improved. It is suggested to improve the platform management, creator training, and algorithm optimization, so as to promote the improvement of public mental health literacy.

Introduction

Bipolar disorder is one of the more prevalent types of mental disorder, and its symptoms can be classified according to severity into bipolar I (experiencing at least one manic episode) and bipolar II (experiencing at least one hypomanic episode and at least one major depressive episode) as defined by the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) (1). Most patients typically manifest varying degrees of depression or mania, frequently accompanied by co-occurring psychiatric disorders and physical illnesses. Recurrent episodes, high disability rates, and high relapse rates characterize the disease (2). On a global scale, the prevalence of bipolar disorder is relatively high, estimated at approximately 1–2% (3, 4). The most recent World Mental Health Survey, conducted between 2001 and 2022, encompassed 156,331 respondents across 29 countries. The findings revealed a lifetime prevalence of 2.5% for males and 2.3% for females (5). Furthermore, the risk of suicide among patients diagnosed with this disorder is 10 times higher in women than in the general population, while in men it is eight times higher. The findings of this study suggest that bipolar disorder exerts a considerable detrimental effect on the quality of life and survival time of patients (6). According to the most recent classification of the World Health Organization, bipolar disorder has been identified as one of the leading candidates for the global burden of disease in 2030. The disability adjusted life years (DALYs) attributable to this disorder account for 0.5% of the total global disease burden. While neuropsychiatric disorders are projected to account for 15.5% of the total burden of disease by 2030, bipolar disorder is ranked as one of the top five priority disorders for intervention (7, 8). Considering the gravity of the situation, there is an imperative for heightened public awareness regarding the disease, alongside the optimization of diagnostic and treatment systems. Consequently, the dissemination of knowledge about bipolar disorder to the public is of paramount importance.

In recent years, with the rapid development of science and technology, and the emergence of social media (short video platforms) as a major channel for information dissemination, especially in China, the China Internet Network Information Centre (CNNIC) reports that the scale of short video users exceeded 1 billion (1,012 million) for the first time in December 2022, and increased to 1,026 million in June 2023, with a penetration rate of 95.2%. About the intensity of usage, the mean daily usage time of users was 151 min (approximately 2.5 h) in 2023 (9, 10). Short video platforms such as TikTok, Kwai, and Bilibili have disrupted the traditional monopoly on medical information, offering a wider array of sources for the public. This shift has empowered individuals to access medical health information autonomously (11, 12). In contemporary society, the importance of mental health is increasingly recognized, with the public demonstrating growing awareness and understanding of related issues (13–15). Bipolar disorder, as a prevalent mental health issue, is garnering increasing attention. Meanwhile, short video platforms, characterized by their unique fragmented information dissemination methods and high interactivity through features such as comments and retweets, have emerged as one of the primary avenues for individuals to acquire knowledge about bipolar disorder and other mental health topics (16, 17). The persistent stigmatization narratives and ineffective treatment content pushed by short video platforms distort the public’s perception of bipolar disorder, raise the threshold for professional help, and lead to treatment delay and increase the risk of suicide (18, 19). In China, short videos have become the primary source of health information for more than 100 million users and curbing their spread of misinformation is an urgent public health priority. It has been demonstrated that the quality of information accessible to the public regarding mental illnesses, such as schizophrenia and depression, is inconsistent and challenging to regulate (20, 21). For the average user, assessing the scientific validity and reliability of online health information is more difficult, seriously affecting their ability to access quality health knowledge (22–24). Therefore, healthcare professionals need to evaluate the quality of online information related to bipolar disorder to provide appropriate guidance to patients.

According to this study, the quality of online information regarding bipolar disorder on video-based Chinese social media has not been thoroughly investigated. Consequently, we selected several currently popular short video platforms in China, including TikTok, Kwai, Bilibili, WeChat, Xiaohongshu, and Baidu (the video aggregation service of the Baidu search engine), to collect video information related to bipolar disorder. Among these platforms is TikTok, a current representative of emerging short-form video platforms. According to the latest data from Statista (2025), TikTok has approximately 159 million monthly active users, making it the fifth most popular social media platform in the world (25). In addition, TikTok’s website receives approximately 218 million visits per month, with the vast majority of visits (over 65%) coming from mobile devices (26, 27). Kwai is also a highly influential short video platform with a substantial user base, reporting approximately 714 million monthly active users and over 408 million daily active users globally as of the third quarter of 2024 (28). Bilibili is a prominent short-form video platform in China, recognized for its diverse content and interactive features. As of the third quarter of 2024, it has 348 million monthly active users globally, with 83 percent of its user base falling within the 18–24 age group (29). Xiaohongshu is a distinctive short-video and social e-commerce platform that significantly influences the lifestyles of young people through user-generated content and authentic sharing. Furthermore, WeChat and Baidu have demonstrated considerable potential in the short video distribution space in recent years.

Currently, the quality of video information regarding bipolar disorder on most short-form video platforms in China has not been systematically evaluated. This study aims to assess the quality of relevant content available on these platforms and to provide evidence-based recommendations for enhancing health-related short videos.

Methods

Ethical considerations

The studies involving human participants were reviewed and approved by the Medical Ethics Committee of Gansu Provincial Hospital. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin. The social media data was accessed and analyzed in accordance with the platform’s terms of use and all relevant institutional/national regulations.

Search strategy and data collection

In May 2025, we used “双相情感障碍” (bipolar affective disorder), “双相障碍” (bipolar disorder), “躁郁症” (manic depression), and “环性心境障碍” (cyclothymic disorder) as keywords to search on six Chinese social and video platforms and services. We collected the top 100 video results from the dedicated short-video apps of TikTok, Bilibili, Kwai, Xiaohongshu, and WeChat Channels. For Baidu, we conducted searches on the main Baidu search engine (), applied the “Video” tab filter, and collected the top 100 videos from the aggregated results. The default sorting algorithm of each platform was used in the search process, and there was no restriction on the time of video release. We limited our analysis sample to the top 100 videos for two reasons. First, search algorithms on platforms such as TikTok, Kwai, Bilibili, WeChat, Xiaohongshu, and Baidu prioritize content that is highly relevant to the query topic. Videos that rank above 100 are less relevant to bipolar disorder. Secondly, from the perspective of user behavior patterns, most viewers usually only browse the top popular results, rather than all the retrieved content, so the top 100 videos are more representative and more in line with the actual viewing and participation patterns (30, 31). Although content moderation policies may lead to the removal of some videos over time, the cross-sectional approach used in this study enabled the capture and analysis of currently publicly accessible video content at a given point in time. This practice not only ensures the rigor of data collection and repeatability of results, but also truly reflects the information environment that users can access during this period. Exclusion criteria were as follows: (1) repeat video; (2) advertising content; (3) include irrelevant topics; (4) Author information could not be determined (author was anonymous or did not provide any professional credentials, even self-claimed); Or (5) poor audio quality. It should be noted that unverified creators claiming to be “doctors” or “psychologists” are still included as independent users, regardless of whether their credentials can be verified, to analyze content that viewers may consider professional. The characteristics of eligible videos were recorded and analyzed, including the number of likes, comments, saves, shares, release time, video duration, video source, the uploader’s address, the number of the uploader’s fans, the video’s presentation form, the video’s content, related diseases, and the medical expertise. All extracted data were entered into Excel (Microsoft Corp) software. The study selection process is summarized in Figure 1.

Figure 1

Flowchart displaying the process of retrieving and screening videos. Four keywords lead to six platforms: TikTok, Bilibili, WeChat, Kwai, Baidu, and Xiaohongshu, retrieving 869 videos. After exclusion of 153 videos for criteria like duplication and quality, 716 videos are screened. Another 110 videos are excluded for irrelevance or being commercials, resulting in 606 videos included.

Figure 1. Search strategies for short videos on bipolar affective disorder.

Platform selection criteria

The selection of platforms for this study is based on threefold criteria: (1) User coverage: Platforms ranked among the top 10 in China by monthly activity, including TikTok, Kwai, and Bilibili, were included to ensure the breadth of user coverage. (2) Representativeness of content ecology: This criterion encompasses comprehensive platforms (e.g., WeChat, Baidu), vertical communities (e.g., Xiaohongshu), and generation-specific aggregation platforms (e.g., Bilibili), to comprehensively reflect the characteristics of mental health content dissemination across multiple scenarios. (3) Regional penetration: The selected platforms cover all provincial administrative regions in mainland China and include creators from regions with insufficient medical resources (as shown in Figure 2), thereby avoiding regional biases in the sample.

Figure 2

Map of China with provinces color-coded by data value, ranging from low (blue) to high (red). A bar graph below shows province-specific data: Beijing highest at 124, followed by Henan at 87, Shanghai at 78, and other provinces with decreasing values.

Figure 2. The distribution of video authors in China.

Video quality evaluation tools and methods

The JAMA Benchmarking Criteria is a widely used instrument for evaluating the reliability of health-related websites, encompassing four domains: authorship, attribution, disclosure, and currency. GQS is a five-point Likert scale that subjectively rates overall video quality, considering site traffic and usability, with scores ranging from 1 (very poor) to 5 (excellent). The modified DISCERN tool, which is extensively recommended in the literature for assessing the reliability and quality of online resources, comprises five equally weighted items. To further evaluate the videos’ comprehensibility and actionability, we applied the Patient Education Materials Assessment Tool (PEMAT) developed by the Agency for Healthcare Research and Quality (AHRQ). In addition, the reliability and transparency of the online health information were examined using the HONcode. Detailed scoring results for the GQS, JAMA, modified DISCERN, PEMAT, and HONcode assessments are presented in Supplementary Tables 1–5, respectively.

Two psychiatrists with over 10 years of clinical experience independently assessed the standardized process. To minimize potential bias, a strict blinding procedure was implemented. All videos were downloaded and edited using video editing software to remove any on-screen elements that could reveal the creator’s identity (e.g., username, verification badges, profile pictures, hospital logos) or user engagement metrics (e.g., likes, comments, share counts) from the beginning, end, or overlay of the video. Assessors thus evaluated only the core informational content of each video. The scoring strictly followed standardized tools such as GQS, JAMA criteria, modified DISCERN, PEMAT, and HONcode. The evaluation process consisted of the following: first, two evaluators independently scored all videos; second, if there were discrepancies in the scores, a third senior psychiatrist acted as an arbitrator and provided an independent assessment; finally, all three evaluators discussed the discrepant items until a full consensus was reached on the final scores for all assessment items. The Cohen’s κ coefficient was used to assess inter-rater agreement for the initial independent ratings, and the result was 0.83 (p < 0.001), indicating high inter-rater agreement and excellent reliability.

Statistical analyses

The data in this study were non-parametric distributions, so descriptive statistics were performed using the median and interquartile range (IQR). The Kruskal-Wallis test was used to evaluate the differences between groups, and then Dunn’s multiple comparison test was used for multiple comparisons. The relationships among quantitative variables were evaluated using the Spearman correlation analysis method. Exploratory regression analyses were performed to assess associations between video variables such as engagement measures, duration, and quality scores. All analyses were performed using R version 4.4.2 (The R Foundation)¹; Double-tailed p < 0.05 is considered statistically significant. These analyses aim to characterize patterns in the data rather than establish predictions or causal relationships, especially given the platform-dependent nature of the engagement metrics.

Results

Video characteristics

As shown in Table 1, there were statistically significant differences in the number of likes, comments, saves, shares, days since published, video duration, fans, GQS score, JAMA score, Modified DISCERN score, PEMAT Understandability score, Actionability score, and HONCODE score among the six platforms: TikTok, Kwai, WeChat, Xiaohongshu, Bilibili, and Baidu (all p < 0.05). Kwai had the highest median numbers of likes and followers, with values of 1,752 and 121,000, respectively. TikTok recorded the highest median numbers of saves and shares, with values of 472 and 475, respectively. WeChat had the highest median number of comments at 495. This difference may be related to the user behavior characteristics of each platform. Kwai users prefer short and relaxed entertainment content and are used to quickly expressing their preferences through likes. With its efficient content dissemination mechanism, TikTok has promoted the saving and sharing of health-related videos related to bipolar disorder. Based on its social network foundation, WeChat is more likely to elicit in-depth comments and exchanges of illness experiences. Bilibili had the longest median video upload time (739 days) and longest video duration (89 min), which is consistent with Bilibili’s positioning as a knowledge-sharing platform, with users preferring longer form and more in-depth content. TikTok’s JAMA score and Modified DISCERN score were both higher than those of videos from the other five platforms, with averages of 2.7 and 3.4, respectively. The highest number of creators originated from Beijing, Henan, and Shanghai, totaling 124, 87, and 78 creators, respectively (Figure 2). Among the videos posted on TikTok, WeChat, and Baidu, the greatest number was uploaded in 2025, with 50, 35, and 45 videos, respectively. In contrast, the highest number of videos on Xiaohongshu and Kwai was posted in 2024, with 44 videos each. For Bilibili, the highest number of videos was released in 2023, totaling 30 (Figure 3).

Table 1

Table 1. General characteristics and scoring of videos related to bipolar affective disorder.

Figure 3

Line graph with two sections comparing TikTok and WeChat download trends from 2020 to 2022. The TikTok section shows a sharp increase from zero to fifty in 2022. The WeChat section shows growth from zero in 2020 to thirty-five in 2022. Each section reflects dramatic increases over the years.

Figure 3. The distribution of video years.

Video source and content

As shown in Table 2, there are significant differences among the videos from six different platforms, including their sources, medical specialties, content, knowledge about various diseases, and presentation formats (all p < 0.05). Most of the videos on TikTok, Kwai, Xiaohongshu, Bilibili, and Baidu were posted by physicians, accounting for 96, 90, 58, 61, and 79%, respectively. Most of these videos were related to Clinical medicine practitioners, with proportions of 82, 85, 95, 90, 86, and 75%, respectively. In contrast, WeChat videos were predominantly posted by independent users, accounting for 40%. This finding illustrates that professional identity remains key to building trust in the dissemination of health information about bipolar disorder. The relatively high proportion of non-professional content on WeChat may be due to the diversity of its official account ecology, but it also indicates that there is more non-professional information on the platform, which needs to be paid attention to. The video content across TikTok, Kwai, WeChat, Xiaohongshu, Bilibili, and Baidu primarily focused on disease knowledge, with proportions of 90, 78, 82, 83, 81, and 83%, respectively. Regarding disease knowledge, a significant portion of the videos on TikTok, Kwai, WeChat, and Xiaohongshu concentrated on symptoms, accounting for 47, 59, 79, and 60%, respectively. Bilibili primarily addressed symptoms and posttreatment caveats, accounting for 35 and 34%, respectively, while Baidu focused on symptoms and treatment, accounting for 46 and 41%, respectively. This divergence arises from the distinct focus of each platform: content on Bilibili emphasizes “post-treatment precautions,” whereas Baidu concentrates more on the “treatment” protocols themselves. This difference may stem from variations in user intent—Bilibili users are typically seeking more comprehensive knowledge on rehabilitation management, whereas Baidu users are often in the earlier stages of information seeking and thus focus more on diagnosis and treatment options. In terms of video presentation form, TikTok, Kwai, WeChat, Bilibili, and Baidu predominantly featured expert monologs, with proportions of 55, 64, 49, 53, and 70%, respectively. Xiaohongshu utilized expert monologs, visual pictures, and literature, with respective proportions of 42 and 38%. This is highly related to the community culture of Xiaohongshu software.

Table 2

Table 2. Sources and content of videos related to bipolar affective disorder.

Video quality and reliability assessments

In the GQS score, video ratings from TikTok and Kwai were significantly higher than those from Baidu (all p < 0.05). In JAMA scores, TikTok’s video ratings surpassed those of WeChat, Xiaohongshu, Bilibili, and Baidu (all p < 0.05); the video scores from Baidu and Bilibili were lower than those from Kwai (all p < 0.05); furthermore, WeChat’s video scores were higher than those from Baidu (p < 0.01). In the Modified DISCERN score, video ratings from TikTok and Kwai were higher than those from WeChat, Xiaohongshu, Bilibili, and Baidu (all p < 0.01). Regarding PEMAT Understandability and Actionability scores, Baidu’s video scores were greater than those of TikTok, Kwai, and WeChat (all p < 0.05). In terms of HONCODE scores, Baidu’s video ratings were lower than those of TikTok, Kwai, and WeChat (all p < 0.05), while Kwai’s video scores exceeded those of Xiaohongshu and Bilibili (all p < 0.05) (Figure 4). The higher reliability scores of TikTok and Kwai may be related to the fact that the two platforms have many certified medical professionals to create (Table 2), and the platform mechanism is more conducive to the screening and exposure of high-quality content. However, Baidu’s videos scored the highest in PEMAT Understandability and Actionability, perhaps because their content is more basic and straightforward, aiming to quickly answer users’ questions. In the HONCODE score of information transparency, Baidu was significantly lower than the other platforms, indicating that although it is easy to understand, there are shortcomings in the standardization of citing sources and the disclosure of authors’ qualifications.

Figure 4

Bar graph comparing video platforms TikTok, Kwai, WeChat, Xiaohongshu, BiliBili, and Baidu on six evaluative criteria: GQS, JAMA, Modified DISCERN, PEMAT Understandability, PEMAT Actionability, and HONcode. Each platform is represented in different colors, with significance levels indicated above the bars.

Figure 4. GQS scores, JAMA scores, modified DISCERN scores, PEMAT Understandability score, Actionability score, and HONCODE scores of short videos on bipolar affective disorder on different platforms (TikTok, Kwai, WeChat, Xiaohongshu, Bilibili, and Baidu). *p < 0.05, **p < 0.01, ***p < 0.001. ns: not significant at p ≥ 0.05.

The quality and popularity of videos from different sources, content, and presentation forms

Videos posted by physicians, hospitals, and news agencies received significantly higher ratings than those from independent users across GQS, JAMA, Modified DISCERN, PEMAT (Understandability and Actionability), and HONCODE assessments (all p < 0.05). News agencies scored higher than physicians in GQS, PEMAT (Understandability and Actionability), and HONCODE ratings (all p < 0.05) (Figure 5). This shows that the source of the video is a key factor in predicting its quality. Content released by professional organizations is more rigorously reviewed, more scientific, and of higher quality. News organizations are better at colloquia and good production and thus perform better in user experience (GQS) and intelligibility (PEMAT). Among the videos uploaded by physicians, the PEMAT Understandability score, Actionability score, and HONCODE score for videos posted by clinical medicine practitioners were significantly higher than those posted by TCM practitioners (all p < 0.001) (Figure 6). For bipolar disorder, the treatment of modern Western medicine and popular science content is more international and standardized, often with higher accuracy and clarity. TCM often scores low in standardized assessments due to its traditional expression and integration of multiple concepts. Additionally, the GQS score, JAMA score, Modified DISCERN score, and PEMAT Understandability score for different video content were significantly higher for outpatient scenario videos compared to those that focused on personal experiences and disease knowledge (all p < 0.05) (Figure 7). This may be since the outpatient scene significantly improves the professional credibility of the content and the sense of situational substitution by restoring the real diagnosis and treatment process, which is an efficient form of knowledge transmission. In contrast, disease knowledge explanation was difficult to go in-depth due to time limitations, and personal experience sharing was subjective and prone to containing atypical information, so the quality score was generally low.

Figure 5

Bar chart comparing scores across different evaluation criteria: GQS, JAMA, Modified DISCERN, PEMAT Understandability, PEMAT Actionability, and HONcode. The four groups—Physicians (dark green), Hospital (red), Independent Users (yellow), and News Agencies (light green)—show varying scores. Significant differences marked by asterisks, with

Figure 5. GQS scores, JAMA scores, modified DISCERN scores, PEMAT Understandability score, Actionability score, and HONCODE scores from different sources of videos related to bipolar affective disorder. *p < 0.05, **p < 0.01, ***p < 0.001. ns: not significant at p ≥ 0.05.

Figure 6

Bar chart comparing clinical medicine practitioners (green) and TCM practitioners (red) across six measures: GQS, JAMA, Modified DISCERN, PEMAT Understandability, PEMAT Actionability, and HONcode. No significant difference is noted for GQS, JAMA, and Modified DISCERN. Significant differences (p < 0.001) are observed for PEMAT Understandability, PEMAT Actionability, and HONcode. Error bars indicate variability.

Figure 6. GQS scores, JAMA scores, modified DISCERN scores, PEMAT Understandability score, Actionability score, and HONCODE scores for bipolar affective disorder-related videos from different medical specialties. *p < 0.05, **p < 0.01, ***p < 0.001. ns: not significant at p ≥ 0.05.

Figure 7

Bar chart comparing disease knowledge, personal experience, and outpatient scenarios across six categories: GQS, JAMA, Modified DISCERN, PEMAT Understandability, PEMAT Actionability, and HONcode. The chart uses green, red, and yellow bars for the three categories. Significant differences are noted with asterisks and

Figure 7. GQS scores, JAMA scores, modified DISCERN scores, PEMAT Understandability score, Actionability score, and HONCODE scores for bipolar affective disorder-related videos with different content. **p < 0.01, ***p < 0.001. *ns: not significant at p ≥ 0.05.

In videos focused on disease knowledge, the JAMA scores and Modified DISCERN scores for treatment-related videos were lower than those for definition-related videos (all p < 0.05). The HONCODE scores showed that the treatment videos were rated lower than the symptom videos (p < 0.05) and the videos on post-treatment precautions were rated lower than the symptom, prevention, and definition videos (all p < 0.05) (Figure 8). Among videos featuring different presentation forms, GQS scores, JAMA scores, and Modified DISCERN scores were significantly lower for patient video blogs compared to expert monologs, dialogues, Visual pictures, and literature (all p < 0.05). Regarding the PEMAT Understandability score, expert monolog videos received lower ratings than those of Visual pictures and literature (p < 0.05). In terms of actionability scores, both expert monolog videos and patient blogs were rated lower than Visual pictures and literature (all p < 0.05). Additionally, HONCODE scores for expert monolog and dialogue videos were lower than those for Visual pictures and literature (all p < 0.05) (Figure 9). This reveals a key paradox: high-quality content does not necessarily lead to high engagement. Patient video blog has a strong story and is easy to cause emotional interaction, but its quality score is the lowest due to subjectivity and non-professionalism. The form of “Visual pictures and literature “makes the content clear and easy to understand with high quality through intuitive charts and literature citations. At the same time, because of its dense information and strong practicality, users are more willing to collect and share, to achieve an effective balance between quality and dissemination.

Figure 8

Bar chart comparing six metrics: GQS, JAMA, Modified DISCERN, PEMAT Understandability, PEMAT Actionability, and HONcode. Each metric is represented by six colored bars for categories like Treatment, Symptom, Posttreatment Caveats, Prevention, Definition, and Reexamination. Statistical significance is denoted by symbols: ns (not significant), asterisks indicate varying levels of significance. Bars are similar in height, particularly high in HONcode and low in PEMAT Actionability. Y-axis is logarithmic from 0.1 to 10.

Figure 8. GQS scores, JAMA scores, modified DISCERN scores, PEMAT Understandability score, Actionability score, and HONCODE scores for bipolar affective disorder-related videos for different disease knowledge. *p < 0.05. *ns: not significant at p ≥ 0.05.

Figure 9

Bar chart comparing four types: expert monologue, dialogue, vlog of patients, and visual pictures/literature across six categories: GQS, JAMA, Modified DISCERN, PEMAT Understandability, PEMAT Actionability, HONcode. Bars are color-coded: green for expert monologue, red for dialogue, yellow for vlog, orange for visual content. Statistical significance is noted with symbols (*, **, ***) above comparisons. Data shows varying scores among categories.

Figure 9. GQS scores, JAMA scores, modified DISCERN scores, PEMAT Understandability score, Actionability score, and HONCODE scores for bipolar affective disorder-related videos for different presentation forms. *p < 0.05, **p < 0.01, ***p < 0.001. *ns: not significant at p ≥ 0.05.

As shown in Table 3, the popularity of videos, measured by likes, comments, saves, and shares, was compared across different sources, medical specialties, content types, disease knowledge, and presentation form. The results indicated that among various sources, videos produced by independent users received more likes and saves, videos from news agencies garnered more comments, and physicians ‘videos were shared more frequently. Additionally, videos related to clinical medicine practitioners showed a higher number of likes, comments, and saves (all p < 0.01). In terms of video content, personal experience videos were more likely to be liked (p = 0.007). Regarding disease knowledge, videos covering preventive measures, symptoms, and definitions outperformed others in terms of likes, comments, and saves (all p < 0.05). In terms of video presentation, patient video blogs performed better in terms of likes and comments, while videos featuring Visual pictures and literature were more likely to be saved and shared (all p < 0.05).

Table 3

Table 3. The popularity of videos from different sources with different content and presentation forms.

Correlation analysis

The results of Spearman correlation analysis showed that there were statistically significant positive correlations between the number of likes, comments, favorites, and shares (all p < 0.001; Spearman’s rho coefficient ranged from 0.66 to 0.94). There were also statistically significant positive associations between these interaction indicators and the assessment scores (GQS, JAMA, modified DISCERN, PEMAT intelligibility, and operability scores) (all p < 0.05). It is important to note that this observed association does not imply causation; videos with higher interaction metrics may not necessarily have higher quality, and vice versa. However, Effect sizes were generally weak to moderate (Spearman’s rho coefficient ranged from 0.09 to 0.34). In contrast, video duration showed consistent, statistically significant, and weak negative associations with all assessment scores (p < 0.05 for all; rho coefficients range, approximately − 0.09 to − 0.23), which indicates that longer videos tend to receive slightly lower quality scores. The different scoring systems themselves had strong positive correlations (p < 0.001 for all; rho coefficients ranged from 0.50 to 0.85), which indicates their high agreement in assessing video quality (Figure 10). The negative correlation between video length and quality is a noteworthy finding. One possible explanation is that excessively long content tends to be difficult to keep compact on short video platforms, which may lead to lengthy and distracting information and reduced viewer attention, thus causing lower ratings. However, other explanations still need to be considered, such as the fact that popular creators may tend to produce shorter content—regardless of its actual quality—and that these types of videos are generally more likely to garner higher engagement, which could also be a contributing factor to the phenomenon. However, the association may also be influenced by platform characteristics and assessment tools. For example, Bilibili, which is dominated by long videos, has a significantly higher average duration than TikTok and Kwai, but lower quality scores, such as JAMA and Modified DISCERN (Table 1). This negative correlation may be more a reflection of systematic differences in content between platforms rather than a direct quality decline due to duration. In addition, the assessment tool used was originally designed for text and was not sensitive enough to the narrative and educational value of long videos to objectively evaluate content that requires in-depth interpretation.

Figure 10

Correlation matrix showing relationships between various metrics, including Likes, Comments, Saves, Shares, and Days since published, among others. The matrix uses a color gradient from green (positive correlation) to red (negative correlation). Key correlations are marked with asterisks indicating significance levels.

Figure 10. Spearman correlation analyses between video variables and between video variables and GQS scores, JAMA scores, modified DISCERN scores, PEMAT Understandability score, Actionability score, and HONCODE scores. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.

Regression analysis of video variables and video quality

Exploratory analyses showed that all video quality scores were significantly correlated with interaction behaviors (liking, commenting, saving, and sharing; Supplementary Figure 1) and video duration (Supplementary Figure 2) (all p < 0.05); however, due to the platform-dependent nature of the interaction metrics, these associations were limited to descriptive correlations and are not suitable for outward generalization.

Discussion

Principal findings

This study analyzed the characteristics and quality of videos related to bipolar disorder across multiple platforms. Significant differences were observed in both audience engagement metrics and video attributes among the platforms. Specifically, in terms of engagement, Kwai had the highest median number of likes and fans, while TikTok excelled in terms of saves and shares. WeChat received a significantly greater number of comments compared to other platforms. In contrast, Bilibili exhibited the longest upload duration and the longest video length. Separately, when assessing informational quality, TikTok recorded higher scores across all five metrics compared to other platforms. These differences in engagement reflect the behavioral preferences and content dissemination characteristics of users on various platforms. The differences in quality scores reflect varying levels of adherence to scientific and editorial standards across platforms. Consistent with previous research (30), we found that low-quality mental health content contributed to misperceptions. As shown in Figure 5, videos posted by nonexpert creators generally scored lower on scientific accuracy, most likely perpetuating harmful stereotypes. The results again echo WHO’s call for digital platforms for mental health governance, for which platform-level interventions are urgently needed. Therefore, having a medical background is crucial, and it is even more essential to disseminate medical knowledge in a scientifically accurate manner. Additionally, the impact of video content created by professionals should be carefully evaluated (32). Secondly, the public must become more aware of the importance of utilizing reliable sources for obtaining health information to ensure that their understanding of diseases is not compromised by the varying quality of available content (33).

We contrasted the findings of this study with previous studies on video quality assessment in mental health, such as depression and schizophrenia (20, 21). Consistent with previous studies, the quality of health information in short video platforms is generally heterogeneous, and a similar trend was observed in BD related content in this study. However, we also identified some challenges unique to the disease: low public awareness and strong stigma (34), prompting authors to focus more on symptom interpretation and misunderstanding clarification; The bipolarity of its clinical manifestations (mania and depression) easily leads to sensationalized narratives, which aggravates the “quality and prevalence paradox.” The complexity of diseases also increases the difficulty of short video popularization. This study not only verified the general lack of quality of short video health information but also provided new specific insights for bipolar disorder.

This study found three main problems with how short videos share information about bipolar disorder. First, longer videos tend to receive lower quality ratings. Because bipolar disorder is complex, it is hard to explain fully in a short video. To keep videos brief, creators often simplify too much or leave out important details. This reduces the completeness and accuracy of the information, which lowers overall quality. Second, high-quality videos do not always get more engagement. Videos made by health professionals are more reliable and scientific, but personal vlogs by patients get more likes and comments. This “quality–popularity paradox” might occur because emotional and story-based content is shared more widely. Furthermore, platform algorithms are known to promote videos with more interactions, which could potentially amplify the reach of less scientific but more attractive content. Therefore, although the scientific quality of personal vlogs is lower, their storytelling expression is more likely to elicit high interaction. The feedback loop between algorithms and users may allow emotional content to get over-disseminated regardless of information accuracy. Interaction data should be viewed as an indicator of prevalence, not as a basis for scientific quality. High interaction does not equal high scientific validity, which requires the attention of users, creators, and platforms. Third, most video creators live in big cities like Beijing and Shanghai, where medical resources are plentiful. This means users in areas with fewer medical resources may have less access to good-quality information. Instead of reducing this inequality, short video platforms may be making regional health knowledge gaps worse.

Recommendations based on our results

Based on the findings of this study, we propose the following suggestions to improve the quality and dissemination of short videos related to bipolar disorder. Short video platforms should improve content recommendation algorithms by including quality metrics, not just interaction metrics, to increase the visibility of trusted information. They should also validate the credentials of medical creators through accreditation systems and provide evidence-based content guidelines that encourage the citation of authoritative sources. In addition, platforms should avoid restricting the flow of medically sensitive terms during the audit process, aiming to improve the efficiency of the dissemination of professional information, while ensuring that the content remains scientific and reliable. Healthcare professionals are advised to use clinical scenarios or visual AIDS such as animations and diagrams when producing videos to increase clarity and engagement, and to break down complex topics into short, focused segments. Viewers should critically evaluate sources, choose content from certified providers, and be wary of videos that rely heavily on personal stories or exaggerations. Finally, health organizations and policymakers should support the establishment of quality certification systems for BD related content and provide more training and resources for creators in underserved areas to reduce information inequity.

Strengths and limitations

The strength of this study lies in its comprehensive analysis of the quality of bipolar BD-related videos across six short-form video platforms, which examines the relationship between video quality and prevalence. However, several limitations should be noted. First, the analysis was restricted to six Chinese short-video platforms, limiting the generalizability of the findings to other social media platforms. In addition, because this study was conducted in China’s specific sociocultural and digital ecosystem, the findings may not be directly generalizable to other countries with different cultural attitudes toward mental health, healthcare systems, and platform governance policies. Second, approximately 100 relevant videos were selected from each platform, which may not fully represent the entire population of BD-related content available on such platforms. Third, the use of a cross-sectional design, as opposed to a cohort design, precludes the establishment of causal relationships. Fourth, the cross-sectional design of this study suggests that the relationships found between video quality and engagement measures, such as likes and shares, are only correlational and cannot establish causality. Longitudinal or experimental studies are needed to test the causal effect of video quality on engagement. Finally, the assessment tools used in this study (e.g., GQS, JAMA Benchmarks, etc.) were originally designed for text-based health messages (e.g., websites) and did not cover key features of short videos, such as audio-visual elements, fragmented representations, and duration constraints. This mismatch between tool and medium may introduce assessment bias. This issue is particularly acute in this field, especially in the Chinese context, where there is still a lack of specifically validated video assessment tools. Although we adapted existing tools to create a standardized assessment framework, which has been a common practice in the past, the development of validated native tools designed specifically for health-based short videos remains key to improving the measurement validity of future studies.

Conclusion

This study reveals that although clinical practitioners are the primary creators of bipolar disorder-related content on Chinese short-video platforms, often using expert-led monologs, the overall scientific depth and production quality remain inadequate. Significant inter-platform variability exists, with TikTok and Kwai showing higher reliability scores, while Baidu and WeChat offered more understandable but less transparently sourced content. A clear “quality–popularity paradox” was observed, whereby emotionally compelling yet scientifically weak videos achieved wider engagement. Furthermore, geographic disparities in content creation may worsen existing inequities in access to reliable mental health information. These findings underscore the need for improved content supervision, creator training, and algorithm refinement to promote accurate, accessible information on bipolar disorder.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

Author contributions

XQ: Writing – original draft, Writing – review & editing. QL: Writing – original draft, Software. BL: Software, Writing – review & editing. SL: Writing – review & editing, Investigation, Methodology, Project administration. LZ: Methodology, Writing – review & editing. RX: Project administration, Writing – review & editing. QW: Methodology, Supervision, Validation, Writing – original draft, Writing – review & editing. DZ: Funding acquisition, Methodology, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

The authors would like to thank all participants in this study for their assistance.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2025.1627885/full#supplementary-material

Footnotes

1. ^http://www.R-project.org

References

1. First, MB. Diagnostic and statistical manual of mental disorders, 5th edition, and clinical utility. J Nerv Ment Dis. (2013) 201:727–9. doi: 10.1097/NMD.0b013e3182a2168a

PubMed Abstract | Crossref Full Text | Google Scholar

2. Luo, J, Liang, M, Yi, P, and Li, X. The neuropsychological mechanisms of treatment of bipolar disorder and borderline personality disorder: activation likelihood estimation meta-analysis of brain imaging research. J Clin Psychiatry. (2023) 84:22r14463. doi: 10.4088/JCP.22r14463.

PubMed Abstract | Crossref Full Text | Google Scholar

3. Merikangas, KR, Jin, R, He, JP, Kessler, RC, Lee, S, Sampson, NA, et al. Prevalence and correlates of bipolar spectrum disorder in the world mental health survey initiative. Arch Gen Psychiatry. (2011) 68:241–51. doi: 10.1001/archgenpsychiatry.2011.12

PubMed Abstract | Crossref Full Text | Google Scholar

4. Humpston, CS, Bebbington, P, and Marwaha, S. Bipolar disorder: prevalence, help-seeking and use of mental health care in England. Findings from the 2014 adult psychiatric morbidity survey. J Affect Disord. (2021) 282:426–33. doi: 10.1016/j.jad.2020.12.151.

PubMed Abstract | Crossref Full Text | Google Scholar

5. McGrath, JJ, Al-Hamzawi, A, Alonso, J, Altwaijri, Y, Andrade, LH, Bromet, EJ, et al. Age of onset and cumulative risk of mental disorders: a cross-national analysis of population surveys from 29 countries. Lancet Psychiatry. (2023) 10:668–81. doi: 10.1016/s2215-0366(23)00193-1

PubMed Abstract | Crossref Full Text | Google Scholar

6. Crump, C, Sundquist, K, Winkleby, MA, and Sundquist, J. Comorbidities and mortality in bipolar disorder: a Swedish national cohort study. JAMA Psychiatr. (2013) 70:931–9. doi: 10.1001/jamapsychiatry.2013.1394

PubMed Abstract | Crossref Full Text | Google Scholar

7. Whiteford, HA, Degenhardt, L, Rehm, J, Baxter, AJ, Ferrari, AJ, Erskine, HE, et al. Global burden of disease attributable to mental and substance use disorders: findings from the global burden of disease study 2010. Lancet. (2013) 382:1575–86. doi: 10.1016/s0140-6736(13)61611-6

PubMed Abstract | Crossref Full Text | Google Scholar

8. World Health Organization. Mental health action plan 2013-2030. Geneva, Switzerland: World Health Organization (2021).

Google Scholar

9. China Internet Network Information Center (CNNIC). The 51st statistical report on internet development in China. Beijing, China: CNNIC (2023).

Google Scholar

10. National Radio and Television Administration (NRTA). Research report on the development of China’s online audiovisual industry. Beijing, China: NRTA (2024).

Google Scholar

11. Ventola, CL. Social media and health care professionals: benefits, risks, and best practices. P T. (2014) 39:491–520.

Google Scholar

12. Moorhead, SA, Hazlett, DE, Harrison, L, Carroll, JK, Irwin, A, and Hoving, C. A new dimension of health care: systematic review of the uses, benefits, and limitations of social media for health communication. J Med Internet Res. (2013) 15:e85. doi: 10.2196/jmir.1933

PubMed Abstract | Crossref Full Text | Google Scholar

13. Patel, V, Saxena, S, Lund, C, Thornicroft, G, Baingana, F, Bolton, P, et al. The lancet commission on global mental health and sustainable development. Lancet. (2018) 392:1553–98. doi: 10.1016/s0140-6736(18)31612-x

PubMed Abstract | Crossref Full Text | Google Scholar

14. Corrigan, PW, Druss, BG, and Perlick, DA. The impact of mental illness stigma on seeking and participating in mental health care. Psychol Sci Public Interest. (2014) 15:37–70. doi: 10.1177/1529100614531398

PubMed Abstract | Crossref Full Text | Google Scholar

15. Vigo, D, Thornicroft, G, and Atun, R. Estimating the true global burden of mental illness. Lancet Psychiatr. (2016) 3:171–8. doi: 10.1016/s2215-0366(15)00505-2

PubMed Abstract | Crossref Full Text | Google Scholar

16. Naslund, JA, Aschbrenner, KA, Marsch, LA, and Bartels, SJ. The future of mental health care: peer-to-peer support and social media. Epidemiol Psychiatr Sci. (2016) 25:113–22. doi: 10.1017/s2045796015001067

PubMed Abstract | Crossref Full Text | Google Scholar

17. Engel, E, Gell, S, Heiss, R, and Karsay, K. Social media influencers and adolescents' health: a scoping review of the research field. Soc Sci Med. (2024) 340:116387. doi: 10.1016/j.socscimed.2023.116387

PubMed Abstract | Crossref Full Text | Google Scholar

18. Mayor, E, and Bietti, LM. A social media study of portrayals of bipolar disorders on YouTube: content and thematic analyses. J Med Internet Res. (2025) 27:e67129. doi: 10.2196/67129

PubMed Abstract | Crossref Full Text | Google Scholar

19. Srivastava, K, Chaudhury, S, Bhat, PS, and Mujawar, S. Media and mental health. Ind Psychiatry J. (2018) 27:1–5. doi: 10.4103/ipj.ipj_73_18

PubMed Abstract | Crossref Full Text | Google Scholar

20. Zhang, W, Wang, M, Shu, H, Zhou, C, Zhang, C, Hu, C, et al. Evaluation of the content and quality of schizophrenia on TikTok: a cross-sectional study. Sci Rep. (2024) 14:26448. doi: 10.1038/s41598-024-75372-7

PubMed Abstract | Crossref Full Text | Google Scholar

21. Chen, Y, Yin, J, Ding, Y, Wang, C, Zhu, J, and Niu, L. Evaluation of the quality of depression-related information on Chinese websites and video platforms: a cross-sectional comparative analysis. Front Psych. (2024) 15:1408384. doi: 10.3389/fpsyt.2024.1408384

PubMed Abstract | Crossref Full Text | Google Scholar

22. Diviani, N, van den Putte, B, Giani, S, and van Weert, JC. Low health literacy and evaluation of online health information: a systematic review of the literature. J Med Internet Res. (2015) 17:e112. doi: 10.2196/jmir.4018

PubMed Abstract | Crossref Full Text | Google Scholar

23. Kata, A. A postmodern Pandora's box: anti-vaccination misinformation on the internet. Vaccine. (2010) 28:1709–16. doi: 10.1016/j.vaccine.2009.12.022

PubMed Abstract | Crossref Full Text | Google Scholar

24. Zhao, Y, and Zhang, J. Consumer health information seeking in social media: a literature review. Health Inf Libr J. (2017) 34:268–83. doi: 10.1111/hir.12192

PubMed Abstract | Crossref Full Text | Google Scholar

25. Statista. Number of monthly active TikTok users worldwide from January 2018 to March 2025. Statista GmbH (2025). Available online at: https://www.statista.com/statistics/1234567/tiktok-global-mau.

Google Scholar

26. BusinessofApps. TikTok statistics and facts. BusinessofApps Ltd; (2025). Available onlinen at: https://www.businessofapps.com/data/tiktok-statistics/.

Google Scholar

27. SimilarWeb. TikTok analytics: monthly visits, user demographics, and more. SimilarWeb Ltd. (2025). Available online at: https://www.similarweb.com/website/tiktok.com/.

Google Scholar

28. Technology K. (2024) Third quarter financial report. Kuaishou Inc. Available online at: https://ir.kuaishou.com/news-releases/news-release-details/kuaishou-technology-announces-third-quarter-2024-unaudited/.

Google Scholar

29. Inc B. Third quarter financial report. Bilibili (2024). Available online at: https://www.globenewswire.com/news-release/2024/11/14/2980919/0/en/Bilibili-Inc-Announces-Third-Quarter-2024-Financial-Results.html.

Google Scholar

30. Li, M, Liang, RB, Zhou, Y, Wang, K, Yang, R, Li, T, et al. Assessing the quality of myopia prevention videos on Chinese short video platforms: a cross-sectional content analysis by source. BMJ Open. (2025) 15:e102818. doi: 10.1136/bmjopen-2025-102818

PubMed Abstract | Crossref Full Text | Google Scholar

31. Zhu, W, He, B, Wang, X, Du, Y, Young, K, and Jiang, S. Information quality of videos related to esophageal cancer on tiktok, kwai, and bilibili: a cross-sectional study. BMC Public Health. (2025) 25:2245. doi: 10.1186/s12889-025-23475-9

PubMed Abstract | Crossref Full Text | Google Scholar

32. Pretorius, C, McCashin, D, and Coyle, D. Mental health professionals as influencers on TikTok and Instagram: what role do they play in mental health literacy and help-seeking? Internet Interv. (2022) 30:100591. doi: 10.1016/j.invent.2022.100591

PubMed Abstract | Crossref Full Text | Google Scholar

33. Sumayyia, MD, Al-Madaney, MM, and Almousawi, FH. Health information on social media. Perceptions, attitudes, and practices of patients and their companions. Saudi Med J. (2019) 40:1294–8. doi: 10.15537/smj.2019.12.24682

PubMed Abstract | Crossref Full Text | Google Scholar

34. Latifian, M, Abdi, K, Raheb, G, Islam, SMS, and Alikhani, R. Stigma in people living with bipolar disorder and their families: a systematic review. Int J Bipolar Disord. (2023) 11:9. doi: 10.1186/s40345-023-00290-y

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: bipolar disorder, short videos, health information, quality and reliability assessment, content analysis

Citation: Qi X, Lu Q, Li B, Liu S, Zhang L, Xiang R, Wang Q and Zhao D (2025) Videos in short video sharing platforms as a source of information on bipolar disorder: a cross-sectional content analysis study. Front. Public Health. 13:1627885. doi: 10.3389/fpubh.2025.1627885

Received: 13 May 2025; Accepted: 24 September 2025;
Published: 28 October 2025.

Edited by:

Iffat Elbarazi, United Arab Emirates University, United Arab Emirates

Reviewed by:

Momcilo Mirkovic, University of Pristina, Serbia
You Peng, National University of Singapore, Singapore
Tadeas Mares, Charles University, Czechia

Copyright © 2025 Qi, Lu, Li, Liu, Zhang, Xiang, Wang and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dongrong Zhao, MTM5MTkxMzE3NjdAMTYzLmNvbQ==; Qilong Wang, d2FkZTIwMDAyM0BvdXRsb29rLmNvbQ==

^†These authors share first authorship

^‡ORCID: Qi Lu, orcid.org/0009-0006-2787-3081

Sha Liu, orcid.org/0009-0004-9649-4291

Qilong Wang, orcid.org/0009-0003-3973-8475

Dongrong Zhao, orcid.org/0009-0007-7933-5709

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.