Abstract
Post-traumatic stress disorder (PTSD) is a very common condition with more than 3 million new cases per year in the US alone. The right diagnosis in a timely manner is key to ensuring a prompt treatment that could lead to a full recovery. Unfortunately, avoidance of trauma reminders, social stigma, self-presentation, and self-assessment biases often prevent individuals from seeking timely evaluation, leading to delays in treatment and suboptimal outcomes. Previous studies show that various mental health conditions are associated with distinct patterns of language use. Analyzing language use may also help to avoid response bias in self-reports. In this study, we analyze text data from online forum users, showing that language use differences between PTSD sufferers and controls. In all groups of PTSD sufferers, the usage of singular first-person pronouns was higher and that of plural first-person pronouns was lower than in control groups. However, the analysis of other word categories suggests that subgroups of people with the same mental health disorder (here PTSD) may have salient differences in their language use, particularly in word usage frequencies. Additionally, we show that word usage patterns may vary depending on the type of the text analyzed. Nevertheless, more studies will be needed to increase precision by further examine a variety of text types and different comorbidities. If properly developed, such tools may facilitate earlier PTSD diagnosis, leading to timely support and treatment, which are associated with better outcomes.
Introduction
Currently, in the US alone, nearly 6 million people are disabled by mental illness. This number rises each day by more than 400. This is not just a health crisis but also an economic one: direct and indirect global economic costs of mental illness were estimated at US$2.5 trillion as of 2010. As reported by the National Center for post-traumatic stress disorder (PTSD), 7–8% of the US population will experience PTSD at some point in their lifetime. To maintain economic and social stability, mental illness must be prevented and treated. In the past few decades, considerable progress has been made in understanding mental illness (Norquist and Hyman, 1999), leading to the development of more effective treatment options. Nonetheless a large portion of this population remains untreated. Nationwide epidemiologic studies show that over half of the people with mental health disorders who would benefit from treatment do not receive it (Regier et al., 1993; Kessler et al., 2001).
Numerous studies have tried to explain why some people with mental illness seek treatment while others do not. In both military and civilian populations, common factors include fear of stigmatization, as well as self-presentation and self-assessment biases (Olfson et al., 1998; Perlick et al., 2001; Link et al., 2008; Ahmedani, 2011; Bonfils et al., 2018).
Another important factor is the inability of the affected individuals to recognize the symptoms (Ahmedani, 2011), which prevents help-seeking and at best delays diagnosis and treatment. Most mental health disorders are much easier to control and reverse if they are diagnosed and treated early. Currently used mental health evaluations, such as the those based on DSM (Diagnostic and Statistical Manual of Mental Disorders), require specialized help that is not always promptly available. However, early and accurate diagnosis is key to facilitating timely and effective treatment that could lead to a full recovery (Acosta et al., 2013; Burnam et al., 2014; Brahm et al., 2016; Jones et al., 2017; Horn and Feder, 2018). In that context, individuals with mental health concerns could benefit from an anonymous and easily accessible self-testing tool that analyzes language usage patterns and correlates such patterns to mental health conditions or risk thereof. While unsuitable for any definitive diagnosis, such a tool may prompt at-risk individuals to seek timely mental health evaluation and care.
Adding to the problem, the evidence from the broadly studied “Halo effect” (definition: the extrapolation of a general impression on evaluations of individual attributes of a person) (Sappenfield, 1971; Cockayne and Samuelson, 1978; Smith, 1988) and Dr. Daniel Kahneman’s and Dr. Amos Tversky’s work show a general assessment bias in the observer, even in highly trained humans (Kahneman, 2003). Already in the 1960s Dr. Hoffman’s and colleagues’ work which was summarized by Dr. Lewis Goldberg, showed that in some situations simple algorithms could outperform clinical specialists in diagnosing diseases (Goldberg, 1968).
To address these issues, researchers in mental health related fields started using implicit or indirect evaluation methods (Pressman and Cohen, 2007; Thoma et al., 2013). In particular, language analysis may help to bypass self-presentation and self-assessment biases common in self reports (Mehl and Pennebaker, 2003; Pennebaker et al., 2003; Pennebaker, 2011). While a wide variety of semantic, syntactic and other language usage patterns may be associated with the state of a person’s mental health, word usage frequencies appear to be the least affected by incidental variations, such as education level. Some word categories appear to be especially significant for assessing an individual’s mental state via language analysis. For instance, function words, such as pronouns, account for only about 1% of all vocabulary words, but represent approximately 15% of the total words used, and appear to be strongly linked to underlying mental processes (Miller, 1995; Tausczik and Pennebaker, 2010). Language usage, including word category frequencies, is affected by a variety of factors, including age (Pennebaker and Stone, 2003), gender (Mehl and Pennebaker, 2003), and personality (Hirsh et al., 2009). However, for certain word categories, the state of an individual’s mental health may have the greatest impact on word usage frequency (Pennebaker, 1993). In fact, the approach to access mental states by analyzing patterns of language usage is supported by a growing body of research in the fields of psychology and neuroscience (Pennebaker et al., 2003). Previous studies using Linguistic Inquiry and Word Count (LIWC) have found that it is possible to characterize depression and other mental states through analyzing natural language use (Tausczik and Pennebaker, 2010; Booker et al., 2018; Kaplow et al., 2018; Tackman et al., 2018). For example, Stirman and Pennebaker showed that suicidal poets displayed a higher usage of first person pronouns in their texts and less first-plural pronouns than non-suicidal poets (Stirman and Pennebaker, 2001). Rude, Gortner, and Pennebaker demonstrated that the language of depressed students has a higher frequency of first person singular pronouns and negative emotion words, and a lower frequency of positive emotion words, compared to students that never experienced depression (Rude et al., 2004). Together, these results support the social engagement/disengagement model of depression (Fisher and Chon, 1989), and Pyszcynski and Greenberg’s self-awareness theory of depression (Pyszczynski and Greenberg, 1987). Furthermore, clinicians have observed higher use of first singular pronouns in depressed patients (Bucci and Freedman, 1981; Weintraub et al., 1981), and lower second-person pronoun usage, which may reflect a decreased sense of community (Simmons et al., 2005).
People with PTSD were shown to use significantly fewer second-person pronouns (Coppersmith et al., 2014b). Additionally, studies have shown differences in the use of causal and cognitive words during recalls of negative events (Boals and Klein, 2005), which may be an indicator of the attempt to rationalize and resolve traumatic experiences (Kross and Ayduk, 2008). The analysis of additional linguistic markers, such as the use of negative emotion words, cognition words, and insight words predicted the future mental health of college students who wrote about traumatic events (Pennebaker, 2001). Furthermore, the presence of words relating to death and dying was an indicator of treatment-resistant PTSD (Alvarez-Conrad et al., 2001). Consequently, the analysis of linguistic elements in different text types could be crucial for understanding cognitive mechanisms associated with trauma and may hold a valuable potential to diagnose and predict PTSD symptoms and subtypes. If properly developed, such technology could help individuals to self-test and public health organizations to screen for possible mental health conditions and prompt further evaluation when warranted, potentially preventing disorders from becoming chronic, debilitating, and difficult to treat.
While language analysis programs could be an effective means for tentative self-diagnosis and mental health screening, the development of such programs is impeded by the gaps and inconsistencies in our current understanding of the links between language usage and mental illness. The few existing language analysis programs are based on data from generalized mentally ill populations that commingle otherwise fairly distinct subgroups. This often results in disregarding salient individual differences and may lead to misdiagnosis.
A major hurdle for developing more specific and reliable language based diagnostic tools has been the lack of variety in the available language data. With the growth of social media, internet forums are shown to present a useful source of naturalistic writing from people that use these forums as an anonymous and inexpensive self-help tool, especially for stigmatizing mental health illnesses such as PTSD (Baker et al., 2003; Lampe et al., 2003; Berger et al., 2005). Also, a practically useful and versatile pre-diagnostic and screening language analysis tool should produce salient results even from relatively short text samples common in social media and modern communications. Recent studies show that using an automated word counting approach is an efficient way to characterize the language of online groups (Lyons et al., 2006). Patterns of emotional and cognitive expression in internet support groups were used to research depression and other affective disorders (Houston et al., 2002; Griffiths et al., 2009), and as a predictor of future mental health in cancer patients and recovering anorexics (Lieberman and Goldstein, 2006; Lyons et al., 2006).
We hypothesize that there are salient language use characteristics throughout different text samples from people affected by PTSD. The goal of this project is to identify significant indicators of the common PTSD-related pathologies, as it could help to develop novel diagnostic tools for broad screening of the general population.
Materials and Methods
Data Collection and Categorization
In this study, we analyze text data from online forum users and differentiate between people that have PTSD from those that do not. Data for this study are composed of text samples from public forums collected and screened according to previously described procedures (Coppersmith et al., 2014a). Forums users often discuss their health for various reasons, such as to seek support or advice. More specifically to mental health, users may choose an anonymous forum due to the social stigma associated with mental illness. Many forum users describe their diagnosis in a large variety of mental health conditions (Coppersmith et al., 2015). In this study we focused on PTSD. A human editor assessed each description of diagnosis and removed quotes or other disingenuous text sections (Example of disingenuous statements of diagnosis: “Omg I just messed up my makeup! I’m literally crying I have PTSD” – anonymous).
To ensure that each included forum user has a sufficient amount of data, we ensured that each user had at least 100 words, however over 80% had between 200 and 500 words.
Ideally, age and gender should be controlled when performing mental health research. A small amount of studies have taken matched samples into consideration for example, by examining a specific subgroup of population such, as college students (Rude et al., 2004). In order to have age- and gender-matched groups, we analyzed each text sample and its language as described in previously published studies (Sap et al., 2014). To obtain our final data set for each PTSD forum user, we determined or estimated the gender and age using the user profile self-description and by analyzing the user’s other posts. When compiling control groups, we selected the text samples from the users with matching estimated gender and the closest estimated age.
Data
Text samples for the PTSD group were collected from various internet forums dedicated to PTSD1 (Table 1). A large number of potentially suitable text samples were manually screened and either discarded or sorted into the groups of interest-based on the content of the excerpt itself, forum thread context, and other available anonymous information. Previous studies often compare data from people affected by PTSD vs. healthy people/general population data. In our study, we also used general population data as a control group. However, it is also important to compare PTSD data to text samples from traumatized individuals that didn’t develop PTSD to control for trauma-related word patterns unrelated to PTSD. This also allows for the possible detection of resilience-related word patterns. Therefore, our first control group consists of data from firefighter forums, where users discuss work, daily life and extreme/traumatic situations, yet, are not suffering from PTSD (Table 1). Our second control group consists of general population data (Table 1).
TABLE 1
| Subgroup | Number of text samples (excerpts) |
| Individuals with PTSD sharing daily events | 21 |
| Individuals who suffered trauma within last 12 month and are at risk for PTSD | 16 |
| Individuals who suffered trauma years ago and developed PTSD | 21 |
| Military veterans, police officers, firefighters with PTSD | 19 |
| Firefighters without PTSD (Control 1) | 19 |
| General population (Control 2) | 26 |
Description of PTSD subgroups and control groups for language use analysis.
Datasets used for study (total n = 122).
We also distinguished between narrative related to the trauma and daily life event narratives (Table 1). Forums have various discussion group categorizations. For “trauma narratives” we choose text that discussed the trauma event explicitly. We collected text samples for “daily life narratives” from journal entry discussion groups where forum users shared their daily lives with other users but made no explicit mention of trauma events (Table 1).
Analysis
We used word usage frequency analysis conceptually similar to that in LIWC methodology and program (Pennebaker et al., 2003). LIWC was shown to be effective in detecting a number of psychologically salient language usage patterns. We developed a custom software program that extended LIWC approach by combining it with character language models (CLMs) for additional word matching features, as well as additional and/or modified word categories tailored to PTSD. This provided a score even for very short texts (McNamee, 2004). Word matching included pattern matching whole words, roots, salient word parts, simple stemming, split verb/expression stemming, and others.
Based on literature (Pennebaker, 1993, 2001; Stirman and Pennebaker, 2001; Pennebaker and Stone, 2003; Kenardy et al., 2007; Tausczik and Pennebaker, 2010; Badger et al., 2011; Jaeger et al., 2014; Mott et al., 2015; Knutsen and Jensen, 2017; Westerman et al., 2017; Rometsch-Ogioun El Sount et al., 2018) and text screening, the following word categories were determined to be potentially salient for PTSD and/or depression and were used in this study:
- •
Singular first-person pronouns (related to self only);
- •
Plural first-person pronouns (related to group including self);
- •
Words positively correlated with depression (Stirman and Pennebaker, 2001; Pennebaker et al., 2003; Rude et al., 2004; Tausczik and Pennebaker, 2010; Baddeley et al., 2011; Mowery et al., 2017);
- •
Negative emotions;
- •
Mortality, death, and dying;
- •
Indicators of cognitive complexity;
- •
Words indicating causative relationships.
For each word category and population group, a standard set of usage frequency statistical data was calculated (using stats and stats-lite npm software modules), including mean, median, variance, standard deviation, and percentile distribution. Statistical significance of group differences was calculated using t-test, ANOVA, and post hoc Tukey test. For each user, we scored each text based on the character n-grams in the text with the CLMs for the condition. This method followed previous work on predicting mental health in social media (Coppersmith et al., 2014a).
This study is an analysis of existing, de-identified and publicly available data. No sensitive information was collected, and the study data is completely anonymous. As by regulation of §46.104, if the project does not include any interaction or intervention with human subjects or include any access to identifiable private information, then the project does not require IRB review and is exempt.
Results
The present study employed a computerized text analysis to examine language usage patterns in people affected by PTSD. We observed differences in linguistic markers in posts of similar word count written by different groups of people affected by PTSD and two control groups.
Texts From People With PTSD Whose Trauma Occurred Years Ago Differ From Those Written by PTSD Sufferers With Recent Trauma
We distinguished between people whose trauma occurred recently (within 12 months) and those who experienced trauma years ago, including childhood trauma. In both PTSD groups singular first-person pronoun usage was higher than in the two control groups (Mdiff = 0.068, Mdiff = 0.087; Mdiff = 0.061, Mdiff = 0.08, p < 0.001, Figure 1A). First person plural pronouns were lower in frequency in both PTSD groups compared to control 1, although in the group where trauma occurred years ago the usage was the lowest (Mdiff = 0.014, Mdiff = 0.019, p < 0.001, Figure 1B). Compared to control 2, only text from the PTSD years ago group showed a significant difference in plural first-person pronoun usage (Mdiff = 0.006; p > 0.05; Mdiff = 0.01, p < 0.05, Figure 1B). Compared to control 1, the usage of negative emotion words was only different in the PTSD group where trauma occurred years ago (Mdiff = 0.0097, p < 0.001, Figure 1C). However, compared to control 2, both PTSD groups showed a significantly higher usage of negative emotion words (Mdiff = 0.0096, p < 0.001; Mdiff = 0.0149, p < 0.001, Figure 1C). The usage of cognitive words was higher in both PTSD groups compared to control 1, but not control 2 (Mdiff = 0.038, Mdiff = 0.049, p < 0.001; Mdiff = 0.003, Mdiff = 0.007, p > 0.05, Figure 1D). In contrast to the previously published data (Jaeger et al., 2014), we could not detect a significant difference in causation words (p > 0.05, Figure 1E). Death-related word usage was higher in both PTSD groups compared to control 2 (Mdiff = 0.0019, p < 0.05; Mdiff = 0.0023, p < 0.001, Figure 1F).
FIGURE 1
Language Usage Differences Detected Comparing People With PTSD Whose Trauma Occurred in a Work Setting to PTSD Sufferers With Personal Life Related Trauma
We identified a set of variables in language usage in two different groups of PTSD, (1) people who went through trauma at work, including veterans, police officers, and firefighters (professional life), and (2) people that experienced trauma in their personal lives (civilians). The usage of singular first-person pronouns was higher in civilians, but also high in the professional group compared to both controls (Mdiff = 0.086, Mdiff = 0.052, p < 0.001; Mdiff = 0.102, Mdiff = 0.0683, p < 0.001, Figure 2A). In both groups, plural first-person pronouns occurred significantly less frequently than in the control 1. However, compared to control 2, only civilians affected by PTSD had a lower occurrence of plural first-person pronouns (Mdiff = 0.019, Mdiff = 0.015, p < 0.001; Mdiff = 0.0103, p < 0.01, Mdiff = 0.0068, p > 0.05, Figure 2B). Negative emotion words were used more frequently in civilians compared to control 1 (Mdiff = 0.0097, p < 0.05, Figure 2C). Compared to control 2, both PTSD groups had a higher frequency of negative emotion (Mdiff = 0.0149, p < 0.001; Mdiff = 0.0086, p < 0.05, Figure 2C). Analysis of cognitive words showed a higher usage frequency in both PTSD groups than in control 1 (Mdiff = 0.049, p < 0.001; Mdiff = 0.0503, p < 0.001, Figure 2D), but no difference compared to control 2 (p > 0.05, Figure 2D). There was no significant difference in causation word usage between the two PTSD groups and controls (p > 0.05, Figure 2E). Death-related word occurrence was higher in the professional life-related trauma group than in both controls with the difference being greater when compared to control 2 (Mdiff = 0.0033, p < 0.05; Mdiff = 0.004, p < 0.001, Figure 2F).
FIGURE 2
Word Patterns Vary in Different Text Types From People With PTSD: Comparing Daily Life Narratives to Trauma Narratives
We compared trauma narratives and daily life narratives from people with PTSD. In both PTSD related text types, singular first-person pronouns occurred more often than in controls. We found the highest frequency in the trauma narratives written by individuals whose trauma occurred years ago (Mdiff = 0.067, Mdiff = 0.087, Mdiff = 0.069, p < 0.001; Mdiff = 0.0827, Mdiff = 0.102, Mdiff = 0.0844, p < 0.001, Figure 3A). We detected a significantly lower usage of plural first-person pronouns in daily narratives compared to recent trauma narratives (Mdiff = 0.0063, p < 0.05, Figure 3B). Nevertheless, all three PTSD groups used fewer plural first-person pronouns than control 1, but compared to control 2 only the recent trauma group and daily narratives showed a significant difference (Mdiff = 0.019, Mdiff = 0.014, Mdiff = 0.02, p < 0.001; Mdiff = 0.0069, Mdiff = 0.0133, p < 0.001, Mdiff = 0.012, p > 0.05, Figure 3B). Negative emotion word usage was the highest in the daily narratives, but it was also higher than the controls in the narratives where trauma had occurred years ago. There was no significant difference in negative emotion word usage between recent trauma related texts and controls (Mdiff = 0.008, p < 0.05; Mdiff = 0.0135, p < 0.05; Mdiff = 0.009, p < 0.05, Mdiff = 0.015, p > 0.05, Figure 3C). Cognitive word frequency was the highest in daily narratives of PTSD sufferers. In all three PTSD groups, it was higher than in control 1, but not significantly higher than in control 2 (Mdiff = 0.039, Mdiff = 0.049, Mdiff = 0.066, p < 0.001; p > 0.05, Figure 3D). There was no significant difference in causation word usage between PTSD groups and controls (p > 0.05, Figure 3E). Death-related word usage was increased in both trauma text types compared to control 2, but not compared to control 1 (Mdiff = 0.0019, Mdiff = 0.0023, p < 0.05; p > 0.05; p > 0.05, Figure 3F).
FIGURE 3
Language Analysis Suggests Positive but Variable Correlation Between Depression and PTSD in Different Groups of Affected People
In this section of our study, we used a recently developed procedure for analyzing depression through language usage, which is based on previous studies (Stirman and Pennebaker, 2001).
We found the highest usage frequency of words positively correlated with depression in PTSD sufferers who experienced trauma years ago. However, the difference was only statistically significant compared to control 1, not to control 2 (Mdiff = 0.011, p < 0.001; p > 0.05, Figure 4). The second highest usage frequency of depression-correlated words was found in the PTSD group with professional life-related trauma (Mdiff = 0.007, p < 0.05; Mdiff = 0.009, p < 0.05, Figure 4). However, depression-correlated word usage was somewhat elevated in all PTSD groups we analyzed. Comparison of daily narratives from PTSD sufferers and daily narratives from the general population (control 2) points to an overall comorbidity of PTSD and depression as indicated by word usage (Mdiff = 0.0077, p < 0.01, Figure 4).
FIGURE 4
Discussion
Our analysis, based on text samples from PTSD online forums, shows that the language use in PTSD appears to vary from that of controls in a number of word categories, including first and third person pronouns, negative emotion words, death- and dying-related words, as well as words indicating cognitive complexity (Table 2). Furthermore, our data shows word usage differences between distinct groups of PTSD sufferers. We also found that word usage patterns are affected by the type of text. Overall, our results demonstrate the importance of taking into account additional factors, such as population subtypes and text types when analyzing language usage as a reflection of mental states and their pathologies. Considering the results of a study examining predictors of PTSD after burn injuries, the lack of psychological support seems to be a stronger predictor of PTSD than the severity or nature of the trauma (Perry et al., 1992), the development of feasibly accurate and accessible analytical tools facilitating early diagnosis appears to have value for both prevention and effective treatment of PTSD. Nonetheless, our research is limited by using online forum data set types, and more studies will be needed to increase precision by further examination of variable data resources, and a number of important or conflicting variables, such as specific patient subgroups, situational background, and co-morbidities.
TABLE 2
| Group | Singular 1-st person pronouns | Plural 1-st person pronouns | Negative emotion | Cognitive words | Causation words | Death related words | Depression correlation |
| PTSD 12 months | Increased | Low | Regular | Regular | Regular | Increased | Increased |
| PTSD years ago | Increased | Lower | Increased | Regular | Regular | Increased | Increased |
| PTSD daily | Increased | Lowest | Increased | Slight increase | Regular | Regular | Increased |
| PTSD professional | Increased | Lower | Increased | Regular | Regular | Increased | Less increased |
Summary of data analysis of language use in PTSD compared to controls.
We found the greatest differences in the usage of first-person pronouns. All groups of PTSD sufferers, regardless of text type, had a markedly higher usage of singular first-person pronouns than the controls. This suggests an increased focus on oneself (Campbell and Pennebaker, 2003; Pennebaker et al., 2003), possibly at the expense of the focus on, and collaboration with, others. In fact, we found markedly lower usage of the plural first-person pronouns in text types of daily narratives and in civilians with PTSD whose trauma occurred years ago. On the other hand, individuals with recent trauma had an only mild-to-moderate decrease in plural first-person pronoun usage, which may reflect the dynamics of early stage PTSD development. Low occurrence of plural first-person pronouns in PTSD texts, particularly in long-standing PTSD, may also be an indicator of comorbidity with depression and a predictor of suicidal tendencies, which is consistent with the social integration model of suicide (Stirman and Pennebaker, 2001).
The observed pattern of markedly higher singular first-person pronoun usage, especially when combined with lower frequency of the plural pronouns, may be salient, not only for detecting existing PTSD but also, and perhaps more importantly, for assessing the risk of developing PTSD in individuals with recent trauma, i.e., during the time window when intervention and treatment tend to be the most effective. Studies show that the type of trauma is not, by itself, a reliable predictor of PTSD, whereas certain characteristic signs and symptoms, such as persistent avoidance, are indicative (Dancu et al., 1996; Panasetis and Bryant, 2003; Briere et al., 2005). In that context, we should note the earlier findings that higher usage of singular first-person pronouns is associated with greater culpability and shame, which are predictive of developing PTSD during the first year post-trauma (Kubany et al., 2003; Negrao et al., 2005). Also, high singular first-person pronoun use in PTSD groups may be associated with trauma-related persistent dissociation, which is a strong predictor of PTSD (Briere et al., 2005). Overall, the observed pronoun usage patterns appear to have value for detecting both PTSD risk and onset.
We also found a moderately higher use of negative emotion words in all PTSD groups relative to controls. This word category is known to strongly correlate with the symptoms of depression. Notably, among PTSD groups, negative emotion words were higher in the groups with long-standing PTSD and lower (but still elevated relative to control) in the recent trauma group. We also analyzed the usage of depression-correlated words, a composite category comprising a representative sample of words commonly correlating with depression. We found that the usage of depression-correlated words was increased in all PTSD groups, but more dramatically in the group with long-standing PTSD. Our findings are consistent with prior research indicating that PTSD comorbidity with depression is common (ca. 50%) and tends to occur in people with more severe and persistent forms of PTSD (Flory and Yehuda, 2015). This hypothesis appears to be further supported by our data showing higher death-related word usage in the PTSD groups with either a very recent trauma or a trauma that occurred years ago. However, while the usage frequencies of death-related words in these groups are elevated to similar levels, the reasons may be different. In the recent trauma group, the increase in death-related word usage may reflect an acute reaction and processing of the recent trauma, especially if such trauma involved witnessing fatalities. In the group who experienced trauma far in the past, it may reflect a higher prevalence of severe, chronic depression that is linked to a higher risk of suicide (Stirman and Pennebaker, 2001; Indu et al., 2017; Ophuis et al., 2018; Dong et al., 2019). Proper interpretation of such group-related distinctions may improve the accuracy of pre-diagnostic evaluation and assist in guiding therapeutic intervention. Furthermore, it would be critical to recognize the subgroup of PTSD sufferers showing comorbidity with depression, because they generally have a higher level of neurocognitive distress (Blanchard et al., 1998; Nijdam et al., 2013; Flory and Yehuda, 2015) and are at a greater risk of suicide than people with PTSD alone (Campbell et al., 2007; Ramsawh et al., 2014). It would be also crucial to determine if current treatment options are effective in people showing comorbidity of PTSD with depression.
There are a number of different comorbidities that we didn’t analyze in this study; for example, substance use disorders, and other anxiety disorders. However, this overlap leads in many cases to diagnostic confusion and especially to the underdiagnosis of PTSD when trauma histories were not collected (Brady et al., 2000). It will be necessary to further research these various comorbidities in future studies. In our cognitive mechanism word analysis, we distinguished between causation and cognitive words. There were no significant differences in causation word frequencies between PTSD groups and controls. Cognitive word usage was increased in all PTSD groups relative to control 1 but was similar to control 2. This difference may be the most meaningful for the professional-life related PTSD group for which control 1 (firefighters without PTSD) is a better match. Cognitive word usage was somewhat higher in the daily narratives of PTSD sufferers than in trauma narratives, which is in line with previous research (Rubin, 2011).
It is essential for both research and clinical practice to be able to accurately assess psychological well-being in order to identify individuals who are at risk or have developed PTSD after trauma. Within that population, it is also critically important to detect cases at higher risk for suicide in view of a known relationship between traumatic experiences and suicidal behaviors (Knox, 2008).
In a future study, it would be interesting to conduct the analysis of the data sets from the perspective of sexual battery/abuse-related trauma, especially for the individuals who suffered trauma years ago and developed PTSD. However, we excluded this perspective from the present study because we wanted to establish salient differences across broader populations and different trauma backgrounds in order to develop an easy-to-use PTSD pre-diagnostic tool that did not require too much information from the traumatized person, since such requirement could hinder engagement and timely diagnosis.
Explicit screening methods are often deficient due to biases in self-assessment, self-presentation, and self-reporting, as well as avoidance due to lack of anonymity. Implicit screening methods, such as language analysis, have the capability to increase accuracy and reliability used either alone or in conjunction with explicit screening. Furthermore, this type of screening can provide anonymous self-prediagnosis without the risk of social stigma, thus reducing the obstacles to seeking immediate care. Facilitating prompt care could decrease PTSD rates and/or severity in people who have suffered a recent trauma. Further studies should help improve the accuracy and practical utility of language usage analysis for PTSD and other conditions. Such studies may involve further examining a variety of salient or confounding factors, such as different patient subgroups, situational context, co-morbid conditions, and variable dataset resources. Analyzing language use patterns other than word frequencies, such as syntactic and semantic structures, may yield additional insights. Overall, as a supplementary tool, language analysis has an advantage of being economical and time-efficient and also provides a convenient option for anonymous self-prediagnosis and screening.
Statements
Data availability statement
The data that support the findings of this study are available from the corresponding author (GT), upon reasonable request.
Ethics statement
This study is an analysis of existing, de-identified, and publicly available data. No sensitive information was collected, and the study data is completely anonymous. As by regulation of §46.104, if the project does not include any interaction or intervention with human subjects or include any access to identifiable private information, then the project does not require IRB review and is exempt.
Author contributions
CCu and GT conceived of the presented idea. GT developed the theory and performed the computations. CCu and CCa verified the analytical methods. KM assisted GT. All authors discussed the results and contributed to the final manuscript.
Acknowledgments
We would like to thank Emma McBrian and David Ashurov for their help in categorizing the data.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
References
1
AcostaJ.RamchandR.JaycoxL. H.BeckerA.EberhartN. K. (2013). Interventions to prevent suicide: a literature review to guide evaluation of California’s mental health prevention and early intervention initiative.Rand Health Q.2:2.
2
AhmedaniB. K. (2011). Mental health stigma: society, individuals, and the profession.J. Soc. Work Values Ethics841–416.
3
Alvarez-ConradJ.ZoellnerL. A.FoaE. B. (2001). Linguistic predictors of trauma pathology and physical health.Appl. Cogn. Psychol.15S159–S170. 10.1002/acp.839
4
BaddeleyJ. L.DanielG. R.PennebakerJ. W. (2011). How Henry Hellyer’s use of language foretold his suicide.Crisis32288–292. 10.1027/0227-5910/a000092
5
BadgerK.RoyseD.MooreK. (2011). What’s in a story? A text analysis of burn survivors’ web-posted narratives.Soc. Work Health Care50577–594. 10.1080/00981389.2011.592114
6
BakerL.WagnerT. H.SingerS.BundorfM. K. (2003). Use of the Internet and e-mail for health care information: results from a national survey.JAMA2892400–2406. 10.1001/jama.289.18.2400
7
BergerM.WagnerT. H.BakerL. C. (2005). Internet use and stigmatized illness.Soc. Sci. Med.611821–1827. 10.1016/j.socscimed.2005.03.025
8
BlanchardE. B.BuckleyT. C.HicklingE. J.TaylorA. E. (1998). Posttraumatic stress disorder and comorbid major depression: is the correlation an illusion?J. Anxiety Disord.1221–37. 10.1016/s0887-6185(97)00047-9
9
BoalsA.KleinK. (2005). Word use in emotional narratives about failed romantic relationships and subsequent mental health.J. Lang. Soc. Psychol.24252–268. 10.1177/0261927x05278386
10
BonfilsK. A.LysakerP. H.YanosP. T.SiegelA.LeonhardtB. L.JamesA. V.et al (2018). Self-stigma in PTSD: prevalence and correlates.Psychiatry Res.2657–12. 10.1016/j.psychres.2018.04.004
11
BookerJ. A.GraciM. E.HudakL. A.JovanovicT.RothbaumB. O.ResslerK. J.et al (2018). Narratives in the immediate aftermath of traumatic injury: markers of ongoing depressive and posttraumatic stress disorder symptoms.J. Trauma Stress31273–285. 10.1002/jts.22271
12
BradyK. T.KilleenT. K.BrewertonT.LuceriniS. (2000). Comorbidity of psychiatric disorders and posttraumatic stress disorder.J. Clin. Psychiatry61(Suppl. 7), 22–32.
13
BrahmP.CortazarA.FillolM. P.MingoM. V.VielmaC.AranguizM. C. (2016). Maternal sensitivity and mental health: does an early childhood intervention programme have an impact?Fam. Pract.33226–232. 10.1093/fampra/cmv071
14
BriereJ.ScottC.WeathersF. (2005). Peritraumatic and persistent dissociation in the presumed etiology of PTSD.Am. J. Psychiatry1622295–2301. 10.1176/appi.ajp.162.12.2295
15
BucciW.FreedmanN. (1981). The language of depression.Bull. Menninger Clin.45334–358.
16
BurnamM. A.BerryS. H.CerullyJ. L.EberhartN. K. (2014). Evaluation of the California mental health services authority’s prevention and early intervention initiatives: executive summary and commentary.Rand Health Q.4:7.
17
CampbellD. G.FelkerB. L.LiuC. F.YanoE. M.KirchnerJ. E.ChanD.et al (2007). Prevalence of depression-PTSD comorbidity: implications for clinical practice guidelines and primary care-based interventions.J. Gen. Intern. Med.22711–718. 10.1007/s11606-006-0101-4
18
CampbellR. S.PennebakerJ. W. (2003). The secret life of pronouns: flexibility in writing style and physical health.Psychol. Sci.1460–65. 10.1111/1467-9280.01419
19
CockayneT. W.SamuelsonC. O.Jr. (1978). Halo effect and medical student evaluation of instruction.J. Med. Educ.53:364. 10.1097/00001888-197804000-00016
20
CoppersmithG.DredzeM.HarmanC. (2014a). “Quantifying mental health signals in twitter,” in Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: from Linguistic Signal to Clinical Reality, Baltimore, MD, 51–60.
21
CoppersmithG.HarmanC.DredzeM. (2014b). Measuring Post Traumatic Stress Disorder in Twitter.Baltimore, MD: Johns Hopkins University.
22
CoppersmithG.DredzeM.HarmanC.HollingsheadK. (2015). “From ADHD to SAD: analyzing the language of mental health on twitter through self-reported diagnoses,” in Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: from Linguistic Signal to Clinical Reality, Denver, CO.
23
DancuC. V.RiggsD. S.Hearst-IkedaD.ShoyerB. G.FoaE. B. (1996). Dissociative experiences and posttraumatic stress disorder among female victims of criminal assault and rape.J. Trauma Stress9253–267. 10.1007/bf02110659
24
DongL.KalesnikavaV. A.GonzalezR.MezukB. (2019). Beyond depression: estimating 12-months prevalence of passive suicidal ideation in mid- and late-life in the health and retirement study.Am. J. Geriatr. Psychiatry271399–1410. 10.1016/j.jagp.2019.06.015
25
FisherG. A.ChonK. K. (1989). Durkheim and the social construction of emotions.Soc. Psychol. Q.521–9. 10.2307/2786899
26
FloryJ. D.YehudaR. (2015). Comorbidity between post-traumatic stress disorder and major depressive disorder: alternative explanations and treatment considerations.Dialogues Clin. Neurosci.17141–150.
27
GoldbergL. R. (1968). Simple models or simple processes? Some research on clinical judgments.Am. Psychol.23483–496. 10.1037/h0026206
28
GriffithsK. M.CalearA. L.BanfieldM. (2009). Systematic review on internet support groups (ISGs) and depression (1): do ISGs reduce depressive symptoms?J. Med. Internet Res.11:e40. 10.2196/jmir.1270
29
HirshJ. B.DeyoungC. G.PetersonJ. B. (2009). Metatraits of the big five differentially predict engagement and restraint of behavior.J. Pers.771085–1102. 10.1111/j.1467-6494.2009.00575.x
30
HornS. R.FederA. (2018). Understanding resilience and preventing and treating PTSD.Harv. Rev. Psychiatry26158–174. 10.1097/HRP.0000000000000194
31
HoustonT. K.CooperL. A.FordD. E. (2002). Internet support groups for depression: a 1-year prospective cohort study.Am. J. Psychiatry1592062–2068. 10.1176/appi.ajp.159.12.2062
32
InduP. S.AnilkumarT. V.PisharodyR.RussellP. S. S.RajuD.SarmaP. S.et al (2017). Prevalence of depression and past suicide attempt in primary care.Asian J. Psychiatr.2748–52. 10.1016/j.ajp.2017.02.008
33
JaegerJ.LindblomK. M.Parker-GuilbertK.ZoellnerL. A. (2014). Trauma narratives: it’s what you say, not how you say it.Psychol. Trauma6473–481. 10.1037/a0035239
34
JonesN.FearN. T.WesselyS.ThandiG.GreenbergN. (2017). Forward psychiatry - early intervention for mental health problems among UK armed forces in Afghanistan.Eur. Psychiatry3966–72. 10.1016/j.eurpsy.2016.05.009
35
KahnemanD. (2003). A perspective on judgment and choice: mapping bounded rationality.Am. Psychol.58697–720. 10.1037/0003-066X.58.9.697
36
KaplowJ. B.WardeckerB. M.LayneC. M.KrossE.BurnsideA.EdelsteinR. S.et al (2018). Out of the mouths of babes: links between linguistic structure of loss narratives and psychosocial functioning in parentally bereaved children.J. Trauma Stress31342–351. 10.1002/jts.22293
37
KenardyJ.SmithA.SpenceS. H.LilleyP. R.NewcombeP.DobR.et al (2007). Dissociation in children’s trauma narratives: an exploratory investigation.J. Anxiety Disord.21456–466. 10.1016/j.janxdis.2006.05.007
38
KesslerR. C.BerglundP. A.BruceM. L.KochJ. R.LaskaE. M.LeafP. J.et al (2001). The prevalence and correlates of untreated serious mental illness.Health Serv. Res.36(6 Pt 1), 987–1007.
39
KnoxK. L. (2008). Epidemiology of the relationship between traumatic experience and suicidal behaviors.PTSD Res. Q.191–8.
40
KnutsenM.JensenT. K. (2017). Changes in the trauma narratives of youth receiving trauma-focused cognitive behavioral therapy in relation to posttraumatic stress symptoms.Psychother. Res.2999–111. 10.1080/10503307.2017.1303208
41
KrossE.AydukO. (2008). Facilitating adaptive emotional analysis: distinguishing distanced-analysis of depressive experiences from immersed-analysis and distraction.Pers. Soc. Psychol. Bull.34924–938. 10.1177/0146167208315938
42
KubanyE. S.HillE. E.OwensJ. A. (2003). Cognitive trauma therapy for battered women with PTSD: preliminary findings.J. Trauma Stress1681–91. 10.1023/A:1022019629803
43
LampeK.DoupiP.van den HovenM. J. (2003). Internet health resources: from quality to trust.Methods Inf. Med.42134–142.
44
LiebermanM. A.GoldsteinB. A. (2006). Not all negative emotions are equal: the role of emotional expression in online support groups for women with breast cancer.Psychooncology15160–168. 10.1002/pon.932
45
LinkB.CastilleD. M.StuberJ. (2008). Stigma and coercion in the context of outpatient treatment for people with mental illnesses.Soc. Sci. Med.67409–419. 10.1016/j.socscimed.2008.03.015
46
LyonsE. J.MehlM. R.PennebakerJ. W. (2006). Pro-anorexics and recovering anorexics differ in their linguistic Internet self-presentation.J. Psychosom. Res.60253–256. 10.1016/j.jpsychores.2005.07.017
47
McNameeP. M. (2004). Character N-gram tokenization for European language text retrieval.J. Inf. Retr.773–97.
48
MehlM. R.PennebakerJ. W. (2003). The sounds of social life: a psychometric analysis of students’ daily social environments and natural conversations.J. Pers. Soc. Psychol.84857–870. 10.1037/0022-3514.84.4.857
49
MillerG. A. (1995). Wordnet - a lexical database for English.Commun. ACM3839–41. 10.1145/219717.219748
50
MottJ. M.GalovskiT. E.WalshR. M.ElwoodL. S. (2015). Change in trauma narratives and perceived recall ability over a course of cognitive processing therapy for PTSD.Traumatology2147–54. 10.1037/trm0000012
51
MoweryD.SmithH.CheneyT.StoddardG.CoppersmithG.BryanC.et al (2017). Understanding depressive symptoms and psychosocial stressors on twitter: a corpus-based study.J. Med. Internet Res.19:e48. 10.2196/jmir.6895
52
NegraoC.IIBonannoG. A.NollJ. G.PutnamF. W.TrickettP. K. (2005). Shame, humiliation, and childhood sexual abuse: distinct contributions and emotional coherence.Child Maltreat.10350–363. 10.1177/1077559505279366
53
NijdamM. J.GersonsB. P.OlffM. (2013). The role of major depression in neurocognitive functioning in patients with posttraumatic stress disorder.Eur. J. Psychotraumatol.4:19979. 10.3402/ejpt.v4i0.19979
54
NorquistG.HymanS. E. (1999). Advances in understanding and treating mental illness: implications for policy.Health Aff.1832–47. 10.1377/hlthaff.18.5.32
55
OlfsonM.KesslerR. C.BerglundP. A.LinE. (1998). Psychiatric disorder onset and first treatment contact in the United States and Ontario.Am. J. Psychiatry1551415–1422. 10.1176/ajp.155.10.1415
56
OphuisR. H.OlijB. F.PolinderS.HaagsmaJ. A. (2018). Prevalence of post-traumatic stress disorder, acute stress disorder and depression following violence related injury treated at the emergency department: a systematic review.BMC Psychiatry18:311. 10.1186/s12888-018-1890-9
57
PanasetisP.BryantR. A. (2003). Peritraumatic versus persistent dissociation in acute stress disorder.J. Trauma Stress16563–566. 10.1023/B:JOTS.0000004079.74606.ba
58
PennebakerJ. W. (1993). Putting stress into words - health, linguistic, and therapeutic implications.Behav. Res. Ther.31539–548. 10.1016/0005-7967(93)90105-4
59
PennebakerJ. W. (2001). Dealing with a traumatic experience immediately after it occurs.Adv. Mind Body Med.17160–162. 10.1054/ambm.2000.0307
60
PennebakerJ. W. (2011). Your use of pronouns reveals your personality.Harv. Bus. Rev.8932–33.
61
PennebakerJ. W.MehlM. R.NiederhofferK. G. (2003). Psychological aspects of natural language. use: our words, our selves.Annu. Rev. Psychol.54547–577. 10.1146/annurev.psych.54.101601.145041
62
PennebakerJ. W.StoneL. D. (2003). Words of wisdom: language use over the life span.J. Pers. Soc. Psychol.85291–301. 10.1037/0022-3514.85.2.291
63
PerlickD. A.RosenheckR. A.ClarkinJ. F.SireyJ. A.SalahiJ.StrueningE. L.et al (2001). Stigma as a barrier to recovery: adverse effects of perceived stigma on social adaptation of persons diagnosed with bipolar affective disorder.Psychiatr. Serv.521627–1632. 10.1176/appi.ps.52.12.1627
64
PerryS.DifedeJ.MusngiG.FrancesA. J.JacobsbergL. (1992). Predictors of posttraumatic stress disorder after burn injury.Am. J. Psychiatry149931–935. 10.1176/ajp.149.7.931
65
PressmanS. D.CohenS. (2007). Use of social words in autobiographies and longevity.Psychosom. Med.69262–269. 10.1097/PSY.0b013e31803cb919
66
PyszczynskiT.GreenbergJ. (1987). Self-regulatory perseveration and the depressive self-focusing style: a self-awareness theory of reactive depression.Psychol. Bull.102122–138.
67
RamsawhH. J.FullertonC. S.MashH. B.NgT. H.KesslerR. C.SteinM. B.et al (2014). Risk for suicidal behaviors associated with PTSD, depression, and their comorbidity in the U.S. Army.J. Affect. Disord.161116–122. 10.1016/j.jad.2014.03.016
68
RegierD. A.NarrowW. E.RaeD. S.ManderscheidR. W.LockeB. Z.GoodwinF. K. (1993). The de facto US mental and addictive disorders service system. Epidemiologic catchment area prospective 1-year prevalence rates of disorders and services.Arch. Gen. Psychiatry5085–94. 10.1001/archpsyc.1993.01820140007001
69
Rometsch-Ogioun El SountC.WindthorstP.DenkingerJ.ZiserK.NikendeiC.KindermannD.et al (2018). Chronic pain in refugees with posttraumatic stress disorder (PTSD): a systematic review on patients’ characteristics and specific interventions.J. Psychosom. Res.11883–97. 10.1016/j.jpsychores.2018.07.014
70
RubinD. C. (2011). The coherence of memories for trauma: evidence from posttraumatic stress disorder.Conscious. Cogn.20857–865. 10.1016/j.concog.2010.03.018
71
RudeS.GortnerE.-M.PennebakerJ. (2004). Language use of depressed and depression-vulnerable college students.Cogn. Emot.181121–1133. 10.1080/02699930441000030
72
SapM.ParkG. J.EichstaedtJ. C.KernM. L.StillwellD.KosinskiM.et al (2014). “Developing age and gender predictive lexica over social media,” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha.
73
SappenfieldB. R. (1971). Social desirability, the halo effect, and stereotypical perception in person perception and self-perception.Percept. Mot. Skills33683–689. 10.2466/pms.1971.33.3.683
74
SimmonsR. A.GordonP. C.ChamblessD. L. (2005). Pronouns in marital interaction: what do “you” and “I” say about marital health?Psychol. Sci.16932–936. 10.1111/j.1467-9280.2005.01639.x
75
SmithR. (1988). The halo effect.Nurs. Times84:68.
76
StirmanS. W.PennebakerJ. W. (2001). Word use in the poetry of suicidal and nonsuicidal poets.Psychosom. Med.63517–522. 10.1097/00006842-200107000-00001
77
TackmanA. M.SbarraD. A.CareyA. L.DonnellanM. B.HornA. B.HoltzmanN. S.et al (2018). Depression, negative emotionality, and self-referential language: a multi-lab, multi-measure, and multi-language-task research synthesis.J. Pers. Soc. Psychol.116817–834. 10.1037/pspp0000187
78
TausczikY. R.PennebakerJ. W. (2010). The psychological meaning of words: LIWC and computerized text analysis methods.J. Lang. Soc. Psychol.2924–54. 10.1177/0261927x09351676
79
ThomaM. V.La MarcaR.BronnimannR.FinkelL.EhlertU.NaterU. M. (2013). The effect of music on the human stress response.PLoS One8:e70156. 10.1371/journal.pone.0070156
80
WeintraubM.TavesD. R.HasdayJ. D.MushlinA. I.LockwoodD. H. (1981). Determinants of response to anorexiants.Clin. Pharmacol. Ther.30528–533. 10.1038/clpt.1981.198
81
WestermanN. K.CobhamV. E.McDermottB. (2017). Trauma-focused cognitive behavior therapy: narratives of children and adolescents.Qual. Health Res.27226–235. 10.1177/1049732315627795
Summary
Keywords
PTSD, trauma, natural language use analysis, diagnostic tool, screening
Citation
Todorov G, Mayilvahanan K, Cain C and Cunha C (2020) Context- and Subgroup-Specific Language Changes in Individuals Who Develop PTSD After Trauma. Front. Psychol. 11:989. doi: 10.3389/fpsyg.2020.00989
Received
20 December 2018
Accepted
21 April 2020
Published
15 May 2020
Volume
11 - 2020
Edited by
Gianluca Castelnuovo, Catholic University of the Sacred Heart, Italy
Reviewed by
Mary Vance, Uniformed Services University of the Health Sciences, United States; Vera Regina Rohnelt Ramires, University of the Rio dos Sinos Valley, Brazil
Updates
Copyright
© 2020 Todorov, Mayilvahanan, Cain and Cunha.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Catarina Cunha, clemos.catarina@gmail.com
This article was submitted to Psychology for Clinical Settings, a section of the journal Frontiers in Psychology
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.