Original Research ARTICLE
Declines in Connected Language Are Associated with Very Early Mild Cognitive Impairment: Results from the Wisconsin Registry for Alzheimer’s Prevention
- 1Wisconsin Alzheimer’s Institute, University of Wisconsin School of Medicine and Public Health, Madison, WI, United States
- 2Department of Neurology, University of Wisconsin School of Medicine and Public Health, Madison, WI, United States
- 3Geriatric Research Education and Clinical Center, William S. Middleton Memorial Veterans Hospital, Madison, WI, United States
- 4Wisconsin Alzheimer’s Disease Research Center, University of Wisconsin School of Medicine and Public Health, Madison, WI, United States
- 5Department of Communication Sciences and Disorders, University of Wisconsin–Madison, Madison, WI, United States
- 6Neuroscience Training Program and Department of Surgery, University of Wisconsin–Madison, Madison, WI, United States
Changes to everyday spoken language (“connected language”) are evident in persons with AD dementia, yet little is known about when these changes are first detectable on the continuum of cognitive decline. The aim of this study was to determine if participants with very early, subclinical memory declines were also showing declines in connected language. We analyzed connected language samples obtained from a simple picture description task at two time points in 264 participants from the Wisconsin Registry for Alzheimer’s Prevention (WRAP). In parallel, participants were classified as either “Cognitively Healthy” or “Early Mild Cognitive Impairment” based on longitudinal neuropsychological test performance. Linear mixed effects models were used to analyze language parameters that were extracted from the connected language samples using automated feature extraction. Participants with eMCI status declined faster in features of speech fluency and semantic content than those who were cognitively stable. Measures of lexical diversity and grammatical complexity were not associated with eMCI status in this group. These findings provide novel insights about the relationship between cognitive decline and everyday language, using a quick, inexpensive, and performance-based method.
There have been significant advances in detecting neuropathological changes early in the Alzheimer’s disease (AD) continuum, and these have spurred the development of multiple clinical trials, investigating both pharmacological and non-pharmacological early interventions. As a result, there is an urgent need for sensitive tools that measure cognitive and functional change at very early stages of disease (Snyder et al., 2014). One candidate tool is analysis of connected language in discourse, which has shown to be an informative measure of language problems across mild to moderate stages on the AD continuum (Nicholas et al., 1985; Bayles et al., 1992; Tomoeda and Bayles, 1993). “Connected language” (also referred to as “connected speech,” “spontaneous speech,” or “discourse”) refers to spoken language that is used in a continuous sequence, as in everyday conversations. The production of spontaneous speech involves the use and coordination of multiple cognitive and physiological processes, including retrieval from semantic and episodic memory, the ability to sustain and divide attention for error monitoring, and the reliance upon working memory for syntax production (Carter et al., 1998; Hartsuiker and Barkhuysen, 2006; Barbeau et al., 2012). As such, connected language analysis may be sensitive to early cognitive changes, and also may yield performance-based measures that are more representative of the actual skills needed for activities of daily living than are typical standardized language and other cognitive tests. Few studies have examined the earliest point at which connected language changes are observable [i.e., the Mild Cognitive Impairment (MCI) phase or before], and the literature that exists has significant limitations. Additionally, there are few longitudinal studies that have documented whether or not subtle changes in discourse can be reliably measured over time. Longitudinal data are important because both discourse features and the clinical expression of MCI and AD are heterogeneous (Duong et al., 2005), so measurement of within-person change over time is critical for characterizing change. Thus, there is a need for both studies of connected language in persons with early cognitive decline, and also studies that examine performance over time.
The few studies of connected language in MCI have shown similar deficiencies in semantic content and semantic processing as in AD dementia. The earliest prospective evidence of connected language change in true preclinical AD came from Cuetos et al. (2007), who compared connected language samples from a group of 19 asymptomatic individuals carrying the hereditary mutation of the Presenilin-1 gene, to similar-aged family members without the mutation, prior to evidence of significant cognitive decline. They found that mutation carriers expressed less semantic content in the preclinical phase of AD (mean age = 43) than non-carriers (Cuetos et al., 2007). Retrospective studies of novelist Iris Murdoch’s writings indicated that impoverished vocabulary and less complex syntax were evident in her 40 s, decades before her clinical diagnosis of AD (Le et al., 2011). Similarly, Berisha et al. (2015) retrospectively analyzed unscripted speeches of former U.S. President Ronald Reagan and found increases in filled pauses and decreases in unique words over time, well in advance of his diagnosis of AD. Ahmed et al. (2013b) retrospectively analyzed picture descriptions of nine autopsy-confirmed patients with AD at three stages of disease (MCI, Mild AD, Moderate AD), and detected a progressive decline in efficiency (time required to express ideas) and semantic content.
Although these findings and others (Tomoeda et al., 1996; Bschor et al., 2001; Fleming and Harris, 2008; Choi, 2009; Drummond et al., 2015) offer encouraging evidence for using connected language as a measure of change in AD, existing studies have significant limitations. First, studies either had small sample sizes or were single-case studies, limiting generalizability to other cohorts. Second, in the few studies with larger sample sizes (Forbes-McKay et al., 2013; Fraser et al., 2016), patients were in the mild to moderate stages of AD and results may not generalize to earlier stages of disease when compensatory abilities can result in more subtle impairments in functional language. Third, in the case of retrospective analyses (Garrard et al., 2005; Ahmed et al., 2013b), standardization procedures often were not reported so studies may be difficult to replicate. Fourth, the potential effects of psychological and biological factors (e.g., depression, anxiety, sleep) on connected language are not well studied, and the contribution of these potentially modifiable factors to discourse variability is unknown. In order to determine the feasibility and sensitivity of connected language analysis and its potential sources of variability, particularly for the purposes of assessment and monitoring within clinical trials, larger-scale longitudinal studies of preclinical populations are necessary.
The present study investigated connected language over time in a late middle-aged cohort enriched for risk of sporadic AD. The Wisconsin Registry for Alzheimer’s Prevention (WRAP) is an ongoing longitudinal study that began in 2001, with an objective of identifying cognitive and biomarker profiles that may be predictors of AD risk. In addition to completing an extensive neuropsychological test battery at study visits every 2 years, participants also provide a connected language sample of a picture description task. The overarching goal of the current study was to determine whether WRAP participants who were classified with sub-clinical, very early Mild Cognitive Impairment (“eMCI”) based on standard neuropsychological tests showed evidence of cognitive decline on connected language measures. Our aim was to determine whether being classified as eMCI was associated with atypical longitudinal connected language trajectories compared to cognitive stable WRAP participants. We hypothesized that eMCI would be associated with greater decline in connected language over time.
Materials and Methods
WRAP Study Sample
The WRAP is a longitudinal study of late-middle-aged adults enriched for AD risk based on parental family history. WRAP participants are asymptomatic, English speaking, and are between the ages of 40 and 65 at the time of enrollment (Sager et al., 2005; Johnson et al., 2017). Approximately 72% of participants have a parent with either autopsy-confirmed or probable AD [family history positive (FH+)] as defined by the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer’s disease and Related Disorders Association (NINCDS-ADRDA) criteria (McKhann et al., 1984). The sample has a higher-than-average proportion of carriers of at least one ε-4 allele of the APOE gene, therefore are at higher genetic risk for sporadic AD than the general population (Liu et al., 2013). Approximately 28% of participants had no parental history of AD and met the criteria that their mothers survived to at least age 75, and their fathers to at least age 70, with no dementia [family history negative (FH-)].
The WRAP study began in 2001 and currently has enrolled 1551 participants, primarily from the upper Midwest. The sample is 71% women, 88% white, and has a mean age at baseline of 54 years. The longitudinal study design includes a baseline visit, followed by a second visit 4 years later, and serial follow-up visits every 2 years thereafter. Participants are tested at one of three sites: Madison, LaCrosse, and Milwaukee, Wisconsin. WRAP participants undergo an extensive neuropsychological battery at each visit, and collection of a spontaneous language sample began at the third wave visit in 2012. This study was approved by The University of Wisconsin–Madison’s Institutional Review Board. All subjects gave written informed consent in accordance with the Declaration of Helsinki.
Present Study Sample
For the present study, we restricted our sample to participants who had completed language samples and neuropsychological test batteries at two time points, approximately 2 years apart (n = 280). Participants also had to have had a consensus diagnosis of either “cognitively healthy” (CH) or “early Mild Cognitive Impairment” (eMCI) at each of the two visits. The main component of this experimental diagnosis is a subtle cognitive deficit less pronounced than the traditional MCI diagnosis and not requiring a cognitive complaint from the participant or informant, (3) clinical MCI, (4) clinical impairment-not MCI, or (5) dementia. For additional detailed information on the consensus diagnosis process, see Koscik et al. (2016) and Johnson et al. (2017). Because the aim of this study was to capture language problems very early on in the possible-AD continuum, we excluded participants who were diagnosed with dementia, clinical MCI or “clinical impairment-not MCI” (n = 9). We also excluded participants who were non-native speakers of English (n = 2); those who had neurological diagnoses including stroke, epilepsy, Parkinson’s disease, and multiple sclerosis at either visit (n = 2); and those whose speech recordings were incomplete (n = 1); resulting in a final sample size of 264 participants (cognitively healthy = 200; eMCI = 64).
Discourse Collection Procedure
Participants provided informed consent to have their speech recorded while describing the “Cookie Theft” picture from the Boston Diagnostic Aphasia Examination (Goodglass and Kaplan, 1983). Participants were instructed to “Tell me everything you see going on in this picture.” Evaluators provided no feedback during participants’ descriptions; however, if responses were unusually brief (e.g., one or two sentences), evaluators provided the scripted prompt, “Do you see anything else going on?” Language samples had a mean duration of 50.4 s (SD = 0.02), including prompts from the examiner. All responses were recorded using an Olympus VN-6200PC digital audio recorder.
Language samples were transcribed by a trained speech-language pathologist (KDM), and two trained graduate students, using Codes for Human Analysis of Transcripts (CHAT) (MacWhinney, 2000). Transcribers were blinded to the cognitive status of the participant. Utterances were segmented into C-Units, an established metric for discourse analysis defined as “an independent clause and all of its modifiers” (Hunt, 1965; Hughes et al., 1997). Transcripts were coded for automatic analyses by the Computer Language Analysis (CLAN) program (Macwhinney et al., 2011), including codes for filled and unfilled pauses, repetitions, revisions, semantic units, errors (semantic, phonological, lexical), and non-verbal behaviors (e.g., coughing, laughing). Semantic units, parts of speech, total utterances, grammatical relations, and other quantifiers were then automatically extracted by the CLAN program using the MOR and MEGRASP programs (MacWhinney, 2000).
Three raters analyzed 15% of samples to calculate inter-rater reliability. Reliability was calculated using the RELY program within CLAN, and agreement was 92.4% for transcription and 98% for coding of semantic units.
Our discourse measures of interest were derived from a previous study from our group (Mueller et al., 2017), which detected underlying latent factors from multiple discourse measures. The aims of the Mueller et al. (2017) study were the following: (1) to determine the most salient connected language measures for use in this group of asymptomatic, late-middle-aged individuals; (2) to address the problem of multiple comparisons by determining latent constructs to use for analysis rather than many individual measures; (3) to determine the test-retest stability of discourse measures in a group of individuals who were cognitively stable across several study visits and (4) to determine factor invariance across sex and family history of AD. In brief, factor analysis using promax rotation and principal axis factor extraction (Fabrigar et al., 1999) was used to reduce a large set of 18 language measures to a smaller number of latent factors. Table 1 shows the factors and the connected language measures that loaded on each factor. The factors represent four underlying constructs in discourse: (1) Semantic content refers to the proportion of meaningful content words relative to total words. The Semantic factor is comprised of variables measuring pronouns, verbs and nouns in spontaneous speech; (2) the Syntax factor describes the complexity of sentences, and is represented by measures of grammatical complexity and verb usage within utterances; (3) the Lexical factor describes the diversity of words and is represented by indices of unique words to total words, content units to total words; and the Fluency factor represents the flow of speaking, characterized by an index measuring dysfluencies such as speech repetitions, revisions, and filled and unfilled pauses. For additional information on factor score development, including details on the exploratory and confirmatory factor analyses and factor invariance assessments, see Mueller et al. (2017). We multiplied the Fluency factor and Semantic factors by -1 to standardized directionality across all factor scores, such that negative scores indicated worse performance.
Participant Demographics, Clinical Characteristics, Cognitive Functioning, and Baseline Language
Sample characteristics of the cognitively healthy and eMCI groups were compared using t-tests for continuous data and chi-square tests for categorical data. Neuropsychological test performance was compared with F statistics, adjusting for age, sex and literacy; in the case of highly skewed data [Mini-Mental State Examination (MMSE), Boston Naming Test (BNT)], we used Mann–Whitney U-tests. We correlated participants’ performance on standardized neuropsychological tests with language factors using Spearman’s rank correlation coefficients, due to the skewed distribution of the BNT and MMSE. We examined baseline performance of each of the four connected language factors by cognitive status, adjusting for age, sex and literacy via analysis of covariance (ANCOVA).
Relationship between Cognitive Status and Longitudinal Connected Language Performance Trajectories
Analyses were conducted using R version 3.3.2 and SPSS version 22 using linear mixed models to examine longitudinal trends in connected language factor scores and their relation to cognitive status (CH vs. eMCI), the predictor of interest. Linear mixed effects modeling allowed us to examine connected language for participants as a group (fixed effects), while accounting for variation associated with individual differences (random effect of subject-specific intercept) (Laird and Ware, 1982). “Time” was operationalized as age at each visit, in order to account for differences in baseline ages and time intervals between visits. We centered age at the average of 63 years for ease of interpretation, with precision to two decimal places for improved temporal resolution. Additional covariates were included so that fixed effects could vary by sex and literacy [scores on the WRAT-III reading subtest (Wilkinson, 1993)]. Additional covariates examined included: (1) depressive symptoms, as measured by the Center for Epidemiologic Studies Depression Scale (CES-D) (Radloff, 1977) total score; (2) anxiety, measured by a binary variable indicating whether or not a participant self-reported a diagnosis of anxiety at either visit; and (3) self-reported sleep quality, measured as a continuous variable, based on the 12-item Rand Sleep Scale Survey used in the Medical Outcomes Study (MOS).
Models were fit using the lme4 package within the R environment (Bates et al., 2014). First, we examined unconditional means and growth models (i.e., subject-level random effects). Growth terms were not significant for any outcome, so random effects used in subsequent models included only subject-specific intercepts. Subsequent models also included fixed effects of sex and literacy. We tested interactions between cognitive status and age, sex, and literacy. Only the cognitive status by age interaction was significant for the Semantic and Fluency factor models; therefore, we used the base models for the Syntax and Lexical factors. As secondary analyses, the final models were re-run with additional fixed effects of depressive symptoms, sleep and anxiety. In each case the final model selected was fit with restricted maximum likelihood estimation (REML).
Other Descriptive Connected Language Variables
We extracted connected language variables that have been shown in the literature to be sensitive to MCI or probable AD dementia, but did not load on any of our factors (see Mueller et al., 2017). We performed ANCOVAs adjusting for age, sex, and literacy in order to depict the means at two time points, and mean change, between the CH and eMCI groups.
Table 2 displays sample characteristics by cognitive status (CH, eMCI). Table 3 presents the raw scores of the neuropsychological tests adjusted for age, sex and literacy (WRAT-III reading). Both tables reflect data collected at the second speech sample visit. Our final sample consisted of data from 264 participants at two time points with a mean of 2.0 years apart (SD = 1.5), typically the 4th and 5th wave visits. At the visit with the second speech sample, there were 64 participants classified as eMCI and 200 as CN; 47 of the 64 eMCI participants were also eMCI at the first speech sample. The eMCI group was comprised of significantly more males (54%) and they were slightly older (mean age of 66 at follow-up, versus 64 in the CH group). On average, the groups were similar in terms of years of education, literacy, APOE-ε4 status, family history status and depression scores. After adjusting for age, sex and literacy, the two groups differed significantly on tests of memory (RAVLT, Logical Memory-Delayed), executive functions (Digit Symbol, Stroop Interference), and language (Boston Naming Test, category and letter verbal fluency). Figure 1 shows baseline performance of the four connected language factor scores by cognitive status. Syntax was marginally lower in the eMCI group at baseline (p = 0.09).
FIGURE 1. Boxplot showing baseline performance on four connected language factors by cognitive status, adjusting for age, sex, and literacy.
Figure 2 depicts correlations between the four connected language factors and select standardized neuropsychological test scores. The Fluency factor was correlated with AVLT delayed recall (r = 0.19), AVLT total score (0.18), and Animal Naming (0.14). The Lexical factor was negatively correlated with Letter-Number Sequencing (-0.22), letter fluency (-0.16), BNT (-0.10), Logical Memory delayed recall (-0.12), and Animal Naming (-0.15). Syntax was positively correlated with Trails B (0.11), and the MMSE (0.10). The semantic factor showed a non-significant, weakly positive correlation with the Boston Naming Test (0.08), the RAVLT delayed recall (0.08), and Trails B (0.08).
FIGURE 2. Correlations between language factors and standardized neuropsychological tests. Pearson’s correlation coefficient is shown in the lower diagonal; the upper diagonal depicts magnitude (larger circle = larger correlation) and direction (blue = negative, red = positive). CFL, letter fluency, Controlled Oral Word Association Test (Wechsler, 1987); CESD, Center for Epidemiology Scale for Depression; MMSE, Mini-Mental Status Examination (Folstein et al., 1975); LetNum, Letter Number sequencing from the WAIS-R; BNT, Boston Naming Test from the Boston Diagnostic Aphasia Examination; AVLT, Rey Auditory Visual Learning Test; AVLTrecall, 30 min delayed recall of RAVLT; StroopCW, Color-Word Stroop subtest; Trails B, Total Trails B score (Benton et al., 1994); WMSR, Logical Memory Story Recall from the WMS; DigitSym, Digit Symbol from the WAIS (Wechsler, 1981); Animals, Category fluency.
Relationships between Cognitive Status and Longitudinal Connected Language Trajectories
Results from linear mixed effects regression models examining the relationships between cognitive status and connected language factor score at each visit are presented in Table 4. All four linear mixed effects models were re-run with additional fixed effects of sleep, depression, and anxiety; none of these predictors were significant, thus Table 4 depicts the simpler models.
Figure 3 depicts simple slopes of the interaction between age and cognitive status for mixed model-predicted fluency and semantic values.
FIGURE 3. The simple slopes of the interaction between predicted language factor values and age from the linear mixed effects models.
A significant interaction between time (centered age) and cognitive status indicated that individuals who were diagnosed with eMCI declined faster than the CH group on the semantic connected language factor over time (β = -0.04, p = 0.03).
Interactions between age and cognitive status, or cognitive status and the other covariates were not significant; therefore Model 1 was used. Cognitive status was not a significant predictor of syntax performance at either visit.
No interactions were significant so we retained Model 1. A significant main effect for time for the lexical factor indicated that all individuals tended to decrease in lexical diversity from time one to time two, but cognitive status was not a significant predictor of this decline.
A significant interaction between time and cognitive status indicated that participants who were diagnosed with eMCI declined faster on fluency factor scores over time (β = -0.03, p = 0.03). Cognitive status had a significant main effect on the fluency factor, indicating that eMCI status at either both visits or the most recent visit was a significant predictor of poorer speech fluency scores (β = -0.23, p = 0.03).
Other Descriptive Connected Language Variables
Mean scores and results of ANCOVAs for descriptive connected language variables (that were not part of the factor scores) are presented in Table 5. There were significant differences between the eMCI group and the CH group for mean length of utterance (MLU) at both time points (p < 0.001), words per minute at Time 2 (p = 0.02), and semantic units per minute at Time 2 (p = 0.03). Change from Time 1 to Time 2 was not significant between groups for any of the descriptive discourse variables.
Although a decline in episodic memory is the typical defining feature of AD dementia, changes in language are also evident early in the disease course. Analyses of connected language in the mild to moderate stages of AD dementia has proved to be a sensitive measure of subtle changes in performance, but we did not know if connected language analysis would be similarly sensitive in very early stages of cognitive decline. Thus, we set out to investigate whether connected language performance was associated with a diagnosis of sub-clinical MCI in a risk-enriched cohort of healthy, late-middle-aged individuals.
Our findings suggest that, in this younger and relatively healthy cohort, subclinical declines in memory and/or executive function were associated with changes in connected language. Specifically, as individuals progressed through the study, those who showed eMCI status declined more rapidly than those who were cognitively healthy in both semantic and fluency features of connected language. Lexical diversity and syntactic complexity were not associated with eMCI status in these analyses. When we added sleep quality, depression, and anxiety to the models, the results remained the same and these factors were not significantly associated with language declines.
The literature supports our finding that eMCI participants declined more rapidly than CH participants on semantic measures. Our connected language semantic factor (Mueller et al., 2017) is weighted such that it represents a higher proportion of pronouns, a lower percentage of nouns, and an increase in use of verbs. Studies using composite scores or factor scores similar to our semantic factor (Mueller et al., 2017) have found comparable results later on the AD continuum. For example, Fraser et al. (2016) trained a machine learning classifier on 370 linguistic features of Cookie Theft picture descriptions from 167 adults with AD and 97 controls. They subsequently performed a factor analysis of successfully classifying features, which resulted in a semantic factor that included of a high proportion of pronouns, low proportion of nouns, and high verb frequency (i.e., less semantic content overall). This semantic factor differentiated speech samples of adults with AD from those of controls (Fraser et al., 2016). Ahmed et al. (2013b) used a composite score comprised of measures of pronoun ratios and verb ratios, which they termed “lexical content,” to examine 9 adults across three stages of autopsy-confirmed AD: MCI, mild AD, and moderate AD. Composite scores differentiated between MCI and mild AD stages, but trends in the difference between controls and patients with MCI did not reach statistical significance (Ahmed et al., 2013b). Our study sample may represent a stage that is even earlier on the AD continuum than MCI, and the finding of significant differences in our group is likely because of our large sample size.
The higher loading of verbs relative to nouns in our semantic factor is a phenomenon that has been explored previously in language of adults with AD. There is an emerging literature regarding putative neuroanatomical substrates of verb vs. noun storage (Damasio and Tranel, 1993; Daniele et al., 1994). Future research involving network or regional specificity to our findings may help to further understand these results.
With respect to high pronoun index, Almor et al. (1999) showed that adults with AD (n = 11) used more pronouns in relation to noun phrases than controls (n = 9) (Almor et al., 1999). While several other studies of adults with AD report a similar over-use of pronouns in connected language (Ripich and Terrell, 1988; Ulatowska et al., 1988), such findings in studies of MCI are limited. Our study adds evidence to the notion that connected language may become semantically impoverished early on the continuum of cognitive decline.
Correlations between the semantic factor and standardized measures of semantic content (BNT and animal naming) were not statistically significant. This finding is partially supported by the connected language literature, but results of that literature are mixed. Almor et al. (1999) found no correlation between increased pronoun use and picture naming scores, and theorized that pronoun measures in discourse may be more a reflection of working memory problems than of semantic memory. Nicholas et al. (1985) similarly found no correlation between discourse measures of “empty speech” (e.g., pronouns without referents) and scores on picture naming tasks. Kavé and Goral (2016) found that picture naming and semantic measures of Cookie Theft descriptions were correlated in a group of adults with AD (n = 20; r = 0.46–0.67), but not in a control group. Kavé and Goral (2016) also showed that semantic verbal fluency scores were not correlated with semantic measures of connected language in either the group with AD or the controls. Moreover, in a longitudinal study of cognitive decline (6–10 years follow-up), Hodges et al. (2006) found that scores in category fluency were among the first to decline, while the onset of other semantically based deficits varied from one to 6 years. In present study, the eMCI group differed significantly from the CH group on verbal fluency and picture naming tasks; the lack of correlation of these measures with discourse measures lends support to the idea that measures of connected language describe not only semantic knowledge and expression, but also the coordination of multiple cognitive processes and are independent of traditional clinically administered language tests. Further, the results of these correlations may change over time as participants age and experience more significant cognitive impairment.
Discourse fluency declined more rapidly in persons with eMCI, and that fluency change was a predictor of eMCI status. A few studies of MCI have shown similar results; for example, Berisha et al. (2015) found that former President Ronald Reagan’s unscripted speeches contained more filled pauses over time (e.g., “um,” “well”) during a period of 6–13 years prior to his diagnosis of AD. By contrast, discourse fluency was not a distinguishing factor in MCI, mild, or moderate AD in the 9 adults studied by Ahmed et al. (2013b), although this sample was very small. Other studies of connected language in AD dementia have shown increased repetitions and revisions (Nicholas et al., 1985; Tomoeda and Bayles, 1993), increased pauses or hesitations (Davis and Maclagan, 2009; Hoffmann et al., 2010; Gayraud et al., 2011; Sajjadi et al., 2012), and an increase in repaired errors (McNamara et al., 1992). It is possible that these mixed results with fluency measures are due to examining only select disfluent behaviors. Our fluency factor consisted of an index that included filled and unfilled pauses, repetitions, revisions, and false starts divided by the number of utterances. To our knowledge, no studies of discourse in MCI or AD dementia have examined a comprehensive fluency measure such as this, and it is possible that a ratio of fluency behaviors is more sensitive than the individual measures alone. Further, while semantic deficits are common in connected language of people with AD dementia, it is possible that at very early stages a surge in dysfluent behavior may reflect the beginnings of difficulty with word search and retrieval, foreshadowing more problems expressing semantic content. There is evidence that error monitoring does not decline until later in the disease (McNamara et al., 1992; Forbes et al., 2001); so perhaps adults with eMCI in our study noticed and repaired more errors (hence had a higher number of “revisions”), resulting in higher dysfluency scores.
Correlations between the fluency factor and standardized neuropsychological test scores were weakly positive with semantic fluency, AVLT total score, and AVLT delayed recall. These data suggest that dysfluent behavior in the eMCI group may be related to aspects of memory (semantic storage of words) (Henry et al., 2004), executive functions (search and retrieval of words) (Alvarez and Emory, 2006), and working memory (Almor et al., 1999; Hartsuiker and Kolk, 2001).
While syntax is commonly described as “intact” until more severe stages of AD, declines and differences in syntax complexity have been observed at mild and moderate stages (Croisile et al., 1996; Kemper et al., 2001). In our sample, syntactic complexity was not associated with eMCI status. Our syntax measure is comprised of a verb index (ratio of verbs to utterances), a grammatical complexity measure based on embedded clauses (Macwhinney et al., 2011), and a measure of propositional density. Snowden et al. (Snowdon et al., 1996) examined the early life free writing samples of cloistered nuns and found that lower idea density at an average age of 22 significantly increased the risk of AD 58 years later. Kemper et al. (2001) examined Nun Study data over time and found that those who met criteria for dementia later in life had lower baseline measures of idea density and grammatical complexity. These results have been explained by both the cognitive reserve hypothesis and the early neuropathology hypothesis. According to the cognitive reserve hypothesis, nuns with high linguistic abilities either had AD-type neuropathology but were able to either compensate for changes because of their high linguistic capacity. In both models we tested (logistic and linear mixed models), literacy was significantly associated with syntax, which lends some support to the cognitive reserve hypothesis. Conversely, Snowdon et al. (1996) proposed that low linguistic ability early in life might have signaled that development of neuropathology had already begun, decades before dementia was expressed. Comparing baseline syntax measures to longitudinal, multi-modal measures of amyloid deposition and tau protein formation via cerebral spinal fluid and imaging may help to clarify these associations in WRAP participants in the future.
Our lexical factor was not associated with eMCI status, which was surprising given evidence of decreases in lexical diversity and efficiency in adults with AD and MCI (Ahmed et al., 2013b; Fraser et al., 2016). The lexical factor was comprised of measures of lexical diversity (type-token ratio) and semantic unit idea density. That the lexical factor was negatively correlated with phonemic fluency was also unexpected. We examined several transcripts of those with low measures on the lexical factor and found that these picture descriptions contained an overabundance of words with fewer expression of semantic units (i.e., verbosity), as we expected, which would suggest a link between lexical aspects of discourse and eMCI status. It is possible that our lack of significant findings was related to the variable length of transcripts, as type-token ratio is influenced by sample size. One small study of 8 people with MCI and 14 controls found that the MCI group showed significantly higher “potential vocabulary size” (an adapted type-token ratio) than the controls (Aramaki et al., 2016). The authors argued that the adults with MCI may have been compensating for cognitive decline by talking more without necessarily conveying more content. An adapted type-token ratio might be more revealing in future studies, as would use of a task that elicited longer language samples.
We examined other metrics of the language samples that were not included in the factor structure, but were important to include as descriptors, such as total words and words per minute. We also examined MLU, as some studies have shown shorter utterance length for individuals with MCI and AD (Ripich et al., 2000; Ahmed et al., 2013b). We found that the eMCI group had significantly shorter utterances (MLU) at both time points than the CH group. The groups did not differ on total number of words or total number of semantic units. The fact that semantic units were similar for both groups at both time points is surprising, since previous studies of preclinical AD have shown this measure to differentiate between groups (Cuetos et al., 2007; Ahmed et al., 2013a; Mueller et al., 2016). Both groups showed an increase in total words from Time 1 to Time 2, so it is possible that talking more also allowed for more elements of the picture to be described. That the eMCI group expressed fewer words per minute and semantic units per minute indicates that the efficiency of descriptions was compromised, and the time it takes to convey content is a more sensitive measure at this stage.
Sleep, depression and anxiety were not associated with declines in connected language. Murray (2010) examined discourse variables in adults with clinical depression, mild AD, and adults with no psychiatric or neurological diagnosis. While adults with AD produced less informative discourse, adults with depression and control groups did not differ on any discourse variable. Although poor sleep quality has been associated with cognitive decline (Lim et al., 2013), no such associations have been reported with connected language. However, sustained speech analysis was able to detect participants with obstructive sleep apnea (OSA) with 89% accuracy (Blanco et al., 2013), so it is possible that a more detailed speech and voice analysis may yield more information about the relationship between sleep quality and discourse features. Anxiety has been associated with increases in dysfluency in persons who stutter (Yang et al., 2017), and with pragmatic language difficulties (Cummings, 2007), but research is lacking regarding the relationship between anxiety and connected language in the general population or in persons with cognitive decline. As our WRAP sample ages and we continue to collect discourse samples, we will be able to examine these relationships with additional data points.
Our study had strengths and limitations. A strength was the sample size, which far exceeded that of any other prospective longitudinal study of connected language in older adults at risk for AD. In addition, by using a confirmed structure of latent factors, we were able to summarize measures in broad categories of language performance and better characterize our results. The standardized and focused nature of the picture description task was another strength, as it minimized demands on episodic memory, but still yielded spontaneous language. The Computerized Language Analysis program (CLAN) also was a benefit, as it provided a quick, efficient, objective, and standardized means of extracting data; this combined with our relatively simple factor structure and the ubiquity of the Cookie Theft picture in the AD literature will support replication of our analysis in future studies.
One of the limitations of our study was the relatively short length of the language samples (average total words = 109). While there is no established minimum length for language samples in adults with MCI, for adults with non-fluent aphasia, Brookshire and Nicholas recommended an average of 300–400 words (Brookshire and Nicholas, 1994), and others recommended a minimum of 150 words (Saffran et al., 1989; Sajjadi et al., 2012). Longer samples might be needed for individuals with non-fluent aphasia, however, given their reduced language output. Fraser et al. (2016) also had a mean sample length of 100 words from patients with AD, and reported findings with some parallels to ours, so 100 words may be sufficient. The short task may also be considered a strength, as it presents a relatively low burden for participants compared to demands of typical standardized tests. If subtle changes can be detected in shorter samples, this will save time for both the participant and the clinician. Future longitudinal research in order to develop sample size recommendations in this group would be beneficial. An additional limitation includes that we did not adjust for multiple comparisons when examining the four linear mixed effects models (p = 0.03 vs. Bonferonni-adjusted criteria of p < 0.013). Future research will address this limitation as our sample size of transcribed language samples increases, and as additional time points become available.
Although our approach proved to be feasible in this large cohort, the transcription and coding of these language samples was relatively labor-intensive. The field of computational linguistics has made advances in machine learning techniques for analyzing linguistic features, but these methods also require transcriptions of spoken language. Researchers are beginning to improve upon Automatic Speech Recognition (ASR) to solve this problem, which will allow for more efficient analyses that can be translated to clinical practice for disease monitoring.
A limitation exists within the WRAP sample itself: it is predominantly white, well-educated, and primarily from the Upper Midwest region of the United States. Future projects should validate these results in more diverse cohorts.
Finally, while the very early, sub-clinical construct of eMCI can be considered a strength since no other studies have examined connected language in an at-risk cohort, the participants diagnosed with eMCI may have an etiology that is not AD. Thus one cannot conclude from these results alone that our method of connected language analysis would be a sensitive tool for detecting decline in persons with known preclinical AD, which in research can now be characterized by the presence of beta-amyloid, tau protein accumulation, and neurodegeneration, evidenced by PET-imaging, cerebrospinal fluid analysis, and MRI (Jack et al., 2016). Further, the term ‘early’ in the context of this work simply connotes cognitive decline of less severity than MCI and does not refer to any age of onset specifically. Future research will address this limitation by taking these analyses further and examining connected language in individuals with preclinical AD based on biomarker evidence of AD pathology. We also plan to continue to follow these participants and collect future language samples and biomarkers in order to determine the course of connected language change as the WRAP cohort ages.
In a review of assessment in AD and dementia, Laske et al. (2015) listed connected language analysis as one of the most promising state-of-the-art diagnostic measures for MCI and AD. The authors described spoken language as a multi-dimensional, non-invasive, and informative biological sample for the early detection of AD, primary progressive aphasia, and other dementia syndromes. Our results provide evidence that features of connected language are associated with very early, sub-clinical declines in memory in late-middle age. This research contributes toward a better understanding of early language changes in conjunction with cognitive decline. Future validation of this method in a preclinical AD sample may show that connected language analysis may be a candidate tool for only improve disease monitoring in clinical trials, but also for informing cognitive-communication interventions and caregiver education programs that maintain and enhance quality of life and life participation for adults with MCI and AD dementia.
KM and LT conceived the idea of the work. KM, LT, RK, BH, and SJ contributed to the statistical plan and study design, and the analysis and interpretation of data for the work. KM drafted the work, and LT, RK, BH, and SJ revised it critically for important intellectual content. KM, LT, RK, BH, and SJ provided final approval of the version to be published. KM, LT, RK, BH, and SJ provided agreement to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.
This research was supported by the National Institutes of Health Award (R01AG027161) and by donor funds including the Wisconsin Alzheimer’s Institute Lou Holland Fund and contributions from anonymous donors. Portions of this research were supported by resources at the Wisconsin Alzheimer’s Institute, the Wisconsin Alzheimer’s Disease Research Center and the Geriatric Research Education and Clinical Center of the William S. Middleton Memorial Veterans Hospital, Madison, WI, United States. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NIH or the Veterans Administration.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors gratefully acknowledge the WRAP study team who have carefully acquired the longitudinal data, Sarah Riedeman and Kristina Fiscus for transcribing the language samples, and the WRAP participants who make this research possible.
Ahmed, S., de Jager, C. A., Haigh, A. M., and Garrard, P. (2012). Logopenic aphasia in Alzheimer’s disease: clinical variant or clinical feature? J. Neurol. Neurosurg. Psychiatry 83, 1056–1062. doi: 10.1007/s00415-015-7904-3
Ahmed, S., de Jager, C. A., Haigh, A. M., and Garrard, P. (2013a). Semantic processing in connected speech at a uniformly early stage of autopsy-confirmed Alzheimer’s disease. Neuropsychology 27, 79–85. doi: 10.1037/a0031288
Ahmed, S., Haigh, A. M., de Jager, C. A., and Garrard, P. (2013b). Connected speech as a marker of disease progression in autopsy-proven Alzheimer’s disease. Brain 136(Pt 12), 3727–3737. doi: 10.1093/brain/awt269
Almor, A., Kempler, D., MacDonald, M. C., Andersen, E. S., and Tyler, L. K. (1999). Why do Alzheimer patients have difficulty with pronouns? Working memory, semantics, and reference in comprehension and production in Alzheimer’s disease. Brain Lang. 67, 202–227. doi: 10.1006/brln.1999.2055
Barbeau, E. J., Didic, M., Joubert, S., Guedj, E., Koric, L., Felician, O., et al. (2012). Extent and neural basis of semantic memory impairment in mild cognitive impairment. J. Alzheimers Dis. 28, 823–837. doi: 10.3233/JAD-2011-110989
Bates, D., Maechler, M., Bolker, B., and Walker, S. (2014). lme4: Linear mixed-effects Models Using Eigen and S4. R Package Version, 1.1-7. Available at: http://CRAN.R-project.org/package=lme4
Bayles, K. A., Tomoeda, C. K., and Trosset, M. W. (1992). Relation of linguistic communication abilities of Alzheimer’s patients to stage of disease. Brain Lang. 42, 454–472. doi: 10.1016/0093-934X(92)90079-T
Berisha, V., Wang, S., LaCross, A., and Liss, J. (2015). Tracking discourse complexity preceding Alzheimer’s disease diagnosis: a case study comparing the press conferences of Presidents Ronald Reagan and George Herbert Walker Bush. J. Alzheimers Dis. 45, 959–963. doi: 10.3233/JAD-142763
Blanco, J. L., Hernández, L. A., Fernández, R., and Ramos, D. (2013). Improving automatic detection of obstructive sleep apnea through nonlinear analysis of sustained speech. Cogn. Comput. 5, 458–472. doi: 10.1007/s12559-012-9168-x
Brookshire, R. H., and Nicholas, L. E. (1994). Speech sample size and test-retest stability of connected speech measures for adults with aphasia. J. Speech Lang. Hear. Res. 37, 399–407. doi: 10.1044/jshr.3702.399
Brown, C., Snodgrass, T., Kemper, S. J., Herman, R., and Covington, M. A. (2008). Automatic measurement of propositional idea density from part-of-speech tagging. Behav. Res. Methods 40, 540–545. doi: 10.3758/BRM.40.2.540
Bschor, T., Kuhl, K. P., and Reischies, F. M. (2001). Spontaneous speech of patients with dementia of the Alzheimer type and mild cognitive impairment. Int. Psychogeriatr. 13, 289–298. doi: 10.1017/S1041610201007682
Carter, C. S., Braver, T. S., Barch, D. M., Botvinick, M. M., Noll, D., and Cohen, J. D. (1998). Anterior cingulate cortex, error detection, and the online monitoring of performance. Science 280, 747–749. doi: 10.1126/science.280.5364.747
Croisile, B., Ska, B., Brabant, M. J., Duchene, A., Lepage, Y., Aimard, G., et al. (1996). Comparative study of oral and written picture description in patients with Alzheimer’s disease. Brain Lang. 53, 1–19. doi: 10.1006/brln.1996.0033
Cuetos, F., Arango-Lasprilla, J. C., Uribe, C., Valencia, C., and Lopera, F. (2007). Linguistic changes in verbal expression: a preclinical marker of Alzheimer’s disease. J. Int. Neuropsychol. Soc. 13, 433–439. doi: 10.1017/S1355617707070609
Daniele, A., Giustolisi, L., Silveri, M. C., Colosimo, C., and Gainotti, G. (1994). Evidence for a possible neuroanatomical basis for lexical processing of nouns and verbs. Neuropsychologia 32, 1325–1341. doi: 10.1016/0028-3932(94)00066-2
Drummond, C., Coutinho, G., Fonseca, R. P., Assuncao, N., Teldeschi, A., de Oliveira-Souza, R., et al. (2015). Deficits in narrative discourse elicited by visual stimuli are already present in patients with mild cognitive impairment. Front. Aging Neurosci. 7:96. doi: 10.3389/fnagi.2015.00096
Fabrigar, L. R., Wegener, D. T., MacCallum, R. C., and Strahan, E. J. (1999). Evaluating the use of exploratory factor analysis in psychological research. Psychol. Methods 4, 272–299. doi: 10.1007/s10803-009-0816-2
Folstein, M. F., Folstein, S. E., and McHugh, P. R. (1975). “Mini-mental state”: a practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 12, 189–198. doi: 10.1016/0022-3956(75)90026-6
Garrard, P., Maloney, L. M., Hodges, J. R., and Patterson, K. (2005). The effects of very early Alzheimer’s disease on the characteristics of writing by a renowned author. Brain 128(Pt 2), 250–260. doi: 10.1093/brain/awh341
Gayraud, F., Lee, H. R., and Barkat-Defradas, M. (2011). Syntactic and lexical context of pauses and hesitations in the discourse of Alzheimer patients and healthy elderly subjects. Clin. Linguist. Phon. 25, 198–209. doi: 10.3109/02699206.2010.521612
Henry, J. D., Crawford, J. R., and Phillips, L. H. (2004). Verbal fluency performance in dementia of the Alzheimer’s type: a meta-analysis. Neuropsychologia 42, 1212–1222. doi: 10.1016/j.neuropsychologia.2004.02.001
Hodges, J. R., Erzinçlioðlu, S., and Patterson, K. (2006). Evolution of cognitive deficits and conversion to dementia in patients with mild cognitive impairment: a very-long-term follow-up study. Dement. Geriatr. Cogn. Disord. 21, 380–391. doi: 10.1159/000092534
Hoffmann, I., Nemeth, D., Dye, C. D., Pákáski, M., Irinyi, T., and Kálmán, J. (2010). Temporal parameters of spontaneous speech in Alzheimer’s disease. Int. J. Speech Lang. Pathol. 12, 29–34. doi: 10.3109/17549500903137256
Jack, C. R., Bennett, D. A., Blennow, K., Carrillo, M. C., Feldman, H. H., Frisoni, G. B., et al. (2016). A/T/N: an unbiased descriptive classification scheme for Alzheimer disease biomarkers. Neurology 87, 539–547. doi: 10.1212/WNL.0000000000002923
Johnson, S. C., Koscik, R. L., Jonaitis, E. M., Clark, L. R., Mueller, K. D., Berman, S. E., et al. (2017). The Wisconsin Registry for Alzheimer’s Prevention: a review of findings and current directions. Alzheimers Dement. (Amst.) (in press). doi: 10.1016/j.dadm.2017.11.007
Kemper, S., Thompson, M., and Marquis, J. (2001). Longitudinal change in language production: effects of aging and dementia on grammatical complexity and propositional content. Psychol. Aging 16, 600–614. doi: 10.1037/0882-7922.214.171.1240
Koscik, R. L., Berman, S. E., Clark, L. R., Mueller, K. D., Okonkwo, O. C., Gleason, C. E., et al. (2016). Intraindividual cognitive variability in middle age predicts cognitive impairment 8–10 years later: results from the Wisconsin Registry for Alzheimer’s Prevention. J. Int. Neuropsychol. Soc. 22, 1016–1025. doi: 10.1017/S135561771600093X
Laske, C., Sohrabi, H. R., Frost, S. M., López-de-Ipiña, K., Garrard, P., Buscema, M., et al. (2015). Innovative diagnostic tools for early detection of Alzheimer’s disease. Alzheimers Dement. 11, 561–578. doi: 10.1016/j.jalz.2014.06.004
Le, X., Lancashire, I., Hirst, G., and Jokel, R. (2011). Longitudinal detection of dementia through lexical and syntactic changes in writing: a case study of three British novelists. Lit. Linguist. Comput. 26, 435–461. doi: 10.1093/llc/fqr013
Lim, A. S., Kowgier, M., Yu, L., Buchman, A. S., and Bennett, D. A. (2013). Sleep fragmentation and the risk of incident Alzheimer’s disease and cognitive decline in older persons. Sleep 36, 1027–1032. doi: 10.5665/sleep.2802
McKhann, G., Drachman, D., Folstein, M., Katzman, R., Price, D., and Stadlan, E. M. (1984). Clinical diagnosis of Alzheimer’s disease report of the NINCDS-ADRDA work group∗ under the auspices of department of health and human services task force on Alzheimer’s disease. Neurology 34, 939–944. doi: 10.1212/WNL.34.7.939
McNamara, P., Obler, L. K., Au, R., Durso, R., and Albert, M. L. (1992). Speech monitoring skills in Alzheimer’s disease, Parkinson’s disease, and normal aging. Brain Lang. 42, 38–51. doi: 10.1016/0093-934X(92)90055-J
Mueller, K. D., Koscik, R. L., Clark, L. R., Hermann, B. P., Johnson, S. C., and Turkstra, L. S. (2017). The latent structure and test-retest stability of connected language in the Wisconsin Registry for Alzheimer’s Prevention. Arch. Clin. Neuropsychol. doi: 10.1093/arclin/acx116 [Epub ahead of print].
Mueller, K. D., Koscik, R. L., Turkstra, L. S., Riedeman, S. K., LaRue, A., Clark, L. R., et al. (2016). Connected language in late middle-aged adults at risk for Alzheimer’s disease. J. Alzheimers Dis. 54, 1539–1550. doi: 10.3233/JAD-160252
Ripich, D. N., Fritsch, T., Ziol, E., and Durand, E. (2000). Compensatory strategies in picture description across severity levels in Alzheimer’s disease: a longitudinal study. Am. J. Alzheimers Dis. Other Demen. 15, 217–228. doi: 10.1177/153331750001500407
Sager, M. A., Hermann, B., and La Rue, A. (2005). Middle-aged children of persons with Alzheimer’s disease: APOE genotypes and cognitive function in the Wisconsin Registry for Alzheimer’s Prevention. J. Geriatr. Psychiatry Neurol. 18, 245–249. doi: 10.1177/0891988705281882
Sajjadi, S. A., Patterson, K., Tomek, M., and Nestor, P. J. (2012). Abnormalities of connected speech in semantic dementia vs Alzheimer’s disease. Aphasiology 26, 847–866. doi: 10.1080/02687038.2012.654933
Snowdon, D. A., Kemper, S. J., Mortimer, J. A., Greiner, L. H., Wekstein, D. R., and Markesbery, W. R. (1996). Linguistic ability in early life and cognitive function and Alzheimer’s disease in late life. Findings from the nun study. JAMA 275, 528–532. doi: 10.1001/jama.1996.03530310034029
Snyder, P. J., Kahle-Wrobleski, K., Brannan, S., Miller, D. S., Schindler, R. J., DeSanti, S., et al. (2014). Assessing cognition and function in Alzheimer’s disease clinical trials: do we have the right tools? Alzheimers Dement. 10, 853–860. doi: 10.1016/j.jalz.2014.07.158
Tomoeda, C. K., Bayles, K. A., Trosset, M. W., Azuma, T., and McGeagh, A. (1996). Cross-sectional analysis of Alzheimer disease effects on oral discourse in a picture description task. Alzheimer Dis. Assoc. Disord. 10, 204–215. doi: 10.1097/00002093-199601040-00006
Ulatowska, H. K., Allard, L., Donnell, A., Bristow, J., Haynes, S. M., Flower, A., et al. (1988). Use of discourse analysis for evaluation of the condition of patients with dementia of the Alzheimer type. Neurol. Neurochir. Pol. 22, 34–37.
Keywords: Alzheimer’s disease (AD), mild cognitive impairment (MCI), language, speech, connected speech, discourse analysis, picture description, verbal fluency
Citation: Mueller KD, Koscik RL, Hermann BP, Johnson SC and Turkstra LS (2018) Declines in Connected Language Are Associated with Very Early Mild Cognitive Impairment: Results from the Wisconsin Registry for Alzheimer’s Prevention. Front. Aging Neurosci. 9:437. doi: 10.3389/fnagi.2017.00437
Received: 20 October 2017; Accepted: 18 December 2017;
Published: 09 January 2018.
Edited by:Lutz Jäncke, University of Zurich, Switzerland
Reviewed by:Jed A. Meltzer, Baycrest Hospital, Canada
Guido Gainotti, Fondazione Policlinico Universitario A. Gemelli – Università Cattolica del Sacro Cuore, Italy
Copyright © 2018 Mueller, Koscik, Hermann, Johnson and Turkstra. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Kimberly D. Mueller, email@example.com