- 1Department of Psychology, Durham University, Durham, United Kingdom
- 2Ascentys Limited, Bradford, United Kingdom
- 3Centre for Dementia Research, Leeds Beckett University, Leeds, United Kingdom
- 4School of Social and Health Sciences, Leeds Trinity University, Leeds, United Kingdom
- 5Cumbria Partnership NHS Foundation Trust, School of Psychology, Newcastle University, Newcastle, United Kingdom
- 6Leeds and York Partnership NHS Trust, Leeds, United Kingdom
- 7School of Psychological Science, University of Bristol, Bristol, United Kingdom
- 8Tees, Esk and Wear Valleys NHS Foundation Trust, Darlington, United Kingdom
- 9Institute of Cognitive Neuroscience, University College London, London, United Kingdom
- 10The Department of Psychological Medicine, The Old Chapel, Bootham Park, York and Scarborough Teaching Hospitals NHS Foundation Trust, York, United Kingdom
Introduction: In an increasingly aging society, testing hippocampal-dependent cognition in a quick and low resource manner will be crucial in: assessing the potential benefits of lifestyle choices and interventions affecting cognitive ageing (such as those involving exercise, diet, and sleep); detecting pathological aging, such as in Alzheimer’s disease, where hippocampal degeneration occurs relatively early on.
Methods: Over 300 participants aged 18-89 completed three cognitive tests, namely the Addenbrooke’s Cognitive Examination-III (ACE-III), The Four Mountains Task (4MT), and a new task introduced here, the Spaces and Sequences Episodic Video Task (SSEVT). Hippocampal tissue is particularly vulnerable to aging, and the 4MT and SSEVT were designed to be hippocampal-dependent. Accordingly, we tested the hypothesis that 4MT and SSEVT performance would be significantly compromised by aging. As an initial proof-of-concept exploration of these tests’ ability to detect pathological aging, such as in Alzheimer’s disease, we compared 10 patients with Mild Cognitive Impairment (MCI) with matched subsamples of the older group (Healthy ageing, HA).
Results: Supporting the hippocampal-aging related hypothesis, 4MT and SSEVT scores showed appreciably stronger age-related declines than ACE-III scores. The middle-aged group (mean: ∼51 years) were significantly worse than the young group (mean: ∼21 years) on the 4MT (Cohen’s d = 0.724) and the SSEVT (Cohen’s d = 0.443); and the older group (mean: ∼71 years) were significantly worse than the middle-aged group on the SSEVT (Cohen’s d = 0.724). Neither pattern was seen for ACE-III. Suggestively, the MCI patients performed worse than the matched HA group on the 4MT (consistent with previous work), and on our novel SSEVT, but not on the ACE-III.
Discussion: We conclude that the 4MT and SSEVT may be suitable for assessing lifestyle choices and interventions affecting cognitive ageing. We also propose that these findings provide an initial proof-of-concept for these tests’ ability to detect pathological aging in its early stages and support further exploration of this with larger clinical samples.
1 Introduction
1.1 Need for early diagnosis of Alzheimer’s disease (AD)
Alzheimer’s disease (AD) is a neurodegenerative disorder characterized by progressive neuronal death (Braak and Braak, 1995; Hardy and Higgins, 1992). If identified in its prodromal stages, AD progression may be decelerated via protective lifestyle factors (Frisoni et al., 2020) such as social embeddedness and exercise (Cass, 2017; Edwards et al., 2019; Norton et al., 2014) and the emergence of anti-amyloid immunotherapies such as lecanemab raises the possibility of future pharmacological disease modification or potentially, even pharmacological treatments (Aisen et al., 2017; Sims et al., 2023; Van Dyck et al., 2023). However, for any such treatments to be effective, they must be targeted to those in the early stages of the disease. In these prodromal stages, AD present clinically as mild cognitive impairment (MCI; Albert et al., 2011), in which there is evidence of subjective and objective cognitive impairment but with intact daily living activities, but on clinical grounds alone MCI due to AD can be hard to distinguish from other causes of MCI such as anxiety. The forecasted near tripling of global AD cases from 2019 to 2050 (Nichols and Vos, 2021), currently estimated as ∼35% of over-85’s (Alzheimer’s Association, 2021), reinforces the pressing need for new screening measures able to identify the disease early.
1.2 Using hippocampal-dependent tasks to assess early stages of Alzheimer’s disease (AD) and healthy cognitive aging
Though AD pathology is widespread throughout the neocortex in its later stages, the early stages of AD is characterized by changes localized to the hippocampal formation, and to the rhinal cortices which provide input to the hippocampus (Braak et al., 1993; Gómez-Isla et al., 1996; Wang et al., 2003; Whitwell et al., 2007; Braak and Del Tredici, 2015). While such post-mortem staging evidence has been taken to suggest an obligatorily rhinal first stage, more recent evidence suggests hippocampal as well as rhinal initial staging posts of neurodegeneration (Ravikumar et al., 2024). As such, detection of hippocampal damage may aid detection of early AD, increasing diagnostic confidence when combined with blood biomarkers of β-amyloid and tau pathology that will be available for clinical use in the near future. However, measurement of hippocampal atrophy using structural brain imaging has been found on systematic review to have low sensitivity and specificity for MCI due to AD and is therefore not recommended as a standalone add-on diagnostic test (Lombardi et al., 2020).
Tests of hippocampal-dependent cognition may provide some benefit here. While neuropsychological cognitive tests cannot of course replace biomarker tests and clinical assessments, they may have some advantages as initial or additional screening tools (e.g., non-invasive, easier to upscale). In particular, there is likely strong benefit to short cognitive tests which can provide complementary diagnostic information, potentially increasing the sensitivity and, crucially, specificity afforded by current tests (Laske et al., 2015; Wood et al., 2016). More complex test batteries could be employed at later stages for specifically at-risk categories. Tasks targeting dysfunction in the Entorhinal cortex could also be promising, (e.g., Howett et al., 2019) but remain at present in the research domain and are yet to undergo the same degree of validation and large-sample testing as hippocampal tests.
Given the size of the aging population, cognitive tests of hippocampal function that can be applied at scale and at low cost and user burden will be of great benefit in assessing the potential benefits of lifestyle choices and pro-cognitive interventions such as those involving sleep, diet, stress-reduction, and exercise (e.g., Meerlo et al., 2009; Erickson et al., 2011; Duzel et al., 2016; Fjell et al., 2020). The hippocampus is one of very few brain regions where neurogenesis occurs in the adult human, and thus likely has a privileged role in mediating healthy cognition in aging. Taking exercise as an example, cross-sectional and longitudinal evidence indicates that cardiorespiratory-focused exercise boosts hippocampal volume, and spatial memory (Erickson et al., 2011; Aghjayan et al., 2021), effects at least partly dependent upon neurogenesis (Van Praag, 2008; Erickson et al., 2011; Liu and Nusslock, 2018).
In summary, hippocampal-dependent cognitive tests could therefore be useful not just as screening tools for early AD, but in exploring the nature of healthy cognitive aging and factors that affect it.
1.3 Hippocampal-dependent cognition: spatial and episodic memory
What cognitive functions depend upon the hippocampus? Many lines of evidence (e.g., reviewed O’Keefe and Nadel, 1978; Poulter et al., 2018) indicate an important role for the hippocampus in spatial memory. Briefly, these include: the discovery of cells in the hippocampal formation with spatial correlates (O’Keefe and Dostrovsky, 1971; Taube et al., 1990; Ekstrom et al., 2003; Hafting et al., 2005; Lever et al., 2009; Jacobs et al., 2013; Høydal et al., 2019; Poulter et al., 2021); impairments in spatial memory following hippocampal lesions (Broadbent et al., 2004; Kessels et al., 2001; Morris et al., 1982); and positive associations between spatial abilities on the one hand, and hippocampal volume (Schinazi et al., 2013; Woollett and Maguire, 2011) and activity (Burgess et al., 2002; Cornwell et al., 2008; Li and King, 2019) on the other.
Spatial memory likely shares underlying processes with episodic memory (e.g., O’Keefe and Nadel, 1978; Bird and Burgess, 2008; Buzsáki and Moser, 2013), and hippocampal-lesioned patients show a combination of spatial, episodic, and temporal memory deficits (Scoville and Milner, 1957; Vargha-Khadem et al., 1997). The latter may be linked to a hippocampal role in supporting memories for sequences of events (Fortin et al., 2002; Gilbert and Kesner, 2002; Kumaran and Maguire, 2006). Importantly, the hippocampus is thought crucial to episodic recollection, whilst other regions likely subserve familiarity-based recognition (Brown and Aggleton, 2001; Ranganath et al., 2004; Diana et al., 2007). Thus, a putative hippocampal-dependent episodic memory task should minimize contributions from purely familiarity-based recognition. This was an important consideration in the design of our novel “Spaces and Sequences Episodic Video task” (SSEVT) described below (see section “2.2 Tests and questionnaires”) and introduced in this report.
In addition to the lines of evidence outlined above, the fact that spatial and episodic memory are hippocampal-dependent aligns with the observed decline in hippocampal volume during healthy aging (Cansino, 2009; Heo et al., 2010). It is also consistent with the prominence of spatial and episodic memory deficits, contrasted with the relative preservation of semantic memory, in MCI and early stages of AD (Bäckman et al., 2001; Ghoshal et al., 2002; Greenaway et al., 2006; Grober et al., 2008; Di Paola et al., 2007; Lithfous et al., 2013; Mortamais et al., 2017; Murphy et al., 2008; Twamley et al., 2006).
1.4 Some limitations of existing screening tools for AD and cognitive aging
Despite the potential advantages of neuropsychological cognitive tests, the value of existing measures is constrained by several limitations. The most used tests of dementia in the United Kingdom are the Mini Mental State Examination (MMSE; Folstein et al., 1975), the Montreal Cognitive Assessment (Nasreddine et al., 2005), and Addenbrooke’s Cognitive Examination-III (ACE-III; Hsieh et al., 2013), all three being brief screens of global cognition. To date, evidence for the accuracy of these screens in identifying AD and MCI has been mixed. For example, some studies have found the ACE-III has high diagnostic accuracy in identifying mild AD (Matias-Guiu et al., 2017) and can differentiate between controls, MCI patients, and dementia patients (Potts et al., 2022). Comparatively, Beishon et al. (2019) systematic review concluded that there was insufficient evidence to recommend the use of ACE-III in screening for dementia or MCI in those at risk of cognitive decline, reporting variable sensitivity (dementia: 82%–97%; MCI: 75%–77%) and specificity (dementia: 4%–77%; MCI: 89%–92%). The latter also noted that no studies had been conducted in the primary care setting. Similarly, the accuracy of MMSE scores in the conversion of MCI to AD dementia has been shown to vary in sensitivity (27%–89%) and specificity (32%–90%, Arevalo-Rodriguez et al., 2015).
These screening tools are also subject to educational bias (Bruno and Schurmann Vignaga, 2019; Pan et al., 2022), influencing their accuracy of dementia diagnosis (Jubb and Evans, 2015). This may be because education provides a degree of “cognitive reserve” whereby a more-educated individual performs better in neuropsychological tests than a lesser educated individual, despite having the same degree of AD pathology (Wilson et al., 2019; Swanwick et al., 1999). Higher education seems to facilitate better performance on cognitive tests (Glymour et al., 2012) and a lower dementia risk (Caamaño-Isorna et al., 2006; Sharp and Gatz, 2011), consistent with cognitive reserve theory (Stern, 2012). However, some have found that higher education accelerates deterioration in AD once diagnosed (Scarmeas et al., 2006; Wilson et al., 2004) and does not affect rates of cognitive decline in healthily aging individuals (Lenehan et al., 2015). Regarding education, we suggest it is important, if difficult, to distinguish between two hypothetical factors: education as a factor conferring expertise advantages (e.g., re instructions, strategies, verbal associations) specific to a given neuropsychological task; and education as having provided a kind of directly neurotrophic factor which boosts real-world episodic memory in a more general manner dependent on healthier neural tissue. In our view, episodic memory test performance should ideally be dependent only upon this second factor.
In tasks such as the ACE-III, not only education but also verbal intelligence, proficiency in the test language, and cultural biases can modulate scores, potentially causing inequity in diagnostic provision across differing ethnocultural communities (Brown et al., 2021). These can be partially addressed by generating versions specific to a language/culture (Pan et al., 2022), but these may also run into difficulties (e.g., intersectional participants).
These existing measures also heavily rely on verbal tasks, which can be problematic when exploring medial temporal lobe functioning. For example, such tasks have limited utility in the context of patients with expressive aphasia. Visual tasks, by comparison, would not be limited by this issue. Furthermore, making tasks more reliant on visual than verbal elements has the potential to reduce the effect of education and sex on test performance (Kawano et al., 2010; Vadikolias et al., 2012) and to tap into dysfunction in the linguistically non-dominant hemisphere.
Few tests are currently available that are both suitable for application to clinical practice and can target episodic memory in an ecologically valid manner (Plancher et al., 2012). Current screening measures often rely on tasks that are not reflective of daily life and therefore may not be generalizable. Assessments that are reflective of genuine, real-life cognitive abilities are essential if we are to detect cognitive decline early, as these represent the clinical, functional impact of such cognitive changes (Bhargava et al., 2024). The most common episodic memory tests test verbal learning of word lists (e.g., RAVLT), and typically require specialized training to execute and score, as do most neuropsychological tests. This could be problematic if such tests are to be applied as screening tools on a large scale, bringing additional time and cost. Similarly, where there is possible variation in administration and interpreting scores, an element of bias and subjectivity is introduced. More importantly, verbal learning scores are often affected by secondary factors, including sex (Bolla-Wilson and Bleecker, 1986; Herlitz and Rehnman, 2008; Sundermann et al., 2019), verbal intelligence (Bolla-Wilson and Bleecker, 1986), proficiency in the test language (Blumenau and Broom, 2011), and education (Olaya et al., 2017; Van der Elst et al., 2005; Lou et al., 2022).
1.5 Could tests tapping the hippocampus fill the gap?
The above considerations suggest a need for tasks which test spatial and/or episodic memory in a hippocampal-dependent manner, while being minimally reliant on culture-specific knowledge and education-based expertise. Toward this aim, we explored the Four Mountains Test (4MT) and our novel “Spaces and Sequences Episodic Video Task” (SSEVT). The 4MT is a test of short-term, allocentric spatial memory, in which hippocampal-damaged patients show clear deficits (Hartley et al., 2007). 4MT scores correlate with hippocampal volume in healthy young adults (Hartley and Harlow, 2012). Regarding AD, 4MT scores have been shown to correlate with AD biomarkers in MCI patients (Moodley et al., 2015), predict MCI-to-AD conversion (Wood et al., 2016), and distinguish AD from Frontotemporal and Semantic Dementia (Bird et al., 2010; Pengas et al., 2010).
Our intention with the SSEVT (Figures 2, 3) was that it should be an ecologically-valid task, that it should be focused on hippocampus-dependent cognitions notably spatial and episodic memory, that it should be easy to administer, and ideally minimally dependent upon education-based expertise. See section “2.2.2 Spaces and sequences episodic video task (SSEVT)” below for further details regarding task design rationale, and specificity of each of the five types of questions. Briefly, the video in the SSEVT simulates a participant touring through a house from a first-person perspective. The tour lasts 256 s, and involves various episodic events taking place within eight rooms, with two visits to each of the eight rooms, and exposure to seven characters overall. Two characters appear three times, the others appear twice. The purpose of doubling exposures to characters and rooms is to generate interference, and bias successful strategies toward potentially hippocampus-reliant processes involving representations of context and sequences. Importantly, unlike most verbal tests of episodic memory, with a view to ease of administration, the SSEVT uses a forced-choice question format. After a 20 min interval, participants answer five sections of questions, each beginning with a practice question (Figure 2 shows example practice questions). Section 1 of the SSEVT examines character-event associations, section 2 examines object-context associations, and sections 3–5 all examine sequence memory, each with varying reliance on spatial context.
1.6 Study rationale and aims: key points
Evidence clearly shows (see section “1.3 Hippocampal-dependent cognition: spatial and episodic memory”) that hippocampal tissue integrity (e.g., volume) declines in aging. Consistent with this, hippocampal-dependent function such as episodic memory declines with aging, while other abilities such as semantic memory are relatively preserved, (e.g., Levine et al., 2002; Piolino et al., 2002). Both the 4MT and SSEVT were designed to be hippocampal-dependent (already shown for 4MT, see section “1.5 Could tests tapping the hippocampus fill the gap?”). Accordingly, to be useful as sensitive tests of hippocampal-dependent aging (while also being simple to administer and score), and thus potentially suitable for assessing the factors that modulate hippocampal-dependent aging, we reasoned that performance on the 4MT and SSEVT should show a clear decline with age. The first part of this study examined this prediction, while also exploring other potential modulators of performance on these tests, such as sex and education (see section “1.4 Some limitations of existing screening tools for AD and cognitive aging”).
The 4MT and SSEVT were also designed with a view to detecting early stages of AD. As an initial proof-of-concept test of this idea, in the second part of this study, we examined MCI-vs-Healthy Aging (HA) discrimination in three cognitive tests: the 4MT, SSEVT, and ACE-III. We compared performance of a small sample (n = 10) of MCI patients, diagnosed according to the Albert et al. (2011) criteria, against age-matched and education-matched healthy aging controls. Thus the second part of the study tested the prediction that 4MT and SSEVT would discriminate the MCI vs. HA groups.
2 Materials and methods
2.1 Participants
2.1.1 Healthy sample
A total of 376 healthy participants (161 male, 215 female) aged 18–89 years-old (M = 42.26, SD = 22.56) were recruited from the general population via opportunistic sampling, including through the Durham University Participant Pool and advertisements on social media. Racial identity was largely White British/Irish (matching the MCI sample) as follows. White (including British/Irish/European/other White), n = 249; Black (various), n = 7; Asian (including East Asian and South Asian) n = 21; Mixed, n = 33; Other/data not given/available, n = 66. Exclusion criteria were as follows: (1) Presence of significant neurological conditions; (2) Major psychiatric disorders; (3) The use of cognitive enhancing drugs e.g., Cholinesterase Inhibitors; (4) A history of alcohol excess or illicit drug use resulting in functional issues or reliance on external assistance; (5) The diagnosis of any form of dementia; (6) Individuals under the age of 18. All participants were required to be fluent in English.
2.1.2 Cognitively impaired sample
A total of 10 MCI patients (five male, five female) aged 62–86 (M = 73.30, SD = 8.47), all White British, took part in the study. All had received a diagnosis of MCI, according to the Albert et al. (2011) criteria, within the previous 3 months: this time frame was adopted to ensure that patients’ cognition had not appreciably declined further in the interim between diagnosis and testing. Participants were recruited from four different sites: Tees Esk and Wear Valleys NHS Foundation Trust (N = 1); Cumbria Partnership NHS Foundation Trust (N = 5); Leeds and York Partnership NHS Foundation Trust (N = 3); and South Tees Hospitals NHS Foundation Trust (N = 1). Exclusion criteria were as above for healthy participants. Ethical approval for this research was given by the United Kingdom Health Research Authority and the Durham University Ethics Committee.
2.2 Tests and questionnaires
2.2.1 Four Mountains Task
The 4MT is a test of working allocentric memory that assesses the participants’ ability to recall the spatial configuration of a series of computer-generated landscapes from different viewpoints; this is designed to reflect the role of the hippocampus in spatial cognition. The rationale behind the 4MT has two key foundations: (1) The hippocampus is affected early in AD pathology [see section “1.2 Using hippocampal-dependent tasks to assess early stages of Alzheimer’s disease (AD) and healthy cognitive aging” above]; (2) The hippocampus is critical in spatial memory (see section “1.3 Hippocampal-dependent cognition: spatial and episodic memory”). A test such as the 4MT, which explores allocentric, view-independent spatial memory, should be highly dependent on the integrity of hippocampal tissue, and thus sensitive to the damage caused by early stages of AD. Participants complete 15 trials in which they are first shown a landscape depicting four mountains, and then after a two second delay, have to identify which of four images shows the same initial landscape, but viewed from a different perspective. For the full procedure of the 4MT, (see Chan et al., 2016; Hartley et al., 2007; Bird et al., 2010; Moodley et al., 2015). Figure 1A below shows a plan view of the computer-generated landscape, with the seven possible viewpoints shown, whilst Figure 1B shows an example trial. Participants recorded their answers on an A4 answer sheet.

Figure 1. The Four Mountains Test. Stimuli used for the Four Mountains Test. (A) Each landscape consists of four hills of varying shape and size in a semicircular range. Each stimulus used a unique configuration of hills in addition to various combinations of non-spatial parameters, including cloud cover, lighting, texture, and vegetation color, to create naturalistic variation in the scenes. The scenes are shown as though through a virtual camera sited at one of seven viewpoints (shown by white circles), each equidistant from the center of the circular field. (B) Example of one trial within the test. The subject is first presented with a rendered stimulus image based on the topography described above in panel (A). They are then presented with four scenes comprising varying topographical landscapes. One of these scenes depicts the same topographical landscape as shown in the previous scene but viewed from a different angle (one of the seven pre- determined viewpoints). The other three images show different scenes, where the size, shape, or layout of the mountains have changed. The correct answer is the image showing the same topographical landscape. Participants complete fifteen of these trials.

Figure 2. Example practice questions from sections 1 to 2 of the Spatial and Sequence Episodic Video Task (SSEVT). During the video, participants see multiple rooms, characters, objects, and events. After a retention interval, the participant is then asked 20 multiple-choice questions about what they saw, divided into five sections. (A) An example from section 1 of the video: character questions. During the video, participants see a woman holding a tray who offers them a variety of snacks. During the questionnaire stage, they are then asked, “Who offered you a snack?”. The participant is shown four options of different characters, one of whom being the correct character seen in the video. The other three characters all appeared at some point in the video and were set against neutral backgrounds, preventing recall by familiarity alone. (B) An example of a question from section 2 (object-based questions). During the video, the participant sees a woman looking out the window of a room with blue flooring, yellow walls, and various pieces of furniture, including a black coffee table. They are later asked, “In which room did you see this table?”. The participant is then shown four different options of rooms, one of which being the correct room as seen in the video. The three other rooms contain features present in then video so that the correct answer cannot be identified by familiarity alone.

Figure 3. The practice questions for each of the five sections of the SSEVT. The SSEVT has five sections as shown in the Figure above. Sections focus upon; Person associations, object location, Event order (Immediately before/after), Event order (Multiple events), and Room order. Each section was preceded by a practice question. Responses were forced choice with one correct answer and either three (see sections “1 Introduction”–“4 Discussion”) or one (section 5) other option(s) as lures. No feedback was given for either practice or scored questions.
2.2.2 Spaces and Sequences Episodic Video Task (SSEVT)
Key design aims of this task included the following. We aimed to create an ecologically valid first-person perspective task assessing spatial and episodic recall that was easy to administer. To render scoring expertise unnecessary, we implemented a forced-choice question format. With a view to the putative hippocampal-dependence of the task, we sought to minimize the ability of familiarity-dependent recognition processes to task success. For example: (a) each of the eight rooms is visited twice, with a different event on each visit; (b) person-association questions presented characters who all appeared within the video and were therefore all familiar, with character images set against neutral backgrounds to minimize snapshot memory contributions; (c) object-room association questions presented objects against neutral backgrounds, and bird’s eye views of rooms that were not previously presented in the video itself. With these aims in mind, the SSEVT (see Supplementary Materials 1) was created by authors CL, SK, JD, CW, SS, and GM-B. The SSEVT simulates a participant touring through a house from a first-person perspective. During the 256 s video, the individual moves through a house made up of eight distinctly decorated rooms. Across these rooms the viewer is exposed to seven people (four women and three men). Two characters appear three times, whilst all others appear twice. Within each room, a particular event occurs, such as someone watching TV, talking about the weather, or hanging up a picture (see Figure 2 for example screenshots of the video). Before the video begins, the participant is presented with the following script: “The video you are about to watch is presented as if you were walking through a computerized house. The video lasts approximately 4.5 min and various things will happen as you walk through the house. Please pay attention to the video and remember what you can as you will be asked questions about the video 20 min after it finishes. The questions will assess how well you can remember; the people, the objects, the rooms, and the events.” A total of 20 min after viewing the video, the participant answers 20 forced-choice questions over five sections. Each section began with a practice question to familiarize the participant with the up-coming question format. Two practice questions are shown as examples in Figure 2.
For a visual overview and example questions for each section, see Figure 3. Section 1 assesses character-based associations. These questions require the participant to identify which of four possible individuals (all of whom appeared in the video) were involved in a given event. In Section 2, participants are shown an object and are asked in which room it appeared. This aims to tap into hippocampal functioning by assessing associations between a spatial context and its contents (object-room associations). Sections 3–5 all examine sequence memory, but each with a different reliance on spatial context. In Section 3, participants are reminded of an event that occurred (e.g., they saw a person watching TV) and are asked to recall what happened in the room immediately before it. This therefore relies on participants being able to recall the order (sequence) of events. Comparatively, Section 4 presents participants with three separate events and asks them to place them in the correct order that they occurred in the video. This again relies on memory of event order but does not link events to rooms. Finally, section 5 does not necessitate memory of events, but instead asks participants to recall the order in which they visited different rooms. Participants are shown pictures of two different rooms and asked, “Which room did you visit first” or “Which room did you visit most recently?”. For Sections 1-4, participants had four options to choose from, and each correct answer scored one point. For section 5, as they had only two options to choose from, each correct answer scored 0.5 points. The total score on the SSEVT was the sum of points from all five sections. Accordingly, with six questions in section 5, the maximum score on the SSEVT was 17 points.
2.2.3 Addenbrooke’s Cognitive Examination III (ACE-III)
The ACE-III is a screening tool for global cognition that is free to use and has been widely adopted in the United Kingdom’s NHS, and elsewhere (Hsieh et al., 2013). Due to this widespread use, the ACE-III was included in the testing battery of this project. The test is paper-based and includes items assessing attention, memory, fluency, language, and visuospatial abilities, with a maximum score of 100.
2.2.4 Spatial Abilities and Practices Questionnaire (SAPQ)
The rationale for the need for hippocampal-dependent tasks for assessing healthy aging and early stages of AD also apply to questionnaires. With this in mind, the SAPQ (Supplementary Materials 1) was developed by authors CL, GM-B, DC, and DH to assess spatial memory and navigational ability in both familiar and unfamiliar environments in the real world, and is made freely-available here. The questionnaire consists of 15 questions assessing different kinds of spatial ability, in the form of statements to endorse using a Likert scale, e.g., “I have difficulty accurately visualizing in my mind’s eye the local walking routes (e.g., to shops, pubs, parks, restaurants) to and from my home.” One important aim in designing the SAPQ was to minimize potential misunderstanding of terminology and the need for abstract reflection by including specific questions that may be friendlier to answer for cognitively aging participants. Thus, the questions often contain examples, such as “when I walk out of a large shop or shopping center,” and “local walking routes (e.g., to shops, pubs, parks, restaurants),” designed to try to jog typical memories and knowledge of habits from the participant’s mind. Another aim was to capture potential decline in spatial abilities. Accordingly, every question in the SAPQ is followed by the question “Has this ability changed within the last 12 months?”, to gauge potential decline. This was scored from −2 to +2 (indicating direction of change) and then added or deducted from the original score. One potential drawback associated with the use of self-report questionnaires is the issue of illness insight, relevant of course for MCI and AD, as insight into the disease declines as it progresses (Migliorelli et al., 1995; Starkstein et al., 1996). This kind of anosognosia (Horning et al., 2014) can affect the way in which disease-affected individuals answer questionnaires (Bédard et al., 2003; Farias et al., 2005), such as those tapping spatial navigation abilities: participants may overestimate their navigational abilities. In order to mitigate this in the present study, we allowed participants to get help (from a partner, family member, or other career) in answering questions, with a tick box to specify if the participant answered the questions alone or with assistance.
2.2.5 Social Networks and Social Embeddedness Questionnaire (SNSEQ)
Having rich and meaningful social networks (being “socially embedded”) appears to protect against cognitive decline (Giles et al., 2012; Holtzman et al., 2004). Importantly, this may be achieved through hippocampal modulation (Branchi et al., 2006; Smith et al., 2018). Larger social networks are thought to provide a form of “social buffering,” which protects against AD pathology (Davitz and Mason, 1955; Coe et al., 1978). Those with smaller social networks may therefore have a greater dementia risk (Bennett et al., 2006; Fratiglioni et al., 2004). Consequently, measuring social embeddedness may be important in understanding AD risk. With this in mind, this study introduces the SNSEQ (Supplementary Materials 2) as a freely-available tool in the assessment of healthy and pathological aging. The SNSEQ was developed by authors GM-B, JD, and CL. It consists of 16 questions in the form of statements to endorse using a Likert scale [(1). Strongly disagree, (2). Disagree, (3). Neither agree not disagree, (4). Agree, or (5). Strongly agree]. As with the SAPQ, each item on the SNSEQ incorporates a measure of change: a standard Likert-scale question which asks whether there has been a change in the last 12 months and in which direction. It includes assessment of social motivation, and reliance on others to initiate and maintain social exchanges, as well as the number and quality of social networks. As with the SAPQ (see above), participants could receive help in answering questions, with a tick box to indicate if such assistance was used.
2.2.6 UCLA Loneliness Scale (ULS) and Satisfaction with Life Scale (SWLS)
Evidence for a relationship between loneliness and cognitive-aging has so-far proved inconclusive (Lara et al., 2019a,2019b; Victor, 2021) but has been associated with hippocampal volume (Düzel et al., 2019) and a heightened AD-dementia risk (Sundström et al., 2020). Life satisfaction may also influence cognition (Llewellyn et al., 2008; Goveas et al., 2016). Life satisfaction was measured using the Satisfaction with Life Scale (SWLS; Diener et al., 1985) and subjective loneliness was measured using the UCLA Loneliness Scale (ULS; Russell et al., 1978).
2.3 Procedure
2.3.1 Healthy controls
The cognitive tasks were tested in the sequence: 4MT first, SSEVT second, and ACE-III last. Testing was conducted in a quiet environment (Durham University’s Psychology Department, private homes, and occasionally quiet public places e.g., church hall). To explore the relationship between age and test performance, for some analyses the sample was divided into three groups: Young, 18–35; Middle-aged, 36–61; and Older people, 62–89.
2.3.2 MCI patients
Patients were diagnosed as having MCI according to the (Albert et al., 2011) criteria, by one of the four NHS trusts (see section “2.1 Participants” above). To have been diagnosed with MCI, participants had already taken several cognitive tests. A few patients had undergone biomarker testing, but this varied across the NHS trusts, and the sample n would be too small for meaningful analysis. Discussions with the referring site established which tests included in the project battery each patient had already completed, to prevent unnecessary duplication and/or practice effects. Potential participants were provided with an information sheet for the project, with a number to call if they were interested in participating. A follow-up letter was then sent to their address. Before testing began, informed consent was sought from the patient for use of their data, comprising: (1) their previous clinical data, notably the results from the NHS site-specific psychological test and questionnaire battery; (2) results from our additional tests; (3) any diagnostic changes made at a future time. The project-specific battery consisted of the following tasks in the following order, although for many the ACE-III had already been administered so was not repeated: 4MT; SSEVT; TMT-B; SNSEQ; RAVLT; SAPQ; and the ACE-III. Testing was conducted either at an NHS clinic (n = 1) or at the participant’s home at their request (n = 9), taking care to implement safety guidelines in accordance with risk assessments.
2.4 Data analysis
Relationships between variables in the healthy sample were explored using both Pearson and Spearman’s correlations, with Welch ANOVAs with Games Howell post hoc tests then probing differences between age-demographics. Welch tests were used to test for differences in sex and education within the sample. Multiple regressions were used to probe whether age could predict cognitive performance, with sex and education included as covariates. Differences between MCI patients and controls were probed using Welch tests. Welch tests were used as they are generally considered more robust than student t-tests or one-way ANOVAs. Participants with a missing value(s) for a measure were simply excluded from that measure whist retaining their other scores. A maximum of 87 participants per measure were excluded in this way (minimum sample size 289). For the analysis presented in Results below, 12 participants’ ACE-III scores were removed; their scores were low outliers for their age-groups (young, middle-aged, old), presumably attributable to imperfect application of the English fluency requirement. Analysis including these 12 datapoints affects none of the results reported below, namely: (a) ACE-III had a much lower sensitivity to age (all included: β = −0.132; below: β = −0.126) than 4MT and SSEVT; (b) education positively predicted ACE-III scores (all included: β = 0.249; below: β = 0.212), more so than 4MT and SSEVT; (c) ACE-III performance did not discriminate MCI patients from Healthy-aging controls (Welch t test all included: p = 0.466; below: p = 0.348).
3 Results
3.1 Overview: healthy participant characteristics and sex differences
An overview of participant demographics and scores on each test and questionnaire for the healthy sample are shown in Table 1. The only sex difference observed was that males rated their spatial ability on the Spatial Abilities and Practices Questionnaire (SAPQ) higher than females did (t287 = 4.36, p = 2e-5, Cohen’s d = 0.516), which is in line with previous finding of sex differences in self-report of spatial abilities (Weiss et al., 2003) (Additional potential sex differences were further explored using regression analyses, but none emerged). The main focus of this report is the associations with age, education, and cognitive impairment of the three cognitive tests (4MT, SSEVT, and ACE-III). We note here at the outset that 4MT and SSEVT scores were more positively correlated with each other (N = 358, r = 0.334, p = 8e-11, rho = 0.330, p < 1e-10, Figure 4A) than with ACE-III (SSEVT: N = 344, r = 0.161, p = 0.003, rho = 0.158, p = 0.003; 4MT: N = 352, r = 0.218, p = 3.6e-5, rho = 0.231, p = 1.4e-5, Figures 4B, C). These results are consistent with the 4MT and our novel SSEVT being both designed to tap hippocampal-dependent cognition. We first consider associations of the cognitive tests with age.

Figure 4. Different between-task correlations for the three cognitive tasks: the Four Mountains task (4MT), the Spaces and Sequences Episodic Video task (SSEVT), and Addenbrooke’s Cognitive Examination, version 3 (ACE-III). Between-task correlations for the three cognitive tasks. The highest correlation (r = 0.334) was seen for scores on the two tasks designed to be hippocampus dependent: the 4MT and SSEVT (A). Weaker correlations were seen between the ACE-III and SSEVT (r = 0.161) (B), and between the between the ACE-III and 4MT (r = 0.218) (C). Scores on the Spatial Abilities and Practices Questionnaire (SAPQ) were most correlated with the 4MT (r = 0.199) (D). See main text for further details of Pearson’s and Spearman’s correlation statistics. Solid blue line represents line of best fit. To avoid overplotting at shared, discrete positions, some data points were jittered, preserving vertical position while adjusting horizontal positions only marginally.
3.2 Age-related decline in test performance was more pronounced in the 4MT and SSEVT than the ACE-III
We hypothesized (see section “1.6 Study rationale and aims: key points”) that episodic and spatial memory abilities, and thus 4MT and SSEVT scores, would appreciably decline with age. The healthy sample was divided into three groups: Young, 18–35; Middle-aged, 36–61; and Older people, 62–89 (Table 2).

Table 2. Descriptive statistics and scores on the 4MT, SSEVT, and ACE-III for the different age groups.
There was a clear effect of age group on 4MT scores [F(2, 174) = 40.93, p = 3e-15, η2 = 0.181, Figure 5A]. Young people scored higher than both Middle-aged (t = 4.94, p = 8e-6, Cohen’s d = 0.724), and Older groups (t = 8.76, p = 7e-16, Cohen’s d = 1.057), with no difference between the Middle-aged and Older people (p = 0.085).

Figure 5. The 4MT and SSEVT were more sensitive to aging than the ACE-III. (A) On the 4MT, the young group scored higher than both Middle-aged and Older groups. The maximum score on the 4MT was 15. (B) On the SSEVT, the young group scored higher than both Middle-aged and Older groups, and the Middle-aged scored higher than the Older group. The maximum score on the SSEVT was 17. (C) On the ACE-III (which has a maximum score of 100), the only group difference was that the Young group scored somewhat higher than Older group. Shown are boxplots with boxes representing 25th percentile (bottom), 75th percentile (top) and median (central line and notches). Whiskers depict the minimum and maximum calculated values for each group (minimum: Q 1 -l.5*IQR; maximum Q3 +l.5*IQR). Black dots represent individual datapoints. Red asterisks signpost outliers. Young = 18–35 years; Middle-Aged = 36–61 years; Older = 62–89 years. *p < 0.05, **p < 0.001.
There was also a clear effect of age group on SSEVT scores [F(2, 164) = 35.11, p = 2e-13, η2 = 0.161, Figure 5B), with Older people scoring lower than both Middle-aged (t = 3.52, p = 0.002, Cohen’s d = 0.557) and Young participants (t = 8.37, p = 6e-14, Cohen’s d = 1.036), and Middle-Aged people also scoring lower than Young participants (t = 3.02, p = 0.009, Cohen’s d = 0.443).
Compared to 4MT and SSEVT, the effect of age group on ACE-III scores (Figure 5C) was much weaker [F(2, 150) = 5.063, p = 0.007, η2 = 0.029], with the only group difference being between Young and Older people, (t = 3.09, p = 0.007, Cohen’s d = 0.417, other comparison p-values > 0.328.
Consistent with these group differences, the negative correlation between age and performance was higher for 4MT (N = 375, r = −0.451, p = 3e-20, rho = −0.425, p = 7e-18) and SSEVT (N = 358, r = −0.418, p = 1e-16, rho = −0.404, p = 2e-15) than for ACE-III (N = 352, r = −0.185, p = 5e-4, rho = −0.122, p = 0.022). In summary, these findings are consistent with ACE-III being a test of diverse cognitions, including those (e.g., semantic memory) relatively well-preserved in aging, and with SSEVT and 4MT being tests of more specific, hippocampal-dependent cognitions, and thus expected to decline more severely with aging.
In exploratory analyses, we also examined relationships between the five individual sections of the SSEVT and other measures. A negative correlation with age was clearly seen for Section 1–4 scores (N = 357, Section 1: r = −0.283, p = 5e-8, rho = −0.279, p = 8e-8; Section 2: r = −0.220, p = 3e-5, rho = −0.219, p = 3e-5; Section 3: r = −0.218, p = 3e-5, rho = −0.222, p = 2e-5; Section 4: r = −0.286, p = 4e-8, rho = −0.255, p = 1e-6), but this was not reliable for section 5 (r = −0.089, p = 0.094, rho = −0.071, p = 0.181). Broadly similarly, positive correlations with 4MT were seen for Sections 1–4 (N = 357, all p < 0.005), but this was not reliable for section 5 (r = 0.088, p = 0.098, rho = 0.075, p = 0.159) (Table 3).

Table 3. Correlations between Spaces and Sequences Episodic Video Task (SSEVT) Section scores and other variables.
3.3 Education positively predicted performance on ACE-III and 4MT, but not SSEVT
We next explored potential education effects. Education levels differed between the three age groups, F(2,136) = 23.02, p = 2e-9, η2 = 0.149, with older people having less formal education than both middle-aged (t = 4.55, p = 3e-5) and younger (t = 6.80, p = 1e-9) people, reflecting generational changes in United Kingdom. Accordingly, straightforward between-group analysis across educational levels (Table 4) is potentially misleading. To try tease out independent associations of age and education, and check if the age-related associations above (see section “3.2 Age-related decline in test performance was more pronounced in the 4MT and SSEVT than the ACE-III”) were robust, we ran multiple regressions on the entire participant sample, with age and education as co-predictors of cognitive test performance.

Table 4. Age and cognitive test scores on the Four Mountains Test (4MT), Spaces and Sequences Episodic Video Task (SSEVT), and Addenbrooke’s Cognitive Examination-III (ACE-III) across education levels.
In a significantly predictive model of ACE-III [F(2,349) = 14.33, AdjR2 = 0.071, p = 1e-6], education was more positively predictive (β = 0.212, p < 0.00009), than age was negatively predictive (β = -0.126, p = 0.02), of ACE-III scores.
This pattern was different to that for 4MT [F(2,372) = 51.54, AdjR2 = 0.211, p = 2e-20], where age was strongly negatively predictive (β = −0.412, p = 6e-19), and education weakly positively predictive (β = 0.118, p = 0.016), of 4MT scores.
As regards our novel SSEVT [F(2,355) = 37.71, AdjR2 = 0.171, p = 1e-15], age strongly negatively predicted (β = −0.412, p = 5e-15), while education did not predict (β = 0.02, p = 0.67), SSEVT scores.
We note two points in summary. The first point is merely confirmatory of our age-related analyses above (see section “3.2 Age-related decline in test performance was more pronounced in the 4MT and SSEVT than the ACE-III,” Figure 5); that similarly large age-related reductions in performance were seen for 4MT and SSEVT (both βs = −0.41), while a much smaller age-related effect was seen for ACE-III (β = −0.126). The second point is that the potential influence of education upon task performance varied across the three tasks. This influence was clearest for ACE-III (β = 0.212), intermediate for 4MT (β = 0.118), while there was no sign that education boosted SSEVT performance (β = 0.02, p = 0.67).
The lack of association of level of education with SSEVT scores reported here is consistent with our design aim that this task be minimally dependent upon education. However, we acknowledge sample limitations, and the need to replicate this result with a larger sample with a good variance in education levels across all ages.
3.4 Social embeddedness, loneliness, and life satisfaction cannot predict cognitive performance
In our sample, older people were not less socially embedded (N = 361, SNSEQ, r = 0.003, p = 0.950, rho = −0.019, p = 0.716) or more lonely (N = 315, ULS, r = −0.036, p = 0.524, rho = −0.035, p = 0.532), but were more satisfied with life (N = 313, r = 0.128, p = 0.023, rho = 0.118, p = 0.037). Social embeddedness (SNSEQ score) was not correlated with 4MT (N = 360, r = −0.016, p = 0.768, rho = 0.004, p = 0.938) or SSEVT performance (N = 357, r = −0.022, p = 0.677, rho = −0.014, p = 0.792), but was positively correlated with SAPQ scores (N = 289, r = 0.280, p = 1e-6, rho = 0.291, p = 5e-7) and (weakly) with ACE-III scores (N = 347, r = 0.106, p = 0.049, rho = 0.123, p = 0.022). Neither the SWLS nor ULS were predictive of any measure of cognitive performance.
3.5 Spatial Abilities and Practices Questionnaire (SAPQ)
There were no differences between age-groups on overall SAPQ score, [F(2,137) = 0.961, p = 0.385, η2 = 0.006]. Age did not predict SAPQ scores in the whole sample (β = −0.083, p = 0.160), in men (β = 0.161, p = 0.075) or in women (β = 0.033, p = 0.671). However, when “change” scores (i.e., how much each rated ability had changed in the past year) were examined in isolation from overall SAPQ score, age was negatively correlated with reported change score (N = 242, r = −0.365, p = 4.9e-9, rho = −0.410, p = 3.1e-11). That is, younger people tended to report more improvements, and older people more declines, over time. The SAPQ was positively correlated with performance on the 4MT (N = 288, r = 0.199, p = 7e-4), (Figure 4D), and SSEVT (N = 285, r = 0.178, p = 0.002), but not ACE-III (N = 276, r = 0.114, p = 0.059). Exploratory analysis found positive correlations of SAPQ scores with SSEVT scores on Section 2 spatial context, (N = 285, r = 0.157, p = 0.008, rho = 0.144, p = 0.015) and Section 4 event order, (N = 285, r = 0.177, p = 0.003, rho = 0.183, p = 0.002) of the SSEVT, but these were weak on the other Sections (Sections 1, 2, and 5, all p > 0.277). Three separate multiple regressions were run with each test score, and age, as co-predictors of overall SAPQ score. Consistent with the opening results of this section, age never significantly predicted SAPQ score. Only 4MT scores (β = 0.206, p = 0.002), and SSEVT scores (β = 0.174, p = 0.009), but not ACE-III score (β = 0.103, p = 0.093, overall model not significant), could significantly predict overall SAPQ performance.
3.6 The 4MT and SSEVT (but not ACE-III) discriminated MCI patients from healthy aging controls
As proof of concept of the ability of 4MT and SSEVT to discriminate healthy from pathological aging (section “1.6 Study rationale and aims: key points” above), we compared a small group of MCI patients (n = 10) to a sample of healthy aging controls (n = 78), matched for age and education. For further details on the matching and group scores, please see Table 5. As intended, the MCI group did not differ from controls in age (t86 = 1.04, p = 0.321), sex (X2 = 0.023, p = 0.879), or education (t86 = 1.45, p = 0.175).

Table 5. Participant demographics and scores in the MCI (N = 10) and matched healthy control group (max N = 78) on the Four Mountains Test (4MT), Spaces and Sequences Episodic Video Task (SSEVT), Addenbrooke’s Cognitive Examination III (ACE-III), Spatial Abilities and Practices Questionnaire (SAPQ), and Social Networks and Social Embeddedness Questionnaire (SNSEQ).
As expected, 4MT scores were lower in MCI patients than healthy aging controls (t86 = 2.56, p = 0.026, Cohen’s d = 0.882, Figure 6A). Similarly, as expected, SSEVT scores were lower in MCI patients than these controls (t74 = 6.86, p = 3e-8, Cohen’s d = 1.397, Figure 6B). Exploratory section-by-section analysis is shown in Table 6. The sections with the largest relative impairments in MCI were: Section 1 (character-associations, Cohen’s d = 1.216); section 5 (room order, Cohen’s d = 0.694), and Section 4 (event order, Cohen’s d = 0.511), but only that for Section 1 was statistically significant (1: t74 = 3.77, p = 0.004; 5: t74 = 1.98, p = 0.082; 4: t74 = 1.33, p = 0.223). An appreciably larger sample is needed for reliable section-by-section analysis. Given the limited MCI sample available for the SSEVT (N = 7), we compared these results with a more closely-matched control sample (N = 38: MCI vs HA: age (t43 = 1.20, p = 0.266), sex (X2 = 0.09, p = 0.933), education (t43 = 0.61, p = 0.558), which produced a similar result (t39 = 4.58, p = 5e-5, Cohen’s d = 1.172). ACE-III scores did not significantly differ between MCI patients and controls (t10 = 0.99, p = 0.348, Cohen’s d = 0.410, Figure 6C). This may partly reflect a lack of statistical power in this small sample.

Figure 6. In initial, proof-of-concept exploratory comparison, MCI patients perform significantly worse on the 4MT and SSEVT, but not on the ACE-III. Threshold plot showing 4MT scores for control group (left) and MCI patient group (right), and recommended cut-off associated with best between-group differentiation (score ≤ 8, thin horizontal black line, 80% sensitivity and 50% specificity.) (B) ROC plot for 4MT (AUC: 0.729, p = 0.005). (C) As for (A), threshold plot showing SSEVT scores for both groups, and recommended cut-off associated with best between-group differentiation (score ≤ 5.5, 100% sensitivity and 63% specificity). (D) ROC plot for SSEVT (AUC: 0.835, p < 0.001). (E) Threshold plot for ACE-III scores, for score ≤ 88 [a recommended cutoff for MCI detection in Beishon et al. (2019)], associated with 30% sensitivity and 69% specificity. (F) ROC plot for ACE-III, which did not deviate from chance classification (AUC: 0.560, p = 0.565).

Table 6. Participant scores in the mild cognitive impairment (MCI) (N = 7) and healthy control group [N = 69 for Spaces and Sequences Episodic Video Task (SSEVT)] on the sections of the SSEVT.
The Four Mountains Task performance differentiated the MCI patients and controls with an AUC (ROC, DeLong et al., 1988) of 0.729 (z = 2.787, p = 0.005); a 4MT score of ≤ 8 was associated with 80% sensitivity and 50% specificity (Figures 6A, B). For the SSEVT, the AUC was 0.835, (z = 7.157, p < 0.001); SSEVT scores of ≤ 5.5 were associated with 100% sensitivity and 63% specificity (Figures 6C, D). In contrast, ACE-III test performance did not achieve better-than-chance discrimination (AUC = 0.560, z = 0.575, p = 0.565) (Figures 6E, F).
In summary, our findings provide some preliminary, proof-of-concept evidence that 4MT and SSEVT are useful in discriminating pathological from healthy cognitive aging. Further, since most MCI patients typically progress to a diagnosis of AD, these tests may be useful in assessing early stages of AD.
3.7 MCI patients vs. healthy aging controls: questionnaire results
Mild cognitive impairment patient questionnaire data were available for the measures of social embedding (SSNEQ), and spatial abilities and practices (SAPQ). MCI patients were less socially embedded than their healthy aging controls (t78 = 2.47, p = 0.028). This is potentially consistent with social isolation as a risk factor for dementia (see section “2.2 Tests and questionnaires” above). However, caution is clearly merited: (1) the sample size of the patients is small (n = 10); (2) in this cross-sectional study, the lower social embedding at the MCI diagnosis stage may reflect a compensatory response to cognitive decline, rather than a risk factor. Neither overall SAPQ scores (t68 = 1.18, p = 0.267) nor SAPQ change scores (t35 = 0.43, p = 0.673) differed between MCI patients and controls.
4 Discussion
4.1 The SSEVT and 4MT were sensitive to aging
In the same participants, age-related decline in performance was greater on the tasks tapping spatial and episodic memory (4MT: allocentric spatial memory; SSEVT, spatial and episodic/sequence memory) than on the ACE-III. This is consistent with: (a) ACE-III being a composite test of diverse cognitive functions, including linguistic ability and semantic memory, which are relatively well-preserved in aging; (b) SSEVT and 4MT tapping hippocampal-dependent cognition, which would be expected, given hippocampal age-related neurodegeneration, to decline more severely with age.
Accordingly, SSEVT and 4MT may be useful tasks in assessments of cognitive aging. These could include correlative and intervention-type studies examining the potential “anti-aging” effects of factors such as exercise, stress reduction, good sleep, and good diet which are expected to impact upon hippocampal tissue and hippocampal-dependent cognition. One advantage of both these tasks is that they both use forced-choice question formats, and straightforward scoring, thus potentially enabling automatic and/or remote testing. In so much as both the 4MT and SSEVT test spatial memory, the marked age-related diminution in performance on these tasks is consistent with the age-associated decline in use of spatial strategies reported in a very large scale study of a mobile app-based video game (“Sea Hero Quest”) (West et al., 2023).
In contrast to these objective tests tapping spatial memory ability (4MT: allocentric spatial memory; SSEVT: spatial and episodic/sequence memory), our novel questionnaire assessing “everyday” spatial abilities (SAPQ) was not very sensitive to age. Change over the last year, (the young tending to improve, the old tending to worsen), rather than absolute score, was correlated with age.
4.2 Education positively predicted performance on ACE-III and 4MT, but not SSEVT
We found that education levels predicted performance on ACE-III and 4MT in a positive manner, (the higher the education level, the higher the scores), but did not predict SSEVT scores. The finding that education level did not predict SSEVT scores aligns with our intention for SSEVT to be minimally dependent upon education-based expertise. We intended that the SSEVT, including its instructions and questions, should be simple to understand, and that success on the SSEVT should be resistant to education-reliant strategies. This non-association with education is thus encouraging, but we should acknowledge a caveat. In our sample, older people tended to be less educated than our young and middle-aged groups. It will be important to replicate the non-association of education with SSEVT scores in a larger sample with a good variance in education levels across all ages.
A task that is relatively easy to understand, and where education cannot confer any additional benefits to task performance (e.g., in verbalizing events) would be advantageous, since the need to take account of education is a complicating factor in tests of cognitive aging and AD diagnostics. To give just one example relevant here, one study used five screening tests for diagnosing AD, and found that with all of them, but especially the ACE-III, diagnostic accuracy for AD was higher in patients with full primary education than in the less educated group (Matias-Guiu et al., 2017).
4.3 The 4MT and SSEVT discriminated MCI from healthy aging
The 4MT and SSEVT were also designed with a view to detecting early stages of pathological aging, such as in Alzheimer’s disease. Testing spatial and/or sequence memory has also been used by other groups with a view to detecting (early) Alzheimer’s disease (e.g., Bellassen et al., 2012; Tu et al., 2017; Coughlan et al., 2019; Howett et al., 2019; Park and Lee, 2021; Schöberl et al., 2020; Coughlan et al., 2023).
As an initial proof-of-concept test of this idea, we examined MCI-vs.-HA discrimination in three cognitive tests: the 4MT, SSEVT, and ACE-III. Consistent with previous evidence for such detection (e.g., Moodley et al., 2015; Wood et al., 2016), the 4MT did discriminate the MCI and HA groups. We also found that our novel task, the SSEVT, did discriminate the MCI and HA groups. In contrast, the ACE-III did not significantly discriminate the MCI and HA groups. ACE-III was likely not designed to detect specifically early-AD pathology (rhinal cortices, hippocampus), and measures some cognitive domains (attention, verbal fluency, language) generally impacted only later in AD disease progression (Grady et al., 1988; Irwin et al., 2018), and may not accurately capture MCI-specific impairments (Beishon et al., 2019). While our results are preliminary, they provide some initial proof-of-concept encouragement for the possibility of using the 4MT (further to that summarized in see section “1.5 Could tests tapping the hippocampus fill the gap?”) and SSEVT as screening tools for pathological aging including that due to early AD.
4.4 Social embedding
In this sample, MCI patients were less socially embedded (SNSEQ) than their age-matched controls. Our small MCI sample size, cross-sectional design, and opportunistic sampling prohibits any conclusive inferences. We simply note here that this result is potentially consistent with previous research suggesting that social isolation is a risk factor for cognitive deterioration [sections 2.2.5/–6 see also (e.g., Pereda-Pérez et al., 2013; Wang et al., 2019; Drinkwater et al., 2022)], but also that social isolation can be a consequence of impaired cognition, (e.g., Hultsch et al., 1999). In the healthy sample, the degree of social embedding was not correlated with 4MT or SSEVT performance, and only weakly positively correlated with ACE-III performance (r = 0.106, rho = 0.123). This issue should be investigated in larger, older-biased samples, with recruitment including more socially-isolated individuals than was possible here.
3.4 Limitations and future directions
While middle-aged people were under-represented in our sample, the sample size in the first part of this study is likely sufficient for some confidence in our findings of much greater age-related decline in performance on the 4MT and SSEVT than on the ACE-III. Effect sizes were fairly large for 4MT and SSEVT, and much smaller for ACE-III. However, as we have acknowledged above, the differing patterns of association between task performance and education, namely small effect sizes for ACE-III and 4MT, and no effect for SSEVT, warrant further study in a sample with greater variance in education at all ages.
As we outlined in see section “1.4 Some limitations of existing screening tools for AD and cognitive aging” above, our aims in exploring 4MT and SSEVT were to overcome some of the limitations associated with verbal episodic memory tasks, which lack ecological validity, can be complex to score, and strongly modulated by factors such as sex and verbal intelligence. Nevertheless, a test of verbal episodic memory could have provided a useful additional comparator task in the present study by which to evaluate the success of 4MT and SSEVT. We aim to address this in future work.
The rather small patient sample, defined without necessary accompanying biomarkers according to the Albert et al. (2011) criteria, in this study’s second part, is the present study’s main limitation, limiting our findings to being initial proof-of-concept data. Nevertheless, it seems encouraging that the SSEVT may have value for future examination of both modulators of healthy aging, and the discrimination of healthy from pathological cognitive aging. We acknowledge the limitation that our healthy aging sample was largely White British, and our MCI sample entirely White British. It would be good to replicate the results obtained here with different ethnocultural samples.
This is the first report on the use of the SSEVT. We envisage potential improvements and adaptations can be made to its use. Although the SSEVT does not directly require facial recognition, participants will benefit from the ability to recognize faces. Color blindness could also potentially impact performance negatively. This could be addressed in future work by the addition of pre-test screens for prosopagnosia (face recognition) and color blindness. To suit particular study purposes, the number of presentations can be increased, and the length of the delay between presentation and test can be lengthened. In ongoing work examining consolidation for instance, we have used delays of 30 min and 2 weeks after three presentations of the SSEVT video, with separate sets of questions for the 30 min and the 2 weeks delay.
4.6 Conclusion
The Four Mountains Test and SSEVT are highly sensitive to a diminution in cognition associated with advancing age, and detectable at middle age. Since 4MT and SSEVT were designed to test hippocampal-dependent cognition, we interpret this age-related diminution as attributable (at least partly) to hippocampal atrophy being prominent in aging and present at middle age. ACE-III, a composite test of diverse cognitive functions, was much less age-modulated than 4MT and SSEVT. The degree to which education appeared to boost test performance was variable across the tasks: moderate for ACE-III, weak for 4MT, and minimal/absent for SSEVT. The seemingly minimal modulation of SSEVT scores by education and sex may confer extra value for its potential use in testing factors promoting healthy aging, and in discriminating healthy from pathological cognitive aging.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
Ethical Approval was given by the Psychology Ethics Sub-Comittee of Durham University and NHS Health Research Authority Approval: IRAS project ID 207985, REC reference 16/NE/0336. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
GM-B: Formal Analysis, Investigation, Project administration, Resources, Writing – original draft, Writing – review and editing. MB: Data curation, Formal Analysis, Investigation, Visualization, Writing – original draft, Writing – review and editing. BF: Investigation, Writing – review and editing. SK: Resources, Software, Writing – review and editing. SS: Methodology, Resources, Writing – review and editing. CW: Methodology, Resources, Writing – review and editing. AH: Formal Analysis, Visualization, Writing – review and editing. CA: Investigation, review and editing. ST: Resources, Writing – review and editing. WN: Resources, Writing – review and editing. DH: Resources, Writing – review and editing. SD-S: Resources, Writing – review and editing. DC: Resources, Writing – review and editing. JD: Funding acquisition, Methodology, Resources, Supervision, Writing – review and editing. SP: Conceptualization, Formal Analysis, Funding acquisition, Visualization, Writing – review and editing. SE: Funding acquisition, Methodology, Project administration, Supervision, Writing – review and editing, Conceptualization. CL: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Project administration, Resources, Supervision, Writing – original draft, Writing – review and editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. We gratefully acknowledge: a joint PhD studentship to GM-B from James Cook University Hospital Voluntary Services Council and Durham University Psychology Department; BBSRC grant (BB/T014768/1) to CL (PI) and SP (Co-I); Alzheimer’s Society United Kingdom Fellowship (AS-JF-15-008) to JD (PI) and CL (Co-I Sponsor).
Acknowledgments
We thank Donald Brechin (Neuropsychology dept, James Cook University Hospital) for encouragement and help with the James Cook University Hospital Voluntary Services Council studentship to GM-B. We thank the following Durham University Undergraduate students who contributed to data collection: Helena Tessmann, Danny Deng, Thomas O’Farrell, Aiofe Talbot, Catherine Jones, Abigail Tighe, Maja Kobylak, Goretti Hurtado Barbeito, Ana Martinez Bailey, Sarah Darwish, Natalie Maier, Mingyao Xiong, Annisha Attanayake, Charlotte Cooper, Ffion Bennett, Abbie Henry, and Eleanor Roberts.
Conflict of interest
SK was employed by Ascentys Limited. SE is a director of Psychle.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnagi.2025.1547834/full#supplementary-material
References
Aghjayan, S., Lesnovskaya, A., Esteban-Cornejo, I., Peven, J., Stillman, C., and Erickson, K. (2021). Aerobic exercise, cardiorespiratory fitness, and the human hippocampus. Hippocampus 31, 817–844. doi: 10.1002/hipo.23337
Aisen, P., Touchon, J., Amariglio, R., Andrieu, S., Bateman, R., Breitner, J., et al. (2017). EU/US/CTAD task force: Lessons learned from recent and current Alzheimer’s prevention trials. J Prev Alzheimers Dis. 4, 116–124. doi: 10.14283/jpad.2017.13
Albert, M., DeKosky, S., Dickson, D., Dubois, B., Feldman, H., Fox, N., et al. (2011). The diagnosis of mild cognitive impairment due to Alzheimer’s disease: Recommendations from the national institute on aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. 7, 270–279. doi: 10.1016/j.jalz.2011.03.008
Alzheimer’s Association (2021). Alzheimer’s disease facts and figures. Alzheimers Dement 17, 3708–3821. doi: 10.1002/alz.13809
Arevalo-Rodriguez, I., Smailagic, N., Roqué, I., Figuls, M., Ciapponi, A., Sanchez-Perez, E., et al. (2015). Mini-mental state examination (MMSE) for the detection of Alzheimer’s disease and other dementias in people with mild cognitive impairment (MCI). Cochrane Database Syst. Rev. 2015:CD010783. doi: 10.1002/14651858.CD010783.pub2
Bäckman, L., Small, B., and Fratiglioni, L. (2001). Stability of the preclinical episodic memory deficit in Alzheimer’s disease. Brain 124, 96–102. doi: 10.1093/brain/124.1.96
Bédard, M., Felteau, M., Mazmanian, D., Fedyk, K., Klein, R., Richardson, J., et al. (2003). Pilot evaluation of a mindfulness-based intervention to improve quality of life among individuals who sustained traumatic brain injuries. Disabil. Rehabil. 25, 722–731. doi: 10.1080/0963828031000090489
Beishon, L., Batterham, A., Quinn, T., Nelson, C., Panerai, R., Robinson, T., et al. (2019). Addenbrooke’s cognitive examination III (ACE-III) and mini-ACE for the detection of dementia and mild cognitive impairment. Cochrane Database Syst. Rev. 12:CD013282. doi: 10.1002/14651858.CD013282.pub2
Bellassen, V., Iglói, K., de Souza, L., Dubois, B., and Rondi-Reig, L. (2012). Temporal order memory assessed during spatiotemporal navigation as a behavioral cognitive marker for differential Alzheimer’s disease diagnosis. J. Neurosci. 32, 1942–1952. doi: 10.1523/JNEUROSCI.4556-11.2012
Bennett, D., Schneider, J., Tang, Y., Arnold, S., and Wilson, R. (2006). The effect of social networks on the relation between Alzheimer’s disease pathology and level of cognitive function in old people: A longitudinal cohort study. Lancet Neurol. 5, 406–412. doi: 10.1016/S1474-4422(06)70417-3
Bhargava, Y., Kottapalli, A., and Baths, V. (2024). Validation and comparison of virtual reality and 3D mobile games for cognitive assessment against ACE-III in 82 young participants. Sci. Rep. 14:23918. doi: 10.1038/s41598-024-75065-1
Bird, C., and Burgess, N. (2008). The hippocampus and memory: Insights from spatial processing. Nat. Rev. Neurosci. 9, 182–194. doi: 10.1038/nrn2335
Bird, C., Chan, D., Hartley, T., Pijnenburg, Y., Rossor, M., and Burgess, N. (2010). Topographical short-term memory differentiates Alzheimer’s disease from frontotemporal lobar degeneration. Hippocampus 20, 1154–1169. doi: 10.1002/hipo.20715
Blumenau, J., and Broom, Y. (2011). Performance of South African adolescents on two versions of the rey auditory verbal learning test. South Afr. J. Psychol. 41, 228–238. doi: 10.1177/008124631104100
Bolla-Wilson, K., and Bleecker, M. L. (1986). Influence of verbal intelligence, sex, age, and education on the rey auditory verbal learning test. Dev. Neuropsychol. 2, 203–211. doi: 10.1080/87565648609540342
Braak, H., and Braak, E. V. A. (1995). Staging of Alzheimer’s disease-related neurofibrillary changes. Neurobiol. Aging 16, 271–278. doi: 10.1016/0197-4580(95)00021-6
Braak, H., and Del Tredici, K. (2015). The preclinical phase of the pathological process underlying sporadic Alzheimer’s disease. Brain 138, 2814–2833. doi: 10.1093/brain/awv236
Braak, H., Braak, E., and Bohl, J. (1993). Staging of Alzheimer-related cortical destruction. Eur. Neurol. 33, 403–408. doi: 10.1159/000116984
Branchi, I., D’Andrea, I., Fiore, M., Di Fausto, V., Aloe, L., and Alleva, E. (2006). Early social enrichment shapes social behavior and nerve growth factor and brain-derived neurotrophic factor levels in the adult mouse brain. Biol. Psychiatry 60, 690–696. doi: 10.1016/j.biopsych.2006.01.005
Broadbent, N., Squire, L., and Clark, R. (2004). Spatial memory, recognition memory, and the hippocampus. Proc. Natl. Acad. Sci. U.S.A. 101, 14515–14520. doi: 10.1073/pnas.0406344101
Brown, M., and Aggleton, J. (2001). Recognition memory: What are the roles of the perirhinal cortex and hippocampus? Nat. Rev. Neurosci. 2, 51–61. doi: 10.1038/35049064
Brown, S., Livingston, G., and Mukadam, N. (2021). A national memory clinic survey to assess provision for people from diverse ethnic backgrounds in England and wales. Int. J. Environ. Res. Public Health 18:1456. doi: 10.3390/ijerph18041456
Bruno, D., and Schurmann Vignaga, S. (2019). Addenbrooke’s cognitive examination III in the diagnosis of dementia: A critical review. Neuropsychiatr. Dis. Treat. 15, 441–447. doi: 10.2147/NDT.S151253
Burgess, N., Maguire, E., and O’Keefe, J. (2002). The human hippocampus and spatial and episodic memory. Neuron 35, 625–641. doi: 10.1016/s0896-6273(02)00830-9
Buzsáki, G., and Moser, E. (2013). Memory, navigation and theta rhythm in the hippocampal-entorhinal system. Nat. Neurosci. 16, 130–138. doi: 10.1038/nn.3304
Caamaño-Isorna, F., Corral, M., Montes-Martínez, A., and Takkouche, B. (2006). Education and dementia: A meta-analytic study. Neuroepidemiology 26, 226–232. doi: 10.1159/000093378
Cansino, S. (2009). Episodic memory decay along the adult lifespan: A review of behavioral and neurophysiological evidence. Int. J. Psychophysiol. 71, 64–69. doi: 10.1016/j.ijpsycho.2008.07.005
Cass, S. (2017). Alzheimer’s disease and exercise: A literature review. Curr. Sports Med. Rep. 16, 19–22. doi: 10.1249/JSR.0000000000000332
Chan, D., Gallaher, L., Moodley, K., Minati, L., Burgess, N., and Hartley, T. (2016). The 4 Mountains test: A short test of spatial memory with high sensitivity for the diagnosis of pre-dementia Alzheimer’s disease. J. Vis. Exp. 116:54454. doi: 10.3791/54454
Coe, C., Mendoza, S., Smotherman, W., and Levine, S. (1978). Mother-infant attachment in the squirrel monkey: Adrenal response to separation. Behav. Biol. 22, 256–263. doi: 10.1016/s0091-6773(78)92305-2
Cornwell, B., Johnson, L., Holroyd, T., Carver, F., and Grillon, C. (2008). Human hippocampal and parahippocampal theta during goal-directed spatial navigation predicts performance on a virtual Morris water maze. J. Neurosci. 28, 5983–5990. doi: 10.1523/JNEUROSCI.5001-07.2008
Coughlan, G., Coutrot, A., Khondoker, M., Minihane, A., Spiers, H., and Hornberger, M. (2019). Toward personalized cognitive diagnostics of at-genetic-risk Alzheimer’s disease. Proc. Natl. Acad. Sci. U.S.A. 116, 9285–9292. doi: 10.1073/pnas.1901600116
Coughlan, G., DeSouza, B., Zhukovsky, P., Hornberger, M., Grady, C., Buckley, R., et al. (2023). Spatial cognition is associated with levels of phosphorylated-tau and β-amyloid in clinically normal older adults. Neurobiol. Aging 130, 124–134. doi: 10.1016/j.neurobiolaging.2023.06.016
Davitz, J., and Mason, D. (1955). Socially facilitated reduction of a fear response in rats. J. Comp. Physiol. Psychol. 48, 149–151. doi: 10.1037/h0046411
DeLong, E. R., DeLong, D. M., and Clarke-Pearson, D. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44, 837–845. doi: 10.2307/2531595
Di Paola, M., Macaluso, E., Carlesimo, G., Tomaiuolo, F., Worsley, K., Fadda, L., et al. (2007). Episodic memory impairment in patients with Alzheimer’s disease is correlated with entorhinal cortex atrophy. A voxel-based morphometry study. J. Neurol. 254, 774–781. doi: 10.1007/s00415-006-0435-1
Diana, R., Yonelinas, A., and Ranganath, C. (2007). Imaging recollection and familiarity in the medial temporal lobe: A three-component model. Trends Cogn. Sci. 11, 379–386. doi: 10.1016/j.tics.2007.08.001
Diener, E., Emmons, R., Larsen, R., and Griffin, S. (1985). The satisfaction with life scale. J. Pers. Assess. 49, 71–75. doi: 10.1207/s15327752jpa4901_13
Drinkwater, E., Davies, C., and Spires-Jones, T. (2022). Potential neurobiological links between social isolation and Alzheimer’s disease risk. Eur. J. Neurosci. 56, 5397–5412. doi: 10.1111/ejn.15373
Duzel, E., van Praag, H., and Sendtner, M. (2016). Can physical exercise in old age improve memory and hippocampal function? Brain 139, 662–673. doi: 10.1093/brain/awv407
Düzel, S., Drewelies, J., Gerstorf, D., Demuth, I., Steinhagen-Thiessen, E., Lindenberger, U., et al. (2019). Structural brain correlates of loneliness among older adults. Sci. Rep. 9:13569. doi: 10.1038/s41598-019-49888-2
Edwards, G., Gamez, N., Escobedo, G., Calderon, O., and Moreno-Gonzalez, I. (2019). Modifiable risk factors for Alzheimer’s disease. Front. Aging Neurosci. 11:146. doi: 10.3389/fnagi.2019.00146
Ekstrom, A., Kahana, M., Caplan, J., Fields, T., Isham, E., Newman, E., et al. (2003). Cellular networks underlying human spatial navigation. Nature 425, 184–188. doi: 10.1038/nature01964
Erickson, K., Voss, M., Prakash, R., Basak, C., Szabo, A., Chaddock, L., et al. (2011). Exercise training increases size of hippocampus and improves memory. Proc. Natl. Acad. Sci. U.S.A. 108, 3017–3022. doi: 10.1073/pnas.1015950108
Farias, S., Mungas, D., and Jagust, W. (2005). Degree of discrepancy between self and other-reported everyday functioning by cognitive status: Dementia, mild cognitive impairment, and healthy elders. Int. J. Geriatr. Psychiatry 20, 827–834. doi: 10.1002/gps.1367
Fjell, A., Sørensen, Ø, Amlien, I. K., Bartrés-Faz, D., Bros, D. M., Buchmann, N., et al. (2020). Self-reported sleep relates to hippocampal atrophy across the adult lifespan: Results from the Lifebrain consortium. Sleep 43:zsz280. doi: 10.1093/sleep/zsz280
Folstein, M., Folstein, S., and McHugh, P. (1975). “Mini-mental state”. A practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 12, 189–198. doi: 10.1016/0022-3956(75)90026-6
Fortin, N., Agster, K., and Eichenbaum, H. (2002). Critical role of the hippocampus in memory for sequences of events. Nat. Neurosci. 5, 458–462. doi: 10.1038/nn834
Fratiglioni, L., Paillard-Borg, S., and Winblad, B. (2004). An active and socially integrated lifestyle in late life might protect against dementia. Lancet Neurol. 3, 343–353. doi: 10.1016/S1474-4422(04)00767-7
Frisoni, G., Molinuevo, J., Altomare, D., Carrera, E., Barkhof, F., Berkhof, J., et al. (2020). Precision prevention of Alzheimer’s and other dementias: Anticipating future needs in the control of risk factors and implementation of disease-modifying therapies. Alzheimers Dement. 16, 1457–1468. doi: 10.1002/alz.12132
Ghoshal, N., García-Sierra, F., Wuu, J., Leurgans, S., Bennett, D., Berry, R., et al. (2002). Tau conformational changes correspond to impairments of episodic memory in mild cognitive impairment and Alzheimer’s disease. Exp. Neurol. 177, 475–493. doi: 10.1006/exnr.2002.8014
Gilbert, P., and Kesner, R. (2002). Role of the rodent hippocampus in paired-associate learning involving associations between a stimulus and a spatial location. Behav. Neurosci. 116, 63–71. doi: 10.1037//0735-7044.116.1.63
Giles, L., Anstey, K., Walker, R., and Luszcz, M. (2012). Social networks and memory over 15 years of followup in a cohort of older Australians: Results from the Australian longitudinal study of ageing. J. Aging Res. 2012:856048. doi: 10.1155/2012/856048
Glymour, M., Tzourio, C., and Dufouil, C. (2012). Is cognitive aging predicted by one’s own or one’s parents’ educational level? results from the three-city study. Am J Epidemiol. 175, 750–759. doi: 10.1093/aje/kwr509
Gómez-Isla, T., Price, J., McKeel, D., Morris, J., Growdon, J., and Hyman, B. (1996). Profound loss of layer II entorhinal cortex neurons occurs in very mild Alzheimer’s disease. J. Neurosci. 16, 4491–4500. doi: 10.1523/JNEUROSCI.16-14-04491.1996
Goveas, J., Rapp, S., Hogan, P., Driscoll, I., Tindle, H., Smith, J., et al. (2016). Predictors of optimal cognitive aging in 80+ women: The women’s health initiative memory study. J. Gerontol. A Biol. Sci. Med. Sci. 71, S62–S71. doi: 10.1093/gerona/glv055
Grady, C., Haxby, J., Horwitz, B., Sundaram, M., Berg, G., Schapiro, M., et al. (1988). Longitudinal study of the early neuropsychological and cerebral metabolic changes in dementia of the Alzheimer type. J. Clin. Exp. Neuropsychol. 10, 576–596. doi: 10.1080/01688638808402796
Greenaway, M., Lacritz, L., Binegar, D., Weiner, M., Lipton, A., and Munro Cullum, C. (2006). Patterns of verbal memory performance in mild cognitive impairment, Alzheimer disease, and normal aging. Cogn. Behav. Neurol. 19, 79–84. doi: 10.1097/01.wnn.0000208290.57370.a3
Grober, E., Hall, C., Lipton, R., Zonderman, A., Resnick, S., and Kawas, C. (2008). Memory impairment, executive dysfunction, and intellectual decline in preclinical Alzheimer’s disease. J. Int. Neuropsychol. Soc. 14, 266–278. doi: 10.1017/S1355617708080302
Hafting, T., Fyhn, M., Molden, S., Moser, M., and Moser, E. (2005). Microstructure of a spatial map in the entorhinal cortex. Nature 436, 801–806. doi: 10.1038/nature03721
Hardy, J., and Higgins, G. (1992). Alzheimer’s disease: The amyloid cascade hypothesis. Science 256, 184–185. doi: 10.1126/science.1566067
Hartley, T., and Harlow, R. (2012). An association between human hippocampal volume and topographical memory in healthy young adults. Front. Hum. Neurosci. 6:338. doi: 10.3389/fnhum.2012.00338
Hartley, T., Bird, C., Chan, D., Cipolotti, L., Husain, M., Vargha-Khadem, F., et al. (2007). The hippocampus is required for short-term topographical memory in humans. Hippocampus 17, 34–48. doi: 10.1002/hipo.20240
Heo, S., Prakash, R., Voss, M., Erickson, K., Ouyang, C., Sutton, B., et al. (2010). Resting hippocampal blood flow, spatial memory and aging. Brain Res. 1315, 119–127. doi: 10.1016/j.brainres.2009.12.020
Herlitz, A., and Rehnman, J. (2008). Sex differences in episodic memory. Curr. Dir. Psychol. Sci. 17, 52–56. doi: 10.3389/fpsyg.2020.00613
Holtzman, R., Rebok, G., Saczynski, J., Kouzis, A., Wilcox Doyle, K., and Eaton, W. (2004). Social network characteristics and cognition in middle-aged and older adults. J. Gerontol. B Psychol. Sci. Soc. Sci. 59, 278–284. doi: 10.1093/geronb/59.6.p278
Horning, S., Melrose, R., and Sultzer, D. (2014). Insight in Alzheimer’s disease and its relation to psychiatric and behavioral disturbances. Int. J. Geriatr. Psychiatry 29, 77–84. doi: 10.1002/gps.3972
Howett, D., Castegnaro, A., Krzywicka, K., Hagman, J., Marchment, D., Henson, R., et al. (2019). Differentiation of mild cognitive impairment using an entorhinal cortex-based test of virtual reality navigation. Brain 142, 1751–1766. doi: 10.1093/brain/awz116
Høydal, ØA., Skytøen, E. R., Andersson, S. O., Moser, M. B., and Moser, E. I. (2019). Object-vector coding in the medial entorhinal cortex. Nature 568, 400–404. doi: 10.1038/s41586-019-1077-7
Hsieh, S., Schubert, S., Hoon, C., Mioshi, E., and Hodges, J. (2013). Validation of the Addenbrooke’s cognitive examination III in frontotemporal dementia and Alzheimer’s disease. Dement. Geriatr. Cogn. Disord. 36, 242–250. doi: 10.1159/000351671
Hultsch, D. F., Hertzog, C., Small, B. J., and Dixon, R. A. (1999). Use it or lose it: Engaged lifestyle as a buffer of cognitive decline in aging? Psychol. Aging 14:245. doi: 10.1037//0882-7974.14.2.245
Irwin, K., Sexton, C., Daniel, T., Lawlor, B., and Naci, L. (2018). Healthy aging and dementia: Two roads diverging in midlife? Front. Aging Neurosci. 10:275. doi: 10.3389/fnagi.2018.00275
Jacobs, J., Weidemann, C., Miller, J., Solway, A., Burke, J., Wei, X., et al. (2013). Direct recordings of grid-like neuronal activity in human spatial navigation. Nat. Neurosci. 16, 1188–1190. doi: 10.1038/nn.3466
Jubb, M., and Evans, J. (2015). An investigation of the utility of the addenbrooke’s cognitive examination III in the early detection of dementia in memory clinic patients aged over 75 years. Dement. Geriatr. Cogn. Disord. 40, 222–232. doi: 10.1159/000433522
Kawano, N., Umegaki, H., Suzuki, Y., Yamamoto, S., Mogi, N., and Iguchi, A. (2010). Effects of educational background on verbal fluency task performance in older adults with Alzheimer’s disease and mild cognitive impairment. Int. Psychogeriatr. 22, 995–1002. doi: 10.1017/S1041610210000839
Kessels, R., de Haan, E., Kappelle, L., and Postma, A. (2001). Varieties of human spatial memory: A meta-analysis on the effects of hippocampal lesions. Brain Res. Brain Res. Rev. 35, 295–303. doi: 10.1016/s0165-0173(01)00058-3
Kumaran, D., and Maguire, E. (2006). The dynamics of hippocampal activation during encoding of overlapping sequences. Neuron 49, 617–629. doi: 10.1016/j.neuron.2005.12.024
Lara, E., Caballero, F., Rico-Uribe, L., Olaya, B., Haro, J., Ayuso-Mateos, J., et al. (2019a). Are loneliness and social isolation associated with cognitive decline? Int. J. Geriatr. Psychiatry 34, 1613–1622. doi: 10.1002/gps.5174
Lara, E., Martín-María, N., De la Torre-Luque, A., Koyanagi, A., Vancampfort, D., Izquierdo, A., et al. (2019b). Does loneliness contribute to mild cognitive impairment and dementia? A systematic review and meta-analysis of longitudinal studies. Ageing Res. Rev. 52, 7–16. doi: 10.1016/j.arr.2019.03.002
Laske, C., Sohrabi, H., Frost, S., López-de-Ipiña, K., Garrard, P., Buscema, M., et al. (2015). Innovative diagnostic tools for early detection of Alzheimer’s disease. Alzheimers Dement. 11, 561–578. doi: 10.1016/j.jalz.2014.06.004
Lenehan, M., Summers, M., Saunders, N., Summers, J., and Vickers, J. (2015). Relationship between education and age-related cognitive decline: A review of recent research. Psychogeriatrics 15, 154–162. doi: 10.1111/psyg.12083
Lever, C., Burton, S., Jeewajee, A., O’Keefe, J., and Burgess, N. (2009). Boundary vector cells in the subiculum of the hippocampal formation. J. Neurosci. 29, 9771–9777. doi: 10.1523/JNEUROSCI.1319-09.2009
Levine, B., Svoboda, E., Hay, J. F., Winocur, G., and Moscovitch, M. (2002). Aging and autobiographical memory: Dissociating episodic from semantic retrieval. Psychol. Aging 17, 677–689. doi: 10.1037/0882-7974.17.4.677
Li, A., and King, J. (2019). Spatial memory and navigation in ageing: A systematic review of MRI and fMRI studies in healthy participants. Neurosci. Biobehav. Rev. 103, 33–49. doi: 10.1016/j.neubiorev.2019.05.005
Lithfous, S., Dufour, A., and Després, O. (2013). Spatial navigation in normal aging and the prodromal stage of Alzheimer’s disease: Insights from imaging and behavioral studies. Ageing Res. Rev. 12, 201–213. doi: 10.1016/j.arr.2012.04.007
Liu, P., and Nusslock, R. (2018). Exercise-mediated neurogenesis in the hippocampus via BDNF. Front. Neurosci. 12:52. doi: 10.3389/fnins.2018.00052
Llewellyn, D., Lang, I., Langa, K., and Huppert, F. (2008). Cognitive function and psychological well-being: Findings from a population-based cohort. Age Ageing 37, 685–689. doi: 10.1093/ageing/afn194
Lombardi, G., Crescioli, G., Cavedo, E., Lucenteforte, E., Casazza, G., Bellatorre, A., et al. (2020). Structural magnetic resonance imaging for the early diagnosis of dementia due to Alzheimer’s disease in people with mild cognitive impairment. Cochrane Database Syst. Rev. 3:CD009628. doi: 10.1002/14651858.CD009628.pub2
Lou, F., Yang, G., Cai, L., Yu, L., Zhang, Y., Shi, C., et al. (2022). Effects of age, sex, and education on California verbal learning test-II performance in a Chinese-speaking population. Front. Psychol. 13:935875. doi: 10.3389/fpsyg.2022.935875
Matias-Guiu, J., Cortés-Martínez, A., Valles-Salgado, M., Rognoni, T., Fernández-Matarrubia, M., Moreno-Ramos, T., et al. (2017). Addenbrooke’s cognitive examination III: Diagnostic utility for mild cognitive impairment and dementia and correlation with standardized neuropsychological tests. Int. Psychogeriatr. 29, 105–113. doi: 10.1017/S1041610216001496
Meerlo, P., Mistlberger, R., Jacobs, B., Heller, H., and McGinty, D. (2009). New neurons in the adult brain: The role of sleep and consequences of sleep loss. Sleep Med. Rev. 13, 187–194. doi: 10.1016/j.smrv.2008.07.004
Migliorelli, R., Tesón, A., Sabe, L., Petracca, G., Petracchi, M., Leiguarda, R., et al. (1995). Anosognosia in Alzheimer’s disease: A study of associated factors. J. Neuropsychiatry Clin. Neurosci. 7, 338–344. doi: 10.1176/jnp.7.3.338
Moodley, K., Minati, L., Contarino, V., Prioni, S., Wood, R., Cooper, R., et al. (2015). Diagnostic differentiation of mild cognitive impairment due to Alzheimer’s disease using a hippocampus-dependent test of spatial memory. Hippocampus 25, 939–951. doi: 10.1002/hipo.22417
Morris, R., Garrud, P., Rawlins, J., and O’Keefe, J. (1982). Place navigation impaired in rats with hippocampal lesions. Nature 297, 681–683. doi: 10.1038/297681a0
Mortamais, M., Ash, J., Harrison, J., Kaye, J., Kramer, J., Randolph, C., et al. (2017). Detecting cognitive changes in preclinical Alzheimer’s disease: A review of its feasibility. Alzheimers Dement. 13, 468–492. doi: 10.1016/j.jalz.2016.06.2365
Murphy, K. J., Troyer, A. K., Levine, B., and Moscovitch, M. (2008). Episodic, but not semantic, autobiographical memory is reduced in amnestic mild cognitive impairment. Neuropsychologia 46, 3116–3123. doi: 10.1016/j.neuropsychologia.2008.07.004
Nasreddine, Z., Phillips, N., Bédirian, V., Charbonneau, S., Whitehead, V., Collin, I., et al. (2005). The Montreal cognitive assessment, MoCA: A brief screening tool for mild cognitive impairment. J. Am. Geriatr. Soc. 53, 695–699. doi: 10.1111/j.1532-5415.2005.53221.x
Nichols, E., and Vos, T. (2021). The estimation of the global prevalence of dementia from 1990-2019 and forecasted prevalence through 2050: An analysis for the global burden of disease (GBD) study 2019. Alzheimers Dement. 17:e051496. doi: 10.1002/alz.051496
Norton, S., Matthews, F., Barnes, D., Yaffe, K., and Brayne, C. (2014). Potential for primary prevention of Alzheimer’s disease: An analysis of population-based data. Lancet Neurol. 13, 788–794. doi: 10.1016/S1474-4422(14)70136-X
O’Keefe, J., and Nadel, L. (1978). The hippocampus as a cognitive map. Oxford: Oxford University Press.
O’Keefe, J., and Dostrovsky, J. (1971). The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat. Brain Res. 34, 171–175. doi: 10.1016/0006-8993(71)90358-1
Olaya, B., Bobak, M., Haro, J., and Demakakos, P. (2017). Trajectories of verbal episodic memory in middle-aged and older adults: Evidence from the English longitudinal study of ageing. J. Am. Geriatr. Soc. 65, 1274–1281. doi: 10.1111/jgs.14789
Pan, F., Wang, Y., Huang, L., Huang, Y., and Guo, Q. (2022). Validation of the Chinese version of Addenbrooke’s cognitive examination III for detecting mild cognitive impairment. Aging Ment. Health 26, 384–391. doi: 10.1080/13607863.2021.1881757
Park, J., and Lee, S. (2021). The fragility of temporal memory in Alzheimer’s disease. J. Alzheimers Dis. 79, 1631–1646. doi: 10.3233/JAD-200892
Pengas, G., Patterson, K., Arnold, R., Bird, C., Burgess, N., and Nestor, P. (2010). Lost and found: Bespoke memory testing for Alzheimer’s disease and semantic dementia. J. Alzheimers Dis. 21, 1347–1365. doi: 10.3233/jad-2010-100654
Pereda-Pérez, I., Popoviæ, N., Otalora, B., Popoviæ, M., Madrid, J., Rol, M., et al. (2013). Long-term social isolation in the adulthood results in CA1 shrinkage and cognitive impairment. Neurobiol. Learn. Mem. 106, 31–39. doi: 10.1016/j.nlm.2013.07.004
Piolino, P., Desgranges, B., Benali, K., and Eustache, F. (2002). Episodic and semantic remote autobiographical memory in ageing. Memory 10, 239–257. doi: 10.1080/09658210143000353
Plancher, G., Tirard, A., Gyselinck, V., Nicolas, S., and Piolino, P. (2012). Using virtual reality to characterize episodic memory profiles in amnestic mild cognitive impairment and Alzheimer’s disease: Influence of active and passive encoding. Neuropsychologia 50, 592–602. doi: 10.1016/j.neuropsychologia.2011.12.013
Potts, C., Richardson, J., Bond, R., Price, R., Mulvenna, M., Zvolsky, P., et al. (2022). Reliability of Addenbrooke’s Cognitive Examination III in differentiating between dementia, mild cognitive impairment and older adults who have not reported cognitive problems. Eur. J. Ageing 19, 495–507. doi: 10.1007/s10433-021-00652-4
Poulter, S., Hartley, T., and Lever, C. (2018). The neurobiology of mammalian navigation. Curr. Biol. 28, R1023–R1042. doi: 10.1016/j.cub.2018.05.050
Poulter, S., Lee, S., Dachtler, J., Wills, T., and Lever, C. (2021). Vector trace cells in the subiculum of the hippocampal formation. Nat. Neurosci. 24, 266–275. doi: 10.1038/s41593-020-00761-w
Ranganath, C., Yonelinas, A., Cohen, M., Dy, C., Tom, S., and D’Esposito, M. (2004). Dissociable correlates of recollection and familiarity within the medial temporal lobes. Neuropsychologia 42, 2–13. doi: 10.1016/j.neuropsychologia.2003.07.006
Ravikumar, S., Denning, A., Lim, S., Chung, E., Sadeghpour, N., Ittyerah, R., et al. (2024). Postmortem imaging reveals patterns of medial temporal lobe vulnerability to tau pathology in Alzheimer’s disease. Nat. Commun. 15:4803. doi: 10.1038/s41467-024-49205-0
Russell, D., Peplau, L., and Ferguson, M. (1978). Developing a measure of loneliness. J. Pers. Assess. 42, 290–294. doi: 10.1207/s15327752jpa4203_11
Scarmeas, N., Albert, S., Manly, J., and Stern, Y. (2006). Education and rates of cognitive decline in incident Alzheimer’s disease. J. Neurol. Neurosurg. Psychiatry 77, 308–316. doi: 10.1136/jnnp.2005.072306
Schinazi, V., Nardi, D., Newcombe, N., Shipley, T., and Epstein, R. (2013). Hippocampal size predicts rapid learning of a cognitive map in humans. Hippocampus 23, 515–528. doi: 10.1002/hipo.22111
Schöberl, F., Pradhan, C., Irving, S., Buerger, K., Xiong, G., Kugler, G., et al. (2020). Real-space navigation testing differentiates between amyloid-positive and -negative aMCI. Neurology 94, e861–e873. doi: 10.1212/WNL.0000000000008758
Scoville, W., and Milner, B. (1957). Loss of recent memory after bilateral hippocampal lesions. J. Neurol. Neurosurg. Psychiatry 20, 11–21. doi: 10.1136/jnnp.20.1.11
Sharp, E., and Gatz, M. (2011). Relationship between education and dementia: An updated systematic review. Alzheimer Dis. Assoc. Disord. 25, 289–304. doi: 10.1097/WAD.0b013e318211c83c
Sims, J., Zimmer, J., Evans, C., Lu, M., Ardayfio, P., Sparks, J., et al. (2023). Donanemab in early symptomatic Alzheimer disease: The TRAILBLAZER-ALZ 2 randomized clinical trial. JAMA 330, 512–527. doi: 10.1001/jama.2023.13239
Smith, B., Yao, X., Chen, K., and Kirby, E. D. (2018). A larger social network enhances novel object location memory and reduces hippocampal microgliosis in aged mice. Front. Aging Neurosci. 10:142. doi: 10.3389/fnagi.2018.00142
Starkstein, S., Sabe, L., Chemerinski, E., Jason, L., and Leiguarda, R. (1996). Two domains of anosognosia in Alzheimer’s disease. J. Neurol. Neurosurg. Psychiatry 61, 485–490. doi: 10.1136/jnnp.61.5.485
Stern, Y. (2012). Cognitive reserve in ageing and Alzheimer’s disease. Lancet Neurol. 11, 1006–1012. doi: 10.1016/S1474-4422(12)70191-6
Sundermann, E., Maki, P., Biegon, A., Lipton, R., Mielke, M., Machulda, M., et al. (2019). Sex-specific norms for verbal memory tests may improve diagnostic accuracy of amnestic MCI. Neurology 93, e1881–e1889. doi: 10.1212/WNL.0000000000008467
Sundström, A., Adolfsson, A., Nordin, M., and Adolfsson, R. (2020). loneliness increases the risk of all-cause dementia and Alzheimer’s disease. J. Gerontol. B Psychol. Sci. Soc. Sci. 75, 919–926. doi: 10.1093/geronb/gbz139
Swanwick, G., Coen, R., Maguire, C., Kirby, M., Walsh, J., O’Neill, D., et al. (1999). The association between demographic factors, disease severity and the duration of symptoms at clinical presentation in elderly people with dementia. Age Ageing 28, 295–299. doi: 10.1093/ageing/28.3.295
Taube, J., Muller, R., and Ranck, J. (1990). Head-direction cells recorded from the postsubiculum in freely moving rats. I. Description and quantitative analysis. J. Neurosci. 10, 420–435. doi: 10.1523/JNEUROSCI.10-02-00420.1990
Tu, S., Spiers, H., Hodges, J., Piguet, O., and Hornberger, M. (2017). Egocentric versus allocentric spatial memory in behavioral variant frontotemporal dementia and Alzheimer’s disease. J. Alzheimers Dis. 59, 883–892. doi: 10.3233/JAD-160592
Twamley, E., Ropacki, S., and Bondi, M. (2006). Neuropsychological and neuroimaging changes in preclinical Alzheimer’s disease. J. Int. Neuropsychol. Soc. 12, 707–735. doi: 10.1017/S1355617706060863
Vadikolias, K., Tsiakiri-Vatamidis, A., Tripsianis, G., Tsivgoulis, G., Ioannidis, P., Serdari, A., et al. (2012). Mild cognitive impairment: Effect of education on the verbal and nonverbal tasks performance decline. Brain Behav. 2, 620–627. doi: 10.1002/brb3.88
Van der Elst, W., van Boxtel, M., van Breukelen, G., and Jolles, J. (2005). Rey’s verbal learning test: Normative data for 1855 healthy participants aged 24-81 years and the influence of age, sex, education, and mode of presentation. J. Int. Neuropsychol. Soc. 11, 290–302. doi: 10.1017/S1355617705050344
Van Dyck, C., Swanson, C., Aisen, P., Bateman, R., Chen, C., Gee, M., et al. (2023). Lecanemab in early Alzheimer’s disease. N. Engl. J. Med. 388, 9–21. doi: 10.1056/NEJMoa2212948
Van Praag, H. (2008). Neurogenesis and exercise: Past and future directions. Neuromol. Med. 10, 128–140. doi: 10.1007/s12017-008-8028-z
Vargha-Khadem, F., Gadian, D., Watkins, K., Connelly, A., Van Paesschen, W., and Mishkin, M. (1997). Differential effects of early hippocampal pathology on episodic and semantic memory. Science 277, 376–380. doi: 10.1126/science.277.5324.376
Victor, C. (2021). Is loneliness a cause or consequence of dementia? A public health analysis of the literature. Front. Psychol. 11:612771. doi: 10.3389/fpsyg.2020.612771
Wang, B., Wu, Q., Lei, L., Sun, H., Michael, N., Zhang, X., et al. (2019). Long-term social isolation inhibits autophagy activation, induces postsynaptic dysfunctions and impairs spatial memory. Exp. Neurol. 311, 213–224. doi: 10.1016/j.expneurol.2018.09.009
Wang, L., Swank, J., Glick, I., Gado, M., Miller, M., Morris, J., et al. (2003). Changes in hippocampal volume and shape across time distinguish dementia of the Alzheimer type from healthy aging. Neuroimage 20, 667–682. doi: 10.1016/S1053-8119(03)00361-6
Weiss, E. M., Kemmler, G., Deisenhammer, E. A., Fleischhacker, W. W., and Delazer, M. (2003). Sex differences in cognitive functions. Pers. Individ. Differ. 35, 863–875. doi: 10.1016/S0191-8869(02)00288-X
West, G., Patai, Z., Coutrot, A., Hornberger, M., Bohbot, V., and Spiers, H. (2023). Landmark-dependent navigation strategy declines across the human life-span: Evidence from over 37,000 participants. J. Cogn. Neurosci. 35, 452–467. doi: 10.1162/jocn_a_01956
Whitwell, J., Przybelski, S., Weigand, S., Knopman, D., Boeve, B., Petersen, R., et al. (2007). 3D maps from multiple MRI illustrate changing atrophy patterns as subjects progress from mild cognitive impairment to Alzheimer’s disease. Brain 130, 1777–1786. doi: 10.1093/brain/awm112
Wilson, R., Li, Y., Aggarwal, N., Barnes, L., McCann, J., Gilley, D., et al. (2004). Education and the course of cognitive decline in Alzheimer disease. Neurology 63, 1198–1202. doi: 10.1212/01.wnl.0000140488.65299.53
Wilson, R., Yu, L., Lamar, M., Schneider, J., Boyle, P., and Bennett, D. (2019). Education and cognitive reserve in old age. Neurology 92, e1041–e1050. doi: 10.1212/WNL.0000000000007036
Wood, R., Moodley, K., Lever, C., Minati, L., and Chan, D. (2016). Allocentric spatial memory testing predicts conversion from mild cognitive impairment to dementia: An initial proof-of-concept study. Front. Neurol. 7:215. doi: 10.3389/fneur.2016.00215
Keywords: hippocampus, Alzheimer’s disease, cognitive aging, aging, spatial memory, sequence memory, episodic memory
Citation: Michallat-Bragg G, Bennett M, Flewitt BI, Kazmi S, Smith SJ, Wells C, Hollins A, Ash C, Thwaites S, Neil W, Howett D, Dexter-Smith S, Chan D, Dachtler J, Poulter S, Evans S and Lever C (2025) Exploring the sensitivity of episodic and spatial memory tests to healthy and pathological cognitive aging. Front. Aging Neurosci. 17:1547834. doi: 10.3389/fnagi.2025.1547834
Received: 18 December 2024; Accepted: 13 May 2025;
Published: 20 June 2025.
Edited by:
Ashok Kumar, University of Florida, United StatesReviewed by:
Rafal Czajkowski, Polish Academy of Sciences, PolandLeslie S. Gaynor, Vanderbilt University Medical Center, United States
Copyright © 2025 Michallat-Bragg, Bennett, Flewitt, Kazmi, Smith, Wells, Hollins, Ash, Thwaites, Neil, Howett, Dexter-Smith, Chan, Dachtler, Poulter, Evans and Lever. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Colin Lever, Y29saW4ubGV2ZXJAZHVyaGFtLmFjLnVr
†These authors share first authorship
‡Lead author