Abstract
Hippocampal–medial prefrontal interactions are thought to play a crucial role in mental simulation. Notably, the frontal midline/medial pFC (mPFC) theta rhythm in humans has been linked to introspective thought and working memory. In parallel, theta rhythms have been proposed to coordinate processing in the medial temporal cortex, retrosplenial cortex (RSc), and parietal cortex during the movement of viewpoint in imagery, extending their association with physical movement in rodent models. Here, we used noninvasive whole-head MEG to investigate theta oscillatory power and phase-locking during the 18-sec postencoding delay period of a spatial working memory task, in which participants imagined previously learned object sequences either on a blank background (object maintenance), from a first-person viewpoint in a scene (static imagery), or moving along a path past the objects (dynamic imagery). We found increases in 4- to 7-Hz theta power in mPFC when comparing the delay period with a preencoding baseline. We then examined whether the mPFC theta rhythm was phase-coupled with ongoing theta oscillations elsewhere in the brain. The same mPFC region showed significantly higher theta phase coupling with the posterior medial temporal lobe/RSc for dynamic imagery versus either object maintenance or static imagery. mPFC theta phase coupling was not observed with any other brain region. These results implicate oscillatory coupling between mPFC and medial temporal lobe/RSc theta rhythms in the dynamic mental exploration of imagined scenes.
INTRODUCTION
Our capacity to imagine spatially cohesive representations is associated with the medial pFC (mPFC) and medial temporal lobe (MTL) regions such as the hippocampus (Schacter et al., 2012; Schacter & Addis, 2007; Hassabis, Kumaran, Vann, & Maguire, 2007; Burgess, Becker, King, & O'Keefe, 2001). Increases in approximately 4- to 8-Hz mPFC oscillatory power, known as the frontal midline theta rhythm, are observed during internally generated behaviors such as abstract thinking and meditation (Aftanas & Golocheikine, 2001; Sasaki et al., 1996; Lehmann, Henggeler, Koukkou, & Michel, 1993; Banquet, 1973). Notably, there is growing evidence that the frontal midline theta rhythm is also implicated in working and episodic memory function (see Hsieh & Ranganath, 2014, for a recent review).
In parallel, hippocampal theta oscillations in the MTL have been hypothesized to serve as a network hub (Battaglia, Benchenane, Sirota, Pennartz, & Wiener, 2011) and global signal integrator (O'Keefe, 2006) for information from neocortical regions, including the mPFC and medial parietal/retrosplenial (RSc) cortices. The hippocampal theta rhythm is strongly associated with translational movement in rodents (O'Keefe & Nadel, 1978; Vanderwolf, 1969) but has also been observed in the human MTL and several neocortical areas including the mPFC and the medial parietal lobe/RSc during virtual navigation (Kaplan et al., 2012; Ekstrom et al., 2005; Caplan et al., 2003; Kahana, Sekuler, Caplan, Kirschen, & Madsen, 1999). Recently, several studies have found increased theta phase coupling between these regions during spatial and autobiographical memory retrieval (Fuentemilla, Barnes, Duzel, & Levine, 2014; Kaplan et al., 2014; Foster, Kaveh, Dastjerdi, Miller, & Parvizi, 2013; Watrous, Tandon, et al., 2013), but whether theta power or phase coupling contributes to spatial imagery is currently unclear.
This issue is addressed by a speculative neural level model of memory-guided visuospatial imagery (Bird & Burgess, 2008; Byrne, Becker, & Burgess, 2007; Burgess et al., 2001). It proposes that the MTL provides allocentric scene information consistent with a single viewpoint location and that this information is translated, via intermediate representations in the RSc, into an egocentric image consistent with a specific viewing direction, supported in a medial parietal “window” (PW). This “top–down” activation of the PW from MTL occurs during the first half of each theta cycle, whereas “bottom–up” updating of the MTL representation from the PW representation occurs in the second half of each cycle—allowing egocentric manipulations in the PW to propagate back to the MTL. These egocentric manipulations include selective attention to specific scene elements (allowing reactivation of their identity/perceptual properties in the MTL) and spatial updating because of the real or imagined movement of viewpoint.
According to this model, during actual movement, at the start of a theta cycle, motor efference copy from premotor areas (Graziano & Gross, 1993) drives egocentric spatial updating of locations by modifying the allocentric-to-egocentric translation to accommodate the effect of the movement. The updated egocentric representation then feeds back to update MTL representations during the second half of the theta cycle. The model proposes that the imagined movement of viewpoint (dynamic imagery) occurs in the same way, driven by mock motor efference copy from pFC, which could be used in behaviors like planning. The theta rhythmicity associated with prefrontal control of working memory maintenance (Hsieh & Ranganath, 2014) would thus have to synchronize with the theta-modulated MTL–PW interaction so as to be able to direct dynamic imagery. However, we are unaware of any attempts to investigate theta rhythmicity or phase coupling during dynamic imagery.
Using a spatial working memory task with noninvasive whole-head MEG, we tested this prediction. In the experiment, participants encoded the location of five objects overlaid on a scene or blank background (Figure 1). Participants then closed their eyes during a 20-sec eyes-closed delay period and imagined previously learned object sequences either overlaid on a blank background (object maintenance; Figure 1A), from a static first-person viewpoint in a scene (static imagery; Figure 1B), or moving along a path past the objects (dynamic imagery; Figure 1C). Afterward, participants were prompted to sequentially match each object to its respective location on the screen, in the scene, or along the path.
Experiment trial structure. After a 20-sec eyes-closed baseline period, an auditory tone alerted the participants to an upcoming encoding period (for all trial types). These encoding periods fell into one of three conditions. (A) Object maintenance: Participants were presented with a set of objects overlaid on a black background and instructed to maintain a memory of those object locations during a 20-sec eyes-closed delay period. (B) Static imagery: Participants were presented with a set of objects overlaid on a static scene and instructed to maintain a memory of those object locations from an imagined first-person viewpoint sitting on the bench set within each scene during a 20-sec eyes-closed delay period. (C) Dynamic imagery: Participants were presented with a set of objects overlaid on a static scene and instructed to maintain a memory of those object locations from a first-person viewpoint as they imagined moving through the scene on a trajectory that sequentially passed each object during a 20-sec eyes-closed delay period. After the delay period, participants were prompted with an image of each object and asked to identify its previous location at their own pace (mean = 11.8 sec) as well as report their perceived memory performance and vividness of imagination during the delay period.
Experiment trial structure. After a 20-sec eyes-closed baseline period, an auditory tone alerted the participants to an upcoming encoding period (for all trial types). These encoding periods fell into one of three conditions. (A) Object maintenance: Participants were presented with a set of objects overlaid on a black background and instructed to maintain a memory of those object locations during a 20-sec eyes-closed delay period. (B) Static imagery: Participants were presented with a set of objects overlaid on a static scene and instructed to maintain a memory of those object locations from an imagined first-person viewpoint sitting on the bench set within each scene during a 20-sec eyes-closed delay period. (C) Dynamic imagery: Participants were presented with a set of objects overlaid on a static scene and instructed to maintain a memory of those object locations from a first-person viewpoint as they imagined moving through the scene on a trajectory that sequentially passed each object during a 20-sec eyes-closed delay period. After the delay period, participants were prompted with an image of each object and asked to identify its previous location at their own pace (mean = 11.8 sec) as well as report their perceived memory performance and vividness of imagination during the delay period.
We were primarily interested in the potential co-occurrence of frontal midline/mPFC, RSc, and MTL theta oscillations, interregional mPFC theta phase coupling during the delay period, and whether these phenomena would be modulated by the context of the object sequence imagination task (blank, static spatial, or dynamic spatial). We predicted an increase in frontal midline/mPFC theta power during the delay period, because of the demands of working memory function (Hsieh & Ranganath, 2014). In addition, theta phase coupling with MTL and parietal regions, indicating the presence of functional interactions between these regions, would be specifically recruited to dynamically shift viewpoint during spatial imagery. The blank background condition provides a necessary control for the basic working memory components of the task, whereas the static imagery condition provides a control for working memory accompanied by spatial imagery in the absence of imagined movement. Although spatial updating of object locations could be performed egocentrically (Wang & Spelke, 2000; Wang & Simons, 1999), in the context of a spatial scene, the updating of object locations also involves allocentric processing of locations relative to the environment (Burgess, Spiers, & Paleologou, 2004) and thus egocentric–allocentric translation (Burgess, 2006). Thus, we predicted that mPFC–MTL/parietal theta phase locking would specifically increase with dynamic imagery demands.
METHODS
Participants
Sixteen participants (seven women; mean age = 22.8 years, SD = 4.07 years) gave written consent and were compensated for performing the experimental task, as approved by the local research ethics committee at the University College London in accordance with the Declaration of Helsinki protocols. All participants were right-handed, had normal or corrected-to-normal vision, and reported to be in good health with no prior history of drug abuse, neurological disease, or psychiatric illness.
Task
Stimuli were presented via a digital LCD projector (brightness = 1500 lumens, resolution = 1024 × 768 pixels, refresh rate = 60 Hz) onto a screen (height = 32 cm, width = 42 cm, distance from participant = ∼70 cm) that was parallel to the participant's face inside a magnetically shielded room using the Cogent (www.vislab.ucl.ac.uk/cogent.php) toolbox running in MATLAB (The MathWorks, Inc., Natick, MA). Participants were fitted with MEG-compatible earbuds to deliver auditory stimuli associated with the experiment.
Participants performed a spatial working memory task in the MEG system, during which they were asked to remember the locations of five different everyday objects (e.g., bicycle, lemon) framed by an equally sized white square and overlaid on a blank background or scene. Participants first viewed an example video of each condition recorded from a first-person viewpoint with the same sequence of objects, so they would have an idea of what to imagine and the ideal pacing of imagination during each particular delay period. Participants were instructed to repeat imagination of the sequences if they finished imagining a learned sequence before the end of the delay period. Participants then performed a single practice session of the task, before three counterbalanced pseudorandomized sessions lasting approximately 15 min each inside the scanner.
In the task, participants first had a 20-sec baseline period of eyes-closed rest. This was followed by a 3-sec habituation period, during which participants viewed either a blank background or one of four different scenes created using Unity software (Unity Technologies, San Francisco, CA). A different scene was used for each session (dirt surface environment containing a cemetery and small river with a bridge, grassy forest clearing containing walking paths and a church, a beach environment with tents and stranded boats, and for the practice session, a grassy hill environment containing a small pond and castle). Each scene had a bench in the foreground. After the end of the habituation period, participants were sequentially presented with five objects, arranged circularly in different locations over the scene/background. Each new object was highlighted with a red frame and would appear for 3 sec (with a randomized ±0.3-sec jitter) until all objects appeared overlaid on the background. The appearance of each new object was preceded by a blink period, during which the objects and scene were replaced with a fixation point and the word BLINK for 1 sec. This was followed by a 2-sec intertrial interval before the object sequence resumed with another object highlighted with a red frame. Blocks were counterbalanced for starting location and direction of object presentation (clockwise and counterclockwise). Objects were either presented in a larger circle, to give way for a plausible walking path for dynamic imagery, or clustered in a small circle in front of a bench in the foreground for all objects to be visible from a single viewpoint inside the environment. For the object maintenance condition, objects were presented with equal frequency in either large or small circles over a black background. Importantly, there were no significant differences in behavioral memory performance (t(15) = 0.011, p = .992), RTs (t(15) = −1.34, p = .198), subjective memory (t(15) = 0.983, p = .341), or vividness ratings (t(15) = 0.433, p = .671) between trials with objects arranged in large and small circles within this condition.
After the encoding phase of the experiment, participants had a 20-sec delay period. During the delay period, participants received instructions to close their eyes and either imagine sitting and passively view the locations of the objects in the scene from a static position on the bench (static imagery); imagine walking slowly through the scene, passing each object sequentially (dynamic imagery); or in the case of objects being presented over a black background, simply “remembering the locations of the objects” (object maintenance). They were cued to open their eyes with an auditory tone played into the participants' headphones.
After the auditory tone, the scene/black background from the encoding period was presented again with numbered squares (1–5) superimposed in place of the objects. Participants were then prompted with a picture of each object at the top of the screen and had to match the object to the number of its corresponding location at their own pace. Immediately after the matching of the spatial location for each object, participants had to indicate how good their memory was for that block and how vivid their imagination during the delay period had been on a 1–5 scale, with 1 being very low and 5 being very high.
Participants performed 36 blocks across three sessions, with each of the three conditions being pseudorandomized within a session. Scene order was also counterbalanced across participants. Over the course of the MEG experiment, participants learned 12 different sequences of objects (60 objects in total), once for each of the three conditions and counterbalanced for order.
MEG Recording and Preprocessing
Data were recorded continuously from 274 axial gradiometers using a CTF Omega (Vancouver) whole-head system at a sampling rate of 600 Hz in third-order gradient configuration. MEG was preferred to EEG in this experiment because of the increased number of recording channels, high signal-to-noise ratio, and reduced setup time. Participants were also fitted with four EOG electrodes to measure vertical and horizontal eye movements. MEG data analysis made use of custom MATLAB scripts, SPM8 (Wellcome Trust Centre for Neuroimaging, London, United Kingdom; Litvak et al., 2011), and Fieldtrip (Oostenveld, Fries, Maris, & Schoffelen, 2011).
For preprocessing, MEG data were epoched into 20-sec baseline periods before the encoding period and during the delay period for each of the three conditions. Trials were visually inspected, with any trial featuring head movement or muscular artifacts being removed, along with any corresponding baseline or task period to allow for consistent trial comparison. After visual inspection, a mean of 32.1 (SD = 6.04) trials from the total of 36 trials in the experiment remained for analysis. This included a mean of 10.7 (SD = 2.5) dynamic imagery trials, 10.6 (SD = 2.19) static imagery trials, and 10.8 (SD = 1.8) object maintenance trials per participant, regardless of trial-by-trial performance. To avoid further physiological artifacts, data were not analyzed until 1 sec after the beginning of eyes-closed maintenance or baseline period and not until 1 sec before the end (i.e., the central 18 sec of the 20-sec delay period).
MEG Source Reconstruction
The linearly constrained minimum variance (LCMV) scalar beamformer spatial filter algorithm from SPM8 was applied to the raw time series and used to generate source activity maps in a 10-mm grid (Barnes & Hillebrand, 2003). Co-registration to Montreal Neurological Institute coordinates was based on nasion and left and right preauricular fiducial points. The forward model was derived from a single-shell model (Nolte, 2003) fit to the inner skull surface of the inverse normalized SPM template. Previous studies have shown negligible improvements in spatial resolution by fitting MEG data to individual structural MR images, rather than a canonical template image, under realistic levels of error and head movement (Troebinger et al., 2014; Henson, Mattout, Phillips, & Friston, 2009). The beamformer source reconstruction algorithm consists of two stages: first, based on the data covariance and leadfield structure, weights are calculated, which linearly map sensor data to each source location, and second, a summary statistic based on the mean oscillatory power between experimental conditions is calculated for each voxel.
Because of the proximity of frontal midline regions to the eyes, we wished to control for any possible influence of EOG muscular artifacts during the maintenance period on the estimates of oscillatory power. We therefore computed the variance of two simultaneously recorded EOG signals across each delay period, as a proxy for the number of eye movements made during that delay period, and removed any covariance between these EOG variance values and oscillatory power measurements across voxels by linear regression (Kaplan et al., 2014). This left “residual” oscillatory power measurements for all trials whose variance could not be accounted for by changes in the variance of the EOG signal between trials, and these residual values were used as summary images for the subsequent analyses.
In addition to movements in the ocular region, phase coupling measures can be biased by concurrent changes in oscillatory power because of changes in the signal-to-noise ratio (Muthukumaraswamy & Singh, 2011). To control for any possible influence of changes in oscillatory power (or EOG artifacts) on our phase coupling measures, we repeated the control analysis described above, with additional linear regressors corresponding to oscillatory power in seed and source voxels for each trial (Kaplan et al., 2014). Similarly, we constructed linear regressors for four behavioral measures across trials—RT, memory performance (percentage of object locations remembered correctly for a given trial), subjective memory, and vividness ratings—to examine correlations between behavioral performance and phase coupling values across trials in each voxel.
Phase Coupling
Statistical Analyses
For a comparison between delay period and baseline condition, summary images for each participant were entered into a one-sample t test in SPM8. Regressors for RT, memory performance (percentage of object locations remembered correctly for a given trial), subjective memory, and vividness ratings, in addition to the “nuisance” regressors described above, were each included in the general linear model. For comparisons between conditions, summary images for each participant and each delay period condition (object maintenance, static imagery, and dynamic imagery) were entered into a second-level 1 × 3 within-participant ANOVA in SPM8.
To address the issue of multiple comparisons, we used family-wise error (FWE) correction, derived from Gaussian Random Field Theory and implemented in SPM8 (Kiebel & Friston, 2004a, 2004b; Worsley, Taylor, Tomaiuolo, & Lerch, 2004; Worsley et al., 1996). To summarize, this approach treats the data, under the null hypothesis, as continuous random fields, where the distribution of the Euler characteristic of any statistical process derived from these fields can be used as an approximation to the null distribution required for inference (Kilner, Kiebel, & Friston, 2005). A peak voxel significance threshold of p < .05 FWE-corrected for multiple comparisons across the whole-brain volume was used for the power analyses. For the theta phase coupling analyses, an MTL ROI for small-volume correction (SVC) for multiple comparisons (peak voxel FWE, p < .05) was constructed using a bilateral MTL mask encompassing the amygdala, hippocampus, and parahippocampal and lingual gyrus (to conservatively include all of the parahippocampal place area regions that respond to landmark information; Epstein, 2008) from the automated anatomical labeling toolbox for SPM (Tzourio-Mazoyer et al., 2002). All images are displayed at the p < .001 uncorrected threshold (cluster extent of at least 20 voxels after interpolation to the Montreal Neurological Institute brain) for illustrative purposes. In addition, only clusters containing a significant peak voxel are displayed.
RESULTS
Behavioral Results
Participants remembered an average of 74.7% (standard error = 5.33%) object locations correctly with a mean RT of 1980 msec for matching an individual object to its respective location after the onset of the retrieval period. On a scale of 1–5 (1 being unsatisfactory and 5 being very good), participants rated each working memory trial for subjective vividness (mean = 3.37, standard error = 0.147) and subjective memory (mean = 3.81, standard error = 0.167). There was a significant interaction between Performance and Condition (F(2, 30) = 5.66, p = .016; Figure 2A), where performance was significantly higher for object maintenance than static imagery (t(15) = 3.21, p = .006), but not dynamic imagery (t(15) = 0.715, p = .486). We observed no significant difference between memory performance in static and dynamic imagery conditions (t(15) = 1.67, p = .117).
Behavioral results. (A) Average trial-by-trial memory performance for each experimental condition during the retrieval phase. Main effect of Condition: F(2, 30) = 5.66, p = .016. (B) Average trial-by-trial RT during the retrieval phase for Objects 2–5 in the sequence (as retrieval was self-paced and very long RTs were often recorded for Object 1). (C) Average trial-by-trial subjective memory ratings made on a scale of 1–5 (1 = unsatisfactory, 5 = very good) given immediately after the retrieval phase. Main effect of Condition: F(2, 30) = 11.4, p = .001. (D) Average trial-by-trial vividness ratings made on a scale of 1–5 (1 = poor, 5 = very good). Main effect of Condition: F(2, 30) = 9.42, p = .003. All bar graphs show mean ± SEM over the 16 participants. *p < .05; **p < .005; ***p < .001.
Behavioral results. (A) Average trial-by-trial memory performance for each experimental condition during the retrieval phase. Main effect of Condition: F(2, 30) = 5.66, p = .016. (B) Average trial-by-trial RT during the retrieval phase for Objects 2–5 in the sequence (as retrieval was self-paced and very long RTs were often recorded for Object 1). (C) Average trial-by-trial subjective memory ratings made on a scale of 1–5 (1 = unsatisfactory, 5 = very good) given immediately after the retrieval phase. Main effect of Condition: F(2, 30) = 11.4, p = .001. (D) Average trial-by-trial vividness ratings made on a scale of 1–5 (1 = poor, 5 = very good). Main effect of Condition: F(2, 30) = 9.42, p = .003. All bar graphs show mean ± SEM over the 16 participants. *p < .05; **p < .005; ***p < .001.
During the retrieval period, there was no significant difference in RT when matching objects to their location among the three conditions (F(2, 30) = 3.30, p = .067; Figure 2B). There was, however, a significant difference in subjective memory ratings by condition (F(2, 30) = 11.4, p = .001; Figure 2C). Similar to measured memory performance, subjective memory ratings were significantly higher for object maintenance than static imagery (t(15) = 4.10, p = .001), but not dynamic imagery trials (t(15) = 1.54, p = .144). Subjective memory ratings were also higher for dynamic than static imagery trials (t(15) = 2.58, p = .021). In addition, there was a significant Subjective vividness ratings × Condition interaction (F(2, 30) = 9.42, p = .003; Figure 2D). Specifically, vividness ratings were significantly higher for object maintenance than static imagery (t(15) = 3.71, p = .002), but not dynamic imagery trials (t(15) = 1.71, p = .109). There was no difference in vividness ratings between dynamic and static imagery trials (t(15) = 1.55, p = .143).
To summarize, performance, subjective memory, and vividness ratings were each significantly higher during object maintenance trials than static imagery trials but not significantly different from dynamic imagery trials, and subjective memory ratings were significantly higher in dynamic imagery trials compared with static imagery trials.
Theta Power Changes and Source Reconstruction
To assess changes in frontal midline oscillatory power associated with the working memory task, we first extracted low-frequency power spectra for the middle 18 sec of eyes-closed rest (i.e., baseline) and mental imagery of object location (i.e., delay) periods collapsed across all conditions from a 20-mm spherical ROI in source space, centered on frontal midline coordinates identified by a previous study (x = 0, y = 58, z = 22; Kaplan et al., 2014). A comparison of these power spectra identified a prominent increase in theta power between the baseline and delay periods that peaked at ∼5.5 Hz (Figure 3A). We subsequently defined our frontal midline theta band of interest as a 3-Hz frequency window centered on this peak (i.e., 4–7 Hz). As expected, an examination of changes in 4- to 7-Hz power at the sensor level showed a large increase between the baseline and delay periods over frontotemporal regions (Figure 3B).
Delay versus baseline period, 4- to 7-Hz theta power changes. (A) Power spectra from a virtual electrode placed in the frontal midline region identified by a previous study (x = 0, y = 58, z = 22; Kaplan et al., 2014) for the delay and baseline periods as well as the difference in power between the two, averaged across all trials and participants. The power difference plot shows mean ± SEM across participants in gray. The power difference spectra show a single prominent peak at ∼5.5 Hz, and we focus our subsequent analyses on a 3-Hz frequency band centered on this peak (i.e., 4–7 Hz). (B) Scalp level delay versus baseline period, 4- to 7-Hz theta power changes, which show an increase across the frontal midline region. (C, left) mPFC (x = 22, y = 50, z = 0; z score = 4.51; peak voxel FWE corrected for the whole-brain volume, p = .012) theta power changes between the baseline and delay periods. Image shown at p < .001 uncorrected and overlaid on the canonical Montreal Neurological Institute 152 T1 image.
Delay versus baseline period, 4- to 7-Hz theta power changes. (A) Power spectra from a virtual electrode placed in the frontal midline region identified by a previous study (x = 0, y = 58, z = 22; Kaplan et al., 2014) for the delay and baseline periods as well as the difference in power between the two, averaged across all trials and participants. The power difference plot shows mean ± SEM across participants in gray. The power difference spectra show a single prominent peak at ∼5.5 Hz, and we focus our subsequent analyses on a 3-Hz frequency band centered on this peak (i.e., 4–7 Hz). (B) Scalp level delay versus baseline period, 4- to 7-Hz theta power changes, which show an increase across the frontal midline region. (C, left) mPFC (x = 22, y = 50, z = 0; z score = 4.51; peak voxel FWE corrected for the whole-brain volume, p = .012) theta power changes between the baseline and delay periods. Image shown at p < .001 uncorrected and overlaid on the canonical Montreal Neurological Institute 152 T1 image.
Next, we utilized the LCMV beamformer algorithm (Barnes & Hillebrand, 2003) to estimate all cortical sources that exhibited significant increases in 4- to 7-Hz theta power between the baseline and delay periods. We identified a single large cluster peaking in the mPFC (peak at x = 22, y = 50, z = 0; t(15) = 6.76; z score = 4.51; peak voxel FWE whole-brain corrected, p = .012; Figure 3C), extending into the right anterior temporal lobe/MTL. Using a 1 × 3 within-participant repeated-measures ANOVA by condition, we did not observe any significant theta power differences between conditions. Furthermore, we observed no correlations between theta power and performance, retrieval phase RT, participant memory ratings, or vividness ratings over trials. In addition, no cortical regions showed greater theta power during baseline periods, compared with the delay period.
Given that changes in the 1- to 4-Hz delta/low theta band have also been associated with human mnemonic function (Jacobs, 2014; Watrous, Lee, et al., 2013), we used the LCMV beamformer algorithm to assay changes in 1- to 4-Hz power between the baseline and delay periods across the whole brain. We observed a single subthreshold peak in the left insula that did not survive FWE correction for multiple comparisons. Notably, however, no significant changes were observed in the frontal lobe or MTL (data not shown).
Theta Phase Coupling
Next, we used the mPFC region that exhibited a significant theta power increase between the baseline and delay periods as a seed region to investigate changes in theta phase coupling across the whole brain. As in a previous study (Kaplan et al., 2014), the specific seed voxel for each participant was chosen as that with the greatest theta power increase between the baseline and delay periods within 20 mm of the group maximum, to account for variance in frontal midline theta source locations between participants (Isihara et al., 1981).
First, we used the PLI (Stam et al., 2007; see Methods), a technique that eliminates volume conduction effects, to look for increases in theta phase coupling between the mPFC seed region and all other voxels in the brain. After correcting for eye movements and oscillatory power in seed and source voxels, we found no significant increases in theta phase coupling with the mPFC anywhere in the brain between the delay and baseline periods averaged over all three conditions.
We then compared mPFC theta coupling differences between the three conditions using a 1 × 3 within-participant repeated-measures ANOVA. In a whole-brain analysis, the most significant main effect of Condition on mPFC phase coupling was found in the left posterior MTL extending into the RSc (x = −28, y = −54, z = −2; z score = 4.13, F(2, 30) = 16.1; peak voxel FWE bilateral MTL SVC, p = .011; Figure 4A). Subsequent t tests indicated that mPFC–MTL/RSc theta phase coupling was significantly higher for dynamic imagery than both object maintenance (x = −28, y = −54, z = −2; t(15) = 4.60; z score = 3.97; peak voxel FWE SVC, p = .016; Figure 4C) and static imagery (x = −30, y = −54, z = −4; t(15) = 5.35, z score = 4.45; peak voxel FWE whole-brain corrected, p = .03; Figure 4D).
Delay period 4- to 7-Hz mPFC theta phase coupling; PLI whole-brain analysis. (A) 1 × 3 interaction between Object maintenance, Static imagery, and Dynamic imagery for mPFC theta phase coupling, showing a significant cluster in the left posterior MTL (x = −28, y = −54, z = −2; z score = 4.13; peak voxel FWE for bilateral MTL, p = .011) extending into the RSc. (B) Mean mPFC PLI with 10-mm sphere around the left posterior MTL peak voxel (x = −28, y = −54, z = −2) for all three conditions versus baseline. (C) mPFC theta phase coupling for dynamic imagery versus object maintenance, showing a significant cluster in the left posterior MTL (x = −28, y = −54, z = −2; z score = 3.97; peak voxel FWE for bilateral MTL, p = .016), extending into the RSc. (D) mPFC theta phase coupling for dynamic imagery versus static imagery, showing a significant cluster in the left posterior MTL (x = −30, y = −54, z = −4; z score = 4.45; peak voxel FWE corrected for whole-brain volume, p = .03), extending into the RSc. All images shown at p < .001 uncorrected for visualization purposes and overlaid on the canonical Montreal Neurological Institute 152 T1 image. (E) Mean mPFC PLV with 10-mm sphere around the left posterior MTL peak PLI voxel in A for all three conditions versus baseline. All bar graphs show mean ± SEM over the 16 participants.
Delay period 4- to 7-Hz mPFC theta phase coupling; PLI whole-brain analysis. (A) 1 × 3 interaction between Object maintenance, Static imagery, and Dynamic imagery for mPFC theta phase coupling, showing a significant cluster in the left posterior MTL (x = −28, y = −54, z = −2; z score = 4.13; peak voxel FWE for bilateral MTL, p = .011) extending into the RSc. (B) Mean mPFC PLI with 10-mm sphere around the left posterior MTL peak voxel (x = −28, y = −54, z = −2) for all three conditions versus baseline. (C) mPFC theta phase coupling for dynamic imagery versus object maintenance, showing a significant cluster in the left posterior MTL (x = −28, y = −54, z = −2; z score = 3.97; peak voxel FWE for bilateral MTL, p = .016), extending into the RSc. (D) mPFC theta phase coupling for dynamic imagery versus static imagery, showing a significant cluster in the left posterior MTL (x = −30, y = −54, z = −4; z score = 4.45; peak voxel FWE corrected for whole-brain volume, p = .03), extending into the RSc. All images shown at p < .001 uncorrected for visualization purposes and overlaid on the canonical Montreal Neurological Institute 152 T1 image. (E) Mean mPFC PLV with 10-mm sphere around the left posterior MTL peak PLI voxel in A for all three conditions versus baseline. All bar graphs show mean ± SEM over the 16 participants.
To corroborate these findings, we made use of a more sensitive measure of phase coupling—the PLV (Lachaux et al., 1999; see Methods). We examined mPFC theta PLV within a 10-mm spherical ROI around the MTL/RSc peak voxel (x = −28, y = −54, z = −2) that displayed the most significant PLI difference between the three conditions above. Using a 1 × 3 repeated-measures ANOVA, we observed the same main effect of Condition (F(2, 30) = 10.9, p = .001). Subsequent paired t tests revealed that dynamic imagery PLV was significantly higher than both static imagery (t(15) = 4.07, p = .001) and object maintenance (t(15) = 3.54, p = .003) within this region. As with the PLI analysis, we did not observe any significant differences in PLV between the static imagery and object maintenance conditions (p > .05).
Investigating possible MTL lateralization during dynamic imagery, we found no significant difference in mPFC theta coupling with left versus right posterior MTL during dynamic imagery when we compared PLI values in the left MTL region that showed a significant increase during dynamic imagery effect with those from the right MTL region (x = 48, y = −36, z = −10; t(15) = 4.75, z score = 3.67) that was most strongly coupled to the mPFC between the baseline and delay periods. Crucially, we did not identify any other significant differences in theta phase coupling between mPFC and any other brain regions or between any other conditions.
To ascertain whether mPFC–posterior MTL theta phase coupling difference between conditions related to differences in performance measures across conditions, we tested whether the mPFC–posterior MTL theta PLI values across all delay period trials correlated with our four behavioral regressors across trials. Using a 10-mm sphere around the left posterior MTL PLI peak, we found no significant correlations (all ps > .05) with any of the four behavioral regressors: memory performance, RT, subjective memory, and vividness ratings. Finally, we investigated whether mPFC mean PLI differences between the delay and baseline periods in any other brain regions correlated with any of these performance measures across trials but did not identify any significant effects.
DISCUSSION
We have identified an increase in mPFC and anterior temporal lobe/MTL theta power during eyes-closed mental imagery of previously learned object sequences during a spatial working memory delay period compared with a preceding baseline period of eyes-closed rest. We found that the mPFC theta rhythm showed significantly stronger coupling with the left posterior MTL/RSc than any other brain region during imagined movement around learned object locations (dynamic imagery), compared with static imagery of learned object sequences in a scene (static imagery) or over a blank background (object maintenance).
Our theta power findings add to a rich literature of human working memory studies exploring theta oscillations in the frontal midline during the delay period of human working memory tasks (see Hsieh & Ranganath, 2014, for a review). We found increased mPFC theta power for mental imagery during a working memory delay period before retrieval, versus a baseline period, which parallels results from a previous MEG study that found increased mPFC theta during cued retrieval of previously learned spatial representations (Kaplan et al., 2014). Furthermore, similar to previous MEG and intracranial EEG studies, we also observed increased MTL theta power during a working memory delay period (Poch, Fuentemilla, Barnes, & Duzel, 2011; Axmacher et al., 2010; Cashdollar et al., 2009; Raghavachari et al., 2006; Tesche & Karhu, 2000). Frontal midline theta is often linked to directed attention (Ishii et al., 1999, 2014; see review by Mitchell, McNaughton, Flanagan, & Kirk, 2008), which we did not directly measure in this task. However, if there were any differences in directed attention between conditions, they did not cause any observable differences in frontal midline theta power.
Specifically, comparable with our findings, a previous study observed increased delay period mPFC and hippocampal theta power during spatial integration of object locations (Olsen, Rondina, Riggs, Meltzer, & Ryan, 2013). Unlike the study by Olsen and colleagues, we did not observe separate theta sources in the mPFC and hippocampus during the delay period. This discrepancy might be a consequence of the relatively small number of trials and participants in our task, which increase the difficulty of estimating deep sources with MEG (Quraan, Moses, Hung, Mills, & Taylor, 2011; also see Dalal et al., 2013, for evidence relating direct human hippocampal recordings to MEG source reconstruction). In addition, EEG studies have seen increased frontal midline theta oscillations corresponding to the maintenance of the serial position of items (Hsieh, Ekstrom, & Ranganath, 2011), both spatial location and temporal order judgments (Roberts, Hsieh, & Ranganath, 2013), and the number of items being maintained in working memory (Jensen & Tesche, 2002). These results allow for the possibility that the frontal midline theta rhythm might provide top–down control to maintain the serial order or relation between previously learned spatial or temporal hippocampal representations before retrieval (Olsen et al., 2013), which in the case of our spatial imagery task, would also require MTL/parietal regions related to processing different viewpoints (Byrne et al., 2007).
The mPFC delay period theta rhythm was more strongly coupled to the posterior MTL than any other brain region, further emphasizing the importance of mPFC–MTL theta phase coupling during mnemonic tasks (Backus, Schoffelen, Szebényi, Hanslmayr, & Doeller, 2016; Garrido, Barnes, Kumaran, Maguire, & Dolan, 2015; Fuentemilla et al., 2014; Kaplan et al., 2014; Guitart-Masip et al., 2013; Watrous, Tandon, et al., 2013; Anderson, Rajagovindan, Ghacibeh, Meador, & Ding, 2010). Although the mPFC–right posterior MTL phase coupling delay period effect did not significantly vary by condition, unlike the mPFC–left MTL theta coupling effect we observed, all delay period conditions were also above baseline.
Looking across the different imagery conditions revealed significant differences in mPFC theta phase coupling with the left posterior MTL/RSc according to condition. Crucially, mPFC–MTL/RSc theta coupling was higher for dynamic spatial imagery than our other two delay period imagery manipulations, and the mPFC theta rhythm displayed no significant 4- to 7-Hz phase locking with any other brain region. We also found that mPFC–MTL theta coupling did not show any effect of hemispheric lateralization for dynamic imagery, because there was no significant mPFC theta coupling with the left or right MTL during dynamic imagery. Crucially, no behavioral rating correlated with mPFC–pMTL/RSc theta coupling among the three conditions, suggesting that the increase in mPFC–pMTL/RSc theta coupling for dynamic imagery was due to specific condition demands (e.g., mental imagery for multiple viewpoints, dynamically changing scenes) rather than differences in general behavioral performance (e.g., memory strength). Similar to past studies of spatial attention in human electrophysiology (Mangun & Hillyard, 1990), future experiments should investigate the relationships between distributed neural responses at different stages of dynamic spatial imagery and measures of performance or vividness, to separate responses reflecting the types of processing involved from those reflecting successful execution of that processing type. Furthermore, human intracranial EEG recordings may be better suited than MEG studies to investigate the temporal dynamics of interregional phase interactions, and future studies can investigate the potentially rich, temporal phase coupling dynamics that might differ between imagery conditions.
Notably, one intracranial EEG study recording from human MTL and RSc (Foster et al., 2013) found increased theta phase coupling between the two regions during autobiographical memory, an effect that might be due to the increased dynamic imagery demands of the task. Taken together, these findings complement a previous hypothesis that theta synchronization between MTL/RSc and mPFC should accompany increased contextual associations (Aminoff, Kveraga, & Bar, 2013). Both the posterior MTL and RSc are thought to encode different viewpoints into a large-scale understanding of an environment (Vass & Epstein, 2013).
In terms of specific neural mechanisms, the finding of mPFC–MTL/RSc theta synchrony during dynamic imagery (i.e., movement of viewpoint) supports a recent model of spatial mental imagery. This model proposes that coherent spatial imagery results from a theta-rhythmic interaction between MTL and medial parietal areas, mediated by RSc (Byrne et al., 2007; Burgess et al., 2001). Within this model, pFC is hypothesized to control movements of viewpoint during mental exploration by modulating alternating temporal–parietal flows of information (Dhindsa et al., 2014).
Our findings illustrate the importance of mPFC–MTL interregional phase coupling in memory, irrespective of the presence or absence of theta power changes (Brincat & Miller, 2015; Watrous, Lee, et al., 2013). We provide further support to an expanding literature relating frequency-band-specific interregional phase coupling to a variety of cognitive processes (for reviews, see Fell & Axmacher, 2011; Jutras & Buffalo, 2010; Buzsáki, 2006; Fries, 2005). An important caveat to our findings is that our 4- to 7-Hz dynamic imagery effect is localized to the posterior MTL/RSc, not specifically to the hippocampus. Indeed, a lower frequency of ∼1–4 Hz in the human hippocampus has been related to the hippocampal theta rhythm in rodents (Jacobs, 2014; Watrous et al., 2013), and we had no prediction about how other frontal midline or hippocampal theta rhythms might couple with other frequency bands in other neocortical regions, such as the alpha/beta band, that are present during memory formation (Hanslmayr, Staresina, & Bowman, 2016). Future models can investigate interregional interactions and potential oscillatory multiplexing (Ekstrom & Watrous, 2014; Watrous & Ekstrom, 2014) to better explore the mechanisms underlying dynamic imagery and memory formation.
Hippocampal–mPFC theta phase locking is also commonly observed during rodent spatial exploration and correlates with behavioral performance (Benchenane et al., 2010; Hyman, Zilli, Paley, & Hasselmo, 2010; Sigurdsson, Stark, Karayiorgou, Gogos, & Gordon, 2010; Jones & Wilson, 2005; see Colgin, 2011, and Gordon, 2011, for reviews). Notably, grid cell firing in the entorhinal cortex is also associated with theta states, and grid-like processing has been observed in medial prefrontal, parietal, and temporal regions (Doeller, Barry, & Burgess, 2010). Recent fMRI studies found grid-like processing in the MTL also during mental navigation (Bellmund, Deuker, Navarro Schröder, & Doeller, 2016; Horner, Bisby, Zotow, Bush, & Burgess, 2016), which parallels our dynamic versus static imagery results and suggests that both theta rhythmicity and spatial cell firing might play a role in the mental exploration of imagined spaces.
Conclusions
Our findings suggest that theta oscillations in the mPFC and MTL could work in tandem, along with the RSc, to coordinate dynamic mental imagery during spatial working memory maintenance. Our results allow for the possibility that oscillatory interactions in the theta band between mPFC and MTL/RSc in humans could underlie the mental exploration of possible spatial trajectories and, more generally, mental simulation and fictive planning (Buzsáki & Moser, 2013; Byrne et al., 2007).
Acknowledgments
The research was supported by a Sir Henry Wellcome Postdoctoral Fellowship (WT101261MA) to R. K., Medical Research Council UK and Wellcome Trust grants to N. B., and an MRC Doctoral Training Grant (MR/K501086/1) to S. S. M. The authors thank Gareth Barnes for helpful discussion and Letty Manyande for help with scanning. The Wellcome Trust Centre for Neuroimaging is supported by core funding from the Wellcome Trust 091593/Z/10/Z.
Reprint requests should be sent to Raphael Kaplan, Wellcome Trust Centre for Neuroimaging, UCL Institute of Neurology, University College London, 12 Queen Square, London, United Kingdom WC1N 3BG, or via e-mail: raphael.kaplan.09@ucl.ac.uk, or Neil Burgess, UCL Institute of Cognitive Neuroscience, Alexandra House, 17 Queen Square, London, WC1N 3AZ United Kingdom, or via e-mail: n.burgess@ucl.ac.uk.
REFERENCES
Author notes
Authors contributed equally to this work.