During memory encoding, increased hippocampal activity—thought to reflect the binding of different types of information into unique episodes—has been shown to correlate with subsequent recollection of those episodes. Repetition priming—thought to induce more efficient perceptual processing of stimuli—is normally associated with decreased neocortical activity and is often assumed to reduce encoding into episodic memory. Here, we used fMRI to compare activity to primed and unprimed auditory words in the presence of distracting sounds as a function of whether participants subsequently recollected the word–sound associations or only had a feeling of familiarity with the word in a subsequent surprise recognition task. At the behavioral level, priming increased the incidence of subsequent recollection. At the neuronal level, priming reduced activity in the superior temporal gyrus (STG) but also reversed the traditional increase in encoding-related hippocampal activity associated with subsequent recollection relative to subsequent familiarity. To explain this interaction pattern, further analyses using dynamic causal modeling showed an increase in connectivity from left STG to left hippocampus specific to words that were later recollected. These findings show that successful episodic encoding is not determined solely by local hippocampal activity and emphasize instead the importance of increased functional neocortical–hippocampal coupling. Such coupling might be a better predictor of subsequent recollection than the direction of local hippocampal changes per se. We propose that one consequence of priming is to “free up” attentional resources from processing an item in a noisy context, thereby allowing greater attention to encoding of that context.
Episodic memory refers to recollection of contextual information associated with a prior episode and is often associated with an autonoetic consciousness that gives rise to subjective mental time travel (e.g., Eustache & Desgranges, 2008; Gardiner, 2001; Tulving, 2001). Sometimes, however, a stimulus may seem familiar, without any recollection of related contextual information, a situation related to noetic consciousness and possibly semantic memory (Gardiner, 2001; Tulving, 2001). The hippocampus is generally thought important for binding together the different types of information associated with episodic memories (for a review, see Davachi, 2006; Eichenbaum, 2004). This is consistent with functional neuroimaging studies that have reported increased hippocampal activity associated with encoding processes that lead to subsequent recollection, relative to those leading only to a feeling of familiarity (e.g., Uncapher & Rugg, 2005; Ranganath et al., 2004; Davachi, Mitchell, & Wagner, 2003). Here we report a new finding—whereby an increase in hippocampal activity is not associated with subsequent recollection—and demonstrate instead the importance of increased functional coupling between hippocampus and neocortex for successful episodic encoding.
We achieved this novel finding by combining an implicit repetition priming paradigm with a subsequent explicit recognition memory paradigm. Priming normally elicits reduced activity in the cortical regions associated with processing those stimuli, a reduction that is often attributed to more efficient processing (e.g., Schacter, Wig, & Stevens, 2007; Henson, 2003). We hypothesize here that when stimuli are presented in noisy environments, priming improves their perception and thereby also improves their binding with the concurrent context in episodic memory.
One might expect that if primed stimuli are more efficiently processed—that is, are less novel or “surprising”—then they are less likely to be encoded into episodic memory. Indeed, one previous study showed that conditions that increase priming can impair the encoding of explicit memories (Wagner, Maril, & Schacter, 2000; although see Stark, Gordon, & Stark, 2008). However, the visual word stimuli in that study were presented under relatively good perceptual conditions. Furthermore, Wagner et al. (2000) measured subsequent recognition memory using confidence judgments that do not distinguish recollection from feelings of familiarity in the absence of episodic retrieval (Mandler, 1980). Indeed, familiarity can be strong enough for highly confident judgments (Gardiner, 2001). In a recent behavioral study (Gagnepain, Lebreton, Desgranges, & Eustache, 2008), we used a remember/know/guess (R/K/G) paradigm (Gardiner, Ramponi, & Richardson-Klavehn, 1998; Tulving, 1985) to separate recollection and familiarity of auditory words in the presence of background sounds. In addition, participants had to justify their R responses by further recalling the associated background sound (from two possible alternatives). Our results showed that although prior repetition increased the probabilities of both subsequent recollection and familiarity, the amount of behavioral priming only correlated with the probability of subsequent recollection. This supports the above hypothesis; that is, although prior repetitions can increase levels of both recollection and familiarity, priming can also specifically aid the encoding of episodic memories by helping to bind an impoverished stimulus with its concurrent context.
However, the neuronal correlates of such relationship and especially the impact of neuronal priming on hippocampal processes are currently unknown. Here we adapted our previous paradigm (Gagnepain, Lebreton, et al., 2008) to fMRI and used dynamic causal modeling (DCM) (Friston, Harrison, & Penny, 2003) to explore effective connectivity (coupling) between the hippocampus and a region of the superior temporal gyrus (STG) that exhibits repetition suppression associated with priming of auditory words (Gagnepain, Chetelat, et al., 2008). The paradigm entails three distinct phases (Figure 1A): (a) an initial phase in which “prime words” were presented three times during a phoneme detection task; (b) a second phase, 24 hr later, when these primed words (together with unprimed words and pseudowords) were presented during a lexical decision task with concurrent sound backgrounds and during which the critical fMRI data were acquired; and (c) a final, surprise recognition test (after removal from the scanner) for words presented in the second phase (R/K/G paradigm).
Twenty-four right-handed French men aged 20 to 30 years (M = 23.5; SD = 2.9) were paid to take part in the study. Only men were recruited to match the population sample of our previous experiment (Gagnepain, Chetelat, et al., 2008). They had no reported history of neurological, medical, speech, or hearing disorders, had at least 14 years of schooling, and presented normal MRI structural images. The project was approved by the regional ethics committee, and all participants gave written consent. Two participants were excluded from the analyses because of excessive head motion as defined by outlying standard deviations for slice-wise differences across successive scans.
The items, which were recorded by a female voice, comprised 520 words taken from the LEXIQUE French database (www.lexique.org), 92 pseudowords, and 20 consonant sounds used during the first phoneme detection phase (see Procedure section). Six lists of 80 words were created according to the different experimental conditions (see Procedure section) plus one list of 40 words for the study and test training phases. All the lists were paired separately according to various criteria, including concreteness, French text corpus frequency, Web frequency, gender, stimulus duration, and number of phonemes (five to eight) and syllables (bi-/tri-syllabic). Twenty-four of the various possible combinations of the six word lists were selected in a pseudorandom way so that the experimental conditions assigned to the lists were counterbalanced across the participants.
Two hundred seventy-six sound contexts were selected from the various databases available on the Internet (Shafiro & Gygi, 2004). Three lists of 80 sounds were created in accordance with the experimental conditions plus one list of 36 sounds for the priming test training phase. Using audio processing software, these sounds were then divided into 2-sec representative temporal windows. The words and the pseudowords assigned to the auditory lexical decision task were perceptually degraded by adding these sound contexts. The three sound lists were counterbalanced across the word lists and participants so that each sound list was associated equally often with each word list and each experimental condition. This procedure thus led to the creation of 1440 different test stimulations for words and 240 for pseudowords. Particular care was taken to ensure that none of the word–sound associations involved any obvious semantic relations.
First Phase: Priming Words during a Phoneme Detection Task
After a short training period, the participants heard a total of 480 words and performed a phonological task designed to encourage detailed perceptual processing. After a 500-msec gap, each word was followed by a phoneme pronounced in the same voice as the target word and lasting approximately 300–600 msec. Participants were instructed to decide whether this phoneme was present in the preceding item. Positive (phoneme present) and negative (phoneme not present) trials were counterbalanced across words and participants, and as far as possible, target sounds were located equally often at the beginning, middle, and end of words. Two hundred forty of the 480 trials corresponded to the same 80 words presented three times. There was an ISI of 3000 msec after each participant's response. The phase was divided into three blocks of 160 trials: 80 prime words (presented once in each block) and 80 distractors (only presented in one block). All the words within a block were presented in a pseudorandom order. This three-block structure was not perceptible to the participants and was used purely for methodological purposes.
Second Phase: Test Phase of the Priming Task/Incidental Encoding Phase
Participants performed an auditory lexical decision task during a single fMRI session 24 ± 6 hr later. Two hundred forty items were presented in 240 different sound contexts: 80 primed words, 80 unprimed words, and 80 pseudowords. Participants had to decide, as quickly as possible, whether the items corresponded to a word belonging to the French language. They were also told to pay attention to the sound and were given the impression that they were taking part in a study of the neuronal mechanisms involved in the perception of speech that has been degraded by sound. Participants were also instructed to close their eyes to focus on the auditory task. For each trial, the items started without any cue and participants' RTs were measured from the onset of the words. All trial conditions (primed words, unprimed words, pseudowords) were presented according to an efficient stochastic design. The ISI varied between 3600 and 4400 msec (mean = 4000 msec, 1.82 repetition times, see below), ensuring that the hemodynamic response was sampled approximately every 200 msec over the trials. Items were presented using E-Prime software run by the IFIS System Manager, controlling stimulus delivery by synchronizing each trial with a series of TTL pulses produced during imaging acquisition. Items were delivered via an electrodynamic audio system (MR-CONFON, Magdeburg, Germany) ensuring attenuation of scanner noise by up to 45 dB.
Third Phase: Recognition Task
Fifteen minutes after being removed from the MRI scanner, participants performed a final recognition memory test on the 160 old words (80 primed and unprimed words from the second phase) and 80 new words, with an ISI of 2500 msec after each participant's response. In our previous behavioral study (Gagnepain, Lebreton, et al., 2008), we noticed that some participants had difficulties recalling the sound context associated with each word, which is not necessarily surprising given that the two different sounds used were incidental to the encoding. To address this problem, old words during the recognition task were presented with the same sound contexts as during the lexical decision task. If they had accurate memories of both the word and the sound context and if they mentally relived this stimulation, they had to reply “Yes, I remember” (R response). If, on the other hand, their positive response occurred in the absence of any mental reliving, that is, was based on a feeling of familiarity with the word that could not be associated to the sound (even if the sound itself might be familiar), they had to reply “Yes, I know” (K response). To increase the specificity of memory traces and hence unique recollection of episodes, we used as many sound backgrounds as words rather than just two sound backgrounds. Moreover, to ensure that R responses did not solely correspond to the recollection of the sound context, new words were presented with the (old) sound contexts associated with the pseudowords, and participants were informed of this fact. Lastly, if the participants had any doubts about their response, they were told to give an “I guess” judgment (G response; see Gardiner et al., 1998), and if they were sure they had not seen the word in the scanner, the instruction was to reply “No,” which was labeled as Miss (M) when it was an old word.
In a final debriefing, participants were asked whether, during the second lexical decision phase, they had noticed that any of the words had been used in the initial (phoneme detection) phase on the previous day. All participants noticed repetition of a few words, but only three participants reported noticing repetition of up to 20 words. Participants also said they did not notice any difference between these repeated words and the other words that they remembered during the subsequent memory task. Lastly, none of the participants reported using intentional encoding or retrieval strategies during the first two phases, saying that they were focused on the perceptually demanding main task.
fMRI Data Acquisition
All images were acquired using a Philips 3T system (Eindhoven, Netherlands). BOLD images were collected using a T2*-weighted, echo-planar sequence (64 × 64 × 34; 3.5 × 3.5 × 3.5 mm3; field of view = 224 mm; echo time = 35 msec; flip angle = 80°, repetition time = 2200 msec). The slices were acquired in an interleaved, ascending direction. The 446 functional volumes were collected during one functional session, where the first six volumes were discarded to allow for equilibration effects. T1-weighted structural images were also acquired (256 × 256 × 180, 1 × 1 × 1 mm3, echo time = 4.6 msec, flip angle = 20°, field of view = 256 mm, repetition time = 20 msec).
fMRI Data Processing
Data were analyzed using Statistical Parametric Mapping software (SPM5, Wellcome Department of Imaging Neuroscience, London, UK). During preprocessing, images were first corrected for slice acquisition temporal delay before being spatially realigned to correct for motion. Images were then normalized using the parameters derived from the nonlinear normalization of individual gray matter T1 images to the T1 template of the Montreal Neurological Institute and spatially smoothed using an 8-mm FWHM Gaussian kernel.
Conventional Activation Analyses
In the initial analysis, the resulting preprocessed time series in each voxel were high-pass filtered to 1/128 Hz and globally scaled by the mean over all voxels at each scan. Regressors within a general linear model (GLM) for each voxel were created by convolving a delta function at stimulus onset for each condition of interest with a canonical hemodynamic response function and its time and dispersion derivatives. For the basic test of repetition suppression, the GLM contained only three conditions: wordprimed, wordunprimed, and pseudoword, plus an additional regressor for incorrect lexical decisions. For tests of memory encoding effects, a fuller GLM was created with the following additional conditions for the words: Rprimed, Runprimed, Kprimed, Kunprimed, Gprimed, Gunprimed, Mprimed, and Munprimed. Note that because participants produced few G responses in primed and/or unprimed conditions, these conditions were not included in subsequent analyses, nor were incorrect responses or pseudowords. Further regressors of no interest were the six realignment parameters to account for motion artifacts. Maximum likelihood estimates of the GLM parameters were obtained using an AR(1) plus white noise model of the error. Linear contrasts of the canonical hemodynamic response function parameter estimates (activation maps) were then computed for repetition suppression (wordunprimed − wordprimed) and repetition enhancement (wordprimed − wordunprimed) as well as for the Repetition × Subsequent Recognition interaction of interest (Runprimed − Kunprimed > Rprimed − Kprimed). Note the five other possible interactions resulting from the combination of our six main conditions (Rprimed, Runprimed, Kprimed, Kunprimed, Mprimed, and Munprimed) were also computed and reported in Supplementary Table 3.
These contrast images were then entered into a series of one-sample t tests in a second-level analysis, with participants treated as a random effect. We used an initial voxel height threshold of puncorrected < .001 and cluster extent threshold of k = 5 but corrected for multiple comparisons on the height of the statistic using random field theory (pFWE < .05; FWE = family-wise error) within small volumes for prior ROIs. For the main effect of repetition suppression, the functional ROIs were defined by the thresholded statistical map for repetition suppression from the independent data of our previous study on spoken word priming (Gagnepain, Chetelat, et al., 2008). Given that the hippocampus has been implicated in recollection (see Introduction) whereas the parahippocampal gyrus (PhG), particularly the perirhinal cortex, has been implicated in familiarity (Eichenbaum, Yonelinas, & Ranganath, 2007; Henson, Cansino, Herron, Robb, & Rugg, 2003), we used separate anatomical ROIs for the hippocampus and PhG as defined by the automated anatomical labeling (AAL) atlas (Tzourio-Mazoyer et al., 2002). In addition, we also separated these two different memory structures according to left and right hemispheres given the dominance of the left hemisphere in spoken word processing (resulting in four different search volumes). Given that we had a single a priori hypotheses as to the role of priming with respect to recollection but not familiarity (i.e., hippocampus; see Introduction) and that we expected this effect to be left lateralized, we did not correct pFWE values in the left hippocampus for the number of ROIs. However, because we had no a priori hypotheses as to the pattern of activity that should be expected in the three other ROIs, we corrected pFWE values in the left and right PhG as well as in the right hippocampus for the number of ROIs.
Effective Connectivity Analyses
For analysis of functional coupling or effective connectivity, we used DCM (Friston et al., 2003). DCM entails defining a network of a few ROIs and the forward and backward connections between them. The neuronal dynamics within this network are based on a set of simple differential equations relating the activity in each region (a) to the activity of other regions (intrinsic connections), (b) to experimentally defined extrinsic inputs (“driving input”) to one or more of the regions, and (c) most importantly, to experimentally defined modulations (“modulatory input”) of the connectivity between regions. These neural dynamics are then mapped to the fMRI time series using a sophisticated hemodynamic model of the BOLD response. The neural (and hemodynamic) parameters of this DCM are estimated using approximate variational Bayesian techniques.
On the basis of our working hypotheses and the results of the activation analysis, we created a three-region DCM that contained a left STG region (which showed a main effect of repetition suppression), the left hippocampus, and the left PhG, both of which showed an interaction between repetition priming and subsequent recognition. The three regions were fully interconnected with both forward and backward connections. The driving input entered the left STG and was distinguished according to primed words, unprimed words, and pseudowords. The main interest was in the modulatory inputs, which applied to the forward connections from STG to hippocampus and from STG to PhG. These modulatory inputs were distinguished according to R, K, and M responses. Note that backward connections were not of interest because there were no interactions to explain in the STG and because we were interested into how repetition suppression modulates subsequent memory effect in the MTL and not the reverse. Nonetheless, when we did allow modulation of backward connections by R, K, and M, nothing was reliable (values close to 0 in all participants). The time series for each region and each participant was the first eigenvariate extracted from a sphere of 4-mm radius (volume of interest [VOI]), centered on the maximum of the above group activation analysis (group method). These time series were adjusted for the effects of no interest. Note that we did not include right hemisphere regions in our DCM because we did not find reliable repetition suppression in the right hemisphere, and the simple effects of recollection and priming did not survive correction in the right hippocampus (see below).
However, the maxima identified in the group method might not necessarily map onto the hippocampal and PhG ROIs in each individual. Furthermore, the closeness of these group maxima might lead to correlated time courses in the VOIs centered in these regions, given the spatial smoothness of the functional data. Therefore, to confirm the reliability of our DCM results, we also adopted a second method, individual method, for defining the VOIs using a combination of anatomical and functional criteria (as recommended by Stephan, Marshall, Penny, Friston, & Fink, 2007). In this method, the maxima for each participant (defined by the same contrast than the group analysis) (a) had to be located within the same individual anatomical structure (as defined by their normalized structural MRI; for individual coordinates, see Supplementary Table 4) as the group maximum (i.e., STG, PhG, or hippocampus), (b) had to be located within 2.5 times the FWHM of the smoothing kernel of the group maxima, and (c) had to survive p < .08 uncorrected (a more severe individual p threshold would have drastically reduced the number of participants). According to these criteria, 17 individual DCMs were performed.
The DCM.B matrix for the six coupling parameters of interest from each participant were entered into separate group level analyses for the group and individual methods to assess whether they significantly differed from zero using one-sample t tests, Bonferroni-correcting p values according to the number of coupling parameters. In addition, we tested for differences between the coupling parameters for each of the two forward connections, that is, from the left STG to the two left MTL regions, as a function of R/K/M responses.
Accuracy of the phoneme detection task during the first phase was close to ceiling. Performance of the lexical decision task during the second phase showed clear priming effects from the first phase in terms of both accuracy and RT: with a significantly higher proportion of correct responses, t(21) = 2.92, p < .01, in the primed condition (M = 0.78; SD = 0.09) than that in the unprimed condition (M = 0.71; SD = 0.09) and a significantly shorter RT for correct responses, t(21) = 4.21, p < .001, in the primed condition (M = 1507; SD = 83.9) than that in the unprimed condition (M = 1549; SD = 63.9).
Performance in the final recognition phase was measured by “Pr,” the difference in the proportion of “hits” (R, K, or G) given a correct lexical decision in the second phase and the proportion of “false alarms” (FA) on new words (R, K, or G). Pr was greater than zero for both R judgments, t(21) = 11.55, p < .001, and K judgments, t(21) = 5.8, p < .001, confirming that both R and K judgments were above the chance level but lower than 0 for the G responses, t(21) = −3.56, p < .01. This shows that participants mainly provided a G response on new words but not on old words presumably because they were confident of their R and K responses on old words. Paired t tests were performed to assess the effect of priming on subsequent proportions of R, K, and G responses (means shown in Figure 1B). They showed that, although priming increased the incidence of R hits, t(21) = 6.57, p < .001, it had no effect on K hits, t(21) = 0.88. Note that this difference in priming across R and K responses was reinforced by a significant two-way interaction, F(1, 21) = 6.26, p < .05. In addition, these tests showed a marginal trend for priming to produce fewer G responses, t(21) = −1.88, p = .074.
However, when recollection and familiarity (F) were estimated under independence assumptions, with F = K / (1 − R) (see Yonelinas, 2002), priming also increased the incidence of F responses, t(21) = 4.25, p < .001. Therefore, we calculated individual cumulative proportions of hits and FA for three points—R (R), R plus K (RK), and R plus K plus G (RKG)—and transformed them into Z values to produce Z-transformed receiving operator characteristics (Z-ROCs) for primed and unprimed conditions (see Supplementary Figure 1). Sensitivity measures (d′) for each point of the individual Z-ROCs were calculated (Z Hits − Z FAs). The d′ measure corresponds to the perpendicular distance between each point and the y = x chance line and can therefore be used to estimate the linearity and slope of the Z-ROCs. Although no significant differences were found between R and RK points for primed condition, t(21) = 0.04, or unprimed condition, t(21) = 0.97, a significant decrease was observed from RK to RKG points in the primed condition, t(21) = 3.9, p < .01, which was also close to significance in the unprimed condition, t(21) = 2.05, p < .06. These results indicate that (a) the linearity assumption made by both equal and unequal variance signal detection models (see Parks & Yonelinas, 2007; Wixted, 2007) does not hold for the present data and (b) priming does not simply increase memory strength as the most successful unequal variance signal detection model would predict, in that priming does not produce Z-ROCs with a slope < 1 between R and RK points (Squire, Wixted, & Clark, 2007; Wixted, 2007). These results imply that R/K judgments did not rely on a single memory strength but rather on at least two distinct sources of information, meaning than the effects of priming on both recollection (R) and familiarity (F; when scored under independence assumptions) are likely to have different causes.
Finally, lexical decision RTs in the second phase were analyzed according to a 2 (primed vs. unprimed) × 3 (subsequent R hit vs. K hit vs. Miss) ANOVA. Results showed a significant main effect of subsequent recognition, F(1, 21) = 5.3, p < .01 (and main effect of priming, as summarized earlier), but no interaction between repetition priming and subsequent recognition, F(1, 21) < 1 (see Figure 1B). The main effect of recognition reflected slower RTs for subsequent M responses relative to subsequent K hits, t(21) = −2.28, p < .05, and relative to subsequent R hits, t(21) = −2.99, p < .01, but importantly, the RTs of subsequent K and R hits did not differ significantly, t(21) = −0.9, p = .37. Paired t tests showed significant priming for all types of subsequent recognition: R, t(21) = −3.64, p < .01; K, t(21) = −2.38, p < .05; and M, t(21) = −2.47, p < .05.
In summary, prior presentation in the first phase (a) facilitated response times for all primed words in the second phase, regardless of how they were subsequently remembered in the third phase; (b) increased the probability that words in the third phase would be recollected as having been presented during the second phase; and (c) increased the probability that words would be judged as familiar, at least if R/K judgments are scored under independence assumptions, but these increases in recollection and possibly familiarity are unlikely to be based on the same increase in a single memory strength, given the results of the Z-ROC analyses.
We first explored the main effect of repetition priming. When thresholding the SPM for this contrast at p < .001, uncorrected, no voxels showed increased responses to primed relative to unprimed words (repetition enhancement). Several clusters showed reduced responses to primed words (repetition suppression, see Supplementary Table 1), however, including a cluster of nine voxels in the inferior part of left middle STG (x = −51, y = −27, z = +3, Tmax = 4.15; Figure 2A). Given our prior findings of repetition suppression in various areas of the bilateral STG (Gagnepain, Chetelat, et al., 2008), we defined a search volume on the basis of this previous thresholded repetition suppression map (i.e., bilateral STG ROI). The current left STG maximum survived correction for multiple comparisons within this restricted functional search volume (no maximum survived correction in the right hemisphere). The results of the other main effect—that of subsequent recognition—are reported in detail in Supplementary Table 2.
We then explored how the priming effect interacted with the type of subsequent memory. Because the pattern of interaction is orthogonal to the main effect of priming, it is possible to assess these interaction effects directly by extracting the activity in the above left STG maximum that showed repetition suppression. This maximum showed no evidence that its repetition suppression varied as a function of subsequent R versus K responses, F(1, 21) = 0.75, p = .48, R versus M responses, F(1, 21) = 0.98, p = .34, or K versus M responses, F(1, 21) = 1.07, p = .31. Furthermore, at the whole-brain level, none of the six possible pairwise directional interaction contrasts showed reliable effects in the STG (see Supplementary Table 3). However, there was evidence in the medial-temporal lobes (MTL) that the encoding-related activation associated with subsequent recollection (R > K) was greater for unprimed than primed trials (Figure 2A; a display of these MTL clusters on brain slices can be found in Supplementary Figure 2). Namely, there was a cluster of 16 voxels within the left MTL that included two submaxima, one in the left hippocampus (x = −27, y = −18, z = −12, Tmax = 3.8) and another in the left PhG (x = −24, y = −21, z = −24, Tmax = 4.6). The second cluster of seven voxels had a maximum in the right hippocampus (x = +27, y = −18, z = −9, Tmax = 5.2). Given our a priori hypothesis, we defined a search volume for the left hippocampus as defined by the AAL atlas (see Methods section). Although we did not have a priori hypotheses for the left and right PhG or the right hippocampus, we also included them as search volumes but corrected resulting pFWE values for the number of ROIs (effects has to survive pFWE = .05/3 = .0167). The three MTL maxima described above survived correction for multiple comparisons within their corresponding search volumes (note, however, that the maximum in the right hippocampus failed to survive correction for the number of ROIs, with pFWE = .02). A similar pattern of interaction, with greater encoding-related activity associated with subsequent recollection relative to items later missed (R > M) for unprimed than primed trials, survived small-volume correction in the left hippocampus (x = −21, y = −12, z = −12, Tmax = 3.95).
The patterns of activity in these MTL maxima are shown in Figure 2B (note that the absolute value of the BOLD impulse response estimates cannot be estimated efficiently in the present design; i.e., the “zero” is not important here but rather the relative differences between conditions). To further characterize in the interactions in these MTL regions, we tested the simple effects of priming (primed vs. unprimed) on subsequent R and subsequent K responses as well as the simple effect of subsequent memory (R vs. K vs. M) on primed and unprimed items. However, because these simple effect contrasts are not orthogonal to the interaction contrast, they cannot be performed on the response estimates extracted from the above MTL maxima without biasing the associated p value (Kriegeskorte, Simmons, Bellgowan, & Baker, 2009). To avoid such bias, the maxima of the SPMs for the simple effect contrasts must survive the same statistical threshold used for the interaction (i.e., pFWE < .05 within the left hippocampus and pFWE < .0167 within the three other ROIs). Here, we found that both the simple effects of repetition suppression for subsequent R responses (Rprimed < Runprimed) and of subsequent recognition for unprimed words (Runprimed > Munprimed) survived correction for the left hippocampal ROI. More precisely, the Rprimed < Runprimed simple effect was associated with a cluster of six voxels (x = −27, y = −18, z = −15, Tmax = 4.00), whereas the Runprimed > Munprimed was associated with two clusters of nine voxels each (x = −27, y = −24, z = −12, Tmax = 4.05; x = −18, y = −12, z = −12, Tmax = 3.86). Note that these three clusters largely overlapped with the clusters that showed the significant interaction between priming and subsequent recognition, including the maxima of the latter. We also note that the simple effect of subsequent recollection for unprimed items (Runprimed > Kunprimed) was close to surviving correction in three different maxima within the left hippocampal ROI (x = −18, y = −9, z = −12, Tmax = 3.49, pFWE = .08, p = .0011; x = −27, y = −21, z = −21, Tmax = 3.43, pFWE = .09, p = .0012; and x = −27, y = −18, z = −15, Tmax = 3.41, pFWE = .09, p = .0013). None of the simple effects survived correction in the three other ROIs.
Effective Connectivity Analysis
To investigate further the interaction between repetition suppression and memory encoding, we applied DCM to a fully connected network of the STG, hippocampal, and parahippocampal left hemisphere regions identified above (see Methods). Results of the group method are presented in this section, whereas the (similar) outcomes of the individual method (see Methods section) are presented in Supplementary Figure 3. Note than one participant in the group method analysis was excluded because of outlying DCM coupling parameters (see Supplementary material). We assumed that the driving input, which was a primed word, an unprimed word, or a pseudoword, arrived in the left STG. The MAP estimates for these driving inputs (DCM.C matrix), as expected from the activation pattern in STG, were reliably smaller for primed than unprimed words, t(21) = −4.2, p < .001 (see Supplementary Table 5). However, of particular interest was whether the parameters for the two forward connections, that is, from the left STG to the two left MTL regions, were modulated by R, K, or M responses (Figure 2C). We performed one-sample t tests to assess whether these six coupling parameters of interest significantly differed from zero and corrected p values for this number of comparisons. Results showed a significant increase in the positive coupling between the left STG and left hippocampus for R responses, t(20) = 3.85, pBonferroni-corrected. < .01, but not for K responses, t(20) = 1.9, pBonferroni-corrected = .41, or M responses, t(20) = −1.3, pBonferroni-corrected = .99. Note that this increase in modulatory extrinsic coupling of 0.031 for R responses (DCM.B matrix) was observed on top of a small positive intrinsic coupling (DCM.A matrix) between left STG and hippocampus (Supplementary Table 6), showing that subsequent R responses are associated with a reliable increase in STG–hippocampal effective connectivity. Furthermore, the modulatory coupling was reliably greater for R than for K, t(20) = 2.7, p < .05, and M, t(20) = 3.6, p < .01, responses. None of the other forward modulatory coupling parameters, for example, from STG to PhG, were significant (ts < 1.3). In sum, there was evidence of an increase in effective connectivity from left STG to left hippocampus only for words that were later recollected. Note that this main outcome was replicated using the Individual method (see Supplementary Figure 3).
As with our prior fMRI study (Gagnepain, Chetelat, et al., 2008), repetition priming of auditory words presented against a background sound was associated with reduced activity (repetition suppression) in the left STG. For the first time, however, we also found that this priming significantly modulated the increased MTL activity that is traditionally associated with episodic encoding processes that lead to subsequent recollection (relative to subsequent familiarity; e.g., Ranganath et al., 2004; Davachi et al., 2003). This interaction pattern was found across three MTL regions: left and right hippocampus, plus left PhG (Figure 2B). For unprimed items, we found the traditional increases associated with successful encoding (for items later recollected relative to those later missed plus a trend for items later recollected relative to those later judged familiar). However, this was not found for primed items; rather, we found that priming produced decreased MTL responses but only for items later recollected.
The presence of repetition suppression in the left hippocampus specific to items later recollected is a surprising result, given that priming increased the incidence of subsequent recollection. Although priming also increased the incidence of familiarity, at least when K responses were scored under independence assumptions (Yonelinas, 2002), this increase seems unlikely to be related to hippocampal activity, given that there were no reliable differences between primed and unprimed K responses in this region. We therefore hypothesized that the determinant of episodic encoding, relative to familiarity-based encoding, is not necessarily related to an increase in hippocampal local activity, as suggested by previous studies (e.g., Ranganath et al., 2004; Davachi et al., 2003; for a review Eichenbaum et al., 2007; Squire et al., 2007; Davachi, 2006), but rather a change in the functional coupling between the hippocampus and the neocortical regions associated with processing the stimuli that are later recollected. Using DCM, we confirmed this hypothesis in that subsequent recollection was associated with an increase in the effective connectivity from the STG region to the hippocampal region. In other words, repetition suppression in the left STG was transmitted to the left hippocampus specifically for stimuli that were subsequently recollected, explaining the decreased activity for such stimuli in the left hippocampus when they had been primed. Taking together, our findings suggest that the nature of subsequent memory (i.e., recollection vs. familiarity) does not depend solely on local hippocampal activity, but the increase of cortico-hippocampal connectivity may also be important or even necessary.
It is worth noting that a similar interaction was found in a region of left PhG. This is contrary to previous claims for a functional dissociation between hippocampus and surrounding PhG (particularly perirhinal cortex), with the latter being implicated in processes leading to subsequent familiarity rather than subsequent recollection (Diana, Yonelinas, & Ranganath, 2006; Ranganath et al., 2004; Davachi et al., 2003). Nonetheless, the peak coordinate in the left PhG was close to the intersection between the perirhinal and the parahippocampal cortices as well as to the hippocampus itself and thus could not be attributed confidently to one or the other structure. Given that we did not find reliable simple effects in the PhG nor changes in the functional coupling between STG and PhG, we focus on the claim that it is the coupling between hippocampus and neocortex that is important for binding stimuli with their context to enable subsequent recollection. Finally, note also that the present analyses focused on the left hemisphere, given that the effects were generally stronger on the left and that the left hemisphere normally shows a dominance over the right hemisphere for linguistic processing. Nonetheless, the interaction between priming and subsequent recollection was also found within the right hippocampus.
It is possible that the hippocampal activation pattern associated with subsequent memory might relate not to encoding processes during the lexical decision phase (Phase 2 in our paradigm) but rather to changes occurring during the initial presentation of prime items in the phoneme detection phase (our Phase 1). In other words, subsequent R hits (in our Phase 3) could be better encoded than subsequent K hits in Phase 1 rather than in Phase 2. However, if this were true, we should have observed a significant difference in the amount of behavioral priming (in Phase 2) as a function of subsequent recognition (in Phase 3) and a similar difference in the amount of repetition suppression in the STG. There was no evidence for either. Furthermore, because we instructed participants to only respond with an R judgment when they recollected the word and the sound with which it had been paired in the scanner (i.e., from the presentation of the word in the second rather than first phase), R judgments are likely to reflect encoding processes occurring in Phase 2 rather than Phase 1. Finally, if the primed words in Phase 2 that were subsequently recollected were those that were already hippocampus dependent from Phase 1, then these items should have been remembered explicitly during Phase 2; an occurrence that our debriefing questionnaire suggests was not frequent enough to explain our results (see Methods). In summary, although the incidence of both recollection (R response) and familiarity (K/ [1 − R]) in Phase 3 may increase after priming, the effects are likely to have different causes. K responses are likely to reflect the strength of a familiarity signal, irrespective of whether the source of that signal is Phase 1 or 2. This signal is likely to be stronger for primed items but does not seem to relate to hippocampal activity. R responses, on the other hand, do appear to involve hippocampal encoding-related activity that occurs during the lexical decision task (Phase 2), which is specifically sensitive to the effects of repetition priming.
There is some ambiguity surrounding the mapping of R and K responses to the theoretical constructs of recollection and familiarity (e.g., Wais, Mickes, & Wixted, 2008; Squire et al., 2007; Wixted, 2007). Indeed, it has been argued that R/K/G responses simply correspond to different response criterion along a single continuum of memory strength and so can be modeled via signal detection theory (e.g., Wixted, 2007; Dunn, 2004). In other words, R and K responses would be the product of strong and weaker memory traces, respectively, and might not be the expression of two distinct memory processes or systems underlying recollection and familiarity. However, three main findings suggest that such a one-dimensional model might not adequately explain the present data. First, the three points on Z-ROCS corresponding to R, RK, and RKG criteria did not fall on a straight line, differing reliably in their d′ values. This indicates that distribution of familiarity strength of old/new items is not Gaussian, a core assumption of signal detection theory (see Parks & Yonelinas, 2007). Second, the most successful unequal variance signal detection model predicts that increasing the strength of a memory trace, via priming, should also increase variance of the strength distribution, decreasing Z-ROCs slopes (Squire et al., 2007; Wixted, 2007). However, there was no evidence that the Z-ROCs slopes for primed and unprimed conditions differed. Third, one-dimensional models cannot explain the interaction pattern observed in the hippocampus, predicting instead that the strongest memory trace condition (Rprimed) should give rise to the greatest response in the hippocampus (see Squire et al., 2007). Note that these results do not, of course, rule out the appropriateness of one-dimensional signal detection models for other memory data; neither do the present results indicate whether recollection is a signal detection or a threshold process (Parks & Yonelinas, 2007; Rotello, Macmillan, & Reeder, 2004). Nonetheless, the present analyses suggest that R and K judgments here map onto psychologically and neurally distinct memory processes, as originally suggested by Tulving (1985).
Prima facie, the increased recollection associated with repetition priming is contradictory to the findings reported by Wagner et al. (2000). This study manipulated the lag between initial and repeated presentations of words and found that although shorter lags were associated with greater repetition priming and greater repetition suppression in the left pFC, they were also associated with reduced subsequent recognition memory. Wagner et al. interpreted this in terms of reduced encoding variability in the short lag condition (Martin, 1968). However, Stark et al. (2008) argued that although lag affects priming and subsequent recognition in opposite directions—that is, longer lag reduces priming but increases subsequent recognition—these effects are independent and occur in parallel. Furthermore, the neural changes in the pFC (and behavioral priming) in the study of Wagner et al. are somewhat difficult to interpret because their use of the same task during initial and repeated presentations of stimuli means that their repetition effects may reflect stimulus–response learning rather than more efficient perceptual processing (e.g., Horner & Henson, 2008, 2009; Race, Shanker, & Wagner, 2009; Dobbins, Schnyer, Verfaellie, & Schacter, 2004). Here, we were careful to use different tasks in the first and second phases, which means that repetition suppression (in STG) and behavioral priming are more likely to reflect more efficient perceptual processing.
To our knowledge, only one previous study has performed a direct comparison of episodic encoding and priming like ours (rather than an indirect comparison via another variable like lag). This is the study by Turk-Browne, Yi, and Chun (2006), who presented visual scenes twice during an indoor/outdoor judgment task and then examined subsequent recognition memory. They found that greater repetition priming and repetition suppression, in particular in the PhG and fusiform gyrus, were associated with better subsequent recognition. Although Turk-Browne et al. also used the same task during initial and repeated presentations, the regions involved in this study are less likely to reflect stimulus–response learning (Race et al., 2009; Horner & Henson, 2008). Turk-Browne et al. interpreted their findings as reflecting the fact that both implicit memory retrieval and explicit memory encoding processes benefit from selective attention. In other words, greater attention to a stimulus can increase both subsequent priming and subsequent recognition. We did not observe a modulation of repetition priming and STG repetition suppression by subsequent memory in the present study, suggesting that participants allocated similar attentional resources to subsequently remembered and subsequently forgotten words in the lexical decision task. Furthermore, neither the study of Wagner et al. (2000) nor the study of Turk-Browne et al. attempted to separate recollection from familiarity, for example, by using remember/know judgments.
Another important difference between previous studies (Turk-Browne et al., 2006; Wagner et al., 2000) and ours relates to the absence versus presence of a noisy background context (here environmental sounds). Indeed, we propose that the interaction pattern in the hippocampal activity and in the hippocampal–STG connectivity reflects how well the sound context was bound to words. A previous fMRI study suggested that a noisy environment might disrupt allocation of attentional resources to an item (e.g., Stevens, Hasher, Chiew, & Grady, 2008). Here, we propose a complementary mechanism, according to which item properties might change attentional resources allocated to its environment. Specifically, we propose that one consequence of priming is to “free up” attentional resources from processing an item in a noisy context, allowing greater attention to, and hence encoding of, that context. In the present experiment, although participants' main task was to make a lexical judgment as quickly as possible on the words, our instructions also encouraged them to pay attention to the sound when they could. If more attentional resources are available for the sound context because of word priming, hippocampal binding processes should be less effortful, hence decreasing the magnitude of hippocampal activity but increasing subsequent recollection at the same time. Although we do not have objective measures of context recall in the present study and that other episodic details than the sound context might have driven R responses, the behavioral results (e.g., effect of priming on recollection) are consistent with our previous behavioral study (Gagnepain, Lebreton, et al., 2008), which did have such an objective measurement of recollection. Although further investigation will be needed to test this hypothesis, it is consistent with recent data showing that encoding processes in the hippocampus are indeed sensitive to attentional demands (e.g., Uncapher & Rugg, 2008; Kensinger, Clarke, & Corkin, 2003). Note also that the same attentional hypothesis may also apply to the internal context generated by participants themselves (e.g., thoughts, emotions), which often forms the basis of R judgments.
We showed that priming elicited reduced activity (repetition suppression) in the left STG. For the first time, however, we demonstrated that priming also modulated the normal increase in hippocampal activity associated with subsequent recollection. Further analyses of effective connectivity revealed an increase in functional coupling from the left STG to the hippocampus for words latter recollected, explaining the decreased activity for such stimuli in the left hippocampus when they had been primed. Our findings demonstrate that although hippocampal local changes may be important for successful episodic encoding, they do not necessarily need to be increases in activity. Rather, subsequent recollection might be better predicted by increased connectivity between the cortex and the hippocampus, whatever direction of local activity change in the hippocampus. Nonetheless, it is important to point out that these findings do not discount previous findings of increased hippocampal activity associated with successful encoding. First, we observed a similar pattern for our unprimed items, which are comparable to the stimuli used in previous studies. Second, successful episodic encoding generally involves a recruitment of various neocortical regions in addition to MTL, which might have been positively coupled with increased hippocampal activity if this had been examined (see, e.g., Ranganath, Heller, Cohen, Brozinsky, & Rissman, 2005; Grady, McIntosh, & Craik, 2003). Moreover, studies generally assess conditions under which increased subsequent recollection (and hippocampal activity) is likely to be accompanied by increased neocortical activity as well (rather than the decreased neocortical activity found here). For instance, elaborating semantic associations between items during encoding is likely to increase neocortical recruitment that may in return trigger increased hippocampal activity (Addis & McAndrews, 2006). In summary, our findings suggest that successful episodic encoding may not be supported solely by local changes in the hippocampus but also depends on the nature of cortical inputs reaching the hippocampus. Our findings also suggest a new critical function of priming, which is to “free up” attentional resources from the processing of item information to allow greater encoding of contextual information and hence better episodic memory.
P. Gagnepain was supported by a PhD fellowship from the Délégation Générale pour l'Armement (DGA), the Fondation pour la Recherche Médicale (FRM), and the UK Medical Research Council (WBSE U.1055.05.012.00001.01). The authors would like to thank G. Perchey and J. Dayan for data acquisition, B. Landeau for technical support, and two anonymous reviewers for their helpful comments.
Reprint requests should be sent to Francis Eustache, Inserm-EPHE-Université de Caen/Basse Normandie, Unité U923, CHU Côte de Nacre, F-14033, Caen Cedex, France, or via e-mail: email@example.com.