Emerging evidence suggests that inhibitory control (IC) plays a pivotal role in science and maths counterintuitive reasoning by suppressing incorrect intuitive concepts, allowing correct counterintuitive concepts to come to mind. Neuroimaging studies have shown greater activation in the ventrolateral and dorsolateral pFCs when adults and adolescents reason about counterintuitive concepts, which has been interpreted as reflecting IC recruitment. However, the extent to which neural systems underlying IC support science and maths reasoning remains unexplored in children. This developmental stage is of particular importance, as many crucial counterintuitive concepts are learned in formal education in middle childhood. To address this gap, fMRI data were collected while fifty-six 7- to 10-year-olds completed counterintuitive science and math problems, plus IC tasks of interference control (Animal Size Stroop) and response inhibition (go/no-go). Univariate analysis showed large regional overlap in activation between counterintuitive reasoning and interference control, with more limited activation observed in the response inhibition task. Multivariate similarity analysis, which explores fine-scale patterns of activation across voxels, revealed neural activation similarities between (i) science and maths counterintuitive reasoning and interference control tasks in frontal, parietal, and temporal regions, and (ii) maths reasoning and response inhibition tasks in the precuneus/superior parietal lobule. Extending previous research in adults and adolescents, this evidence is consistent with the proposal that IC, specifically interference control, supports children's science and maths counterintuitive reasoning, although further research will be needed to demonstrate the similarities observed do not reflect more general multidemand processes.

There is an increasing need to support individuals' science, technology, engineering and mathematics (STEM) skills, not only for personal growth, career opportunities (e.g., Waite & McDonald, 2019; Walker & Zhu, 2013), and economic and technological advancements (e.g., Hanushek & Woessmann, 2008) but also for tackling global challenges, such as climate change and public health crises (e.g., Hotez, 2021). However, learning STEM skills can be challenging, in part due to the prevalence of counterintuitive concepts within the domains of science and mathematics (Mareschal, 2016). Examples of intuitive but incorrect beliefs are that it is warmer in the summer because the Earth is closer to the Sun, that heavier objects fall faster even in a vacuum, and that −3 is less than −8. Therefore, to support the development of STEM skills in the population, a better understanding of maths and science counterintuitive reasoning is crucial.

Recent evidence has suggested that inhibitory control, defined as the suppression of prepotent responses or interfering thoughts (Diamond, 2013; Aron, 2007; Nigg, 2000), supports counterintuitive reasoning. Experimental research exploring scientific and mathematical conceptual change in undergraduates (Shtulman & Valcarcel, 2012) and adolescents (Babai, Eidelman, & Stavy, 2012) have interpreted longer RTs when participants reason about counterintuitive concepts (vs. naive intuitive concepts) as evidence for the involvement of inhibitory control (Mareschal, 2016). The theory is that both the naive intuitive concept and the new counterintuitive concept coexist simultaneously, and inhibitory control suppresses the naive concept, so the counterintuitive concept can come to mind.

Individual differences and intervention studies provide further convergent evidence. Individual differences in inhibitory control have been associated with counterintuitive reasoning in science and maths in adolescents (Brookman-Byrne, Mareschal, Tolmie, & Dumontheil, 2018), young adults (Coulanges et al., 2021), and children (Dumontheil et al., 2023; Wilkinson et al., 2020; Mason, Zaccoletti, Carretti, Scrimin, & Diakidoy, 2019; Vosniadou et al., 2018). Moreover, in a large-scale randomized controlled trial, an intervention increasing children's awareness of the prevalence of counterintuitive concepts and encouraging them to stop and think about their responses in maths and science showed that training children to apply their inhibitory control skills for science and maths reasoning led to improvements in science and maths standardized tests in 9- to 10-year-olds, but not 7- to 8-year-olds (Roy et al., 2019). In addition, in smaller samples, this intervention led to improvements in science and maths counterintuitive reasoning in 7- to 8-year-olds but not 9- to 10-year-olds (Dumontheil et al., 2023; Wilkinson et al., 2020). Although there were age differences, these findings indicate that training children to use their inhibitory control skills can lead to improvements in counterintuitive reasoning, thus supporting the theory that inhibitory control is involved in counterintuitive reasoning.

fMRI research has also provided some evidence consistent with an involvement of inhibitory control in counterintuitive reasoning. It has been found that when adults and adolescents reason about maths or science counterintuitive concepts, greater increases in BOLD signal occur in the dorsolateral pFC (DLPFC), ventrolateral pFC (VLPFC) and ACC (Meier, Wambacher, Vogel, & Grabner, 2022; Allaire-Duquette et al., 2021; Potvin, Malenfant-Robichaud, Cormier, & Masson, 2020; Allaire-Duquette, Bélanger, Grabner, Koschutnig, & Masson, 2019; Brault Foisy, Potvin, Riopel, & Masson, 2015; Masson, Potvin, Riopel, & Foisy, 2014; Stavy, Goel, Critchley, & Dolan, 2006), although the exact loci of increased BOLD signal are inconsistent across studies (see Dumontheil, Brookman-Byrne, Tolmie, & Mareschal, 2022, for a review). These brain regions have been implicated in inhibitory control (e.g., Aron, Robbins, & Poldrack, 2014; Aron, 2007; MacDonald, Cohen, Stenger, & Carter, 2000), along with parietal cortex (for meta-analyses, see Gavazzi et al., 2023; Isherwood, Keuken, Bazin, & Forstmann, 2021; Gavazzi, Giovannelli, Currò, Mascalchi, & Viggiano, 2020; Huang, Su, & Ma, 2020; Guo, Schmitz, Mur, Ferreira, & Anderson, 2018; Hung, Gaillard, Yarmak, & Arsalidou, 2018; Zhang, Geng, & Lee, 2017). In addition, adults with greater scientific and mathematical expertise, compared with novices, have shown increased activation in brain regions associated with inhibitory control when reasoning about counterintuitive problems in their area of expertise. For example, male physics undergraduates, compared with humanities undergraduates, showed greater activation in the right VLPFC and left DLPFC when reasoning about falling objects (Brault Foisy et al., 2015) and in the left VLPFC and left DLPFC when reasoning about electrical circuit counterintuitive problems (Masson et al., 2014). Similar neuroimaging findings were observed in mathematicians compared with nonmathematicians when reasoning about maths-related counterintuitive problems (Meier et al., 2022). Furthermore, chemistry professors have been shown to demonstrate greater activation in the left VLPFC, left DLPFC, pre-SMA, and anterior insula when reasoning about counterintuitive chemistry problems, compared with intuitive ones (Potvin et al., 2020). Taken together, this evidence is consistent with the proposal that, compared with novices, experts' better counterintuitive reasoning skills rely in part on the recruitment of inhibitory control processes.

However, it is unclear the extent to which these patterns of neural activation observed in the pFC specifically reflect the recruitment of inhibitory control processes. Instead, these activations may reflect increased top–down attention and processing of the stimuli (Corbetta & Shulman, 2002) or the recruitment of multidemand processes (Crittenden, Mitchell, & Duncan, 2016; Duncan, 2010). To complicate this further, there is currently no consensus on whether inhibitory control is a unitary or diverse construct; however, it is broadly accepted that inhibitory control can be subdivided into interference control and response inhibition (Diamond, 2013). Interference control refers to the ability to suppress distracting/irrelevant information, memories, or semantic knowledge, whereas response inhibition refers to the ability to suppress a motor response (Diamond, 2013). To better understand whether counterintuitive reasoning in science and maths and inhibitory control tasks recruit similar patterns of neural activation, Dumontheil and colleagues (2022) had adolescents undergo fMRI while answering counterintuitive and noncounterintuitive (control) science and maths questions, alongside three inhibitory control localizer tasks, a simple go/no-go task, a complex go/no-go task (with a 1-back working memory load), and a numerical Stroop task. These tasks measure response inhibition, response inhibition with a working memory load, and interference control, respectively.

There are two complementary ways of analyzing fMRI data to understand whether similar regions of the brain show increased activated across tasks. First, univariate analysis can show mean regional overlap of increased activation. However, to better understand the similarity of patterns of activations across tasks, multivariate similarity analysis can be applied. This approach correlates activation across voxels across tasks within an individual, to quantify the similarity of neural activation across tasks within a brain region. In the study by Dumontheil and colleagues (2022), counterintuitive trials showed increased BOLD signal changes in the DLPFC, VLPFC, and parietal regions compared with control trials, consistent with previous work. Univariate analysis showed that these changes partially overlapped, with increased BOLD signal changes in the complex response inhibition task and interference control task. However, multivariate similarity analysis showed neural similarity between counterintuitive reasoning tasks and both complex go/no-go and numerical Stroop tasks in the right supramarginal gyrus only (Dumontheil et al., 2022). These results suggest that in adolescence, simple response inhibition may be less relevant for counterintuitive reasoning than when it is combined with a working memory load and that the pFC activation observed in previous studies may not necessarily reflect inhibitory control neural processes specifically but instead reflect broader cognitive control neural processes involved in a range of complex tasks (cf. multidemand system; Crittenden et al., 2016; Duncan, 2010). The findings further underscore the importance of using inhibitory control localizers for exploring neural similarities between inhibitory control and counterintuitive reasoning.

The relationship between counterintuitive reasoning and inhibitory control is of particular importance in childhood, as this developmental stage is where many counterintuitive concepts in science and maths first arise, as part of the primary school curriculum. Furthermore, there is an extensive body of literature showing behavioral differences in inhibitory control skills (Constantinidis & Luna, 2019; Lewis, Reeve, Kelly, & Johnson, 2017; Cragg, 2016; Klenberg, Närhi, Korkman, & Hokkanen, 2015; van der Laar, van den Wildenberg, van Boxtel, & van der Molen, 2014; Ordaz, Foran, Velanova, & Luna, 2013; Bedard et al., 2002), as well as differences in the neural correlates of inhibitory control, across development (Ordaz et al., 2013; Durston et al., 2002, 2006; Bunge, Dudukovic, Thomason, Vaidya, & Gabrieli, 2002; Tamm, Menon, & Reiss, 2002; see Kang, Hernández, Rahman, Voigt, & Malvaso, 2022, for a review). Studies have also shown increased positive associations between pFC activation and both interference control (Marsh et al., 2006; Adleman et al., 2002, referred to as self-regulation; Schroeter, Zysset, Wahl, & von Cramon, 2004) and response inhibition performance (Tamm et al., 2002) with age. Finally, there is evidence to suggest that at younger ages, executive function skills and their neural correlates may be less differentiated, but become more differentiated with age (e.g., Hartung, Engelhardt, Thibodeaux, Harden, & Tucker-Drob, 2020; Karr et al., 2018; Lee, Bull, & Ho, 2013, see Fiske & Holmboe, 2019, for a review). Collectively, this evidence suggests that the patterns of neural similarity between inhibitory control and counterintuitive reasoning may differ over the course of development.

Therefore, the present study aimed to investigate neural similarities when primary-school-aged children engaged in counterintuitive science and maths reasoning and two inhibitory control tasks, using univariate and multivariate approaches. It was hypothesized that overlapping neural activation would be observed in DLPFC, VLPFC, and parietal cortex (as per Dumontheil et al., 2022; Potvin et al., 2020) and that neural similarity would be present between counterintuitive reasoning tasks and both interference control and response inhibition tasks.

Participants

Fifty-six children aged between 7 and 10 years old (26 female, 30 male, Mage = 9.02 years, SD = 1.01 years) were recruited from Year 3 (n = 25) and Year 5 (n = 31) in 13 schools in England. The children were part of the UnLocke project (www.unlocke.org), which involved the development and assessment of an intervention to encourage children to “Stop & Think” before answering counterintuitive reasoning tasks (see Roy et al., 2019, for details of the randomized controlled trial and Dumontheil et al., 2023, for additional sample information). Twenty-three participants were recruited in October–November 2017 and 33 in March 2018. The original aim, within budgeting constraints, was to recruit a sample of 80 children to assess the impact of the Stop & Think intervention on neural activation. However, recruitment was constrained by the timings of the interventions and our target was not achieved. The final sample is, however, greater than the adolescent sample successfully used in a previous study (Dumontheil et al., 2022; n = 34) and was therefore appropriate to carry out univariate and multivariate fMRI analyses to compare the neural correlates of science and maths counterintuitive reasoning and inhibitory control. A power analysis carried out with G*Power 3.1.9.4 (Faul, Erdfelder, Lang, & Buchner, 2007) suggested that across 50 voxels (our minimum sample size for the definition of ROIs used in the multivariate analyses), a correlation of .334, corresponding to a Fisher's z score of 0.166 (our measure of similarity between tasks), can be detected with 80% power (α = .05). In turn, a sample of 40 participants can give 80% power to find a mean Fisher's z of 0.169 as being significantly greater than zero in a one-sample t test. Our previous study in adolescents reported similarity between maths and science counterintuitive reasoning with Fisher's z scores ranging between 0.22 and 0.28 and similarity between maths and science counterintuitive reasoning and inhibitory control tasks with scores ranging between 0.16 and 0.19. The current sample size is therefore sufficient to detect similar or larger similarity between tasks.

The percentage of children receiving free school meals (an indirect indicator of low household income) ranged between 0.6% and 25.0% across schools (M = 10.1%, SD = 8.2; note 13.6% of primary school pupils received free school meals in England in January 2018; Department for Education, 2018). Participants did not have any known developmental or neurological disorders. Written informed parental consent and verbal assent from each child was collected. Travel expenses were reimbursed, and each child received a small gift plus a digital image of their brain scans for participation. The study was approved by the University College London ethics committee.

Participant and Run Exclusion

Functional neuroimaging data were collected pre- and postintervention, approximately 12 weeks apart. Neural correlates of counterintuitive reasoning and inhibitory control did not prove sensitive enough to detect the effects of intervention with the size of this subsample, even though behavioral intervention effects were observed in much larger cognitive (n = 372; Dumontheil et al., 2023) and randomized controlled trial (n = 5437; Roy et al., 2019) samples. Therefore, for current purposes, data from both time points (T1: pre-intervention, T2: postintervention) were included to increase the power of the study.

Exclusions of participants based on head motion, performance, or unfinished task completion were implemented. Behavioral exclusions were applied if accuracy, RT, or no-response rates were outside 3.29 SDs of the mean. Task runs were excluded for excessive head motion (see the MRI Data Acquisition and Preprocessing section below). Reasons for not completing the tasks included being nervous/uncomfortable in the scanner environment, nausea, and wanting a break. At T2, 17 children did not return for the scanning session and one child attended but was unable to complete any tasks in the scanner due to discomfort when in the scanner. Note that there were no significant differences between those who returned at T2 compared to those who did not in age, sex, nor counterintuitive reasoning and inhibitory control task performance measures (all ps > .071). The final school year group breakdown, sample sizes for each task, and reasons for exclusion can be seen in Table 1.

Table 1.

Year 3 and 5 Sample Sizes for Each Task and Timepoint

TimepointYearReason for ExclusionScience nMaths nAnimal Size Stroop nPokémon Go/No-Go n
T1 Year 3   22 19 17 17 
Year 5   30 28 23 23 
Total   52 47 40 40 
  Not completed 
  Movement 10 10 
  Behavior 1 + (1)a (2)a 
T2 Year 3   12 10 
Year 5   23 23 20 22 
Total   35 33 28 30 
  Did not return at T2 17 17 17 17 
  Not completed 
  Movement 
  Behavior (2)a 2 + (1)a 
TimepointYearReason for ExclusionScience nMaths nAnimal Size Stroop nPokémon Go/No-Go n
T1 Year 3   22 19 17 17 
Year 5   30 28 23 23 
Total   52 47 40 40 
  Not completed 
  Movement 10 10 
  Behavior 1 + (1)a (2)a 
T2 Year 3   12 10 
Year 5   23 23 20 22 
Total   35 33 28 30 
  Did not return at T2 17 17 17 17 
  Not completed 
  Movement 
  Behavior (2)a 2 + (1)a 
a

Participant excluded both because of poor performance and excessive in-scanner movement.

Procedure

Participants completed the testing sessions (T1 and T2) at the Birkbeck-UCL Centre for Neuroimaging. Each fMRI session comprised four runs of science and maths counterintuitive reasoning tasks, and one run each of the Animal Size Stroop task and the Pokémon go/no-go task. Total scanning duration was 40–50 min.

MRI Tasks

Science and Maths

The counterintuitive reasoning task consisted of 40 (20 science, 20 maths) age-appropriate, teacher-approved science and maths counterintuitive questions identified during the preparation of the Stop & Think intervention based on the Key Stage 2 National Curriculum for England (Department for Education, 2013a, 2013b) and books highlighting concepts primary school children find counterintuitive (e.g., Ryan & Williams, 2007; Stavy & Tirosh, 1999). The test items largely differed across year groups to reflect age-appropriate syllabus knowledge. However, 25% of questions overlapped between the groups. On each trial, a statement and image appeared on the screen (Figure 1) and participants were asked to answer whether the statement was true or false by pressing a button using the index or middle finger of their right hand.

Figure 1.

Example science and maths counterintuitive reasoning stimuli (A) and examples of the trials structure including the interleaved baseline animal silhouette task (B). (A) Examples of the science and maths counterintuitive reasoning trials. Participants were presented with a statement designed to induce reasoning about counterintuitive problems in science and maths and asked to respond true or false via a button press. (B) Science and math trials were presented on screen for a maximum of 17 sec or until a response was given. In between counterintuitive reasoning trials, participants indicated whether silhouettes of animals faced left or right.

Figure 1.

Example science and maths counterintuitive reasoning stimuli (A) and examples of the trials structure including the interleaved baseline animal silhouette task (B). (A) Examples of the science and maths counterintuitive reasoning trials. Participants were presented with a statement designed to induce reasoning about counterintuitive problems in science and maths and asked to respond true or false via a button press. (B) Science and math trials were presented on screen for a maximum of 17 sec or until a response was given. In between counterintuitive reasoning trials, participants indicated whether silhouettes of animals faced left or right.

Close modal

Two runs of maths and two of science were completed in an alternating pattern counterbalanced across participants, with 10 counterintuitive reasoning trials per run. A response could be made after 1.5 sec of the stimulus presentation. After 14 sec, the response cues were outlined in a red border to prompt a response. A baseline task of black animal silhouettes facing either to the left or right (Figure 1B) was presented between counterintuitive trials for up to 20 sec or until the participants made a response. Participants were asked to press a button corresponding to the direction the animal was facing; the silhouette remained on the screen until participants made a response, up to a maximum of 1 sec, and the interstimuli interval was randomized between 0 and 200 msec, with a mean of 100 msec. This baseline task encouraged participants to disengage from the previous counterintuitive question and prevented boredom (see Dumontheil et al., 2022, for a similar approach). Although we used matched noncounterintuitive trials in a previous study in adolescents (Dumontheil et al., 2022), here, a baseline task was used for pragmatic reasons relating to the length of time that 7- to 10-year-old children can deliver high-quality data within the scanner and to limit the complexity of the design to test for intervention effects. There were four fixation blocks, presented after the second, fifth, seventh, and last trials, which lasted 10, 15, 10, and 5 sec, respectively.

Inhibitory Control

Interference control.

The Animal Size Stroop task is a modified age-appropriate version of the Stroop task (Morris, Farran, & Dumontheil, 2019; Merkley, Thompson, & Scerif, 2016). In this task, two well-known animals of different sizes were shown on each trial and participants were asked to press the button corresponding to the side of the “larger animal in real life.” On congruent trials, the biologically larger animal was shown as a larger image than the biologically smaller animal (Figure 2A). On incongruent trials, the biologically larger animal was shown as a smaller picture than the biologically smaller animal (Figure 2A). The Animal Size Stroop followed a block design, with alternating congruent and mixed (50% congruent, 50% incongruent) blocks of 12 trials. Congruent and incongruent trials in the mixed blocks were presented in a pseudorandom order, with no more than three congruent or incongruent trials in a row. The mixed blocks were expected to engage greater interference control than the congruent-only blocks. There were five blocks of each type, resulting in 60 congruent trials in congruent blocks, and 30 congruent and 30 incongruent trials in mixed blocks. Task blocks lasted 22.8 sec and were separated by instruction screens presented for 2 sec that stated, “Big animals are always big” (congruent block) or “Big animals are sometimes small” (mixed block). On each trial the stimuli remained on the screen for 1.7 sec or until the participant responded, a fixation cross was then shown until 1.9 sec after the start of the trial. There were four fixation blocks, occurring after the second, fifth, seventh, and last blocks and lasting 10, 15, 10, and 5 sec, respectively. Accuracy and RT were used as behavioral performance measures.

Figure 2.

Examples of trials from the Animal Size Stroop (A) and the Pokémon go/no-go tasks (B).

Figure 2.

Examples of trials from the Animal Size Stroop (A) and the Pokémon go/no-go tasks (B).

Close modal

The Animal Size Stroop task is a variation of the numerical Stroop task used by Dumontheil and colleagues (2022) and was used to maximize the age appropriateness of the materials. Both variants have the key design characteristic that the stimulus has two properties potentially driving incompatible responses (the size of the stimulus vs. the size of the animal in real life in the Animal Size Stroop / the digits presented vs. the quantity of digits presented in the numerical Stroop), and inhibitory control is required to generate responses according to the less-salient dimension.

Response inhibition.

A modified age-appropriate version of the go/no-go task was used to measured response inhibition. If a blue or green Pokémon was presented on screen, the participants were asked to press the button corresponding to the side the Pokémon was presented on. If an orange Pokémon was presented, participants were asked to withhold a response (Figure 2B). This task comprised six go and six mixed (50% go, 50% no-go) alternating blocks. Each block consisted of 16 trials, resulting in 96 go trials in go blocks, and 48 go trials and 48 no-go trials in mixed blocks, presented in a pseudorandom order, with no more than three go or no-go trials in a row. The mixed blocks were expected to engage greater response inhibition than the go-only block. Blocks were separated by 2-sec instruction screens showing a blue and a green square with ticks or a green square with a tick and an orange square with a cross. Stimuli were presented for 400 msec, and participants had a maximum of 1 sec to respond. Trials were separated by a variable interstimulus interval of 0–400 msec, leading to average intertrial interval of 1.2 sec. There were four fixation blocks, occurring after the second, seventh, eleventh, and final task blocks lasting 10, 15, 10, and 5 sec, respectively. In addition to accuracy and RT, the difference between the z scores of the hit rate and false alarms was calculated in mixed blocks to obtain a d′ score combining performance on go and no-go trials (Stanislaw & Todorov, 1999; Hautus, 1995).

MRI Data Acquisition and Preprocessing

A 1.5 Tesla Siemens Avanto MRI scanner with a 30-channel head coil was used to collect structural and functional images. BOLD signal data were acquired using the Centre for Magnetic Resonance Research multiband EPI sequence (Xu et al., 2013) 4× acceleration, leak block on (Cauley, Polimeni, Bhat, Wald, & Setsompop, 2014) with the following parameters: repetition time = 1000 msec, echo time = 45 msec, resolution = 3 × 3 × 3 mm3, 44 slices covering most of the cerebrum. Structural anatomical scans were collected using a T1-weigthed magnetization prepared rapid gradient echo sequence with 2× generalized auto calibrating partial parallel acquisition acceleration. The parameters were as follows: 176 slices, repetition time = 2730 msec, echo time = 3.57 msec, resolution = 1 × 1 × 1 mm3, duration of scan = 5.5 min. The structural scan was carried out between the counterintuitive reasoning task and the inhibitory control task runs.

MRI scans were preprocessed and analyzed using SPM12 (www.fil.ion.ucl/uk/spm/software/spm12/) in MATLAB R2022b/R2023a. Excessive head motion outliers were calculated. Using the six realignment parameter estimates (translations and rotations), framewise displacement was calculated as a scalar measure of head motion for each volume. Volumes with a framewise displacement greater than 0.9 mm were excluded from the general linear model estimation and modeled with a regressor of no interest. Runs with more than 15% of volumes excluded and/or those with a root-mean-square movement greater than 1.5 mm were excluded from the fMRI analysis (Siegel et al., 2014).

The first eight volumes were removed to allow for equilibrium effects. Functional images were realigned to the mean image using a second degree B-spline interpolation. The structural image was then bias-field corrected and co-registered to the mean-realigned functional image of the first counterintuitive task run. For the inhibitory control tasks, the functional images were co-registered to the co-registered structural scans and the same normalization parameters were applied. The co-registered structural image was then segmented based on Montreal Neurological Institute (MNI) registered International Consortium for Brain Mapping tissue probability maps. The realigned functional images were normalized using the spatial normalization parameters and smoothed using an 8-mm FWHM Gaussian kernel.

Behavioral Data Analysis

Behavioral data were analyzed using RStudio 2023.03.0, running R 4.3.0, using the lme4 package to run mixed linear model analyses and restricted maximum likelihood methods of estimation. Linear mixed models (LMMs) were chosen to better account for missing data points and increase power. Assumptions of LMMs were checked to ensure that the residuals were approximately normally distributed. To account for the within-subject nature of the models, participant was included as a random effect (random intercepts only) in the models. p Values and approximations for degrees of freedom were calculated for F-tests and follow-up pairwise t tests using Satterthwaite's method for computing denominator degrees of freedom. Satterthwaite's method has been shown to produce acceptable Type 1 errors rates with smaller samples (<144 observations; Luke, 2017). Estimated marginal means were obtained from pairwise comparisons calculated using the emmeans() function. p Values for these comparisons were adjusted with a Tukey test, to control for Type I errors and to better account for unequal group sizes.

In the science and maths counterintuitive reasoning task, LMMs were conducted on mean accuracy and RT (combining correct and incorrect trials) data with discipline, year, and timepoint as fixed effects and participant as a random effect. LMMs were also conducted on mean accuracy and RT for correct trials in the inhibitory control tasks, with trial type, year, and timepoint as fixed effects and participant as random effects. For all LMMs, effect sizes for the fixed effects are reported as standardized regression coefficients. These were calculated using the standardize_parameters() function, as part of the effectsize package. The function was implemented using the refit method and robust = FALSE argument.

To test whether there were significant associations between counterintuitive reasoning and inhibitory control performance, LMMs were carried out with science or maths accuracy as the dependent variable, timepoint and year and their interaction as fixed effects, subject as a random effect, and (i) Pokémon go/no-go d′, (ii) Animal Stroop incongruent RT (controlling for congruent in congruent block RT), or iii) Animal Stroop incongruent accuracy (controlling for congruent in congruent block accuracy) as covariates.

fMRI Data Analysis

First-level Analyses

Volumes acquired during each run were treated as separate time series. For each series, the variance in the BOLD signal was decomposed using general linear models. In the science and maths counterintuitive reasoning tasks, trials were modeled with a box-car regressor with individual RTs providing the duration of each trial. The implicit baseline included the fixation and animal silhouette task. Contrast images for the counterintuitive trials were calculated separately for science and maths. Inhibitory control tasks were modeled using a block design, including a box-car regression with duration of 2 sec representing the instruction screen at the start of each block. Block designs were used to increase the signal-to-noise ratio. For the Animal Size Stroop task, box-car regressors modeled the congruent, mixed, and fixation blocks, and first-level contrast images between mixed and congruent blocks were calculated. For the Pokémon go/no-go task, box-car regressors modeled go, mixed, and fixation blocks, and first-level contrast images between go-mixed and go blocks were calculated. All regressors were convolved with the standard canonical hemodynamic response function and, together with the separate regressors representing each censored volume and the session mean, comprised the full model for each run. A high-pass filter (cutoff = 128 sec) and an auto-regressive model of order one correction for serial autocorrelation were applied.

Univariate Second-level Analyses

Four two-sample t tests between year groups were conducted on the following contrasts: (i) maths counterintuitive reasoning, (ii) science counterintuitive reasoning, (iii) Animal Size Stroop mixed blocks > congruent blocks, and (iv) Pokémon go/no-go mixed blocks > go blocks. Where participants had data at both timepoints, contrast images were averaged using ImCalc. For the counterintuitive reasoning task, a participant × discipline × year flexible factorial design analysis was conducted, modeling the interaction between discipline and year, with participant as a main effect (to take into account the repeated-measures nature of the data).

For the second-level analyses, SPM maps were thresholded at p < .001 uncorrected at voxel-level and FWE-corrected p < .05 at the cluster level. In addition, voxels significant at whole-brain FWE-corrected p < .05 are reported. All coordinates given are in MNI space, and region labeling was performed using the Automatic Anatomical Labelling toolbox (Rolls, Huang, Lin, Feng, & Joliot, 2020).

Multivariate Similarity Analyses

ROIs were identified from the univariate analysis of the two inhibitory control tasks, using a threshold of p < .001 uncorrected and a minimum cluster size of 50 voxels. For large clusters, WFU_Pickatlas (Maldjian, Laurienti, & Burdette, 2004; Maldjian, Laurienti, Kraft, & Burdette, 2003) and Automatic Anatomical Labelling atlas (Tzourio-Mazoyer et al., 2002) were used to separate large regions into smaller ROIs based on anatomical labels. ROI MNI coordinates, cluster size, and center of mass were identified using MarsBaR (Brett, Anton, Valabregue, & Poline, 2002).

Multivariate correlation analysis was applied to determine whether, beyond spatial overlap, there was neural similarity between the inhibitory control and science and maths counterintuitive reasoning tasks. For each voxel in each cluster, the parameter estimates of each contrast of interest were extracted using MarsBaR. Kendall's tau correlations were calculated across the voxels of each cluster within participants for four task comparisons of interest: (1) Stroop versus science, (2) Stroop versus maths, (3) Pokémon versus science, and (4) Pokémon versus maths (n = 42, 41, 44, and 42, respectively), plus one reference comparison: maths versus science (n = 50). Note that only participants with neuroimaging data (either T1, T2, or an average across both timepoints) for both tasks could be included in each comparison. Kendall's tau correlations were calculated as the data did not meet assumptions for a normal distribution. Kendall's τ values were transformed into Pearson's r (r = sin(0.5 × π × τ)), which were then transformed into Fisher's z scores (z = 0.5 × ln ((1 + r)/(1 − r)); Walker, 2003) to allow further statistical analysis. The term similarity scores will be used to refer to these Fisher's z scores.

Intercepts from one-way ANOVAs on the similarity scores, with year group as a factor, were used to assess whether the similarity scores were significantly greater than zero using SPSS 29. Similarity was assessed between (i) science and maths counterintuitive reasoning and the Animal Size Stroop task separately in the 12 Animal Size Stroop ROIs, (ii) science and maths counterintuitive reasoning and the Pokémon task separately in the two Pokémon ROIs, and (iii) science and maths counterintuitive reasoning on all 14 ROIs. Multiple comparisons were adjusted for across all these analyses using the Benjamini-Hochberg method to adjust false discovery rate (FDR). This adjustment ranks the p values observed in a set of comparisons and identifies an upper bound for p values that fixes the expected proportion of false positives among all rejections of the null hypotheses to a criterion, 5% in this case (Benjamini & Hochberg, 1995).

We conducted 2 × 2 (Year × Discipline) mixed repeated-measures ANOVAs for the 12 ROIs derived from the Animal Size Stroop task and the two ROIs derived from the Pokémon go/no-go task, to assess whether the multivariate similarities of each inhibitory control task differed between science and maths and between year groups. Finally, to assess whether individual variability in neural similarity predicted counterintuitive reasoning behavioral performance, the science and maths similarity scores were averaged across Animal Size Stroop ROIs and correlated with science and maths accuracy on the counterintuitive reasoning task, respectively. As only one Pokémon go/no-go ROI showed significant similarity between math counterintuitive reasoning with the go/no-go task, the similarity score for this ROI was correlated with maths accuracy.

Behavioral Results

Science and Maths Counterintuitive Reasoning

Descriptive statistics for the science and maths counterintuitive reasoning tasks are reported in Table 2, estimated marginal means are reported in the following text, and effect sizes for the LMMs are presented in Table 3. LMMs showed overall higher accuracy in maths (M = 57.8%, SE = 2.0%) compared with science (M = 50.7%, SE = 1.7%), F(1, 111.5) = 15.22, p < .001, and significantly greater accuracy at T2 (M = 56.5%, SE = 2.0%) than T1 (M = 52.1%, SE = 1.6%), F(1, 135.7) = 5.12, p = .025, but no significant differences in accuracy between year groups, nor an interaction, were observed, Fs < 1.62, ps > .21. As mean accuracy was ∼50%, accuracy scores for each question were calculated to determine whether participants were consistently answering at chance or whether mean accuracy differed on a trial-by-trial basis (see Table A1 in the Appendix). The results show a range of mean accuracy scores, demonstrating that the children were engaged, and the questions varied in difficulty. Children responded faster at T2 (M = 6656 msec, SE = 181) than T1 (M = 7045 msec, SE = 158), F(1, 119.9) = 7.78, p = .006, but there was no significant difference between disciplines, F = 3.49, p = .064, or year groups, F = 0.81, p = .37, and no significant interactions, Fs < 1.41 and ps > .24.

Table 2.

Descriptive Statistics of Performance in the Science and Maths Counterintuitive Reasoning Tasks at T1 and T2

YearDisciplineTimepointAccuracy Range (%)Accuracy (%) M (SD)All Trials RT (msec) M (SD)
Year 3 Science T1 35–70 53.8 (11.6) 7,503 (1,240) 
T2 40–75 53.6 (11.8) 6,270 (1,134) 
Maths T1 35–80 56.9 (16.7) 7,709 (1,380) 
T2 50–80 62.1 (11.1) 6,813 (1,026) 
Year 5 Science T1 15–65 45.7 (11.4) 6,623 (976) 
T2 30–85 53.0 (13.0) 6,648 (1,552) 
Maths T1 20–80 54.1 (14.2) 7,140 (1,187) 
T2 35–95 61.8 (16.9) 6,630 (1,052) 
YearDisciplineTimepointAccuracy Range (%)Accuracy (%) M (SD)All Trials RT (msec) M (SD)
Year 3 Science T1 35–70 53.8 (11.6) 7,503 (1,240) 
T2 40–75 53.6 (11.8) 6,270 (1,134) 
Maths T1 35–80 56.9 (16.7) 7,709 (1,380) 
T2 50–80 62.1 (11.1) 6,813 (1,026) 
Year 5 Science T1 15–65 45.7 (11.4) 6,623 (976) 
T2 30–85 53.0 (13.0) 6,648 (1,552) 
Maths T1 20–80 54.1 (14.2) 7,140 (1,187) 
T2 35–95 61.8 (16.9) 6,630 (1,052) 
Table 3.

Effect Sizes (Standardized Regression Coefficients) for the LMMs on Behavioral Data for the Science and Maths Counterintuitive Reasoning Task and the Two Inhibitory Control Tasks

Task and Fixed EffectAccuracyRT
Std. Coefficient95% CIStd. Coefficient95% CI
Counterintuitive Reasoning Science and Maths
Intercept 0.19 −0.23, 0.62 0.35 −0.08, 0.77 
Discipline −0.35 −0.82, 0.13 −0.14 −0.51, 0.24 
Timepoint 0.19 −0.47, 0.84 −0.27 −0.79, 0.26 
Year −0.19 −0.74, 0.36 −0.20 −0.75, 0.34 
Discipline × Timepoint −0.10 −0.94, 0.74 −0.21 −0.87, 0.44 
Discipline × Year −0.16 −0.79, 0.46 −0.13 −0.62, 0.36 
Timepoint × Year 0.42 −0.37, 1.20 −0.13 −0.75, 0.50 
Discipline × Timepoint × Year 0.12 −1.15, 0.91 0.48 −0.32, 1.29 
Animal Size Stroop
Intercept 0.48 0.03, 0.93 −0.48 −0.83, −0.12 
Trial Type 1 (congruent in mixed block) −0.81 −1.34, −0.27 1.23 0.89, 1.58 
Trial Type 2 (incongruent in mixed block) −1.05 −1.58, −0.51 1.58 1.24, 1.93 
Timepoint −0.19 −0.88, 0.50 −0.17 −0.62, 0.28 
Year 0.08 −0.51, 0.68 −0.45 −0.92, −0.02 
Trial Type 1 × Timepoint 0.39 −0.56, 1.34 −0.09 −0.69, 0.52 
Trial Type 2 × Timepoint 0.32 −0.63, 1.27 −0.09 −0.69, 0.52 
Trial Type 1 × Year 0.24 −0.47, 0.94 −0.19 −0.64, 0.26 
Trial Type 2 × Year 0.30 −0.41, 1.01 −0.07 −0.52, 0.38 
Timepoint × Year 0.08 −0.77, 0.93 −0.14 −0.40, 0.69 
Trial Type 1 × Timepoint × Year −0.25 −1.42, 0.91 −0.02 −0.76, 0.73 
Trial Type 2 × Timepoint × Year −0.48 −1.65, 0.68 −0.32 −1.07, 0.42 
Pokémon Go/No-Go
Intercept 0.09 −0.36, 0.54 0.06 −0.33, 0.45 
Trial Type 1 (go in mixed block) 0.36 −0.17, 0.89 0.82 0.53, 1.11 
Trial Type 2 (no-go in mixed block) −0.64 −1.17, −0.11 – – 
Timepoint −0.78 −1.48, −0.08 −0.68 −1.09, −0.27 
Year 0.06 −0.53, 0.65 −0.60 −1.12, −0.08 
Trial Type 1 × Timepoint 0.55 −0.39, 1.48 0.63 0.12, 1.13 
Trial Type 2 × Timepoint 0.48 −0.46, 1.41 – – 
Trial Type 1 × Year −0.11 −0.81, 0.59 0.01 −0.37, 0.39 
Trial Type 2 × Year 0.13 −0.57, 0.83 – – 
Timepoint × Year 0.86 0.02, 1.71 0.57 0.08, 1.05 
Trial Type 1 × Timepoint × Year −0.63 −1.77, 0.51 −0.67 −1.29, −0.06 
Trial Type 2 × Timepoint × Year −0.55 −1.70, 0.59 – – 
Task and Fixed EffectAccuracyRT
Std. Coefficient95% CIStd. Coefficient95% CI
Counterintuitive Reasoning Science and Maths
Intercept 0.19 −0.23, 0.62 0.35 −0.08, 0.77 
Discipline −0.35 −0.82, 0.13 −0.14 −0.51, 0.24 
Timepoint 0.19 −0.47, 0.84 −0.27 −0.79, 0.26 
Year −0.19 −0.74, 0.36 −0.20 −0.75, 0.34 
Discipline × Timepoint −0.10 −0.94, 0.74 −0.21 −0.87, 0.44 
Discipline × Year −0.16 −0.79, 0.46 −0.13 −0.62, 0.36 
Timepoint × Year 0.42 −0.37, 1.20 −0.13 −0.75, 0.50 
Discipline × Timepoint × Year 0.12 −1.15, 0.91 0.48 −0.32, 1.29 
Animal Size Stroop
Intercept 0.48 0.03, 0.93 −0.48 −0.83, −0.12 
Trial Type 1 (congruent in mixed block) −0.81 −1.34, −0.27 1.23 0.89, 1.58 
Trial Type 2 (incongruent in mixed block) −1.05 −1.58, −0.51 1.58 1.24, 1.93 
Timepoint −0.19 −0.88, 0.50 −0.17 −0.62, 0.28 
Year 0.08 −0.51, 0.68 −0.45 −0.92, −0.02 
Trial Type 1 × Timepoint 0.39 −0.56, 1.34 −0.09 −0.69, 0.52 
Trial Type 2 × Timepoint 0.32 −0.63, 1.27 −0.09 −0.69, 0.52 
Trial Type 1 × Year 0.24 −0.47, 0.94 −0.19 −0.64, 0.26 
Trial Type 2 × Year 0.30 −0.41, 1.01 −0.07 −0.52, 0.38 
Timepoint × Year 0.08 −0.77, 0.93 −0.14 −0.40, 0.69 
Trial Type 1 × Timepoint × Year −0.25 −1.42, 0.91 −0.02 −0.76, 0.73 
Trial Type 2 × Timepoint × Year −0.48 −1.65, 0.68 −0.32 −1.07, 0.42 
Pokémon Go/No-Go
Intercept 0.09 −0.36, 0.54 0.06 −0.33, 0.45 
Trial Type 1 (go in mixed block) 0.36 −0.17, 0.89 0.82 0.53, 1.11 
Trial Type 2 (no-go in mixed block) −0.64 −1.17, −0.11 – – 
Timepoint −0.78 −1.48, −0.08 −0.68 −1.09, −0.27 
Year 0.06 −0.53, 0.65 −0.60 −1.12, −0.08 
Trial Type 1 × Timepoint 0.55 −0.39, 1.48 0.63 0.12, 1.13 
Trial Type 2 × Timepoint 0.48 −0.46, 1.41 – – 
Trial Type 1 × Year −0.11 −0.81, 0.59 0.01 −0.37, 0.39 
Trial Type 2 × Year 0.13 −0.57, 0.83 – – 
Timepoint × Year 0.86 0.02, 1.71 0.57 0.08, 1.05 
Trial Type 1 × Timepoint × Year −0.63 −1.77, 0.51 −0.67 −1.29, −0.06 
Trial Type 2 × Timepoint × Year −0.55 −1.70, 0.59 – – 

For Animal Size Stroop task, congruent trials in congruent blocks are the reference trial type. For the Pokémon go/no-go task, go trials in go blocks are the reference trial type. Note that the CIs of the standardized coefficients should be interpreted with caution, as they test individual coefficient significance, and are affected by random effects shrinkage and model complexity, reducing precision.

Interference Control

Accuracy significantly differed across Trial Type, F(2, 154.5) = 17.49, p < .001. Higher accuracy was observed in congruent trials in congruent blocks (M = 96.1%, SE = 0.88) than in congruent trials in mixed blocks (M = 92.4%, SE = 0.88, p = .002) and incongruent trials in mixed blocks (M = 90.4%, SE = 0.88, p < .001). However, congruent and incongruent trials in the mixed block did not significantly differ, p = .10. No significant differences in accuracy were observed between Timepoints or Year groups, and there were no significant interactions, all Fs < 0.76, ps > .39. As expected, Animal Size Stroop RT also differed across trial type, F(2, 152.8) = 124.39, p < .001; mean RT was lower in congruent trials of congruent blocks (M = 749 msec, SE = 15, p < .001) than on congruent trials of mixed blocks (M = 896 msec, SE = 14, p < .001) and highest in incongruent trials of mixed blocks (M = 940 msec, SE = 14, p < .002). Children were quicker at T2 (M = 847 msec, SE = 15) than T1 (M = 876 msec, SE = 13), F(1, 167.6) = 6.16, p = 0.014, and Year 5 children responded more quickly (M = 826 msec, SE = 16) than Year 3 children (M = 897 msec, SE = 20), F(1, 43.2) = 7.46, p = .009, but there were no significant interactions, Fs < 0.88 and ps > .42.

Response Inhibition

In the Pokémon go/no-go task, accuracy differed between Trial Types, F(2, 155.1) = 19.22, p < .001; accuracy was greater on go trials in mixed blocks (M = 86.8%, SE = 1.5) than in go trials in the go blocks (M = 82.1%, SE = 1.5%) and lowest in no-go trials in mixed block (M = 76.7%, SE = 1.5%, all pairwise comparisons ps < .012). There were no significant differences between Year groups, or Timepoints, and no interactions, Fs < 3.07 and ps > .081. Participants responded more quickly on go trials in go blocks (M = 510 msec, SE = 8) than mixed blocks (M = 574 msec, SE = 8), F(1, 90.7) = 156.02, p < .001, and more quickly at T2 (M = 534 msec, SE = 9) than T1 (M = 551 msec, SE = 8), F(1, 103.9) = 7.22, p = .008. No significant difference was observed between year groups, F(1, 44.7) = 3.94, p = .053. There were significant Trial Type × Year, F(1, 90.7) = 4.36, p = .039, and a Trial Type × Timepoint × Year interactions, F(1, 90.7) = 4.65, p = .034. Follow-up analyses showed that there was a greater difference in go trials RT between go and mixed blocks in Year 3 children (Mdiff = 74.6 msec, p < .001) than Year 5 children (Mdiff = 53.2 msec, p < .001), and that Year 3 children showed a decrease in RT between T1 and T2 in go trials in go blocks (p = .001), but not go trials in mixed blocks (p = .789), whereas Year 5 children did not show significant differences in RT over time, in either trial type (ps > .22; Figure 3). Analysis of d′ in mixed blocks showed no significant differences between year groups nor timepoints, with no significant interaction (Fs < 1.75, ps > .19).

Figure 3.

Estimated mean RT for correct go trials of the Pokémon go/no-go task in Year 3 (left) and Year 5 children (right) at the two timepoints. Error bars represent 95% CI.

Figure 3.

Estimated mean RT for correct go trials of the Pokémon go/no-go task in Year 3 (left) and Year 5 children (right) at the two timepoints. Error bars represent 95% CI.

Close modal

Inhibitory Control Measures as Predictors of Counterintuitive Reasoning Accuracy

Inhibitory control measures did not significantly predict counterintuitive reasoning accuracy in maths nor science. For the science LMMs, all Fs < 1.90, ps > .173. For the maths LMMs, all Fs < 0.45, ps > .504.

Neuroimaging Results

Science and Maths Counterintuitive Reasoning Tasks

Science and maths counterintuitive reasoning trials were associated with increases in activation in a broad network of frontal, parietal, temporal, and occipital regions (Figure 4A and B; see Tables A2 and A3 in the Appendix for details). Activation in the science and maths counterintuitive reasoning contrasts broadly overlapped; however, maths counterintuitive reasoning was associated with greater activation in right superior pFC and extended further in left superior pFC (Figure 5A), and there were additional differences between disciplines in the intensity of the BOLD signal. Compared with science, maths counterintuitive reasoning was associated with greater activation intensity in bilateral parietal cortices, including bilateral superior and inferior gyri (Brodmann area [BA] 40), as well as bilateral precuneus (BA 7), bilateral inferior temporal gyri (BA 37/20), a cluster across bilateral superior and middle frontal gyri (BA 6/8/45/46), extending into bilateral SMA on the medial wall, the right frontal operculum and the ACC (BA 32) and clusters in the precentral gyri (BA 6), extending to the inferior frontal gyrus (BA 44), and in the cerebellum (Table A4). The opposite contrast showed greater activation intensity in science counterintuitive reasoning than maths in the inferior and middle occipital gyri (BA 18/19) extending to posterior middle temporal gyri (BA 20), along the length of the fusiform gyri (BA 37) and into anterior hippocampi, as well as in anterior middle temporal gyri (BA 38) extending into temporal poles (BA 48/22) and the left orbital frontal gyrus (BA 47; Table A4).

Figure 4.

Increases in BOLD signal during (A) maths and (B) science counterintuitive reasoning, (C) Animal Size Stroop task mixed > congruent blocks and (D) Pokémon go/no-go mixed > go blocks (collapsed across year groups). Contrasts thresholded at p < .001 uncorrected at the voxel level, and pFWE < .05 at the cluster level. Color bars indicate t values.

Figure 4.

Increases in BOLD signal during (A) maths and (B) science counterintuitive reasoning, (C) Animal Size Stroop task mixed > congruent blocks and (D) Pokémon go/no-go mixed > go blocks (collapsed across year groups). Contrasts thresholded at p < .001 uncorrected at the voxel level, and pFWE < .05 at the cluster level. Color bars indicate t values.

Close modal
Figure 5.

Horizontal brain slices showing regions of overlapping activation across the contrasts of interest: (A) maths versus science counterintuitive reasoning; (B) inhibitory control tasks; (C) maths counterintuitive reasoning and Animal Size Stroop; (D) science counterintuitive reasoning and Animal Size Stroop; (E) maths counterintuitive reasoning and Pokémon go/no-go; (F) science counterintuitive reasoning and Pokémon go/no-go. Contrasts thresholded at p < .001 uncorrected at the voxel level and clusters with voxels > 50 are shown. In E and F, only the slices with changes in BOLD signal in the Pokémon go/no-go > go contrast are shown.

Figure 5.

Horizontal brain slices showing regions of overlapping activation across the contrasts of interest: (A) maths versus science counterintuitive reasoning; (B) inhibitory control tasks; (C) maths counterintuitive reasoning and Animal Size Stroop; (D) science counterintuitive reasoning and Animal Size Stroop; (E) maths counterintuitive reasoning and Pokémon go/no-go; (F) science counterintuitive reasoning and Pokémon go/no-go. Contrasts thresholded at p < .001 uncorrected at the voxel level and clusters with voxels > 50 are shown. In E and F, only the slices with changes in BOLD signal in the Pokémon go/no-go > go contrast are shown.

Close modal

A flexible factorial analysis showed a significant Discipline × Year interaction in two clusters: the right inferior frontal gyrus (BA 45 [39, 38, 8]) and ACC (BA 24 [3, 26, 17]). In both these clusters, greater activation was observed in Year 5 than Year 3 in maths, and greater activation in Year 3 than Year 5 in science.

Inhibitory Control

The Animal Size Stroop task was used as an interference control task. Comparing mixed incongruent/congruent blocks to congruent blocks showed large clusters of increased activation in broad bilateral occipital and parietal regions (Figure 4C, Table A5). More specifically, in the superior parietal lobule, inferior parietal lobule and precuneus, as well as the fusiform gyrus, superior and inferior occipital gyri, and some of the calcarine gyrus. Additional clusters were observed in bilateral superior and inferior frontal gyri and left precentral gyrus. The Pokémon go/no-go task was used as a measure of response inhibition. Comparing mixed go/no-go blocks to go blocks showed increased activation in the right superior parietal gyrus extending to the precuneus and a cluster in the right superior frontal gyrus (Figure 4D, Table A5). No significant differences in activation were observed between Year groups in either inhibitory control task.

Inhibitory Control ROIs

Clusters from the Pokémon and Animal Size Stroop univariate analysis were used as ROIs in the subsequent multivariate pattern analysis, using a threshold of p < .001 uncorrected at the voxel level and a minimum cluster size of 50 voxels. Two clusters from the Pokémon and eight clusters from the Animal Size Stroop univariate analysis were identified (Table 3). One large Animal Size Stroop cluster was split into six clusters using left and right occipital, parietal, and temporal lobes regions from WFUPickAtlas as masks and MarsBaR. This resulted in separate left superior and inferior temporal clusters but a single right temporal cluster. The large cluster also extended to the top of the superior cerebellum; however, this region was not included as it was likely an artifact from smoothing. Note that two clusters in the right thalamus and right reticular formation extending into the caudate nucleus were excluded from a subsequent analysis, as this article's hypotheses focused on cortical regions. The resulting 12 Animal Stroop clusters are listed in Table 3. Note that there is overlap between the Pokémon precuneus/superior parietal cluster and the Animal Size Stroop parietal cluster and overlap between the two task ROIs in the superior frontal gyrus (see Figure 5B; Table 4).

Table 4.

ROIs Derived from the Univariate Analysis of the Animal Size Stroop Mixed > Congruent Blocks and Pokémon Go/No-Go Mixed > Go Blocks Contrasts

Brain RegionL/RBAMNIak
xyz
Animal Size Stroop Task
Superior frontal gyrus 27 56 178 
Inferior frontal gyrus/insula 48 37 31 14 55 
Precentral gyrus/inferior frontal gyrus 44 43 31 94 
Superior frontal gyrus −24 −1 59 108 
Precentral gyrus/inferior frontal gyrus 44 −37 30 98 
Parietal lobeb 28 −57 45 1439 
Parietal lobeb −26 −56 46 1012 
Inferior and middle temporal gyrib 37 42 −59 966 
Superior temporal gyrusb 39 −36 −71 19 166 
Inferior temporal gyrusb 37 −42 −55 −11 439 
Occipital lobeb 19 32 −80 −1 1511 
Occipital lobeb 18 −30 −82 −1 1649 
Pokémon Go/No-Go Task
Precuneus/superior parietal gyrus 14 −61 54 225 
Superior frontal gyrus 22 66 76 
Brain RegionL/RBAMNIak
xyz
Animal Size Stroop Task
Superior frontal gyrus 27 56 178 
Inferior frontal gyrus/insula 48 37 31 14 55 
Precentral gyrus/inferior frontal gyrus 44 43 31 94 
Superior frontal gyrus −24 −1 59 108 
Precentral gyrus/inferior frontal gyrus 44 −37 30 98 
Parietal lobeb 28 −57 45 1439 
Parietal lobeb −26 −56 46 1012 
Inferior and middle temporal gyrib 37 42 −59 966 
Superior temporal gyrusb 39 −36 −71 19 166 
Inferior temporal gyrusb 37 −42 −55 −11 439 
Occipital lobeb 19 32 −80 −1 1511 
Occipital lobeb 18 −30 −82 −1 1649 
Pokémon Go/No-Go Task
Precuneus/superior parietal gyrus 14 −61 54 225 
Superior frontal gyrus 22 66 76 
a

Coordinates of the center of mass of each cluster obtained from MarsBaR.

b

Regions derived from a single large cluster.

Multivariate Similarity Analysis

Significant multivariate similarity (mean similarity scores greater than zero) was observed between the Animal Size Stroop and maths counterintuitive reasoning tasks in all 12 clusters (ηp2 = .205–.624), and with the science counterintuitive reasoning task in 11/12 clusters (ηp2 = .163–.767; Figure 6A). No significant multivariate similarity was observed between the Pokémon go/no-go ROIs and science counterintuitive reasoning tasks, but significant multivariate similarity was observed with maths counterintuitive reasoning in the right precuneus/superior parietal gyrus ROI only (ηp2 = .197; Figure 6B). For comparison, multivariate similarly was explored between the science and maths counterintuitive reasoning tasks. This showed significant similarity in all ROIs, with a higher magnitude than those observed between counterintuitive reasoning and inhibitory control tasks (ηp2 = .740–.950; Figure 6A and B).

Figure 6.

Results of multivariate similarity analysis of patterns of activation across voxels between either (A) the Animal Size Stroop mixed > congruent blocks contrast or (B) the Pokémon go/no-go mixed > go blocks contrasts and the science and maths counterintuitive reasoning contrasts. aMultivariate similarity scores are the estimated marginal mean Fisher's z scores of the correlations of activation between contrasts, across voxels in individual participants. Error bars: 95% CI. Stronger color shades are used to highlight cases where mean multivariate similarity was significantly greater than zero. p Values are FDR adjusted for multiple comparisons across the 14 ROIs and three comparisons. L = left; R = right; IFG = inferior frontal gyrus; SFG = superior frontal gyrus; SPL = superior parietal lobe.

Figure 6.

Results of multivariate similarity analysis of patterns of activation across voxels between either (A) the Animal Size Stroop mixed > congruent blocks contrast or (B) the Pokémon go/no-go mixed > go blocks contrasts and the science and maths counterintuitive reasoning contrasts. aMultivariate similarity scores are the estimated marginal mean Fisher's z scores of the correlations of activation between contrasts, across voxels in individual participants. Error bars: 95% CI. Stronger color shades are used to highlight cases where mean multivariate similarity was significantly greater than zero. p Values are FDR adjusted for multiple comparisons across the 14 ROIs and three comparisons. L = left; R = right; IFG = inferior frontal gyrus; SFG = superior frontal gyrus; SPL = superior parietal lobe.

Close modal

Mixed Discipline × Year group ANOVAs conducted on the similarity scores showed significant differences between disciplines in three Animal Size Stroop ROIs (Figure 6). The left superior frontal gyrus cluster showed greater similarity scores between the Animal Size Stroop task and maths (M = 0.272) than science counterintuitive reasoning (M = 0.069), F(1, 39) = 8.06, pFDR = .033, ηp2 = .171. Reversely, the right inferior/middle temporal gyri ROI and the left inferior temporal gyrus ROI showed greater similarity with science than maths (right hemisphere: F(1, 39) = 11.68, pFDR = .010, ηp2 = .230, science: M = 0.475, maths: M = 0.360; left hemisphere: F(1, 39) = 11.89, pFDR = .010, ηp2 = .234, science: M = 0.547, maths: M = 0.424). Note that the right superior frontal gyrus and left parietal lobe clusters also showed significant differences between disciplines, but these did not survive FDR adjustments (Figure 6). No other significant differences between disciplines in any other ROIs were detected (pFDRs > .126), and no significant year group differences nor Discipline × Year interactions were identified (pFDRs > .646).

Finally, Animal Size Stroop and math counterintuitive reasoning similarity scores (averaged across ROIs) were found to positively correlate with maths counterintuitive reasoning accuracy, r(39) = .40, p = .009. No similar association was found for science, r(39) = .24, p = .881, nor between Pokémon go/no-go and math counterintuitive reasoning similarity score in the superior parietal lobe/precuneus ROI and maths counterintuitive reasoning accuracy, r(40) = .15, p = .330.

Multivariate similarity analysis of fMRI data was used to investigate the role of inhibitory control in science and maths counterintuitive reasoning in primary-school-aged children. We found significant neural similarity between an interference control task and both maths and science counterintuitive reasoning in almost all brain regions that showed increased activation during the interference control task. This included DLPFC, VLPFC, and parietal cortex, as per our hypothesis, plus temporal and occipital regions. In brain areas showing increased activation during response inhibition, significant neural similarity was observed between the response inhibition task and maths counterintuitive reasoning in the right precuneus/superior parietal gyrus only. These findings suggest that in children, science and maths counterintuitive reasoning and inhibitory control, specifically interference control, exhibit similar neural activation. This is consistent with the hypothesis that inhibitory control plays a role in counterintuitive reasoning derived from previous research in adolescents and adults.

Science and Maths Counterintuitive Reasoning Tasks

Behavioral analysis of counterintuitive reasoning task accuracy suggested the difficulty of the tasks were well matched across year groups, as no significant differences in performance across year groups were observed. There was an improvement in performance from T1 to T2, likely reflecting school-based science and maths learning over the 12-week period between scans, as well as possible practice effects. This matches results from a larger cohort (which partially includes this neuroimaging cohort), showing improved performance in a different science and maths counterintuitive reasoning task over a 12-week period (Dumontheil et al., 2023). In the current study, children were significantly slower but more accurate in maths compared with science trials, suggesting that slowing down benefits maths counterintuitive reasoning accuracy (or that children who are better at counterintuitive reasoning take longer to answer). This idea supports the theory that during counterintuitive reasoning, additional cognitive processes are required to suppress the naive intuitive concepts, and concurs with behavioral studies that found slower responses resulted in better performance in counterintuitive reasoning tasks (e.g., Babai et al., 2012).

We observed increased BOLD signal in the science and maths counterintuitive reasoning tasks in large clusters in bilateral occipital, temporal, and parietal areas, extending to the pFC. This concurs with previous research exploring the neural correlates of science and maths counterintuitive reasoning in adults (Meier et al., 2022; Allaire-Duquette et al., 2021; Brault Foisy et al., 2015; Masson et al., 2014; Stavy et al., 2006) and adolescents (Dumontheil et al., 2022; Allaire-Duquette et al., 2019). However, in the current study, much broader regions of increased activation were detected. It is possible that children recruit broader brain regions than adolescents or adults when reasoning about counterintuitive concepts. Alternatively, the broader activation more plausibly reflects the fact that in the present study counterintuitive trials were not compared with control noncounterintuitive trials; thus, this activation also likely reflects the cognitive processing of text, images, and broader reasoning recruited in the task. This control condition was not included in the current study for pragmatic reasons. Future work may wish to investigate the neural correlates of maths and science counterintuitive reasoning using matched noncounterintuitive control trials (as done by, e.g., Meier et al., 2022; Dumontheil et al., 2022; Allaire-Duquette et al., 2019, 2021) to better isolate the specific neural signature of counterintuitive reasoning in children.

Differences in BOLD signal between disciplines were also observed. Although a previous study in adolescents did not find differences between science and maths counterintuitive reasoning activation when compared with noncounterintuitive trials (Dumontheil et al., 2022), in the current study, counterintuitive reasoning in maths compared with science led to activation extending further into the superior parietal cortex and superior frontal cortex, in particular in the right hemisphere, as well as greater increases in BOLD signal in bilateral parietal cortex and both lateral and medial aspects of the pFC. In contrast, counterintuitive science reasoning in science trials, compared with maths trials, was associated with greater increases in BOLD signal in occipital and temporal regions. The stimuli varied in many ways across science subjects and maths topics, with the use of drawings, text, and symbols. Differences between maths and science stimuli may have driven some of the different brain activity patterns observed.

Studies of the neural correlates of maths in children tend to focus on arithmetic and suggest a shift from pFC to posterior parietal cortex and then angular gyrus activation, when children move from calculation to retrieval strategies (Peters & De Smedt, 2018). However, few studies have investigated neural correlates of mathematics across a broader range of subjects, as was used here, that is, arithmetic, as well as geometry and graphs. Amalric and Dehaene (2016) found that when adult professional mathematicians listened to maths statements across a range of topics (analysis, algebra, topology, and geometry) and judged their accuracy, compared with nonmaths statements, they recruited a consistent network of bilateral intraparietal, inferior temporal, and dorsal pFC regions. Frontal and parietal cortical regions have therefore been implicated in mathematical reasoning across ages. In science, in contrast, distinct neural correlates have been found for different aspects of physics (Mason & Just, 2016), suggesting that science may draw on more domain-specific skills. Scientific reasoning has been associated with activation in lateral prefrontal areas supporting diverse executive functions, but also with activation of the medial temporal lobes, which Nenciovici, Allaire-Duquette, and Masson (2019) argue could reflect the recruitment of declarative memory processes, and influence of prior knowledge, on scientific reasoning. However, there is a lack of neuroimaging research on the development of the neural correlates of science reasoning in mid-childhood that the present results could be compared with.

Differences in BOLD signal across disciplines were also observed as an interaction with year group. Although there was no difference in BOLD signal changes during counterintuitive reasoning as a whole between year groups, significant Discipline × Year interactions were found in the right inferior frontal gyrus and ACC, with greater activation in Year 5 than Year 3 in maths and greater activation in Year 3 than Year 5 in science. This is of interest considering the right inferior frontal gyrus (IFG) is thought to be involved in inhibitory control (Aron et al., 2014; Aron, 2007) and the ACC in error detection (e.g., Carter et al., 1998). It is possible that these differences reflect the fact that different problems were presented to Year 3 and Year 5 children. Despite these trials being designed based on the Key Stage 2 maths and science U.K. national curriculum, we could not control for the children's individual school syllabus and whether the concepts had been learned recently or potentially many months previously. For example, Year 5 children could have recently learned about certain mathematical counterintuitive concepts present in the current study and were thus better able to identify the trials as counterintuitive and recruit inhibitory control to answer correctly. Future similar work may wish to collect data on the school syllabus to account for this possibility.

Inhibitory Control

Interference Control

As expected, incongruent trials were associated with lower accuracy and longer RTs in the Animal Size Stroop task. There was also a mixing cost on congruent trials, with lower accuracy and longer RT in congruent trials in mixed blocks compared with congruent-only blocks. There was no significant difference in accuracy between year groups. Year 5 children responded overall faster than Year 3 children; however, this was not specific to incongruent trials, implying we did not find evidence of greater efficiency in inhibitory control skills in the older children. This age group difference may instead reflect maturation of processing speed or motor responses.

In the interference control task, the contrast mixed > congruent showed increased activation in large occipital and parietal areas and small clusters in frontal regions. These findings in children broadly fit with results of individual studies and meta-analyses investigating the neural correlates of interference control in adults and adolescents using variations of the Stroop task, which report increased activation in parietal and occipital cortex, as well as VLPFC and DLPFC (Dumontheil et al., 2022; Isherwood et al., 2021; Huang et al., 2020; Hung et al., 2018; Zhang et al., 2017; Xu, Xu, & Yang, 2016; Cieslik, Mueller, Eickhoff, Langner, & Eickhoff, 2015; Laird et al., 2005). Although studies exploring the neural correlates of interference control in typically developing children are limited (see Kang et al., 2022, for a review), broad occipital and parietal activation has been observed in another fMRI study exploring the (color) Stroop effect in children of this age, suggesting this broad pattern of increased activation could reflect inhibitory control in children (Adleman et al., 2002).

Response Inhibition

RTs for the response inhibition were slower, but accuracy higher in go trials of the mixed blocks than the pure go blocks, suggesting that because of the possibility of no-go trials in mixed blocks, participants slowed down, leading to higher accuracy in the left/right response in go trials. Whether because of development or practice effects, Year 3 children responded faster on go trials in go blocks at T2 than T1. As expected, accuracy was lowest in the no-go trials in the mixed block.

The contrast go/no-go > go blocks showed greater activation in the right superior parietal/precuneus and right superior frontal gyrus only. DLPFC and parietal cortex, among other regions, have been observed in meta-analyses exploring action withholding tasks (e.g., go/no-go tasks) in adults (Gavazzi et al., 2020; Guo et al., 2018; Cieslik et al., 2015; Criaud & Boulinguez, 2013). However, our findings do not concur with the prominent theory that the right IFG is central to response inhibition (Aron et al., 2014; Aron, 2007, 2011). One interpretation is that the right IFG is associated with response inhibition in adults, but less consistently in children (Suskauer et al., 2008; Bunge et al., 2002; Durston et al., 2002; see Kang et al., 2022). Interestingly, Bunge and colleagues (2002) found that 8- to 12-year-old children with better no-go performance showed greater bilateral inferior parietal activation, whereas children with worse no-go performance showed greater bilateral DLPFC and left VLPFC activation. Neither group showed increased activation in the right VLPFC, which was observed in adults. Therefore, posterior activation may be more involved in inhibitory control in children (Bunge et al., 2002).

Lack of significant effects can also be due to a lack of power. The current study used a block design, which is thought to be more sensitive to changes in the BOLD signal than an event-related design (Bunge et al., 2002). However, event-related designs may be more sensitive to transient activation associated with response inhibition on no-go trials specifically. This was also a limitation of the study by Dumontheil and colleagues (2022), which did not find increases in activation in simple mixed go/no-go blocks versus pure go blocks in adolescents.

In the current study, univariate activation in the Animal Size Stroop and Pokémon go/no-go task partly overlapped in right superior parietal cortex and right superior frontal gyrus. A previous study similarly found limited overlap of activation between interference control and response inhibition in 8- to 12-year-old children, compared with much more widespread overlap in adults, strongest in the right VLPFC (Bunge et al., 2002). However, further research, for example, contrasting block and event-related designs, is needed to assess the extent to which these two tasks have similar neural signatures in children or whether these differences reflect methodological differences.

Science and Maths Counterintuitive Reasoning and Inhibitory Control

Contrary to a previous study in a larger sample (Wilkinson et al., 2020), here, we did not find behavioral evidence for a role of inhibitory control in counterintuitive reasoning. This is likely due to the limited sample size in the current study. In contrast, univariate and multivariate analysis of the neuroimaging data suggest a role of inhibitory control in counterintuitive reasoning in science and maths in this age group.

Most of the regions identified in the interference control task showed univariate overlap with the activation observed during science and maths counterintuitive reasoning, although the occipital and parietal interference clusters were slightly broader than those observed during counterintuitive reasoning. In line with the broader frontal and parietal activation observed during maths than science counterintuitive reasoning, maths also showed greater univariate activation overlap with interference control in these regions. This univariate overlap includes regions that also showed overlap between science and maths counterintuitive reasoning and an interference control task (numerical Stroop) in adolescents, specifically in the right middle frontal and right parietal lobe (Dumontheil et al., 2022); however, the overlap in children appears much broader. In contrast, limited univariate overlap in activation was observed between the frontal and parietal clusters identified in the response inhibition task and science and maths counterintuitive reasoning. The adolescent study (Dumontheil et al., 2022) did not observe increased activation in the simple go/no-go task, potentially due to a lack of sensitivity in the task, making comparisons to the current study challenging.

Although overlap of activation was observed in the univariate analyses, these could still reflect recruitment of different neural populations. Multivariate similarity analyses were therefore carried out to complement the univariate analyses. Significant multivariate similarity was observed between interference control and counterintuitive reasoning tasks in (i) all 12 clusters showing increased BOLD signal in the interference control task for maths and (ii) all clusters, except the left superior frontal gyrus, for science. Multivariate similarity was significantly higher for maths than science in the left superior frontal gyrus and significantly stronger for science than maths in temporal clusters bilaterally. These results broadly fit with the fact that there was more superior frontal gyrus activation for maths than science and more temporal activation for science than maths counterintuitive reasoning in the univariate analyses. As per our hypothesis, neural similarity was observed in parietal cortex, DLPFC, and VLPFC, including the right IFG, although the cluster in BA 44 was more superior than VLPFC activation observed in previous studies in BA 47 when comparing counterintuitive and intuitive trials (Stavy & Babai, 2010) or in BA 45 or 47 when comparing counterintuitive reasoning in experts versus novices (Allaire-Duquette et al., 2019; Brault Foisy et al., 2015; Masson et al., 2014). In addition, significant neural similarity was observed in occipital and temporal regions, likely reflecting visual and memory retrieval processes required in both counterintuitive reasoning and the Animal Size Stroop incongruent > congruent contrasts. These findings suggest that counterintuitive reasoning and interference control recruit partially overlapping neural populations.

Significant neural similarity was observed between counterintuitive reasoning and the response inhibition task in the left precuneus/superior parietal gyrus cluster only, and only for maths. These regions have been associated with functions such as visuomotor processing, manipulation of information in working memory, and attention (Alahmadi, 2021; Cavanna & Trimble, 2006), processes likely involved in both the response inhibition and maths counterintuitive reasoning trials. However, no significant similarity was observed in the right superior frontal gyrus cluster, a region thought to be involved in impulse control and motor urgency (Hu, Ide, Zhang, & Li, 2016), suggesting that the overlap between response inhibition and maths counterintuitive reasoning may reflect working memory more than inhibitory control processes. As only limited increased activation was observed in the response inhibition task, further work is needed to confirm these results.

To date, one other study has used multivariate similarity analysis to determine neural similarity between inhibitory control and counterintuitive reasoning tasks, a study conducted in adolescents (Dumontheil et al., 2022). Interestingly, Dumontheil and colleagues (2022) showed neural similarity between maths and science counterintuitive reasoning (combined) with interference control (numerical Stroop) and response inhibition (complex go/no-go) in the supramarginal gyrus/intraparietal sulcus only. The researchers concluded that these results suggested that working memory, attention, or mental imagery may be more relevant to counterintuitive reasoning than inhibitory control. These differences in findings between adolescents and children may be methodological, for example, the slightly different methods for creating ROIs, or the age-appropriate variations in inhibitory control tasks used; however, these tasks were designed to isolate inhibitory control processes using tight contrasts, regardless of variations in content. Alternatively, the differences may be developmental. Children may use broader regions to elicit inhibitory control and reasoning processes than adolescents. One might speculate that repetition of facts for formal qualifications during adolescence may lead to a stronger dependency on memory retrieval for counterintuitive reasoning in this age group, as opposed to inhibitory control; whereas children are likely to have been introduced to counterintuitive concepts more recently, without intense repetition, leading to a greater reliance on inhibitory control and broader recruitment of frontal and parietal regions. Another possible reason for the greater neural similarity is that interference control may be recruited in childhood for science and maths reasoning more broadly, beyond reasoning about counterintuitive problems specifically. Indeed, Cragg, Keeble, Richardson, Roome, and Gilmore (2017) found that a numerical interference control task was associated with overall arithmetic factual knowledge and procedural skills, across age groups.

Although the Stop & Think intervention was designed on the premise that training children to use their inhibitory control skills would be most effective when implemented in a specific context (science and maths), here, we used domain-general tests of inhibitory control. It is possible that greater univariate overlap and multivariate similarity would have been observed if the inhibitory control tasks used numerical and science stimuli. However, in our previous study in adolescents, the interference control task used numerical stimuli and only showed limited similarity with the science and maths counterintuitive reasoning task. Future work could use domain-general and domain-specific inhibitory control tasks, or tasks with more varied stimuli.

The multivariate similarity analyses revealed some differences between disciplines. Neural similarity between the interference control task and counterintuitive reasoning was observed in the superior frontal gyrus for maths trials, but not science trials, which could reflect greater working memory and spatial processing required in maths (El-Baba & Schury, 2024; du Boisgueheneuc et al., 2006) as compared with science. Conversely, significantly greater neural similarity was observed between interference control and science trials, as compared with maths trials in bilateral (predominantly inferior) temporal gyri. We speculate that this reflects an increased semantic memory processing and visual perception load (Patel, Biso, & Fowler, 2024) in science trials compared with maths. Again, these differences may in part relate to differences in the visual representation of the maths and science problems. Complementing the univariate analyses, these results indicated that science and maths counterintuitive reasoning show many similarities, but also some differences in the neural engagement.

Interestingly, when assessing whether individual differences in neural similarity between science and maths reasoning tasks and inhibitory control tasks associated with counterintuitive reasoning accuracy, we observed a significant positive correlation between maths counterintuitive reasoning and interference control tasks similarity scores, averaged across ROIs, and maths counterintuitive reasoning accuracy. This finding provides further evidence for a role of interference control in maths counterintuitive reasoning. No similar association was observed for science or with the response inhibition task similarity score. This may imply a lesser role of interference control in science counterintuitive reasoning. Alternatively, the fact that maths accuracy was overall higher and more variable across participants than science accuracy may reflect greater individual differences in successful counterintuitive reasoning and use of interference control by children in maths than science, which may have resulted in the observed correlation.

Taken together, the univariate and multivariate findings from this study suggest that inhibitory control plays a role in science and maths counterintuitive reasoning in childhood, and the current evidence suggests that interference control may play a greater role than response inhibition. The findings point to a putative neural mechanism behind inhibitory control interventions, which promote science and maths academic outcomes. For example, the Stop & Think intervention (Dumontheil et al., 2023; Gauthier, Porayska-Pomsta, Dumontheil, Mayer, & Mareschal, 2022; Gauthier, Porayska-Pomsta, Mayer, et al., 2022; Wilkinson et al., 2020) could allow children to resolve the interference between intuitive and correct conceptual understandings and lead to more accurate responses. Future work may wish to investigate whether interference control interventions within maths and science domains can improve maths and science educational outcomes in this age group and whether domain-specific response inhibition interventions can produce similar results.

Conclusion

Neural similarity between counterintuitive reasoning and inhibitory control tasks was investigated in primary-school-aged children. Multivariate neural similarity was observed in a broad network of brain regions between maths and science counterintuitive reasoning tasks and interference control tasks, and to a much lesser extent with response inhibition tasks. These findings provide evidence for the theory that inhibitory control plays a role in counterintuitive reasoning in children and may have implications for understanding the mechanisms behind inhibitory control interventions designed to improve maths and science academic outcomes.

Table A1.

Accuracy in Individual Counterintuitive Maths and Science Problems (Mean, Minimum, Maximum, and Histograms)

Accuracy in Individual Counterintuitive Maths and Science Problems (Mean, Minimum, Maximum, and Histograms)
Accuracy in Individual Counterintuitive Maths and Science Problems (Mean, Minimum, Maximum, and Histograms)
ProblemAccuracy (M %)
T1T2
Year 3Year 5Year 3Year 5
ScienceMathsScienceMathsScienceMathsScienceMaths
36 63 68 68 54 82 58 87 
48 50 38 52 62 73 29 37 
41 84 32 36 31 27 38 67 
50 68 11 46 38 36 13 67 
63 58 41 48 54 45 50 50 
18 26 57 36 46 55 71 29 
71 21 38 64 77 36 46 71 
59 58 43 36 77 82 38 33 
43 44 38 14 33 82 67 17 
10 58 37 24 46 31 55 25 67 
11 68 100 82 100 62 82 83 100 
12 52 58 59 92 62 64 71 96 
13 52 47 17 46 50 64 38 50 
14 92 39 62 64 92 27 67 83 
15 23 50 75 32 38 64 75 54 
16 86 95 54 64 92 82 71 75 
17 32 53 46 48 38 82 42 46 
18 32 53 64 72 46 64 63 87 
19 52 68 10 44 46 36 17 42 
20 61 74 52 76 54 82 63 63 
Min. 18 21 10 14 31 27 13 17 
Max. 92 100 82 100 92 82 83 100 
ProblemAccuracy (M %)
T1T2
Year 3Year 5Year 3Year 5
ScienceMathsScienceMathsScienceMathsScienceMaths
36 63 68 68 54 82 58 87 
48 50 38 52 62 73 29 37 
41 84 32 36 31 27 38 67 
50 68 11 46 38 36 13 67 
63 58 41 48 54 45 50 50 
18 26 57 36 46 55 71 29 
71 21 38 64 77 36 46 71 
59 58 43 36 77 82 38 33 
43 44 38 14 33 82 67 17 
10 58 37 24 46 31 55 25 67 
11 68 100 82 100 62 82 83 100 
12 52 58 59 92 62 64 71 96 
13 52 47 17 46 50 64 38 50 
14 92 39 62 64 92 27 67 83 
15 23 50 75 32 38 64 75 54 
16 86 95 54 64 92 82 71 75 
17 32 53 46 48 38 82 42 46 
18 32 53 64 72 46 64 63 87 
19 52 68 10 44 46 36 17 42 
20 61 74 52 76 54 82 63 63 
Min. 18 21 10 14 31 27 13 17 
Max. 92 100 82 100 92 82 83 100 

Table A2.

Brain Regions Showing an Increase in BOLD Signal during Maths Counterintuitive Reasoning Trials

Brain RegionL/RBAkMNIt
xyz
Lingual gyrus 18 253780 −27 −85 −13 16.2* 
Inferior occipital gyrus 37   39 −64 −13 14.8* 
Superior parietal gyrus   −21 −64 47 14.7* 
Inferior occipital gyrus 37   39 −64 −13 14.8* 
Calcarine sulcus 18   −15 −94 −4 14.7* 
Fusiform gyrus 19   −36 −67 −13 14.5* 
Hippocampus (extending left ACC) 27   −18 −31 −1 14.2* 
Inferior temporal gyrus 37   −48 −58 −10 14.1* 
Fusiform gyrus 19   27 −76 −10 14.0* 
Middle occipital gyrus 18   −33 −91 11 13.9* 
Inferior frontal gyrus (opercularis) 44   −42 11 29 13.8* 
Middle temporal gyrus 22   −54 −37 13.6* 
Insula 47   −30 26 −1 12.8* 
Middle occipital gyrus 19   30 −82 20 12.4* 
Inferior frontal gyrus (triangularis) 45   48 32 20 11.0* 
Cerebellum –   −55 −34 10.3* 
Superior temporal lobe 22   57 −28 10.2* 
Supplementary motor area   −13 14 56 10.1* 
Inferior frontal gyrus (opercularis) 48/44   48 14 29 9.4* 
Cerebellum –   −21 −40 −40 8.2* 
Superior frontal gyrus   27 56 8.0* 
Inferior orbital frontal gyrus 38   27 26 −25 6.4* 
Cerebellum –   −45 −43 −37 5.9* 
Brain RegionL/RBAkMNIt
xyz
Lingual gyrus 18 253780 −27 −85 −13 16.2* 
Inferior occipital gyrus 37   39 −64 −13 14.8* 
Superior parietal gyrus   −21 −64 47 14.7* 
Inferior occipital gyrus 37   39 −64 −13 14.8* 
Calcarine sulcus 18   −15 −94 −4 14.7* 
Fusiform gyrus 19   −36 −67 −13 14.5* 
Hippocampus (extending left ACC) 27   −18 −31 −1 14.2* 
Inferior temporal gyrus 37   −48 −58 −10 14.1* 
Fusiform gyrus 19   27 −76 −10 14.0* 
Middle occipital gyrus 18   −33 −91 11 13.9* 
Inferior frontal gyrus (opercularis) 44   −42 11 29 13.8* 
Middle temporal gyrus 22   −54 −37 13.6* 
Insula 47   −30 26 −1 12.8* 
Middle occipital gyrus 19   30 −82 20 12.4* 
Inferior frontal gyrus (triangularis) 45   48 32 20 11.0* 
Cerebellum –   −55 −34 10.3* 
Superior temporal lobe 22   57 −28 10.2* 
Supplementary motor area   −13 14 56 10.1* 
Inferior frontal gyrus (opercularis) 48/44   48 14 29 9.4* 
Cerebellum –   −21 −40 −40 8.2* 
Superior frontal gyrus   27 56 8.0* 
Inferior orbital frontal gyrus 38   27 26 −25 6.4* 
Cerebellum –   −45 −43 −37 5.9* 

Contrast thresholded at p < .001 uncorrected at the voxel level and pFWE < .05 at the cluster level. Voxels significant at pFWE < .05 at the voxel level are indicated with *.

Table A3.

Brain Regions Showing an Increase in BOLD Signal during Science Counterintuitive Reasoning Trials

Brain RegionL/RBAkMNIt
xyz
Thalamus 27 232180 −18 −31 18.5* 
Fusiform gyrus 19   27 −79 −7 18.4* 
Inferior occipital gyrus 37   36 −64 −10 18.2* 
Inferior occipital gyrus 18   −27 −82 −10 17.6* 
Fusiform gyrus 37   36 −49 −13 17.0* 
Calcarine sulcus 17   12 −88 −1 16.5* 
Fusiform gyrus 37   −36 −40 −19 16.4* 
Middle occipital gyrus 17   −9 103 11 15.3* 
Middle temporal gyrus 22   −54 −37 14.5* 
Middle occipital gyrus 19   30 −88 20 14.2* 
Superior occipital gyrus 18   −15 −94 −4 14.2* 
Thalamus –   21 −28 −1 18.4* 
Superior parietal lobule (744) −24 −64 41 13.0* 
Superior parietal lobule (568) 27 −64 44 10.9* 
Inferior frontal gyrus 48 (2396) −51 20 17 12.2* 
Anterior insula/frontal operculum     −30 26 −1 12.0* 
Inferior frontal gyrus 44   −42 11 29 11.5* 
Precentral gyrus   −42 44 6.7* 
Anterior insula/frontal operculum   (1634) 33 26 −1 12.0* 
Inferior frontal gyrus 44   42 11 32 9.4* 
Inferior frontal gyrus 45   60 32 20 9.2* 
Pre-SMA (1132) −6 17 53 9.3* 
Middle cingulate gyrus 32   23 44 6.8* 
Middle cingulate gyrus 32   12 26 32 7.9* 
Medial superior frontal gyrus   41 47 4.2* 
Cerebellum – 112 −55 −34 10.4* 
Brain RegionL/RBAkMNIt
xyz
Thalamus 27 232180 −18 −31 18.5* 
Fusiform gyrus 19   27 −79 −7 18.4* 
Inferior occipital gyrus 37   36 −64 −10 18.2* 
Inferior occipital gyrus 18   −27 −82 −10 17.6* 
Fusiform gyrus 37   36 −49 −13 17.0* 
Calcarine sulcus 17   12 −88 −1 16.5* 
Fusiform gyrus 37   −36 −40 −19 16.4* 
Middle occipital gyrus 17   −9 103 11 15.3* 
Middle temporal gyrus 22   −54 −37 14.5* 
Middle occipital gyrus 19   30 −88 20 14.2* 
Superior occipital gyrus 18   −15 −94 −4 14.2* 
Thalamus –   21 −28 −1 18.4* 
Superior parietal lobule (744) −24 −64 41 13.0* 
Superior parietal lobule (568) 27 −64 44 10.9* 
Inferior frontal gyrus 48 (2396) −51 20 17 12.2* 
Anterior insula/frontal operculum     −30 26 −1 12.0* 
Inferior frontal gyrus 44   −42 11 29 11.5* 
Precentral gyrus   −42 44 6.7* 
Anterior insula/frontal operculum   (1634) 33 26 −1 12.0* 
Inferior frontal gyrus 44   42 11 32 9.4* 
Inferior frontal gyrus 45   60 32 20 9.2* 
Pre-SMA (1132) −6 17 53 9.3* 
Middle cingulate gyrus 32   23 44 6.8* 
Middle cingulate gyrus 32   12 26 32 7.9* 
Medial superior frontal gyrus   41 47 4.2* 
Cerebellum – 112 −55 −34 10.4* 

Contrast thresholded at p < .001 uncorrected at the voxel level and pFWE < .05 at the cluster level. Voxels significant at pFWE < .05 at the voxel level are indicated with *. Due to the broad network of activation, WFU_Pickatlas was used to create frontal+ limbic and parietal masks to identify subpeaks, shown with k values in brackets.

Table A4.

Brain Regions Showing Differences in Changes in BOLD Signal in Maths versus Science Counterintuitive Reasoning

Brain RegionL/RBAkMNIt
xyz
Maths > Science
Superior/inferior parietal gyri 2661 45 −37 56 9.99 
Precuneus   −12 −70 62 7.24* 
Precuneus   −67 56 6.90* 
Inferior parietal gyrus 40   −36 −43 44 6.50* 
Angular gyrus   33 −58 44 5.87* 
Inferior/middle temporal gyri 37 304 63 −55 −7 8.20* 
Middle frontal gyrus 1077 27 53 6.76* 
Superior medial frontal gyrus   29 47 6.28* 
Inferior frontal gyrus 44   51 11 23 6.18* 
Cerebellum – 979 −33 −49 −37 6.59* 
Inferior temporal gyrus 20/37   −57 −37 −16 6.58* 
Inferior parietal gyrus 40   −36 −43 44 6.50* 
Middle frontal gyrus 45 244 45 41 23 5.78* 
Superior parietal gyrus   −30 −67 59 5.45* 
Middle frontal gyrus 329 −27 59 5.37* 
Superior frontal gyrus   −27 65 5.29* 
Precentral gyrus 44 322 −54 11 38 5.21* 
Middle frontal gyrus 45   −42 53 5.20 
Middle frontal gyrus 45 141 −42 53 5.06 
ACC 24 214 11 29 5.05 
Thalamus –   −10 14 4.77 
Cerebellum – 149 24 −67 −28 4.75 
Science > Maths
Fusiform gyrus 37/19 6791 30 −46 −13 10.5* 
Fusiform/middle occipital 19   27 −70 −10 10.0* 
Superior occipital gyrus 18   21 −97 23 9.4* 
Fusiform gyrus 37   −36 −40 −16 10.4* 
Inferior temporal gyrus 20   −39 −22 −22 8.4* 
Superior occipital gyrus 17/18   −12 −106 11 9.5* 
Parahippocampal gyrus 28   −21 −4 −22 8.4* 
Superior temporal pole gyrus 38   −33 −25 5.6* 
Parahippocampal gyrus 28   21 −4 −22 7.0* 
Middle occipital gyrus 19/39   −45 −79 −1 6.7* 
Middle temporal pole 36/38   30 −34 6.4* 
Fusiform gyrus 20   33 −4 −34 6.2* 
Middle temporal gyrus 37/39   −42 −61 20 6.4* 
Middle temporal gyrus 20   −54 −10 −13 6.1* 
Superior medial frontal gyrus 104 −3 56 47 5.8* 
Middle frontal orbital gyrus 47   −33 35 −13 5.7* 
Superior temporal pole 38   −45 14 −19 5.3* 
Middle temporal gyrus 48   51 −16 −10 5.3* 
Brain RegionL/RBAkMNIt
xyz
Maths > Science
Superior/inferior parietal gyri 2661 45 −37 56 9.99 
Precuneus   −12 −70 62 7.24* 
Precuneus   −67 56 6.90* 
Inferior parietal gyrus 40   −36 −43 44 6.50* 
Angular gyrus   33 −58 44 5.87* 
Inferior/middle temporal gyri 37 304 63 −55 −7 8.20* 
Middle frontal gyrus 1077 27 53 6.76* 
Superior medial frontal gyrus   29 47 6.28* 
Inferior frontal gyrus 44   51 11 23 6.18* 
Cerebellum – 979 −33 −49 −37 6.59* 
Inferior temporal gyrus 20/37   −57 −37 −16 6.58* 
Inferior parietal gyrus 40   −36 −43 44 6.50* 
Middle frontal gyrus 45 244 45 41 23 5.78* 
Superior parietal gyrus   −30 −67 59 5.45* 
Middle frontal gyrus 329 −27 59 5.37* 
Superior frontal gyrus   −27 65 5.29* 
Precentral gyrus 44 322 −54 11 38 5.21* 
Middle frontal gyrus 45   −42 53 5.20 
Middle frontal gyrus 45 141 −42 53 5.06 
ACC 24 214 11 29 5.05 
Thalamus –   −10 14 4.77 
Cerebellum – 149 24 −67 −28 4.75 
Science > Maths
Fusiform gyrus 37/19 6791 30 −46 −13 10.5* 
Fusiform/middle occipital 19   27 −70 −10 10.0* 
Superior occipital gyrus 18   21 −97 23 9.4* 
Fusiform gyrus 37   −36 −40 −16 10.4* 
Inferior temporal gyrus 20   −39 −22 −22 8.4* 
Superior occipital gyrus 17/18   −12 −106 11 9.5* 
Parahippocampal gyrus 28   −21 −4 −22 8.4* 
Superior temporal pole gyrus 38   −33 −25 5.6* 
Parahippocampal gyrus 28   21 −4 −22 7.0* 
Middle occipital gyrus 19/39   −45 −79 −1 6.7* 
Middle temporal pole 36/38   30 −34 6.4* 
Fusiform gyrus 20   33 −4 −34 6.2* 
Middle temporal gyrus 37/39   −42 −61 20 6.4* 
Middle temporal gyrus 20   −54 −10 −13 6.1* 
Superior medial frontal gyrus 104 −3 56 47 5.8* 
Middle frontal orbital gyrus 47   −33 35 −13 5.7* 
Superior temporal pole 38   −45 14 −19 5.3* 
Middle temporal gyrus 48   51 −16 −10 5.3* 

Contrasts thresholded at p < .001 uncorrected at the voxel level and pFWE < .05 at the cluster level. Voxels significant at pFWE < .05 at the voxel level are indicated with *.

Table A5.

Brain Regions Showing an Increase in BOLD Signal in the Animal Size Stroop Mixed > Congruent Contrast and Pokémon Go/No-Go > Go Contrast

Brain RegionL/RBAkMNIt
xyz
Animal Size Stroop Mixed Incongruent + Congruent Block > Congruent Block
Superior occipital gyrus 8567 24 −61 38 9.33* 
Middle occipital gyrus 18   −33 −91 −4 8.54* 
Inferior occipital gyrus 19   −39 −85 −4 8.46* 
Fusiform gyrus 19   39 −73 13 8.33* 
Middle occipital/middle temporal gyri 39   42 −73 20 8.31* 
Superior parietal gyrus   −24 −64 41 8.24* 
Inferior occipital gyrus 37   39 −64 10 8.23* 
Inferior parietal gyrus   −24 −49 44 7.98* 
Fusiform gyrus 37   36 −46 19 7.87* 
Middle occipital gyrus 19   33 −70 32 7.76* 
Superior parietal gyrus   −18 −67 50 7.69* 
Middle occipital gyrus 19   −27 −73 26 6.84* 
Reticular formation – 52 −34 −4 5.34 
Precentral gyrus/inferior frontal gyrus 44/6 94 42 32 5.16 
Superior frontal gyrus 178 24 53 4.89 
Thalamus – 65 21 −28 4.82 
Precentral gyrus/inferior frontal gyrus 44/6 98 −42 32 4.78 
Superior frontal gyrus 108 −21 74 4.23 
Precentral gyrus – −30 −4 47 3.78 
Inferior frontal gyrus/insula 45 55 36 32 11 4.41 
Pokémon Mixed Go/No-Go Block > Go Block
Superior frontal gyrus 76 18 74 4.67 
Precuneus/superior parietal gyrus 7/5 225 21 −61 53 4.37 
Brain RegionL/RBAkMNIt
xyz
Animal Size Stroop Mixed Incongruent + Congruent Block > Congruent Block
Superior occipital gyrus 8567 24 −61 38 9.33* 
Middle occipital gyrus 18   −33 −91 −4 8.54* 
Inferior occipital gyrus 19   −39 −85 −4 8.46* 
Fusiform gyrus 19   39 −73 13 8.33* 
Middle occipital/middle temporal gyri 39   42 −73 20 8.31* 
Superior parietal gyrus   −24 −64 41 8.24* 
Inferior occipital gyrus 37   39 −64 10 8.23* 
Inferior parietal gyrus   −24 −49 44 7.98* 
Fusiform gyrus 37   36 −46 19 7.87* 
Middle occipital gyrus 19   33 −70 32 7.76* 
Superior parietal gyrus   −18 −67 50 7.69* 
Middle occipital gyrus 19   −27 −73 26 6.84* 
Reticular formation – 52 −34 −4 5.34 
Precentral gyrus/inferior frontal gyrus 44/6 94 42 32 5.16 
Superior frontal gyrus 178 24 53 4.89 
Thalamus – 65 21 −28 4.82 
Precentral gyrus/inferior frontal gyrus 44/6 98 −42 32 4.78 
Superior frontal gyrus 108 −21 74 4.23 
Precentral gyrus – −30 −4 47 3.78 
Inferior frontal gyrus/insula 45 55 36 32 11 4.41 
Pokémon Mixed Go/No-Go Block > Go Block
Superior frontal gyrus 76 18 74 4.67 
Precuneus/superior parietal gyrus 7/5 225 21 −61 53 4.37 

Contrasts thresholded at p < .001 uncorrected at the voxel level and clusters > 50 voxels. Voxels significant at pFWE < .05 at the cluster level are indicated with †. Voxels significant at pFWE < .05 at the voxel level are indicated with *.

Corresponding author: Lucy R. J. Palmer, Department of Psychological Sciences, Birkbeck, University of London, Malet Street, London, WC1E 7HX, UK, e-mail: [email protected].

Data are available through the Birkbeck Research Data Repository.

Lucy R. J. Palmer: Formal analysis; Writing—Original draft; Writing—Review & editing. Dilini K. Sumanapala: Data curation; Formal analysis. Denis Mareschal: Conceptualization; Funding acquisition; Project administration; Supervision; Writing—Review & editing. Iroise Dumontheil: Conceptualization; Data curation; Formal analysis; Funding acquisition; Supervision; Writing—Original draft; Writing—Review & editing. The UnLocke Team: Conceptualization; Data curation; Funding acquisition; Project administration.

Funding was provided by Birkbeck College and a joint award from the Education Endowment Foundation (grant number: N - 1343) and The Wellcome Trust.

Retrospective analysis of the citations in every article published in this journal from 2010 to 2021 reveals a persistent pattern of gender imbalance: Although the proportions of authorship teams (categorized by estimated gender identification of first author/last author) publishing in the Journal of Cognitive Neuroscience (JoCN) during this period were M(an)/M = .407, W(oman)/M = .32, M/W = .115, and W/W = .159, the comparable proportions for the articles that these authorship teams cited were M/M = .549, W/M = .257, M/W = .109, and W/W = .085 (Postle and Fulvio, JoCN, 34:1, pp. 1–3). Consequently, JoCN encourages all authors to consider gender balance explicitly when selecting which articles to cite and gives them the opportunity to report their article's gender citation balance. The authors of this paper report its proportions of citations by gender category to be: M/M = .365; W/M = .349; M/W = .111; W/W = .175.

Adleman
,
N. E.
,
Menon
,
V.
,
Blasey
,
C. M.
,
White
,
C. D.
,
Warsofsky
,
I. S.
,
Glover
,
G. H.
, et al
(
2002
).
A developmental fMRI study of the Stroop color-word task
.
Neuroimage
,
16
,
61
75
. ,
[PubMed]
Alahmadi
,
A. A. S.
(
2021
).
Investigating the sub-regions of the superior parietal cortex using functional magnetic resonance imaging connectivity
.
Insights into Imaging
,
12
,
47
. ,
[PubMed]
Allaire-Duquette
,
G.
,
Bélanger
,
M.
,
Grabner
,
R. H.
,
Koschutnig
,
K.
, &
Masson
,
S.
(
2019
).
Individual differences in science competence among students are associated with ventrolateral prefrontal cortex activation
.
Journal of Neuroscience Research
,
97
,
1163
1178
. ,
[PubMed]
Allaire-Duquette
,
G.
,
Brault Foisy
,
L.-M.
,
Potvin
,
P.
,
Riopel
,
M.
,
Larose
,
M.
, &
Masson
,
S.
(
2021
).
An fMRI study of scientists with a Ph.D. in physics confronted with naive ideas in science
.
NPJ Science of Learning
,
6
,
11
. ,
[PubMed]
Amalric
,
M.
, &
Dehaene
,
S.
(
2016
).
Origins of the brain networks for advanced mathematics in expert mathematicians
.
Proceedings of the National Academy of Sciences, U.S.A.
,
113
,
4909
4917
. ,
[PubMed]
Aron
,
A. R.
(
2007
).
The neural basis of inhibition in cognitive control
.
Neuroscientist
,
13
,
214
228
. ,
[PubMed]
Aron
,
A. R.
(
2011
).
From reactive to proactive and selective control: Developing a richer model for stopping inappropriate responses
.
Biological Psychiatry
,
69
,
e55
e68
. ,
[PubMed]
Aron
,
A. R.
,
Robbins
,
T. W.
, &
Poldrack
,
R. A.
(
2014
).
Inhibition and the right inferior frontal cortex: One decade on
.
Trends in Cognitive Sciences
,
18
,
177
185
. ,
[PubMed]
Babai
,
R.
,
Eidelman
,
R. R.
, &
Stavy
,
R.
(
2012
).
Preactivation of inhibitory control mechanisms hinders intuitive reasoning
.
International Journal of Science and Mathematics Education
,
10
,
763
775
.
Bedard
,
A.-C.
,
Nichols
,
S.
,
Barbosa
,
J. A.
,
Schachar
,
R.
,
Logan
,
G. D.
, &
Tannock
,
R.
(
2002
).
The development of selective inhibitory control across the life span
.
Developmental Neuropsychology
,
21
,
93
111
. ,
[PubMed]
Benjamini
,
Y.
, &
Hochberg
,
Y.
(
1995
).
Controlling the false discovery rate: A practical and powerful approach to multiple testing
.
Journal of the Royal Statistical Society: Series B (Methodological)
,
57
,
289
300
.
Brault Foisy
,
L.-M.
,
Potvin
,
P.
,
Riopel
,
M.
, &
Masson
,
S.
(
2015
).
Is inhibition involved in overcoming a common physics misconception in mechanics?
Trends in Neuroscience and Education
,
4
,
26
36
.
Brett
,
M.
,
Anton
,
J.-L.
,
Valabregue
,
R.
, &
Poline
,
J.-B.
(
2002
).
Region of interest analysis using an SPM toolbox [abstract]
.
Presented at the 8th International Conference on Functional Mapping of the Human Brain, June 2–6
(
Vol. 16
).
Sendai, Japan
.
Brookman-Byrne
,
A.
,
Mareschal
,
D.
,
Tolmie
,
A. K.
, &
Dumontheil
,
I.
(
2018
).
Inhibitory control and counterintuitive science and maths reasoning in adolescence
.
PLoS One
,
13
,
e0198973
. ,
[PubMed]
Bunge
,
S. A.
,
Dudukovic
,
N. M.
,
Thomason
,
M. E.
,
Vaidya
,
C. J.
, &
Gabrieli
,
J. D. E.
(
2002
).
Immature frontal lobe contributions to cognitive control in children: Evidence from fMRI
.
Neuron
,
33
,
301
311
. ,
[PubMed]
Carter
,
C. S.
,
Braver
,
T. S.
,
Barch
,
D. M.
,
Botvinick
,
M. M.
,
Noll
,
D.
, &
Cohen
,
J. D.
(
1998
).
Anterior cingulate cortex, error detection, and the online monitoring of performance
.
Science
,
280
,
747
749
. ,
[PubMed]
Cauley
,
S. F.
,
Polimeni
,
J. R.
,
Bhat
,
H.
,
Wald
,
L. L.
, &
Setsompop
,
K.
(
2014
).
Interslice leakage artifact reduction technique for simultaneous multislice acquisitions
.
Magnetic Resonance in Medicine
,
72
,
93
102
. ,
[PubMed]
Cavanna
,
A. E.
, &
Trimble
,
M. R.
(
2006
).
The precuneus: A review of its functional anatomy and behavioural correlates
.
Brain
,
129
,
564
583
. ,
[PubMed]
Cieslik
,
E. C.
,
Mueller
,
V. I.
,
Eickhoff
,
C. R.
,
Langner
,
R.
, &
Eickhoff
,
S. B.
(
2015
).
Three key regions for supervisory attentional control: Evidence from neuroimaging meta-analyses
.
Neuroscience & Biobehavioral Reviews
,
48
,
22
34
. ,
[PubMed]
Constantinidis
,
C.
, &
Luna
,
B.
(
2019
).
Neural substrates of inhibitory control maturation in adolescence
.
Trends in Neurosciences
,
42
,
604
616
. ,
[PubMed]
Corbetta
,
M.
, &
Shulman
,
G. L.
(
2002
).
Control of goal-directed and stimulus-driven attention in the brain
.
Nature Reviews Neuroscience
,
3
,
201
215
. ,
[PubMed]
Coulanges
,
L.
,
Abreu-Mendoza
,
R. A.
,
Varma
,
S.
,
Uncapher
,
M. R.
,
Gazzaley
,
A.
,
Anguera
,
J.
, et al
(
2021
).
Linking inhibitory control to math achievement via comparison of conflicting decimal numbers
.
Cognition
,
214
,
104767
. ,
[PubMed]
Cragg
,
L.
(
2016
).
The development of stimulus and response interference control in midchildhood
.
Developmental Psychology
,
52
,
242
252
. ,
[PubMed]
Cragg
,
L.
,
Keeble
,
S.
,
Richardson
,
S.
,
Roome
,
H. E.
, &
Gilmore
,
C.
(
2017
).
Direct and indirect influences of executive functions on mathematics achievement
.
Cognition
,
162
,
12
26
. ,
[PubMed]
Criaud
,
M.
, &
Boulinguez
,
P.
(
2013
).
Have we been asking the right questions when assessing response inhibition in go/no-go tasks with fMRI? A meta-analysis and critical review
.
Neuroscience & Biobehavioral Reviews
,
37
,
11
23
. ,
[PubMed]
Crittenden
,
B. M.
,
Mitchell
,
D. J.
, &
Duncan
,
J.
(
2016
).
Task encoding across the multiple demand cortex is consistent with a frontoparietal and cingulo-opercular dual networks distinction
.
Journal of Neuroscience
,
36
,
6147
6155
. ,
[PubMed]
Department for Education
. (
2013b
).
Science programmes of study: Key stage 2
. https://assets.publishing.service.gov.uk/media/5a806ebd40f0b62305b8b1fa/PRIMARY_national_curriculum_-_Science.pdf
Department for Education
. (
2018
).
Schools, pupils and their characteristics: January 2018
. https://assets.publishing.service.gov.uk/media/5b31eb0840f0b67f7f306124/Schools_Pupils_and_their_Characteristics_2018_Main_Text.pdf
Diamond
,
A.
(
2013
).
Executive functions
.
Annual Review of Psychology
,
64
,
135
168
. ,
[PubMed]
du Boisgueheneuc
,
F.
,
Levy
,
R.
,
Volle
,
E.
,
Seassau
,
M.
,
Duffau
,
H.
,
Kinkingnehun
,
S.
, et al
(
2006
).
Functions of the left superior frontal gyrus in humans: A lesion study
.
Brain
,
129
,
3315
3328
. ,
[PubMed]
Dumontheil
,
I.
,
Brookman-Byrne
,
A.
,
Tolmie
,
A. K.
, &
Mareschal
,
D.
(
2022
).
Neural and cognitive underpinnings of counterintuitive science and math reasoning in adolescence
.
Journal of Cognitive Neuroscience
,
34
,
1205
1229
. ,
[PubMed]
Dumontheil
,
I.
,
Wilkinson
,
H. R.
,
Farran
,
E. K.
,
Smid
,
C.
,
Modhvadia
,
R.
,
Mareschal
,
D.
, et al
(
2023
).
How do executive functions influence children's reasoning about counterintuitive concepts in mathematics and science?
Journal of Cognitive Enhancement
,
7
,
257
275
. ,
[PubMed]
Duncan
,
J.
(
2010
).
The multiple-demand (MD) system of the primate brain: Mental programs for intelligent behaviour
.
Trends in Cognitive Sciences
,
14
,
172
179
. ,
[PubMed]
Durston
,
S.
,
Davidson
,
M. C.
,
Tottenham
,
N.
,
Galvan
,
A.
,
Spicer
,
J.
,
Fossella
,
J. A.
, et al
(
2006
).
A shift from diffuse to focal cortical activity with development
.
Developmental Science
,
9
,
1
8
. ,
[PubMed]
Durston
,
S.
,
Thomas
,
K. M.
,
Yang
,
Y.
,
Uluğ
,
A. M.
,
Zimmerman
,
R. D.
, &
Casey
,
B. J.
(
2002
).
A neural basis for the development of inhibitory control
.
Developmental Science
,
5
,
F9
F16
.
El-Baba
,
R. M.
, &
Schury
,
M. P.
(
2024
).
Neuroanatomy, frontal cortex
. In
StatPearls
.
StatPearls Publishing
. https://www.ncbi.nlm.nih.gov/books/NBK554483/
Faul
,
F.
,
Erdfelder
,
E.
,
Lang
,
A.-G.
, &
Buchner
,
A.
(
2007
).
G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences
.
Behavior Research Methods
,
39
,
175
191
. ,
[PubMed]
Fiske
,
A.
, &
Holmboe
,
K.
(
2019
).
Neural substrates of early executive function development
.
Developmental Review
,
52
,
42
62
. ,
[PubMed]
Gauthier
,
A.
,
Porayska-Pomsta
,
K.
,
Dumontheil
,
I.
,
Mayer
,
S.
, &
Mareschal
,
D.
(
2022
).
Manipulating interface design features affects children's stop-and-think behaviors in a counterintuitive-problem game
.
ACM Transactions on Computer–Human Interaction
,
29
,
1
21
.
Gauthier
,
A.
,
Porayska-Pomsta
,
K.
,
Mayer
,
S.
,
Dumontheil
,
I.
,
Farran
,
E. K.
,
Bell
,
D.
, et al
(
2022
).
Redesigning learning games for different learning contexts: Applying a serious game design framework to redesign Stop & Think
.
International Journal of Child–Computer Interaction
,
33
,
100503
.
Gavazzi
,
G.
,
Giovannelli
,
F.
,
Currò
,
T.
,
Mascalchi
,
M.
, &
Viggiano
,
M. P.
(
2020
).
Contiguity of proactive and reactive inhibitory brain areas: A cognitive model based on ALE meta-analyses
.
Brain Imaging and Behavior
,
15
,
2199
2214
. ,
[PubMed]
Gavazzi
,
G.
,
Giovannelli
,
F.
,
Noferini
,
C.
,
Cincotta
,
M.
,
Cavaliere
,
C.
,
Salvatore
,
M.
, et al
(
2023
).
Subregional prefrontal cortex recruitment as a function of inhibitory demand: An fMRI metanalysis
.
Neuroscience & Biobehavioral Reviews
,
152
,
105285
. ,
[PubMed]
Guo
,
Y.
,
Schmitz
,
T. W.
,
Mur
,
M.
,
Ferreira
,
C. S.
, &
Anderson
,
M. C.
(
2018
).
A supramodal role of the basal ganglia in memory and motor inhibition: Meta-analytic evidence
.
Neuropsychologia
,
108
,
117
134
. ,
[PubMed]
Hanushek
,
E. A.
, &
Woessmann
,
L.
(
2008
).
The role of cognitive skills in economic development
.
Journal of Economic Literature
,
46
,
607
668
.
Hartung
,
J.
,
Engelhardt
,
L. E.
,
Thibodeaux
,
M. L.
,
Harden
,
K. P.
, &
Tucker-Drob
,
E. M.
(
2020
).
Developmental transformations in the structure of executive functions
.
Journal of Experimental Child Psychology
,
189
,
104681
. ,
[PubMed]
Hautus
,
M. J.
(
1995
).
Corrections for extreme proportions and their biasing effects on estimated values of d
.
Behavior Research Methods, Instruments, & Computers
,
27
,
46
51
.
Hotez
,
P. J.
(
2021
).
Anti-science kills: From Soviet embrace of pseudoscience to accelerated attacks on US biomedicine
.
PLoS Biology
,
19
,
e3001068
. ,
[PubMed]
Hu
,
S.
,
Ide
,
J. S.
,
Zhang
,
S.
, &
Li
,
C.-S. R.
(
2016
).
The right superior frontal gyrus and individual variation in proactive control of impulsive response
.
Journal of Neuroscience
,
36
,
12688
12696
. ,
[PubMed]
Huang
,
Y.
,
Su
,
L.
, &
Ma
,
Q.
(
2020
).
The Stroop effect: An activation likelihood estimation meta-analysis in healthy young adults
.
Neuroscience Letters
,
716
,
134683
. ,
[PubMed]
Hung
,
Y.
,
Gaillard
,
S. L.
,
Yarmak
,
P.
, &
Arsalidou
,
M.
(
2018
).
Dissociations of cognitive inhibition, response inhibition, and emotional interference: Voxelwise ALE meta-analyses of fMRI studies
.
Human Brain Mapping
,
39
,
4065
4082
. ,
[PubMed]
Isherwood
,
S. J. S.
,
Keuken
,
M. C.
,
Bazin
,
P. L.
, &
Forstmann
,
B. U.
(
2021
).
Cortical and subcortical contributions to interference resolution and inhibition—An fMRI ALE meta-analysis
.
Neuroscience & Biobehavioral Reviews
,
129
,
245
260
. ,
[PubMed]
Kang
,
W.
,
Hernández
,
S. P.
,
Rahman
,
M. S.
,
Voigt
,
K.
, &
Malvaso
,
A.
(
2022
).
Inhibitory control development: A network neuroscience perspective
.
Frontiers in Psychology
,
13
,
651547
. ,
[PubMed]
Karr
,
J. E.
,
Areshenkoff
,
C. N.
,
Rast
,
P.
,
Hofer
,
S. M.
,
Iverson
,
G. L.
, &
Garcia-Barrera
,
M. A.
(
2018
).
The unity and diversity of executive functions: A systematic review and re-analysis of latent variable studies
.
Psychological Bulletin
,
144
,
1147
1185
. ,
[PubMed]
Klenberg
,
L.
,
Närhi
,
V.
,
Korkman
,
M.
, &
Hokkanen
,
L.
(
2015
).
Examining methodological variation in response inhibition: The effects of outcome measures and task characteristics on age-related differences
.
Child Neuropsychology
,
21
,
586
602
. ,
[PubMed]
Laird
,
A. R.
,
McMillan
,
K. M.
,
Lancaster
,
J. L.
,
Kochunov
,
P.
,
Turkeltaub
,
P. E.
,
Pardo
,
J. V.
, et al
(
2005
).
A comparison of label-based review and ALE meta-analysis in the Stroop task
.
Human Brain Mapping
,
25
,
6
21
. ,
[PubMed]
Lee
,
K.
,
Bull
,
R.
, &
Ho
,
R. M. H.
(
2013
).
Developmental changes in executive functioning
.
Child Development
,
84
,
1933
1953
. ,
[PubMed]
Lewis
,
F. C.
,
Reeve
,
R. A.
,
Kelly
,
S. P.
, &
Johnson
,
K. A.
(
2017
).
Evidence of substantial development of inhibitory control and sustained attention between 6 and 8 years of age on an unpredictable go/no-go task
.
Journal of Experimental Child Psychology
,
157
,
66
80
. ,
[PubMed]
Luke
,
S. G.
(
2017
).
Evaluating significance in linear mixed-effects models in R
.
Behavior Research Methods
,
49
,
1494
1502
. ,
[PubMed]
MacDonald
,
A. W.
, III
,
Cohen
,
J. D.
,
Stenger
,
V. A.
, &
Carter
,
C. S.
(
2000
).
Dissociating the role of the dorsolateral prefrontal and anterior cingulate cortex in cognitive control
.
Science
,
288
,
1835
1838
. ,
[PubMed]
Maldjian
,
J. A.
,
Laurienti
,
P. J.
, &
Burdette
,
J. H.
(
2004
).
Precentral gyrus discrepancy in electronic versions of the Talairach atlas
.
Neuroimage
,
21
,
450
455
. ,
[PubMed]
Maldjian
,
J. A.
,
Laurienti
,
P. J.
,
Kraft
,
R. A.
, &
Burdette
,
J. H.
(
2003
).
An automated method for neuroanatomic and cytoarchitectonic atlas-based interrogation of fMRI data sets
.
Neuroimage
,
19
,
1233
1239
. ,
[PubMed]
Mareschal
,
D.
(
2016
).
The neuroscience of conceptual learning in science and mathematics
.
Current Opinion in Behavioral Sciences
,
10
,
114
118
.
Marsh
,
R.
,
Zhu
,
H.
,
Schultz
,
R. T.
,
Quackenbush
,
G.
,
Royal
,
J.
,
Skudlarski
,
P.
, et al
(
2006
).
A developmental fMRI study of self-regulatory control
.
Human Brain Mapping
,
27
,
848
863
. ,
[PubMed]
Mason
,
L.
,
Zaccoletti
,
S.
,
Carretti
,
B.
,
Scrimin
,
S.
, &
Diakidoy
,
I.-A. N.
(
2019
).
The role of inhibition in conceptual learning from refutation and standard expository texts
.
International Journal of Science and Mathematics Education
,
17
,
483
501
.
Mason
,
R. A.
, &
Just
,
M. A.
(
2016
).
Neural representations of physics concepts
.
Psychological Science
,
27
,
904
913
. ,
[PubMed]
Masson
,
S.
,
Potvin
,
P.
,
Riopel
,
M.
, &
Foisy
,
L.-M. B.
(
2014
).
Differences in brain activation between novices and experts in science during a task involving a common misconception in electricity
.
Mind, Brain, and Education
,
8
,
44
55
.
Meier
,
M. A.
,
Wambacher
,
D.
,
Vogel
,
S. E.
, &
Grabner
,
R. H.
(
2022
).
Interference between naïve and scientific theories in mathematics and science: An fMRI study comparing mathematicians and non-mathematicians
.
Trends in Neuroscience and Education
,
29
,
100194
. ,
[PubMed]
Merkley
,
R.
,
Thompson
,
J.
, &
Scerif
,
G.
(
2016
).
Of huge mice and tiny elephants: Exploring the relationship between inhibitory processes and preschool math skills
.
Frontiers in Psychology
,
6
,
1903
. ,
[PubMed]
Morris
,
S.
,
Farran
,
E. K.
, &
Dumontheil
,
I.
(
2019
).
Field independence associates with mathematics and science performance in 5- to 10-year-olds after accounting for domain-general factors
.
Mind, Brain, and Education
,
13
,
268
278
.
Nenciovici
,
L.
,
Allaire-Duquette
,
G.
, &
Masson
,
S.
(
2019
).
Brain activations associated with scientific reasoning: A literature review
.
Cognitive Processing
,
20
,
139
161
. ,
[PubMed]
Nigg
,
J. T.
(
2000
).
On inhibition/disinhibition in developmental psychopathology: Views from cognitive and personality psychology and a working inhibition taxonomy
.
Psychological Bulletin
,
126
,
220
246
. ,
[PubMed]
Ordaz
,
S. J.
,
Foran
,
W.
,
Velanova
,
K.
, &
Luna
,
B.
(
2013
).
Longitudinal growth curves of brain function underlying inhibitory control through adolescence
.
Journal of Neuroscience
,
33
,
18109
18124
. ,
[PubMed]
Patel
,
A.
,
Biso
,
G. M. N. R.
, &
Fowler
,
J. B.
(
2024
).
Neuroanatomy, temporal lobe
. In
StatPearls
.
StatPearls Publishing
. https://www.ncbi.nlm.nih.gov/books/NBK519512/
Peters
,
L.
, &
De Smedt
,
B.
(
2018
).
Arithmetic in the developing brain: A review of brain imaging studies
.
Developmental Cognitive Neuroscience
,
30
,
265
279
. ,
[PubMed]
Potvin
,
P.
,
Malenfant-Robichaud
,
G.
,
Cormier
,
C.
, &
Masson
,
S.
(
2020
).
Coexistence of misconceptions and scientific conceptions in chemistry professors: A mental chronometry and fMRI study
.
Frontiers in Education
,
5
,
542458
.
Rolls
,
E. T.
,
Huang
,
C.-C.
,
Lin
,
C.-P.
,
Feng
,
J.
, &
Joliot
,
M.
(
2020
).
Automated anatomical labelling atlas 3
.
Neuroimage
,
206
,
116189
. ,
[PubMed]
Roy
,
P.
,
Rutt
,
S.
,
Easton
,
C.
,
Sims
,
D.
,
Bradshaw
,
S.
, &
McNamara
,
S.
(
2019
).
Stop and think: Learning counterintuitive concepts evaluation report
. .
Ryan
,
J.
, &
Williams
,
J.
(
2007
).
Children's mathematics 4–15: Learning from errors and misconceptions
.
Open University Press
.
Schroeter
,
M. L.
,
Zysset
,
S.
,
Wahl
,
M.
, &
von Cramon
,
D. Y.
(
2004
).
Prefrontal activation due to Stroop interference increases during development—An event-related fNIRS study
.
Neuroimage
,
23
,
1317
1325
. ,
[PubMed]
Shtulman
,
A.
, &
Valcarcel
,
J.
(
2012
).
Scientific knowledge suppresses but does not supplant earlier intuitions
.
Cognition
,
124
,
209
215
. ,
[PubMed]
Siegel
,
J. S.
,
Power
,
J. D.
,
Dubis
,
J. W.
,
Vogel
,
A. C.
,
Church
,
J. A.
,
Schlaggar
,
B. L.
, et al
(
2014
).
Statistical improvements in functional magnetic resonance imaging analyses produced by censoring high-motion data points
.
Human Brain Mapping
,
35
,
1981
1996
. ,
[PubMed]
Stanislaw
,
H.
, &
Todorov
,
N.
(
1999
).
Calculation of signal detection theory measures
.
Behavior Research Methods, Instruments, & Computers
,
31
,
137
149
. ,
[PubMed]
Stavy
,
R.
, &
Babai
,
R.
(
2010
).
Overcoming intuitive interference in mathematics: Insights from behavioral, brain imaging and intervention studies
.
ZDM – Mathematics Education
,
42
,
621
633
.
Stavy
,
R.
,
Goel
,
V.
,
Critchley
,
H.
, &
Dolan
,
R.
(
2006
).
Intuitive interference in quantitative reasoning
.
Brain Research
,
1073–1074
,
383
388
. ,
[PubMed]
Stavy
,
R.
, &
Tirosh
,
D.
(
1999
).
How students (mis-)understand science and mathematics: Intuitive rules
.
Teachers College Press
.
Suskauer
,
S. J.
,
Simmonds
,
D. J.
,
Fotedar
,
S.
,
Blankner
,
J. G.
,
Pekar
,
J. J.
,
Denckla
,
M. B.
, et al
(
2008
).
Functional magnetic resonance imaging evidence for abnormalities in response selection in attention deficit hyperactivity disorder: Differences in activation associated with response inhibition but not habitual motor response
.
Journal of Cognitive Neuroscience
,
20
,
478
493
. ,
[PubMed]
Tamm
,
L.
,
Menon
,
V.
, &
Reiss
,
A. L.
(
2002
).
Maturation of brain function associated with response inhibition
.
Journal of the American Academy of Child and Adolescent Psychiatry
,
41
,
1231
1238
. ,
[PubMed]
Tzourio-Mazoyer
,
N.
,
Landeau
,
B.
,
Papathanassiou
,
D.
,
Crivello
,
F.
,
Etard
,
O.
,
Delcroix
,
N.
, et al
(
2002
).
Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain
.
Neuroimage
,
15
,
273
289
. ,
[PubMed]
van der Laar
,
M. C.
,
van den Wildenberg
,
W. P. M.
,
van Boxtel
,
G. J. M.
, &
van der Molen
,
M. W.
(
2014
).
Development of response activation and inhibition in a selective stop-signal task
.
Biological Psychology
,
102
,
54
67
. ,
[PubMed]
Vosniadou
,
S.
,
Pnevmatikos
,
D.
,
Makris
,
N.
,
Lepenioti
,
D.
,
Eikospentaki
,
K.
,
Chountala
,
A.
, et al
(
2018
).
The recruitment of shifting and inhibition in on-line science and mathematics tasks
.
Cognitive Science
,
42
,
1860
1886
. ,
[PubMed]
Waite
,
A. M.
, &
McDonald
,
K. S.
(
2019
).
Exploring challenges and solutions facing STEM careers in the 21st century: A human resource development perspective
.
Advances in Developing Human Resources
,
21
,
3
15
.
Walker
,
D. A.
(
2003
).
JMASM9: Converting Kendall's tau for correlational or meta-analytic analyses
.
Journal of Modern Applied Statistical Methods
,
2
,
525
530
.
Walker
,
I.
, &
Zhu
,
Y.
(
2013
).
The benefit of STEM skills to individuals, society, and the economy
.
Report to Royal Society's Vision for Science and Mathematics
,
1
38
.
Wilkinson
,
H. R.
,
Smid
,
C.
,
Morris
,
S.
,
Farran
,
E. K.
,
Dumontheil
,
I.
,
Mayer
,
S.
, et al
(
2020
).
Domain-specific inhibitory control training to improve children's learning of counterintuitive concepts in mathematics and science
.
Journal of Cognitive Enhancement
,
4
,
296
314
. ,
[PubMed]
Xu
,
J.
,
Moeller
,
S.
,
Auerbach
,
E. J.
,
Strupp
,
J.
,
Smith
,
S. M.
,
Feinberg
,
D. A.
, et al
(
2013
).
Evaluation of slice accelerations using multiband echo planar imaging at 3 T
.
Neuroimage
,
83
,
991
1001
. ,
[PubMed]
Xu
,
M.
,
Xu
,
G.
, &
Yang
,
Y.
(
2016
).
Neural systems underlying emotional and non-emotional interference processing: An ALE meta-analysis of functional neuroimaging studies
.
Frontiers in Behavioral Neuroscience
,
10
,
220
. ,
[PubMed]
Zhang
,
R.
,
Geng
,
X.
, &
Lee
,
T. M. C.
(
2017
).
Large-scale functional neural network correlates of response inhibition: An fMRI meta-analysis
.
Brain Structure and Function
,
222
,
3973
3990
. ,
[PubMed]

Author notes

*

Denis Mareschal is the PI on the UnLocke project.

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. For a full description of the license, please visit https://creativecommons.org/licenses/by/4.0/legalcode.