key: cord-0298757-o7zxcgom authors: Paulus, J.P.; Vignali, C.; Coutanche, M.N. title: Schema support for forming inferences in the human brain date: 2021-11-16 journal: bioRxiv DOI: 10.1101/2021.11.15.466820 sha: f7d681718710e6f8a4428b7d9f31b3d6a4c4bcf5 doc_id: 298757 cord_uid: o7zxcgom Associative inference, the process of drawing novel links between existing knowledge to rapidly integrate associated information, is supported by the hippocampus and neocortex. Within the neocortex, the medial prefrontal cortex (mPFC) has been implicated in the rapid cortical learning of new information that is congruent with an existing framework of knowledge, or schema. How the brain integrates associations to form inferences, specifically how inferences are represented, is not well understood. In this study, we investigate how the brain uses schemas to facilitate memory integration in an associative inference paradigm (A-B-C-D). We conducted two event-related fMRI experiments in which participants retrieved previously learned direct (AB, BC, CD) and inferred (AC, AD) associations between word pairs for items that are schema congruent or incongruent. Additionally, we investigated how two factors known to affect memory, a delay with sleep, and reward, modulate the neural integration of associations within, and between, schema. Schema congruency was found to benefit the integration of associates, but only when retrieval immediately follows learning. RSA revealed that neural patterns of inferred pairs (AC) in the PHc, mPFC, and posHPC were more similar to their constituents (AB and BC) when the items were schema congruent, suggesting that schema facilitates the assimilation of paired items into a single inferred unit containing all associated elements. Furthermore, a delay with sleep, but not reward, impacted the assimilation of inferred pairs. Our findings reveal that the neural representations of overlapping associations are integrated into novel representations through the support of memory schema. Significance Statement Our ability to draw novel links between pieces of existing knowledge allows us to understand how information in memory is related. Existing knowledge (memory ‘schema’) can facilitate learning, and then integration, of new related information. We ask how the human brain uses schema to form links between related pieces of information, and how sleep and reward affect this process. Our results suggest that memory schema helps pieces of knowledge become a single inferred unit in the brain’s memory systems. A delay with sleep between learning and retrieval, though not reward, is important for how schema achieve this. The brain's ability to process and store the information it receives allows us to accumulate knowledge that informs our immediate and future behavior. To use this knowledge flexibly, we must often draw novel links between existing pieces of knowledge. For instance, learning that Boston is North of New York, and that New York is North of Philadelphia, allows us to know that Boston is North of Philadelphia, without needing it to be explicitly taught. Such "associative inferences" (Eichenbaum, 2000; Gluck & Myers, 1993) are possibly supported by the hippocampus (Greene et al., 2006; Heckers et al., 2004) and neocortical regions (Schlichting & Preston, 2015; Spalding et al., 2018) . The integration of new information into memory is supported by having a framework of related knowledge, or 'memory schema' (e.g., Bruett et al., 2018) . The existence of a memory schema can promote rapid cortical learning (Tse et al., 2007; McClelland, 2013) , likely supported by the medial prefrontal cortex (mPFC; van Kesteren et al., 2010a Kesteren et al., , 2010b Kesteren et al., , 2013 Bein et al., 2014; Brod et al., 2015; Reggev et al., 2016; van der Linden et al., 2017) . In this study, we ask how the brain uses schema to successfully infer relationships across items. During an fMRI scan, participants retrieved word pairs (half schema-congruent, half schema-incongruent) that had been directly learned, or had to be inferred. As inferred pairs are successfully integrated into memory, their neural patterns will overlap with the neural representations of the already learned underlying word pairs. If schemas draw on shared preexisting knowledge, congruent pairs should show evidence of greater memory integration, especially in cortical regions. A second experiment examines how memory integration is modulated by two important factors: sleep (Ren & Coutanche, 2021; Durrant et al., 2015; Tse et al., 2007; Lewis & Durrant, 2011) and reward (Al-Imari & Gerlai, 2008; Wolosin et al., 2012) . Based on observed neuronal replay during sleep, believed to help transfer information from hippocampal to neocortical regions (Bendor & Wilson 2012; Born et al., 2006; Walker, 2008) , we hypothesized that a delay with sleep between encoding and retrieval would lead to relatively stronger memory performance and neural integration for schema-driven inferences, relative to schema-incongruent pairs. Additionally, because dopaminergic pathways increase hippocampal activity to promote encoding (Bethus et al., 2010; O'Carroll et al., 2006) , we hypothesized that reward would enhance memory consolidation, particularly through a hippocampal-mPFC circuit (Preston & Eichenbaum, 2013; Schlichting & Preston, 2016) . The role of schema in reward is relatively understudied, so no prior hypotheses exist regarding the role played by reward in modifying schema-driven inferences. Either way, the findings will therefore establish an understanding of their relationship. Participants were native English speakers with normal or corrected-to-normal vision and no learning or attention disorders. For experiment 1, participants were recruited until 22 usable participants' datasets were available for analysis. Data from 28 participants were collected to achieve the 22-subject number, after excluding 6 from analyses based on their poor behavioral accuracy (< 60% on the immediate recognition task for directly learned pairs). The remaining 22 participants (16 females; mean [M] age = 23; standard deviation (SD) = 2.9) were included in all analyses. In experiment 2, data were collected from an independent sample of 20 healthy adults (15 females; M age = 22.3; SD = 3.1). Following the emergence of COVID-19 cases and the resulting changes in regulations imposed by the University of Pittsburgh, data collection was interrupted before reaching the goal of 22 participants. Four individuals were excluded from analyses due to poor retrieval accuracy (<60%) and one individual was excluded due to missing anatomical data, resulting in 15 suitable participants for further analysis. Participants were compensated for their participation in the study. All procedures were approved by the Institutional Review Board of the University of Pittsburgh. To study the effect of schema congruency on the retrieval of word pairs that were directly or indirectly associated in the learning phase, we used a 2 (schema congruent vs. schema incongruent) x 2 (directly learned vs. inferred) within-subjects study design. We adapted and expanded 28 sets of words (32 sets for experiment 2) from Zhang et al. (2018) , with each set containing four nouns (E.g., surgeon -school -sailor -cottage). In line with previous memory studies investigating associative inference (van der Linden et al., 2017; Shohamy & Wagner, 2008; Spalding et al., 2018; Zhang et al., 2018) , each of the four concepts in a set represents a component of the AB-AC inference paradigm (A -B -C -D). This design allows us to test a participant's memory of the word pairs that were directly learned in the learning phase (AB, BC, CD; direct pairs), as well as word pairs that were indirectly learned (associated by several links: AC, AD; inferred pairs). In half of the sets, the four words were strongly thematically related within a schema (e.g., teacher -classroom -student -dormitory), while the remaining sets were incongruent to a single schema (e.g., surgeon -school -sailor -cottage). A and C represented people, and B and D represented locations. For experiment 2, two additional sets were added to the congruent and incongruent conditions, resulting in 32 total sets (16 congruent, 16 incongruent), in order to increase statistical power for the additional reward factor (half associated with a reward; details below). Words within a congruent set were selected based on their semantic relatedness; calculated through Latent Semantic Analysis (LSA; Landauer & Dumais, 1997) . LSA can be used to measure the semantic association between two words by observing their concurrence in a large corpus of texts (Landauer & Dumais, 1997) . Higher LSA values correlate with a stronger semantic relationship between the two words. Given the nature of our forced-choice training task, each set additionally had six location foils and three people foils that were either congruent or incongruent to the schema, depending on the congruence condition of the set. While the foils for the incongruent sets did not share a schema with that set's A, B, C, or D, they shared a schema with words seen in one of the congruent sets. This design prevented participants from selecting the correct pairs of words during the forced-choice tasks simply based on their knowledge that the foil did not belong to any schema they had encountered during the training. The experimental procedure was amended from a recent behavioral study of memory performance based on schema congruency (Zhang et al., 2018) . All task procedures were created in Matlab 2017a using the Psychophysics Toolbox Version 3 (PTB-3) package for presenting stimuli on-screen. Prior to any participation, participants were taken through the informed consent process, after which they completed a brief demographic questionnaire. Participants were initially trained on a computer to learn word pairs (AB, BC, CD) using a forced-choice associative inference task until they successfully reached criterion (more details below; Fig. 1A ). Memory for the word pairs that were explicitly taught (AB, BC, CD) and (untaught) inferred pairs (A-C, A-D) were subsequently tested in a similar task (Fig. 1C ). Following the retrieval task, participants performed a localizer in which they rated how likely two objects (belonging to one of the schemas from the study) are found in the same location (experiment 1) or a monetary incentive delay (MID) task (adapted from Knutson et al., 2001) . These localizers were not used in analyses of this experiment. After the scan, participants completed several memory and sleep related self-report questionnaires. During experiment 1, participants were trained to learn associated word pairs through a forced-choice paradigm, though they were never explicitly told that they would be subsequently tested on their memory of the word pairs. Participants were presented with a cue word (representing A, B, or C from a set) at the top of the screen. Two additional words, the associated word (target) and a foil, were shown on the bottom left and right of the screen (3 sec). The positions for target items and foils for each presentation were randomized so that the target item was presented equally on each side of the screen across trials. Participants were instructed that they had three seconds to select the word that was associated with the cue. Following the participant's response, "Correct" or "Incorrect" was displayed in the center of the screen for 1 second to indicate whether they had selected the correct word. If no response was recorded in the allotted 3 seconds, the screen displayed "Too Late" and proceeded to the next item, without showing the correct choice. As participants were blind to the correct associations on their first encounter with a certain cue word, they would have to randomly choose one of the two choice words (50% chance of successfully selecting the target word) and rely on feedback (1 sec) to learn and inform their response for subsequent trials. The task was divided into four sections, each containing a different subset of the word pairs. Once all of the words within a section were shown once, the word pairs were shown again in the same order to maximize the amount of time between presentations. In all repeat instances of a given item, the foil was always the same. Pairs were removed from the pool after three consecutive correct trials, thereby reducing the number of pairs presented until all pairs were learned to this criterion. Once all four sections were completed, the entire process was repeated for two more cycles to ensure that the word pairs had been successfully learned. While the cue word was paired with the same target word across the three cycles, the foils were unique to that cycle. Experiment 2 followed the same training protocol with the addition of reward cues. To investigate the influence of reward, certain word pairs were associated with a possible future monetary reward. Participants were instructed that correctly learning rewarded items would lead to greater reward later in the study. A symbol indicating the reward potential for each item was presented at the center of the screen before every word pair (1 sec). A picture of a half dollar coin was used to indicate that the following item would be rewarded, while unrewarded items were preceded by a gray circle of the same size with a line through it (Fig. 1B) . Due to the increase in sets from experiment 1, the training was divided into 8 sections (rather than the 4 in Experiment 1). The retrieval phase took place immediately after the training phase (experiment 1) or approximately 24 hours later (experiment 2). Although the retrieval phase was designed to be as similar to the training phase as possible, some changes were made to adapt the task for fMRI. A rapid event-related (rER) design was used, with OptSeq2 providing the presentation sequence to optimize extraction of the signal (Dale, 1999) . Within stimulus presentations, a jittered fixation cross (0-10 sec) was shown to provide a buffer between groups of stimuli, allowing us to isolate the hemodynamic response function associated with each stimuli type. In the retrieval task, participants were tested on their memory of the word pairs they had learned in the training. As in the training, the cue word was displayed at the top of the screen. Participants selected which of the two words on the bottom of the screen had been associated with the cue during training, using two buttons on an fMRI compatible response glove. The location of the target word and foil was randomized to appear on the left and right side of the screen equally. Unlike the training task, participants only had 2 seconds (from 3 secs in training) to respond before the trial was marked as incorrect. Participants were instructed to respond as quickly and as accurately as possible and that no feedback would be provided. In addition to being tested on the directly learned pairs from the training (AB, BC, CD), participants would sometimes have to select target words that were indirectly associated (inferred; AC, AD). The order of presentation was randomized across runs; however, inferred pairs were always presented before direct pairs so that retrieval of direct pairs would not interfere by reinforcing these relations. The retrieval task was separated into four functional runs. The first run contained half of the word pairs, and the second run contained the other half. Runs 3 and 4 consisted of the word pairs from runs 1 and 2 respectively, but with different foils (selected from the training phase), so that each word pair was tested twice in the retrieval phase. Each run had an equal number of schema congruent and incongruent trials. In experiment 2, the items that had been associated with a reward cue during the training were now eligible for additional monetary compensation based on accuracy on those items. Participants were instructed that each rewarded pair would be associated with a $0.50 reward. Since we were interested in the effect of reward during learning, the reward status of each item (previously reward-cued versus not) was not indicated during the recognition task. We used Freesurfer to create anatomically-defined regions of interest (ROIs) for each participant. We created ROIs for the mPFC, a neocortical region found to be involved in the processing of schema congruent information, as well as the hippocampus (van Kesteren et al., 2010 (van Kesteren et al., , 2012 Spalding et al., 2018; Schlichting & Preston, 2014 ). An additional neocortical region, the angular gyrus (AG), implicated in schematic memory was included in our ROI selection (Gilboa & Marlatte, 2017; Wagner et al., 2015) . Following literature that has found functional differences between the anterior and posterior hippocampus (Bein et al., 2019) , we manually divided the hippocampus into two halves along the coronal axis using AFNI (Cox, 1996) . To investigate the role of reward on rapid memory integration, we created ROIs for the nucleus accumbens (NAc) and substantia nigra (SN; Knutson et al., 2001; Adcock et al., 2006) . fMRI Data The scanning parameters were identical for both experiments. Data were collected on a 3.0T Siemens Magnetom Verio equipped with a 32-channel coil. For blood-oxygen level dependent (BOLD) fMRI images, T2-weighted scans were acquired using the following parameters: repetition time (TR) = 2000 ms, echo time (TE) = 30 ms, 2mm slice thickness with 69 slices per run, 2x2x2 mm voxel size, field of view = 212x212 mm, flip angle = 79º. For anatomical images, taken after functional scans, T1-weighted scans were taken from the sagittal plane at 1mm isotropic resolution using an MPRAGE scan with: TR = 2300 ms, TE = 1.97 ms, inversion time (TI) = 900 ms, slice thickness = 1 mm, resolution size = 1 mm isotropic, flip angle = 9 degrees, field of view = 256x256 mm. All preprocessing was conducted using AFNI (Cox, 1996) . Functional data were slicetime corrected, aligned to a base volume, and scaled to have a mean signal of 100 with lower and upper bounds of 0 and 200. Visual inspection of the motion time series confirmed that all participants moved no more than 1.5 mm once in any dimension from one volume to the next. Multi-voxel pattern analysis (MVPA) was conducted using a two-fold cross-validation structure, in order to investigate the information within patterns of activation for directly learned and inferred associations within ROIs. For a given condition, beta coefficients for each item in runs 1 and 3, and then runs 2 and 4, were determined using a generalized linear model. The beta coefficients were then used to train a Gaussian Naïve Bayes (GNB) model. To balance the number of trials from each condition, a randomized subset of directly learned trials were selected to match the smaller number of inferred trials. This process was repeated 100 times in both directions (training on items from runs 1 and 3, and then on items from runs 2 and 4) for each ROI and participant. Accuracy for the model was calculated for each fold within an iteration, and averaged after completion of all 100 iterations. Group-level tests determined if classifier accuracy across participants differed significantly from chance. Six participants were removed due to low retrieval accuracy (<60%) on the directly learned pairs. The remaining 22 participants demonstrated above-chance retrieval performance for direct (p < .001, t(21) = 17.92) and inferred (p = .008, t(21) = 2.92) pairs. As expected, participants performed significantly better (p < .001, t(21) = 5.86) on word pairs they had directly learned (M = .83; sd = .09) than on inferred pairs (M = .61; sd = .18), including for schema-congruent (p < .001, t(21) = 5.59) and incongruent (p < .001, t(21) = 5.68) separately (Fig. 3) . The direct and inferred pairs showed a significant difference in responsetimes (p < .001, t(21) = -7.70) with faster RTs for direct (M = 1.29; sd = .11) than inferred (M = 1.49; sd = .17), in both the congruent (p < .001, t(21) = -7.03) and incongruent (p < .001, t(21) = -7.33) conditions. For more details on behavioral performance, see Table 2 . In order to check whether rapid memory integration was successful in the schema congruent and incongruent items, we tested whether group accuracies in the inferred pairs exceeded chance. Both the congruent inferred (p = .001, t(21) = 3.74) and incongruent inferred (p = .047, t(21) = 2.11) pairs were significantly above chance. Accuracy did not differ between congruent and incongruent inferred pairs (p = .171, t(21) = 1.42). Incongruent inferred RTs were significantly slower than congruent inferred RTs (p < .001, t(21) = -3.89). fMRI Results We first trained a GNB classifier to distinguish between directly learned versus inferred pairs. We found significant classification (Fig. 4) We next examined whether neural patterns for the directly learned AB and BC pairs were reactivated during the retrieval of the linked AC pairs. We correlated the patterns for each set's AB pair with the same set's AC pair. Similarly, we also correlated the BC pair with the AC pair. In a linear mixed effects (LME) regression model for each ROI, schema-congruency was a significant predictor (Fig. 6A) In order to see whether the higher correlations in congruent pairs are simply a result of semantic similarity within the same schema (e.g., due to a shared topic), we correlated AB with BC and found that incongruent sets were more highly correlated than congruent sets in all three regions (PHc: p < .001, t(21) = -10.70; mPFC: p < .001, t(21) = -10.52; posterior hippocampus: p < .001, t(21) = -16.82), suggesting that the higher pattern reactivation of the constituent pairs during retrieval is not simply due to overlapping semantic content (Fig. 6 ). Overall, participants recorded a mean retrieval accuracy of 67.83% on all directly learned trials, indicating that items were directly learned to criterion on day one. Three-factor repeated measures analysis of variance (ANOVA) between congruency, reward, and inference indicated no main effect of congruency on retrieval performance (F(1,14) = 0.012, p = 0.92). However, the same analysis showed a significant main effect of inference on retrieval accuracy (F(1,14) = 185.264, p < .001) as well as a significant main effect of reward (F(1,14) = 6.017, p = 0.028). No significant interaction was observed between any of our three factors. No differences were observed in retrieval performance between congruent and incongruent directly learned items during retrieval (p = 0.991, t(14) = 0.012), which suggests that any difference between congruent and incongruent accuracies found elsewhere are not due to differing degrees of learning. Significant differences were found between rewarded and unrewarded retrieval accuracies (p = 0.010, t(14) = -3.005, Table 3 Finally, congruent directly learned and inferred pairs showed a significant difference in retrieval accuracy (p < 0.001, t(14) = 9.222); this results was replicated for incongruent pairs (p < 0.001, t(14) = 8.878). Both congruent (p < 0.001, t(14) = 4.908) and incongruent (p = 0.0015, t(14) = 3.932) linked pairs were retrieved at levels above chance, which is predictive of memory integration. Similarly, response times for directly learned incongruent items were significantly faster than incongruent inferred (p < .001, t(14) = -5.51). No significant difference was recorded between congruent inferred and incongruent inferred items (p = .397, t(14) = -.874). In further analysis, response times were broken down into direct/inferred and then rewarded/unrewarded groups (Fig. 8) . Here we found evidence that in all conditions, reward led to faster response times (p < 0.001). However when collapsing across congruency and connection, no significant difference was observed between rewarded and unrewarded items (p = 0.879, t(14) = -0.155). fMRI Results The primary analysis tested for differences in pattern activation for ROIs during retrieval of directly learned and inferred items. Here, we observed differences in patterns for inferred and (M = .520, t(14) = .926, p = .370) were not significantly above-chance (Fig. 11 ). An additional paired t-test of classifier accuracies for direct/inferred patterns within ROIs found significantly different accuracies for congruent and incongruent contrasts within the AG (t(14) = -2.60, p <.05) (Fig. 11 ). Colors correspond to Figure 2 Finally, MVPA was performed for rewarded vs. unrewarded items within directly learned and inferred pairs. Two-way repeated measures ANOVA was run between ROI and reward and found a main effect of ROI (F(6,84) = 6.662, p < 0.001, r =0.322), but no main effect of reward (F(1,14) = 0.136, p = 0.718, r = 0.010). Indicating that differences in classification for this contrast appeared due to differences between ROIs but not due to the influence of reward. While no effects were seen in the current study, additional investigation with a larger subject pool may have the power required to find effects of reward. Suggesting that correlations between linked pairs and their constituents were more similar for congruent sets than incongruent sets in all ROIs (see Fig. 13 ). influenced by schema congruency in the mPFC (Fig. 14) . Finally, a two-way repeated measure mixed model ANOVA was conducted between MVPA results of directly learned vs. linked contrasts for experiment 1 and experiment 2, collapsed across reward, in order to determine if an interaction existed between rest and congruency. We found an interaction between rest and congruency (F(1,33) = 7.466, p = 0.010, r = 0.185). Using an associative inference task, we investigated how memory schema affects memory integration. After a night of sleep, the neural representations of schema-congruent inferred pairs were more similar to their constituent pairings (AB, BC), than schema-incongruent inferences, across regions of the memory network. Successfully integrated inferred pairs demonstrated higher pattern similarity in the mPFC with their directly learned constituents than did unsuccessfully integrated pairs, for congruent schema, but only after a night of sleep. These results suggest that delay between encoding and retrieval, paired with a delay with sleep, affects the influence of schema congruency on the brain's memory network. Behavioral measures confirmed that participants were able to successfully integrate both schema congruent and incongruent information regardless of the delay between encoding and retrieval. When retrieval immediately followed encoding, the accuracy of inferred pairs did not differ between congruent and incongruent sets; however, schema congruent inferred pairs were retrieved faster, suggesting that congruency may begin to have a beneficial effect on the integration of inferred pairs. Although not entirely conclusive due to lack of statistical significance in accuracy, these RT findings provide some support for prior results that associative pairs are better integrated when congruent with a schema (Zhang et al., 2018) . When retrieval occurred the day following encoding, participants showed no difference in either response time or accuracy based on schema congruency. One possible explanation for this discrepancy could come from the experimental design of the training phase. In this study, the training task was modeled after the retrieval task, which itself was modified to be compatible for fMRI. Unlike the training phase used by Zhang et al. (2018) in which participants were initially shown the direct pairs with no foils and subsequently given a cued-recall test where they typed the target word, our training phase involved a forced-choice paradigm where the target word was always accompanied by a foil. It is possible that the presence of a foil during learning led to increased interference with the target word, which is consistent with the generally higher behavioral performance observed in Zhang et al. despite using similar stimuli and retrieval tasks. Such interference would be especially detrimental for congruent pairs as they share semantic relationships, and could explain why we did not find such a strong effect of congruency. Building on literature suggesting that associated items are represented more similarly than unrelated items during encoding (Schapiro et al., 2012) , we investigated whether neural patterns for inferred pairs are represented similarly to the underlying learned pairs during retrieval. Additionally, we investigated whether factors that are known to improve memory performance such as sleep (Landmann et al., 2014, Stickgold and Walker, 2013) and rewardmediated motivation (Adcock et al., 2007 , Knutson et al., 2001 would change the effect of congruency on memory integration. Using machine learning to decode multivariate patterns of activity, we identified regions that represented directly learned pairs differently from inferred pairs. In line with research suggesting that hippocampal and neocortical (Schlichting and Preston, 2014; Spalding et al., 2018) involvement is necessary for associative inference, we learned and inferred pairs were represented differently in the posHPC and mPFC. Additionally, the AG, which serves as a semantic hub for schema congruent information (Gilboa and Marlatte, 2017; Wagner et al., 2015) , showed a similar effect. These findings were further supported by results in a second experiment, in which the mPFC and AG were again identified as representing direct and inferred pairs separately. The posHPC was no longer significant in this second experiment, possibly reflecting hippocampal disengagement as information is increasingly consolidated in the mPFC (van Kesteren et al., 2010a) . Examining the role of schema congruency, surprisingly, the first experiment did not identify distinct neural representations for direct and inferred pairs, despite the benefits of schema congruency on memory (Zhang et al., 2018) . Our second experiment revealed a difference in neural representations in the mPFC and AG between congruent and incongruent sets, with the latter showing increased pattern distinctiveness. A mixed-model repeated measures ANOVA between classifier accuracies from experiments 1 and 2 revealed a significant interaction between delay with sleep and congruency. These findings suggest that delay with sleep was necessary for the beneficial effect of schema congruency to be apparent. The final examined modulation included in experiment 2 set out to investigate the influence of reward on memory integration and its interaction with schema. Behavioral data did not support our original hypothesis that reward would benefit both schema congruent and incongruent sets. Interestingly, retrieval performance of schema-congruent (but not schemaincongruent) pairs was hindered for rewarded word pairs that were congruent with a schema when they were associated with reward, while incongruent word pairs retrieval was not influenced by reward. One possible explanation is that reward led to increased interference between learned pairs and the foils of congruent sets. Evidence from prior literature suggests that reward reduces behavioral interference (Krebs et al., 2013) , however disruptive effects of dopamine in the mPFC during working memory tasks have been observed in mice (Valentim Jr. et al., 2009) , though reward has also reduced behavioral interference as well (Krebs et al., 2013) . Increases in dopamine in the mPFC during reward-associated learning may have caused the behavioral detriment observed in this study. Additionally, increased activity of the mPFC during encoding and retrieval of schema congruent, but not incongruent, information (Brod et al., 2015 , Ghosh et al., 2014 could explain why this was not observed in schema incongruent sets, which has been found to rely on greater hippocampal activity (van Kesteren et al., 2010a) . Further investigation is required in order to fully understand the effect of reward on schematic memory. These early results in the examination of reward and schema suggest the two may not interact in this context, but further work is warranted to understand if this applies across all tasks and learned material. RSA analysis that compared the neural representations of inferred pairs AC and their constituents (AB and BC) revealed greater pattern similarity for congruent than incongruent pairs in the PHc, mPFC, and posHPC. Subsequent analyses revealed that these results were not driven simply by shared semantic content between schema congruent pairs. These findings suggest that schema facilitates the assimilation of paired items into a single inferred unit containing all associated elements (Burton et al., 2016) . In experiment 2, we found similar results, indicating that direct and inferred pairs were more similar for schema congruent pairs in the PHc, AG, mPFC, antHPC, and posHPC. Further analyses between both experiments revealed that a delay with sleep between encoding and retrieval increased the assimilation of inferred pairs. After a night of sleep, successfully integrated pairs were represented more similarly to their constituents than were unsuccessfully integrated pairs, in the mPFC, when the words were congruent with a schema. In incongruent sets, however, successful integration had no effect on pattern similarity. These findings expand on the prior literature of associative inference that has shown that reactivation of directly learned patterns during encoding of new events (i.e reactivation of AB when learning BC) predicts subsequent memory performance (Kuhl et al., 2010; . Our results suggest that the mPFC, which is heavily involved in schema (Brod et al., 2015) , gradually shifts the representations of inferred pairs during retrieval to become more similar to direct pairs. Reward-motivated learning: Mesolimbic activation precedes memory formation Sight of conspecifics as reward in associative learning in zebrafish (Danio rerio) Prior knowledge influences on hippocampus and medial prefrontal cortex interactions in subsequent memory The role of prior knowledge in incremental associative learning: An empirical and computational approach Biasing the content of hippocampal replay during sleep Dopamine and Memory: Modulation of the Persistence of Memory for Novel Hippocampal NMDA Receptor-Dependent Paired Associates Sleep to remember Differences in the neural signature of remembering schema-congruent and schema-incongruent events Expertise Moderates Incidentally Learned Associations Between Words and Images Associative independence revisited: Competition between conflicting associations can be resolved or even reversed in one trial AFNI: Software for Analysis and Visualization of Functional Magnetic Resonance Neuroimages Optimal experimental design for event-related fMRI Schema-conformant memories are preferentially consolidated during REM sleep A cortical-hippocampal system for declarative memory Schema Representation in Patients with Ventromedial PFC Lesions Neurobiology of Schemas and Schema-Mediated Memory Hippocampal mediation of stimulus representation: A computational theory An fMRI Analysis of the Human Hippocampus: Inference, Context, and Task Awareness Hippocampal activation during transitive inference in humans Anticipation of Increasing Monetary Reward Selectively Recruits Nucleus Accumbens Reward Associations Reduce Behavioral Interference by Changing the Temporal Dynamics of Conflict Processing Resistance to forgetting associated with hippocampus-mediated reactivation during new learning A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge The reorganisation of memory during sleep Overlapping memory replay during sleep builds cognitive schemata Angular Gyrus Involvement at Encoding and Retrieval Is Associated with Durable But Less Specific Memories Incorporating rapid neocortical learning of new schema-consistent information into complementary learning systems theory Dopaminergic modulation of the persistence of one-trial hippocampus-dependent memory Interplay of hippocampus and prefrontal cortex in memory Distinct Neural Suppression and Encoding Effects for Conceptual Novelty and Familiarity Sleep reduces the semantic coherence of memory recall: An application of latent semantic analysis to investigate memory reconstruction D1 dopamine and NMDA receptors interactions in the medial prefrontal cortex: Modulation of spatial working memory in rats Shaping of object representations in the human medial temporal lobe based on temporal regularities Memory reactivation during rest supports upcoming learning of related content Memory integration: Neural mechanisms and implications for behavior Hippocampal-medial prefrontal circuit supports memory updating during learning and post-encoding rest Integrating Memories in the Human Brain: Hippocampal-Midbrain Encoding of Overlapping Events Ventromedial Prefrontal Cortex Is Necessary for Normal Associative Inference and Memory Integration Sleep-dependent memory triage: Evolving generalization through selective processing Schemas and Memory Consolidation Persistent schema-dependent hippocampal-neocortical connectivity during memory encoding and postencoding rest in humans Retrieval of Associative Information Congruent with Prior Knowledge Is Related to Increased Medial Prefrontal Activity and Connectivity Differential roles for medial prefrontal and medial temporal cortices in schemadependent encoding: From congruent to incongruent How schema and novelty augment memory formation Schematic memory components converge within angular gyrus during retrieval. ELife, 4, e09668 Sleep-dependent memory processing Reward Modulation of Hippocampal Subfield Activation during Successful Associative Encoding and Retrieval Hippocampal and ventral medial prefrontal activation during retrieval-mediated learning supports novel inference Fast Memory Integration Facilitated by Schema Consistency