In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. Neuroimaging contributions to the understanding of discourse processes Robert A. Mason and Marcel Adam Just Center for Cognitive Brain Imaging Department of Psychology Carnegie Mellon University Pittsburgh, PA Neuroimaging research is providing new types of information and insight about the cortical activity underlying discourse processing. Knowing the intensity and location of the brain activity during discourse comprehension adds significantly to the information provided by behavioral measures alone. The combination of neuroimaging data and behaviorally based discourse theories indicate that discourse processing is underpinned by a system of several distinguishable cortical networks that are activated for discourse processing, above and beyond the activation evoked by comprehension at the word and sentence level. Whereas the multiplicities of the processes in discourse comprehension are sometimes seen as a drawback to behavioral experiments, it is something of a benefit in neuroimaging research. Controlled neuroimaging experiments, with their multidimensional measures, can help determine when each of these components contributes to discourse processing. By making some assumptions about the cortical regions/network that underlie this processing, we can begin to determine when an area becomes activated and to what degree it is activated as a function of the discourse properties. In this chapter, we describe some key action), are relatively new to the discourse processing neuroimaging studies of discourse processing, and theory (although comprehending the motivations of observe some systematic patterns of results that apply characters in a story is probably an ancient skill). At the across the described studies. To foreshadow these same time, there is uncertainty about the reality of these patterns, we list here five specialized networks we networks and about their anatomical location. Moreover, believe to be involved in discourse processing. these networks must function in interaction with somewhat lower level comprehension processes that Parallel Networks of Discourse operate at the lexical and sentence level (for a review see: Bookheimer 2002; Gernsbacher & Kaschak, 2003). 1. A coarse semantic processing network Despite these cautions, it seems useful to consider the (right middle and superior temporal) recent research with some framework in mind, and we 2. A coherence monitoring network suggest that these five networks/discourse functions (bilateral dorsolateral prefrontal) provide an initial attempt at such a framework. As we 3. A text integration network (left inferior proceed through the chapter we will elaborate on the frontal-left anterior temporal) characteristics of these networks as they are illuminated 4. A network for interpreting a protagonist’s by the various studies. At the conclusion of the chapter or agent’s perspective (bilateral medial we will expand on a theoretical framework based on frontal/posterior right temporal/parietal) these networks. 5. A spatial imagery network (left dominant, Although the focus of this paper is decidedly not on bilateral intraparietal sulcus) “where” in the brain discourse processing occurs, a brief overview of which areas of the brain play a role in This list should be treated with caution and with discourse comprehension is useful. In almost every excitement. It is exciting that some of the discourse processing task, the traditional left hemisphere components of discourse processing revealed by language network activates in a contrast with a fixation neuroimaging research, like protagonist perspective baseline condition. This traditional left hemisphere monitoring (here we use protagonist as a shorthand language network includes the left hemisphere inferior for any agent in the story capable of intentional frontal gyrus, the superior and middle temporal gyrus, In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. potions of the inferior temporal gyrus and the angular lower levels of language processing. Both gyrus. In addition to this basic language processing circumstances would make the deficit less apparent to network, we expected additional discourse networks clinicians. In recent years, however, more sensitive to activate during discourse processing. An overview neuropsychological investigations have detected of the Parallel Networks of Discourse and a rough impaired discourse functioning at several levels and have sketch of the cortical regions in which they are found such impairments to be correlated with right localized are shown in Figure 1. This schematic hemisphere damage. Unlike the situation with Broca’s representation is shown via surface projection on a and Wernicke’s patients, consistent focal lesions have rendered brain. It is not intended to depict an not been found in these discourse aphasics. Various exhaustive account of discourse processing networks patients have lesions in right hemisphere homologues of but instead highlight some key areas. It is likely that Broca’s area (e.g., the inferior frontal gyrus), Wernicke’s the networks are differentially engaged in the area (e.g., posterior-superior and middle temporal gyrus) research presented here and the localization of the as well as the right hemisphere angular gyrus, peak activation for a specific task could be expected dorsolateral prefrontal cortex and the medial frontal to vary somewhat within an anatomical region. gyrus. Additionally, several other populations, such as Discourse theories become critical in developing individuals with autism (Dennis, 2001) and Alzheimer’s this understanding of the cortical discourse patients (Papagno, 2001), with non-focal lesions or no processing network. In addition, neuroimaging lesions at all have difficulty with discourse processing. research has led to the development of several new Thus, unlike the case for lower-level language discourse theories such as the coarse coding theory of processing research, neuropsychological studies of right hemisphere processing (Beeman, 1998), the discourse processing in brain-damaged and other special dynamic recruitment of networks in response to text populations did not provide much information about constraints (Ferstl, Rinck & von Cramon, 2005; brain function other than a generalized notion that the Mason and Just, 2004; Xu et al., 2005), a Theory of right hemisphere was somehow involved in discourse Mind system responsible for awareness of different processing. perspectives (Gallagher and Frith, 2003), and the It was not until the late 1980’s that researchers spillover of processing to other differential began using brain imaging as a technique to investigate specialized networks in response to capacity cognitive process. At the time neuroimaging was utilization (Just et al., 1996). The cortical activation primarily based on Positron Emission Tomography and these new theories are based on, or at least (PET). One of the earliest neuroimaging investigations consistent with, traditional discourse theories such as of a “discourse variable” was a PET study of metaphor Kintsch’s Construction-Integration framework comprehension conducted by Bottini et al., (1994). (1988), Gernsbacher’s Structure Building Framework Subjects listened to either literal or metaphorical (1990), Myers and O’Brien’s Resonance model sentences, including such examples as: (1998), Giora’s Graded Salience Hypothesis (1997) plausible: “The investors were squirrels collecting and van den Broek’s Landscape architecture (1996) nuts” as well as others. implausible: “The investors were trams.” The Beginnings of Brain Imaging in Discourse The sentences containing metaphors elicited more Comprehension activation in the right hemisphere, particularly the right Dating back to Broca's and Wernicke’s findings inferior frontal gyrus and right posterior temporal cortex. on brain-damaged patients with specific language The increased involvement of the right hemisphere may deficits in the late 1800’s, psychologists have had have been the result of an inference process that some idea of the brain’s functioning as a language combined world knowledge with the contents of the processing mechanism. There is however, no sentence to resolve the ambiguity. This early finding of similarly well-known case of a patient with a deficit right hemisphere involvement in metaphor in discourse processing abilities. One reason for this comprehension, combined with neuropsychological and lack may be the difficulty in defining what is meant visual field presentation data, contributed to the by a discourse processing deficit. Another reason for hypothesis that the right hemisphere is critically the lack of reports on patients with a discourse- involved in metaphor comprehension and in other facets processing deficit is that many patients with such of discourse processing. This view is still widely held deficits are either still able to function well in today although much refined. The right hemisphere everyday life or they also have severe deficits at coarse-coding hypothesis for example (described more 2 In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. Figure 1. A Schematic representation of the Parallel Networks of Discourse. Shaded regions represent surface rendered anatomical regions as described in the text. A rough localization within anatomical regions are represented by colored ellipses. 3 In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. fully in the lateralization section) developed by appropriate experimental paradigms within the Beeman and colleagues (Beeman, 1993; Beeman, constraints of brain scanning methodology. A brief 1998; Beeman et al., 1994; Brownell et al., 1986) is a consideration of imaging methodology will make this broader theory which encompasses the right point more clear. Imaging of brain activity using PET hemisphere discourse view. (Positron Emission Tomography) requires uptake of a Nichelli, et al. (1995) was among the first to tracer substance into the blood stream and into the brain visually present passages consisting of multiple during task performance. The relative insensitivity to sentences (Aesop’s Fables) passages in a the tracer requires long sampling intervals, such that neuroimaging study. (Another early auditory study brain activity can be measured only over many tens of was reported by Mazoyer et al., 1993.) Participants in seconds. As a result, PET studies have to utilize a Nichelli’s PET study were asked to monitor either “blocked” or “epoch” design in which many trials or semantic details, syntactic details or the moral of the items of the same type (i.e. constituting the same fable. Nichelli et al. concluded that the extra experimental condition) are presented in a sequence or activation observed in the right hemisphere during “block,” and brain activity is measured during the moral monitoring and not in the other conditions was processing of the entire block of stimuli. The activity is due to the drawing of an inference about the passage. then contrasted with other blocks (experimental Although there may have been some alternative conditions) in which a different set of processes occur. accounts offered for this finding, this early text The activity in the experimental conditions is typically comprehension study advanced the view that there estimated with respect to a baseline task, usually a was something special about the role of the right simpler task that is believed to be common to the two or hemisphere in discourse processing. In the decade more experimental conditions. The resulting image is since these early discourse studies, considerable then “subtracted” from the experimental blocks so that advancement has been made in both methods and the non-baseline processing can be isolated. Even with theory. some shortcomings of the subtraction paradigm There were several possible reasons for the (Newman et al., 2001) this methodology was very scarcity of neuroimaging research at the discourse fruitful for the early days of imaging. At the very least it processing level, some of which remain as problems: served to illuminate the extent of various networks (1) neuroimaging research into language processing engaged in many cognitive processes. But the nature of at any level is fairly new, (2) discourse processing is PET imaging led to some concessions in experimental a broad field without a tradition for concern about design. Because PET is less sensitive and requires a neural mechanisms, (3) many of the key research radioactive tracer, most mainstream functional questions that have been asked in the discourse neuroimaging has turned to functional Magnetic processing area are not easily addressed with Resonance Imaging, or fMRI. neuroimaging (due to the sluggish nature of the The underlying assumption of fMRI is that it is a hemodynamic response and the low temporal measure of neuronal activity, which should increase in sampling rate), (4) several of the frequently-used some area when a cognitive process makes use of paradigms used to investigate discourse processing substrate in that area. This increase in neuronal activity are not easily adaptable to the scanner environment results in an increase in local blood flow and volume. (e.g., talk aloud protocols, naming), and (5) The oxygen content is then elevated in cortical areas neuropsychological research on discourse processing which are being used. This will result in an increase in deficits is limited. Recent advancements in the MR signal, which is affected by the ratio of technology and knowledge about cortical function deoxygenated hemoglobin to oxygenated hemoglobin have alleviated many of these constraining factors. (Cohen & Bookheimer 1994). The release of some of these constraints has opened A key advantage of the fMRI methodology is that a many exciting new possibilities for the investigation significantly smaller temporal window can be used in the of the neural substrate underlying discourse measurement of brain activity than had been possible processing. with PET. In some cases, the cognitive processing that we would like to measure is very short in duration and, Specialized imaging paradigms for discourse particularly with language processing, it occurs on the processing research order of tens of milliseconds. There is obvious difficulty Perhaps the largest factor constraining discourse when trying to measure a rapid cognitive process with a imaging research is the challenge of developing very slow measurement such as PET. In fact, PET 4 In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. doesn’t really allow a consideration of the moment blank interval appears (about 7 to 14 seconds, hence the by moment changes in activation. It results in a term slow-paced) between the items, sufficient to allow single average image from a large temporal window. the hemodynamic response to return to a baseline level fMRI can be used to acquire an image in many (Mason, et al., 2003, Mason & Just, 2004) so that the consecutive temporal windows. The exact temporal activations associated with individual sentences are window (termed TR, for “time for repetition”) is separable. The development of the slow-paced and fast- related to the strength of the magnet, the amount of paced event-related experimental designs for fMRI cortex being imaged, the criteria for signal to noise allowed for imaging researchers to both randomize and, critically, the amount of time in which it takes presentation of items as well as isolate specific cognitive the protons in the imaged substance to return to processing to single sentences as opposed to examining baseline after the introduction of a radio frequency gross levels of processing differences. pulse that causes them to tilt. Even though the The Mason and Just study (2004) provides an temporal window in imaging varies across example of how the event-related approach can be used experiments, the range is roughly between 1 second to both randomize items as well as isolate specific types (e.g., Just et al., 2004; Mason, et al., 2003) and 3 of cognitive processing. The study examined causal seconds (e.g., Martin and Weisberg, 2003; Robertson, inferencing in the comprehension of two-sentence et al. 2000; St. George et al., 1999). passages, drawn from previous stimulus materials Although early fMRI studies of discourse (Keenan, Baillet, & Brown, 1984; Myers, Shinjo, & comprehension used a block design, they took Duffy, 1987). The critical sentence in each passage was advantage of the more rapid temporal window and the second one, and the experimental manipulation was the ability to compare conditions without subtracting the degree of causal relatedness to the preceding out lower levels of cognitive processing. For sentence. The second sentence was followed by a example, Robertson et al. (2000) presented readers fixation point so that the activation could return to with blocks of sentences which either contained an baseline before the next passage. Using this technique, it indefinite article or a definite article. They found was possible to identify the time interval of cortical greater right hemisphere activation for the lists of activation that corresponded to the processing of the sentences that used definite articles rather than critical sentence. The processing was expected to differ indefinite articles. The definite article sentences among the experimental conditions, and this expectation were assumed to lead to more coherent discourse was confirmed, as described later in the chapter. than the indefinite article sentences. They concluded Even the relatively rapid 1 second temporal that the additional right hemisphere activation sampling rate used by Mason et al. (2003) seems slow reflected processes used to anaphorically relate the relative to the time measures and effect sizes that are nouns in a text. Ferstl and von Cramon (2001) typical of behavioral studies of discourse compared pairs of sentences that were coherent or comprehension. For example, the reading time incoherent as well as cohesive or incohesive. The difference between reading a sentence with an indefinite coherence manipulation resulted in activation in the article and the definite article may be only a matter of left frontal gyrus. The cohesion manipulation tens of milliseconds. Nevertheless, even small involved adding lexical connectives to the pairs of differences in processing time can produce measurable sentences to make them easier to understand as a differences in the brain activity that has been aggregated single unit. Unlike Robertson et al. they did not find over one or more seconds, as several studies (e.g., any additional right hemisphere activation. In both Robertson et al., 2000) have shown. Although averaging the Robertson et al. and Ferstl and von Cramon over 1 or more sec can be viewed as a weakness of studies, the researcher utilized sentences or pairs of fMRI, there is a sense in which it conveys an advantage, sentences in which a similar type of discourse level because it is often difficult to isolate the cognitive processing could be assumed to occur in all items of process of interest to a specific temporal window during one type and not the items of another type. the reading of a passage. For example, McKoon and Recently the development of slow-paced event- Ratcliff (1986, 1989, & 1992) have shown that related fMRI has enabled the randomization of items predictive inferences could be drawn either on the first within experiments (Buckner, 1996), and more sentence at which it was possible to draw the inference importantly, has enabled the measurement of brain or in the subsequent sentence. Consider how Robertson, activity during the comprehension of individual et al., (2000), Ferstl and von Cramon (2001), and Mason sentences. In a slow-paced event-related fMRI, a and Just (2004) dealt with this issue. In all three cases, 5 In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. they used well constrained materials of either a single passages for the majority of subjects. For example, if the sentence or sentence pairs to maximize the generation of a bridging inference is being examined, the probability that the effect would occur in a specific inference must be generated at a similar point within the temporal window. text across stimuli and across readers to be measured. In modern discourse research, the trend has been The limitation on number of passages and subjects towards much longer and more naturalistic passages. constrains the ability to average over a large set of This creates an added methodological burden. Every passages. second of an imaging session is precious, because a Even with the constraints on experimental design, participant will lie still for only so long. Although the imaging still has the benefit of examining how the background and introductory sections of these longer, network as a whole functions. This is true even in those naturalistic passages are critical, the expectation is cases in which a process, such as inferencing, might not that cognitive processing during this context-setting occur at a specific temporal point. Thus, it is possible to period is similar across conditions. The data acquired see the cognitive workload required for generating an during this temporal window would either have to be inference with imaging even if it cannot be determined discarded or treated as an additional factor in the behaviorally whether or when an inference was drawn. design. Similarly, if the critical sentence does not fMRI can thus be sensitive to processes that have only include the expected cortical processing (perhaps the small effects on behavioral measures, while at the same processing is delayed to a post-target sentence) then time capturing the qualitative variation and parallel the likelihood of finding cortical evidence of a nature of the processes underlying discourse cognitive process is greatly reduced given that often comprehension to which response times are insensitive. there are as few as ten to twenty items per condition in neuroimaging experiments. New Perspectives on Text Integration Perhaps the largest advantage that imaging Text attributes at the discourse level enter into research has over behavioral research is that it combinations with other information to allow a reader to provides a fairly direct measure of the processing weave individual sentences into an integrated narrative activity in each of the neural networks underpinning structure. The resulting conceptual structure incorporates discourse comprehension. Measuring cognitive pragmatic information and connects the text with the workload in most purely behavioral studies is reader’s world knowledge. This discourse process difficult, frequently necessitating the use of extends beyond strictly linguistic information. For secondary tasks as means of measuring processing example, discourse comprehension requires that the load. fMRI studies allow the collection of button reader generate inferences and extract meaning that is pressing data and response times, but many other not explicitly encoded in the text. Readers must make data collection methods are problematic. inferences in order to integrate sentences in a coherent Vocalization responses such as naming times and fashion, filling in what is absent from or ambiguous in talk-aloud protocols are more difficult to acquire. the text. Several researchers have tried to describe the Although, noise-canceling technologies have properties of the internal representation of discourse. facilitated the extraction of voice responses in the One of the most influential was the situation model as noisy scanning environment, the head motion developed by van Dijk and Kintsch (1983). The situation induced by jaw movements can render data unusable. model arises from linguistic processing of the text itself The normal scanning procedure also makes it (lexical access, syntactic processing, and construction of difficult to collect discourse recall data from a a propositional based micro-structure) and an interaction scanning session. As with voice responses, these with non-linguistic cognitive processes. The situation problems can be overcome with some effort, but the model is a result of this interaction created by problems have not been systematically solved so far. connecting the text with knowledge derived from the In summary, brain imaging methodology reader’s long-term memory, and involves additional imposes some constraints on discourse processing demands upon attention (e.g., the ability to shift points experimental design. Passages must be designed of view and parse sequences of events), working carefully to maximize the chance of finding how a memory (the ability to retain longer term, anaphoric complex cortical network functions in discourse references), and the contribution of visual imagery, comprehension. Specific process must be temporally empathy, and emotional knowledge. localized to a specific point in the text. Moreover, the Some early attempts at imaging investigation of same process must occur across the majority of discourse processing were designed to determine the 6 In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. neural underpinnings of the construction of a First, late in the evening, chain your dog to his situation model. An fMRI study conducted by St. doghouse, build a small bonfire and let it burn George and colleagues (1999) presented syntactically overnight. (Support 2nd - #1) well-structured paragraphs which were They are insatiably attracted to heat, become uninterpretable by virtue of never specifying the enamored of the fire, leave your dog, jump into referent of the text. The paragraphs were similar to the flames, and die. (Support 2nd- #2) those previously used by Bransford and Johnson (1972.) These paragraphs were either preceded or Fleas will be eliminated from your dog or his not preceded by a title that disclosed the referent. For doghouse with the use of a bonfire. True or example, these paragraphs contained sentences such false? as “Typically, success requires that you start with your left leg, and make sure that it is securely in 'Topic last' condition place. Then swing your body high into the air.” First, late in the evening, chain your dog to his Without knowledge of the referent, it is difficult if doghouse, build a small bonfire and let it burn not impossible to understand the passage. But with overnight. (Support 1st - #1) foreknowledge provided by a title (“Riding a horse”) They are insatiably attracted to heat, become all of the sentences become interpretable. enamored of the fire, leave your dog, jump into The fMRI results revealed that the left the flames, and die. (Support 1st - #2) hemisphere, as a whole, exhibited no effect of whether the paragraph was presented along with the This is a totally guaranteed method to title, while the right hemisphere revealed completely eliminate a flea infestation on your significantly greater involvement during the dog or around his doghouse. (Topic last) presentation of the untitled paragraphs. More Fleas will be eliminated from your dog or his specifically, the left middle and superior temporal doghouse with the use of a bonfire. True or sulci became more active during the processing of the false? titled paragraphs, and conversely, the right middle and superior temporal sulci became more active This paradigm made it possible to measure the brain during the processing of the untitled paragraphs. activation associated with the comprehension of each of These results support the idea that the right the three sentences in each paragraph separately. The hemisphere is concerned with the mapping of results revealed differential effects in the two information into a text representation, as discussed hemispheres. The right temporal cortex revealed greater below. In addition, these results suggest that the involvement during the processing of topic sentences, processing roles of the two hemispheres are, in fact, regardless of their location within the paragraph. In distinguishable. contrast, the left temporal cortex was sensitive to the Tomitch et al. (2004) attempted to investigate location of the topic sentence. the differential processing of the left and right Tomitch et al. cited Gernsbacher’s (1990) Structure hemisphere during text integration using fMRI. They Building Framework (SBF) to account for the processing manipulated the serial position of the topic sentence underlying the cortical activation. According to SBF, in short, three-sentence paragraphs. The topic discourse comprehension builds cohesive mental sentence contained a unifying super-ordinate theme, representations using three general processes: laying the while the supporting sentences in the paragraph foundation, mapping incoming information to previous instantiated that thematic concept. The serial position information, and initiating a new substructure if the manipulation consisted of varying the position of the incoming information is not adequately coherent with topic sentence in the paragraph, placing it either in previous information. SBF states that the first step in the first position - topic 1st or in the third position- building a mental representation of the text is to lay a topic last, as shown below. foundation to which subsequent information presented in the text can be attached. Presumably this first stage must 'Topic first' condition occur across all passages regardless of the order of the This is a totally guaranteed method to topic sentence; the consistent left temporal activation on completely eliminate a flea infestation on the first sentence suggests that the left temporal region is your dog or around his doghouse. (Topic involved in laying the foundation of the text first) representation. 7 In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. A second prediction SBF makes is based on encounter inconsistent emotional or temporal more involvement of the “shifting” process in information in a passage. They suggested the contrast of paragraphs whose topic sentence is in the final consistent and inconsistent information that should be position. When the topic sentence in the final coded at the situation model level would allow them to position, increased shifting is, therefore, expected to examine cortical networks specialized for situational result in higher activation levels. This response was level text representation. They concluded that activation also observed in the left temporal region. in the frontal cortex indicated a specialization for The right temporal region was sensitive to the building and maintaining a situation model presence of a topic sentence but not to its location. representation. The specific region within the This suggests that the right temporal region 1) is frontomedial cortex varied as a function of the type of sensitive to whether a sentence is a potential information and whether it was involved in noticing or statement of the topic and 2) performs additional resolving the inconsistency. First, the ventral portion of processing on the potential topic sentence. The right the medial frontal cortex was involved in detecting hemisphere may then be responsible for using the inconsistent emotional information. In contrast, the information forwarded by the left to “fill in” or map dorsomedial frontal cortex was active in the processing information onto the text representation built by the of emotional consistencies suggesting that a protagonist left and to connect it with the participant’s world interpreter network was engaged during the attempted knowledge. This right temporal activation “mapping” resolution of the inconsistent information. The activation was also found by Robertson et al. (2000) chronological inconsistency activation was located much for both indefinite and definite articles; in contrast, more anteriorly in the orbital portion of IFG and the the less coherent, indefinite article texts also resulted frontopolar region. Interestingly, the right anterior in additional right frontal activation than the definite temporal lobe was more active during the processing of article texts. inconsistent texts regardless of the text type, suggesting Several other cortical regions have been found to that as text processing became more difficult, the play a role in tasks that require structure building. specialized text integration network spilled over into the Partiot et al. (1996) investigated script processing and right hemisphere. found bilateral precuneus/posterior cingulate regions To summarize, while there is still a significant to be activated, along with bilateral medial parietal amount of research to be done to clarify the cortex, during processing of event sequences. contributions of the left and right hemisphere in text Maguire et al. (1999), using Bransford and Johnson integration, neuroimaging studies such as those outlined (1972) passages with or without titles, also found here are making good progress. Although it must be true these same areas to be active in linking textual that the two hemispheres work together to accomplish information with subjects’ prior knowledge. These such a complex function as comprehension, it does operations must be central to construction of a appear as though they are involved in different aspects of situation model, connecting the narrative text with discourse processing. It seems as though the knowledge about the real world. identification of the main idea and the building of the Narrative-specific activations have also been text representation is separable from the mapping of that found in the temporo-parieto-occipital junction, information onto the discourse structure with both angular gyrus, and superior temporal sulcus. A hemispheres working in an interactive manner in order prevailing view has been that the angular gyrus plays to construct a coherent representation of the text. a key role in grapheme to phoneme translation. But Furthermore, the type of the information within the text this region is in fact multifunctional, and has been plays a role in which areas are responsible for building implicated in a variety of cognitive processes— and maintaining a representation of the text. attention, semantic association, problem solving, and Propositional level information may be processed by left mental imagery (Cabeza and Nyberg, 2000) that are frontal and left temporal areas as long as resources are likely to be engaged in the narrative context. For available. Situational model information results in an example, the angular gyrus is activated when subjects engagement of the medial frontal cortex. Specific areas visualize a scene derived from a written text (Mellet within the cortex may be dependent upon the nature of et al., 2002)—precisely the sort of mental model the input, for example, emotional information related to representation required during narrative processing. protagonist interpreter should activate more dorsal Recently, Ferstl, Rinck and von Cramon (2005) frontal regions. examined passages in which the reader might 8 In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. New Perspectives on Inference Processing primed in both left visual field-Right Hemisphere and Often, the links between events in a story are not right visual field-Left Hemisphere at a later time. This explicitly expressed and the reader must connect finding suggests that the right hemisphere is particularly them by generating linking inferences and integrating involved in inference processing. them with the presented information. Almost every There have been few neuroimaging studies of text requires a reader to draw on a rich store of shared inference processing. One question that has been knowledge about the world. There is a strong debated is whether logic-based inferencing relies on the relation between inference generation and text same processes as text-based inferencing. In an fMRI integration. Inference generation is often necessary study, Caplan and Dapretto (2001) directly addressed in building accurate text representations and this issue by comparing the generation of logic-related conversely, an accurate text representation is often inferences versus text-based inferences. There were two necessary for inference generation. types of sentence pairs as shown below Inferences may be drawn to fill in missing information, resolve discrepancies or to predict yet Text based: “Do you believe in angels?” unmentioned events or facts. There are several types “Yes, I have my own special angel” of inferences, including coherence, predictive, elaborative, and causal. The classification of various Logic-related: “Do you like having fun?” types of inferences has been a continuing topic in “Yes, because it makes me happy” discourse research and there have been several excellent attempts to resolve this classification While the logic condition produced greater problem (e.g. Singer, 1994; van den Broek, 1994). activation within the left language areas, the text-based A successful inference generally occurs as a condition revealed more activation in the right result of generating a possible inference and then hemisphere. This study is significant in that it suggests integrating that inference into the internal that at the neural level, and consequently at the process representation of the text. The Construction- level, there are significant differences between logic- Integration (CI) model of text comprehension based and text-based inferencing. Also, the results (Kintsch, 1988) is consistent with this general converge with the neuropsychological findings showing description of inferencing. According to the CI that the right hemisphere is intimately involved in text- model, there is a first process in which the many related inference processing. possible inferences are liberally generated (inference Mason and Just (2004) reported an fMRI construction), followed by a second process of experiment that was designed to examine the cortical integrating only those inferences that have a high areas that are involved in making causal inferences degree of connection with the reader's representation during reading. Participants read sentence pairs that of the preceding text (integration). A successful varied in terms of their causal relatedness. For example, integration of an inference results in a representation an “outcome” sentence (e.g., The next day his body was of the text that involves both the specific propositions covered with bruises.) was preceded by one of three contained in the text and those propositions that were sentences (equated for overall length and number of generated by the reader to connect information in the propositions) that described an antecedent condition: text. Patients with lesions to the right hemisphere Highly Related: Joey's big brother punched him generally have trouble drawing inferences in order to again and again. integrate sentences and maintain coherence (Beeman, Moderately Related: Joey's crazy mother became 1993; Brownell et al., 1986). These patients are less furiously angry with him. likely to mistakenly false alarm to inferences in a text recognition task, presumably because they never Distantly Related: Joey went to a neighbor's house generated the inferences in the first place (Grafman et to play. al., 1987). Such patients also make elaborative inferences more easily than bridging inferences In previous behavioral studies using similar (Tompkins and Mateer, 1985). Furthermore, Beeman materials, Keenan, Baillet, and Brown (1984) and and colleagues (1994) have shown that when probes Myers, Shinjo, and Duffy (1987) found that reading are inference-related, they are primed in the left times on the sentences increased as the degree of visual field-Right Hemisphere immediately, and are relatedness between the sentences decreased; that is, the 9 In M. Traxler & M. A. Gernsbacher, (Eds.), BRAIN IMAGING OF DISCOURSE PROCESSING Handbook of Psycholinguistics (pp. 765-799). Amsterdam: Elsevier. reading times increased from the highly- to suggest that, as proposed by Just et al. (1996), there is an moderately- to distantly-related conditions. inherent limit on how much cognitive processing can be Paradoxically, however, the participants’ memory for done per unit of time in left hemisphere language areas. the two-sentence passages (as measured using a In the context of text processing, this limit means that to variety of recall and recognition tests) followed an the degree that working memory resources are being inverted U-shaped function; that is, the sentences in used to process the text and generate inferences, those the moderately-related condition were remembered resources will not be available for integrating those better than those in both the highly- and distantly- inferences into long-term memory. Reichle and Mason related conditions. Thus, the participants’ memory propose a computational model to demonstrate that these for the sentences does not seem to be a simple resources are exceeded only in the moderately related monotonic function of either their reading times or condition, such that the remaining processing load that the degree of causal relatedness between the two cannot be accommodated in the left hemisphere spills sentences being read. over into the right hemisphere. In the fMRI study of causal relatedness, Mason Research is only now beginning to map out the and Just (2004) found three main foci of fMRI- cortical network associated with drawing inferences in measured cortical activation among the language reading. Evidence to date suggests that the right areas. In the left hemisphere language areas, the hemisphere plays a key role in such a process (Beeman, activation volume did not vary across the three et al., 1994; Mason & Just 2004). At this point, at least relatedness conditions. In contrast, the activation three plausible theories have been developed to volume in the bilateral dorsolateral prefrontal cortices illuminate the right hemisphere’s role. Reichle and showed a marked (albeit not statistically reliable) Mason (in press) building on work by Just et al. propose increase as the sentences became less causally that a limited capacity is exceeded across inferencing related. Finally, the most interesting pattern of component processes. This then leads to processing activation volume was observed in the right being passed to the right hemisphere. This is in contrast hemisphere homologues of the language areas: the to an account proposed by Mason et al.; they propose activation volume was consistent with the patterns that the right hemisphere is utilized during the that had been reported with recognition and recall integration of an inference and that inferences are measures. generated by utilizing the dorsolateral prefrontal cortex. To account for their fMRI data, Mason and Just Finally, Beeman’s coarse coding theory leads to the (2004) proposed that two different cortical networks prediction that inferences are accomplished as the result support the generation and the integration of of activation of coarsely coded semantic information in inferences during reading. The first network, the right hemisphere. In this account, it is not the consisting of the left and right dorsolateral prefrontal inference per se that is processed in the right hemisphere cortex is more involved in generating the inferences. but rather the information from which the inference is As the causal distance between two sentences developed. A further account combines aspects of these increases, the dorsolateral prefrontal regions generate previous explanations. Inference generation is supported more inferences, leading to an increased volume of by the right hemisphere coarse semantic network. activation. The second network, consisting of the Additional attempts to utilize this network will be right inferior frontal gyrus, right superior and middle signaled by the dorsolateral prefrontal coherence temporal gyri, and right inferior parietal lobe, is more monitor, provided resources are available. As the involved in integrating the possible inferences that propositionalization work of the left anterior temporal have been generated. Because of the relative text integration network becomes more demanding (and differences in integrating the inferences in the three resources are consumed), this processing will spillover conditions, the volume of cortical activation that was into the right anterior temporal region. It is clear that observed in this region is described by an inverted U- further research will be needed to specify at which level shaped function, with more activation occurring with of difficulty each of the networks are engaged for the moderately related than either the highly-related various readers’ abilities. or distantly-related sentences. Reichle & Mason (in press) present a working New Perspectives on Figurative Language Processing memory account for the additional right hemisphere The study of metaphor comprehension has long activation that occurs as a result of generating an been a major area of interest in behavioral studies of inference in the moderately related condition. They figurative language (e.g., Allbritton, McKoon & Gerrig, 10
Description: