<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">2767-0279</journal-id>
<journal-title-group>
<journal-title>Glossa Psycholinguistics</journal-title>
</journal-title-group>
<issn pub-type="epub">2767-0279</issn>
<publisher>
<publisher-name>eScholarship Publishing</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5070/G601166</article-id>
<article-categories>
<subj-group>
<subject>Regular article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>The processing of ambiguous pronominal reference is sensitive to depth of processing</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Creemers</surname>
<given-names>Ava</given-names>
</name>
<email>Ava.Creemers@mpi.nl</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Meyer</surname>
<given-names>Antje S.</given-names>
</name>
<email>Antje.Meyer@mpi.nl</email>
<xref ref-type="aff" rid="aff-1">1</xref>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>Max Planck Institute for Psycholinguistics, Nijmegen, the Netherlands</aff>
<aff id="aff-2"><label>2</label>Radboud University Nijmegen, the Netherlands</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2022-04-27">
<day>27</day>
<month>04</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>1</volume>
<issue>1</issue>
<elocation-id>3</elocation-id>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2022 The Author(s)</copyright-statement>
<copyright-year>2022</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://glossapsycholinguistics.journalpub.escholarship.org/articles/10.5070/G601166/"/>
<abstract>
<p>Previous studies on the processing of ambiguous pronominal reference have led to contradictory results: some suggested that ambiguity may hinder processing (<xref ref-type="bibr" rid="B33">Stewart, Holler, &amp; Kidd, 2007</xref>), while others showed an ambiguity advantage (<xref ref-type="bibr" rid="B16">Grant, Sloggett, &amp; Dillon, 2020</xref>) similar to what has been reported for structural ambiguities. This study provides a conceptual replication of Stewart et al. (<xref ref-type="bibr" rid="B33">2007, Experiment 1</xref>), to examine whether the discrepancy in earlier results is caused by the processing depth that participants engage in (cf. <xref ref-type="bibr" rid="B34">Swets, Desmet, Clifton, &amp; Ferreira, 2008</xref>). We present the results from a word-by-word self-paced reading experiment with Dutch sentences that contained a personal pronoun in an embedded clause that was either ambiguous or disambiguated through gender features. Depth of processing of the embedded clause was manipulated through offline comprehension questions. The results showed that the difference in reading times for ambiguous versus unambiguous sentences depends on the processing depth: a significant ambiguity penalty was found under deep processing but not under shallow processing. No significant ambiguity advantage was found, regardless of processing depth. This replicates the results in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) using a different methodology and a larger sample size for appropriate statistical power. These findings provide further evidence that ambiguous pronominal reference resolution is a flexible process, such that the way in which ambiguous sentences are processed depends on the depth of processing of the relevant information. Theoretical and methodological implications of these findings are discussed.</p>
</abstract>
</article-meta>
</front>
<body>
<sec>
<title>Introduction</title>
<p>Recent work by Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) made explicit a discrepancy in results on the processing of <italic>structural</italic> ambiguities, in which the syntactic structure is ambiguous, and <italic>referential</italic> ambiguities, in which the reference of an anaphoric expression is ambiguous. While some results indicate that ambiguous sentences are read faster than unambiguous ones (i.e., an ambiguity <italic>advantage</italic>), others show that ambiguous sentences slow down reading (i.e., an ambiguity <italic>penalty</italic>). In particular, studies of structural ambiguities typically report that ambiguity facilitates processing (e.g. <xref ref-type="bibr" rid="B35">Traxler, Pickering, &amp; Clifton, 1998</xref>; <xref ref-type="bibr" rid="B36">Van Gompel, Pickering, Pearson, &amp; Liversedge, 2005</xref>; <xref ref-type="bibr" rid="B38">Van Gompel, Pickering, &amp; Traxler, 2001</xref>, for an overview see <xref ref-type="bibr" rid="B9">Clifton and Staub 2008</xref>). In contrast, studies on the processing of ambiguous pronominal reference typically suggest that ambiguity may hinder referential processing (<xref ref-type="bibr" rid="B3">Badecker &amp; Straub, 2002</xref>; <xref ref-type="bibr" rid="B33">Stewart et al., 2007</xref>).</p>
<p>To examine the difference in the processing of structural and referential ambiguities, Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) directly compared the two ambiguity types and argued that they are processed similarly when assessed under the same conditions. The results showed an ambiguity advantage for the processing of sentences that contained an ambiguous pronominal reference (e.g., <italic>We met the brother of the waiter when <bold>he</bold> visited the restaurant</italic>). A similar ambiguity advantage was found for sentences that involve a structural ambiguity in the form of a prepositional phrase attachment ambiguity (e.g., <italic>We met the brother of the waiter <bold>with a beard</bold></italic>). These findings prompted an investigation of the cause of the discrepancy between studies that find an ambiguity advantage versus studies that find an ambiguity penalty.</p>
<p>There is some evidence based on the processing of <italic>structural</italic> ambiguities suggesting that comprehenders may underspecify certain linguistic relations when they only engage in shallow, rather than deep processing (<xref ref-type="bibr" rid="B34">Swets et al., 2008</xref>, see also <xref ref-type="bibr" rid="B27">Loga&#269;ev and Vasishth 2016b</xref>). If, as Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) suggest, the language processing system handles the resolution of attachment ambiguities and pronominal reference ambiguities in a similar way, we expect that the processing of sentences with <italic>ambiguous pronouns</italic> should also be sensitive to processing depth. As will be discussed in more detail, a study by Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) provided initial empirical support for the influence of depth of processing on the processing of ambiguous pronominal reference. In the present study, we provide a conceptual replication of Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), using a different methodology and analysis, and a larger sample size. We test the hypothesis that an ambiguity penalty occurs when the reader is motivated to process sentences at a deeper level, while an ambiguity advantage occurs under more shallow processing conditions as ambiguous pronouns may remain underspecified. If this is the case, a model of ambiguity resolution would be needed that accounts for both ambiguity advantages and penalties.</p>
<sec>
<title>Ambiguity penalty</title>
<p>While ambiguous attachment sentences such as the ones used in Swets et al. (<xref ref-type="bibr" rid="B34">2008</xref>) are often complex and fairly unnatural, ambiguous pronominal references are found in everyday language and in less complex sentence constructions. A study by Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) examined the processing of ambiguous pronouns using a self-paced reading task. Their experimental design manipulated the ambiguity of the pronominal reference and the processing depth. Participants were presented with sentences that contained two referents and a personal pronoun that was either ambiguous, as in (1), or could refer to only one of the referents due to the presence of disambiguating gender information, as in (2) and (3).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(1)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Paul<italic><sub>i</sub></italic> lent Rick<italic><sub>j</sub></italic> the CD before <bold>he</bold><italic><sub>i/j</sub></italic> left for the holidays. (<italic>ambiguous</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(2)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Paul<italic><sub>i</sub></italic> lent Kate<italic><sub>j</sub></italic> the CD before <bold>he</bold><italic><sub>i</sub></italic> left for the holidays. (<italic>first character reference</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(3)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Paul<italic><sub>i</sub></italic> lent Kate<italic><sub>j</sub></italic> the CD before <bold>she</bold><italic><sub>j</sub></italic> left for the holidays. (<italic>second character reference</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Depth of processing was manipulated through offline comprehension questions. To probe <italic>shallow</italic> processing, participants were asked questions requiring a <italic>yes/no</italic> answer on only a third of the target sentences (e.g., <italic>Did Paul lend Kate the CD</italic>?) and a third of the filler items. Half of these questions targeted the information content of the main clause and half targeted the information content of the embedded clause. In the <italic>deep</italic> processing condition, questions on <italic>all</italic> experimental items and fillers targeted the information content of the embedded clause, and, hence, required resolution of the pronoun (e.g., <italic>Who left for the holidays?</italic>).</p>
<p>The results showed that in the shallow processing condition the total sentence reading times in the different conditions did not differ from each other (although numerically the ambiguous condition was read fastest). In the deep processing condition, ambiguous sentences were read more slowly than first and second character reference sentences (i.e., an ambiguity penalty or disadvantage). Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) proposed that readers invest processing resources to interpret the ambiguous pronoun when engaged in deep processing only, and not under shallow processing.</p>
<p>A second experiment, in which participants read sentences in a cumulative word-by-word self-paced reading paradigm paired only with shallow processing comprehension questions, again showed no effect of ambiguity on response times to the pronoun region and to the word after the pronoun in the first sentence. However, the critical sentences in this experiment were followed by a second sentence in which the ambiguous pronoun from the first sentence was disambiguated (e.g., for (1): <italic>He went to the Bahamas and sent <bold>Rick</bold> a postcard from the hotel</italic>, for which disambiguation occurs at the repeated name of the character to whom the pronoun does not refer). In the second sentence, a significant effect of ambiguity was found for the disambiguating region. The region that contained the repeated name was read more slowly in the ambiguous conditions than in the unambiguous conditions.</p>
<p>Based on the results of both experiments, Stewart et al. concluded that, under deep processing, readers attempt to assign a referent to the ambiguous pronoun even in a context where no disambiguation is available to indicate which assignment is correct. In contrast, under shallow processing, ambiguous pronouns would ultimately be interpreted but only when the reader is able to do this with certainty. The authors further argued that readers delayed the resolution of the ambiguous pronoun until disambiguating information was available, such that initial underspecification becomes more specified when disambiguating information is encountered, even under shallow processing.</p>
</sec>
<sec>
<title>Ambiguity advantage</title>
<p>In contrast to the results by Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) reported an ambiguity <italic>advantage</italic> for both structural and referential ambiguities in two experiments measuring eye movements during reading. We focus here on the results of the referential ambiguity study, for which the conditions are shown in (4) and (6).<xref ref-type="fn" rid="n1">1</xref></p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(4)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>We met the brother<italic><sub>i</sub></italic> of the waiter<italic><sub>j</sub></italic> when <bold>he</bold><italic><sub>i/j</sub></italic> visited the restaurant, but we didn&#8217;t talk for long. (<italic>ambiguous</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(5)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>We met the brother<italic><sub>i</sub></italic> of the waitress<italic><sub>j</sub></italic> when <bold>he</bold><italic><sub>i</sub></italic> visited the restaurant, but we didn&#8217;t talk for long. (<italic>first character reference</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(6)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>We met the sister<italic><sub>j</sub></italic> of the waiter<italic><sub>i</sub></italic> when <bold>he</bold><italic><sub>i</sub></italic> visited the restaurant, but we didn&#8217;t talk for long. (<italic>second character reference</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>In the ambiguous condition, as in (4), both nouns (e.g., <italic>brother</italic> and <italic>waiter</italic>) were possible referents, while in the unambiguous conditions, as in (5) and (6), only one of the nouns was a possible referent. Rather than using personal names as in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), gender information was based on a mix of lexical-semantic cues (e.g., <italic>brother</italic>), morphological cues (e.g., <italic>waitress</italic>), and stereotypical bias (e.g., <italic>beautician</italic>). The results showed an overall ambiguity advantage: similar to the results for structural ambiguities, Grant et al. reported significantly shorter reading times for ambiguous pronouns compared to unambiguous pronouns in go-past time (defined as the sum of all fixations from first entering a region until leaving it to the right).</p>
<p>To rule out that the ambiguity advantage was due to the lack of salience for the referents, a second experiment was run, in which, similar to the design in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), the main clause subject and object formed the potential referents. The referents were either definite descriptions that had definitional gender (e.g., <italic>king</italic>), or were proper names that were gender unambiguous (e.g., <italic>Olivia</italic>). The conditions were otherwise similar to those in the first experiment, as illustrated in (7) to (9).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(7)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The young prince<italic><sub>i</sub></italic> showed the revered king<italic><sub>j</sub></italic> that <bold>he</bold><italic><sub>i/j</sub></italic> would be a fine leader of the Tharassian empire. (<italic>ambiguous</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(8)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The young prince<italic><sub>i</sub></italic> showed the revered queen<italic><sub>j</sub></italic> that <bold>he</bold><italic><sub>i</sub></italic> would be a fine leader of the Tharassian empire. (<italic>first character reference</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(9)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The revered queen<italic><sub>j</sub></italic> showed the young prince<italic><sub>i</sub></italic> that <bold>he</bold><italic><sub>i</sub></italic> would be a fine leader of the Tharassian empire. (<italic>second character reference</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The results again revealed an ambiguity advantage effect in three out of five reading measures (first pass, go-past, and total time) in the post-critical region, with lower reading times for the ambiguous condition compared to either unambiguous condition. Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) interpreted these results along the lines of an Unrestricted Race Model (cf. <xref ref-type="bibr" rid="B37">Van Gompel, Pickering, &amp; Traxler, 2000</xref>) in which different referents &#8216;race&#8217; to be selected as an antecedent for a pronoun, with the one that becomes available most quickly being adopted as the referent. If the antecedent that wins turns out to not match the pronoun&#8217;s gender features, the reference of the pronoun needs to be re-evaluated. The slower reading times in the unambiguous conditions reflect a penalty due to reanalysis: on some portion of trials, the processor selects the ultimately incorrect reference, which must then be reanalyzed. Such reanalysis never occurs for ambiguous trials as the parser will never be revealed to be wrong. Hence, reading times are predicted to be faster for ambiguous sentences compared to unambiguous ones.</p>
<p>In sum, Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) showed an ambiguity penalty when all sentences were followed by a targeted comprehension question in the deep processing condition (Experiment 1), and no significant difference between ambiguous and unambiguous sentences when 4/24 experimental trials were followed by a targeted comprehension question in the shallow processing condition (Experiment 1 and 2). Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) showed ambiguity advantages when 12/30 sentences (Experiment 1) and 12/24 sentences (Experiment 2) were followed by a targeted comprehension question. However, these studies cannot be compared directly as different methodologies were used: Stewart et al. (<xref ref-type="bibr" rid="B33">2007, Experiment 1</xref>) analyzed whole-sentence self-paced reading times and Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) analyzed eye-tracking measures while reading.</p>
</sec>
<sec>
<title>Depth of processing</title>
<p>There is some evidence suggesting that a less demanding task may lead comprehenders to underspecify certain syntactic relations. Ferreira and Yang (<xref ref-type="bibr" rid="B15">2019</xref>), for example, discuss how readers may flexibly adjust their reading strategies depending on their goals, and that the ultimate comprehension level may depend on reader or listener engagement or motivation. The results by Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) are suggestive in this respect, showing an ambiguity penalty only under deep processing conditions, and not under shallow processing. Swets et al. (<xref ref-type="bibr" rid="B34">2008</xref>), focusing on structural ambiguities, further showed that the processing of structurally ambiguous sentences is sensitive to depth of processing. They examined structural ambiguities in the form of sentences with a relative clause that could attach to one of two sites, as in (10), and sentences in which the relative clause occurred with a disambiguating reflexive pronoun, as in (11) and (12).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(10)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The maid of the princess <bold>who scratched herself</bold> in public was terribly humiliated. (<italic>ambiguous</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(11)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The son of the princess <bold>who scratched himself</bold> in public was terribly humiliated. (<italic>high attachment</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(12)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The son of the princess <bold>who scratched herself</bold> in public was terribly humiliated. (<italic>low attachment</italic>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Depth of processing was manipulated through the number and difficulty of the comprehension questions that followed the sentences. The results showed an ambiguity advantage only when comprehension questions were sparse and superficial, but an advantage for disambiguated low attachment sentences when comprehension questions were frequent and targeted the ambiguity.</p>
<p>Swets et al. argued that the comprehension system creates an underspecified representation when there is no basis on which to fully commit to an attachment decision in an ambiguous sentence. In other words, the syntactic representation in the globally ambiguous case may remain underspecified under shallow processing, but not under deep processing (see also <xref ref-type="bibr" rid="B11">Ferreira, Bailey, &amp; Ferraro, 2002</xref>). Building on the results by Swets et al. (<xref ref-type="bibr" rid="B34">2008</xref>), Loga&#269;ev and Vasishth (<xref ref-type="bibr" rid="B26">2016a</xref>) provided further evidence that, given a sufficiently high degree of task difficulty, sentences with attachment ambiguities are read more slowly than unambiguous sentences.</p>
</sec>
<sec>
<title>The current study</title>
<p>The goal of the present study was to examine to what extent the processing of referential ambiguities is modulated by processing depth, as manipulated through comprehension questions. If deep processing conditions result in an ambiguity penalty, and shallow processing conditions in an ambiguity advantage, this could potentially explain the discrepancy in earlier results on ambiguous referential processing (<xref ref-type="bibr" rid="B16">Grant et al., 2020</xref>; <xref ref-type="bibr" rid="B33">Stewart et al., 2007</xref>). To examine the effects of processing depth on the processing of referential ambiguities, we ran a conceptual replication of the first experiment in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>).</p>
<p>The materials consisted of Dutch sentences similar to the English sentences used in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) and in Grant et al. (<xref ref-type="bibr" rid="B16">2020, Experiment 2</xref>). The sentences were ambiguous or disambiguated through gender features. In the ambiguous sentences, as in (13), the pronominal reference could refer to either character. In the disambiguated sentences, the pronoun could only refer to the gender-consistent referent, which was the first-mentioned character in the first character reference (14), or the second-mentioned in the second character reference (15). Sample items, including translations, are given below (a full stimulus list is provided as supplemental material; see Data Accessibility Statement).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(13)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><italic>Ambiguous</italic>:</p></list-item>
<list-item><p>Abel<italic><sub>i</sub></italic> leende Gijs<italic><sub>j</sub></italic> het boek voor <bold>hij</bold><italic><sub>i/j</sub></italic> op vakantie ging.</p></list-item>
<list-item><p>(Translation: Abel<italic><sub>i</sub></italic> lent Gijs<italic><sub>j</sub></italic> the book before he<italic><sub>i/j</sub></italic> went on holiday.)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(14)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><italic>First character reference</italic>:</p></list-item>
<list-item><p>Abel<italic><sub>i</sub></italic> leende Zo&#235;<italic><sub>j</sub></italic> het boek voor <bold>hij</bold><italic><sub>i</sub></italic> op vakantie ging.</p></list-item>
<list-item><p>(Translation: Abel<italic><sub>i</sub></italic> lent Zo&#235;<italic><sub>j</sub></italic> the book before he<italic><sub>i</sub></italic> went on holiday.)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(15)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><italic>Second character reference</italic>:</p></list-item>
<list-item><p>Zo&#235;<italic><sub>i</sub></italic> leende Abel<italic><sub>j</sub></italic> het boek voor <bold>hij</bold><italic><sub>j</sub></italic> op vakantie ging.</p></list-item>
<list-item><p>(Translation: Zo&#235;<italic><sub>i</sub></italic> lent Abel<italic><sub>j</sub></italic> the book before he<italic><sub>j</sub></italic> went on holiday.)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Processing depth was manipulated through the type of comprehension questions that participants were asked to answer. Sentences in the shallow processing condition were followed by questions that probed the information content of the main clause (e.g., <italic>Wie leende het boek aan Gijs?</italic> &#8216;Who lent the book to Gijs?&#8217;) and which did not require the participant to resolve the pronominal reference in the embedded clause. Sentences in the deep processing condition were followed by a question that probed the information content of the embedded clause and that could not be answered without resolving the pronominal reference (e.g., <italic>Wie ging op vakantie?</italic> &#8216;Who went on holiday?&#8217;). Note that we manipulated the required depth of processing of the embedded clause, not of the entire sentence. We return to this point in the Discussion.</p>
<p>If depth of processing modulates ambiguity resolution, we predict that participants show an ambiguity advantage under shallow processing conditions, but an ambiguity penalty under deep processing conditions. Hence, under deep processing, we predict that reading times for ambiguous pronouns would be longer than those for unambiguous pronouns as readers will attempt to resolve the reference. We predict shorter reading times for ambiguous pronouns under shallow processing as ambiguous pronominal references may remain underspecified under certain conditions, as was argued for English by Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>).</p>
<p>We also examined question responses and question answering times to shed further light on the way in which pronominal resolution occurs, testing whether readers used a heuristic that favored one referent over the other. Referents in subject position are often argued to hold a privileged status, which predicts that the first-mentioned character may be more often co-indexed with the pronoun, even in the absence of disambiguating information (e.g. <xref ref-type="bibr" rid="B1">Arnold, 1998</xref>; <xref ref-type="bibr" rid="B2">Arnold, Eisenband, Brown-Schmidt, &amp; Trueswell, 2000</xref>; <xref ref-type="bibr" rid="B29">McDonald &amp; MacWhinney, 1995</xref>; <xref ref-type="bibr" rid="B33">Stewart et al., 2007</xref>; <xref ref-type="bibr" rid="B39">Van Rij, Van Rijn, &amp; Hendriks, 2013</xref>).</p>
<p>Our study improved on the original design in Stewart et al. (<xref ref-type="bibr" rid="B33">2007, Experiment 1</xref>) in several ways, providing a different methodology, a larger sample size, and up-to-date statistical analyses. We used a <italic>word-by-word</italic> self-paced reading task (<xref ref-type="bibr" rid="B20">Just, Carpenter, &amp; Woolley, 1982</xref>), which allowed us to localize the effects and obtain a more detailed picture of the time course of processing, compared to an analysis based on whole-sentence reading times in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>). In addition, we used a non-cumulative (moving window) paradigm, instead of a cumulative reading task. The cumulative paradigm has been shown to be problematic for studying on-line parsing, as participants develop a reading strategy in which they reveal several segments of a stimulus at a time and then read them all at once, or even wait to process a sentence until it is entirely revealed (<xref ref-type="bibr" rid="B12">Ferreira &amp; Henderson, 1990</xref>; <xref ref-type="bibr" rid="B20">Just et al., 1982</xref>).</p>
<p>Finally, given the fact that the replication crisis has surfaced also in the pronoun interpretation literature (see e.g., <xref ref-type="bibr" rid="B6">Chow, Lewis, and Phillips 2014&#8217;s</xref> failed attempts to replicate the findings in <xref ref-type="bibr" rid="B3">Badecker and Straub 2002</xref>), we included more experimental items. While the original study included 24 target sentences, we included a total of 60 target sentences. We performed an a-priori power analysis to determine the sample needed to observe an effect with at least 80% power (see below). While it is possible that the lack of a significant ambiguity advantage in the shallow processing condition in Stewart et al. (<xref ref-type="bibr" rid="B33">2007, Experiment 1</xref>) was due to a lack of power, our experiment should have appropriate power to detect the effect. We further used linear mixed-effects models to analyze the data, rather than traditional ANOVAs.</p>
</sec>
</sec>
<sec sec-type="methods">
<title>Methods</title>
<sec>
<title>Participants</title>
<p>Participants were 156 non-dyslexic native speakers of Dutch who reported having normal or corrected-to-normal vision (mean age = 24.9; sd = 4.36). Participants were recruited from the participant pool at the Max Planck Institute for Psycholinguistics, and anyone who took part in the norming study was excluded from participation. Participants received a payment of &#8364;6 for their participation. Informed consent was obtained from each participant. The study was approved by the ethics board of the Faculty of Social Sciences of Radboud University.</p>
<p>The number of participants was determined through an a-priori power analysis conducted in G*Power (<xref ref-type="bibr" rid="B10">Faul, Erdfelder, Lang, &amp; Buchner, 2007</xref>), which suggested that a sample size of 39 participants (13 per list) would give us at least 80% power to observe a main effect of sentence type with an effect size <italic>f</italic> of 0.15 or larger (which was calculated based on the results in <xref ref-type="bibr" rid="B33">Stewart et al. 2007, Experiment 2</xref>).<xref ref-type="fn" rid="n2">2</xref> To account for the interaction with processing depth, we included twice as many participants per cell, hence, including four times as many participants, leading to a total of 156 participants.</p>
</sec>
<sec>
<title>Materials and design</title>
<p>As in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), we used a 2 &#215; 3 mixed design, consisting of a between-participants factor &#8216;depth of processing&#8217; of the embedded clause with two levels (shallow processing, deep processing), and a within-participants factor &#8216;sentence type&#8217; with three levels (ambiguous, first character reference, second character reference). The materials consisted of 60 target sentences, with an ambiguous version, a first character reference version, and a second character reference version each. Because self-paced reading experiments typically show spill-over effects, in that difficulty induced by a given word slows down the reading times of subsequent words, the sentences were created such that the word immediately following the pronoun (PRONOUN+1) was always a closed class item (a preposition or determiner) and the word after that (PRONOUN+2) was an open class item. We made sure that the frequency of the open class items was not extremely low (i.e., &gt;1 per million words) or extremely high (&#8804;100 per million). The mean word frequency of the PRONOUN+1 items was 17862.88 per million words (sd = 8345.13); the mean frequency of PRONOUN+2 items was 33.65 per million (sd = 29.28). Word frequencies were retrieved from the SubtLex-NL database (<xref ref-type="bibr" rid="B22">Keuleers, Brysbaert, &amp; New, 2010</xref>).</p>
<p>We ensured that the critical regions of interest in the sentences in the different conditions were identical. To do so, we switched the order of the female and male proper names in the first and second character reference sentences (e.g., <italic>Abel lent Zo&#235;</italic> and <italic>Zo&#235; lent Abel</italic>) which ensured the use of the same pronoun across conditions. All of the critical sentences occurred with the third person masculine personal pronoun <italic>hij</italic> &#8216;he&#8217;, as the Dutch feminine pronoun <italic>zij</italic> or <italic>ze</italic> &#8216;she&#8217; can also refer to the third person plural pronoun &#8216;they&#8217;, in which case a sentence as in (13) would be disambiguated only at the sentence-final verb. Dutch stereotypically male or female proper names were selected from the Dutch First Name database (Nederlandse Voornamenbank<xref ref-type="fn" rid="n3">3</xref>). The different versions (ambiguous, first character reference, second character reference) of each experimental item were rotated across three lists, so that each participant saw every item only once. We further included 30 filler sentences that also consisted of a main and an embedded clause, but differed from the experimental items in that they only contained one character. Twenty of the filler sentences occurred with the feminine personal pronoun <italic>ze</italic> &#8216;she&#8217; and ten occurred without any personal pronoun or with the masculine personal pronoun <italic>hij</italic> &#8216;he&#8217;. Filler items were identical for each of the lists.</p>
<p>To ensure that participants could not use plausibility information to guide the interpretation of the pronoun in the critical sentences, we conducted a norming study to establish that the first and second character reference interpretations were equally plausible. The norming materials consisted of 70 unambiguous sentences. Following the procedure reported in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), a first character reference and a second character reference version were included for each sentence. The norming study consisted of two lists, with first and second character references of the same sentence occurring on different lists. Each list contained equal numbers of first and second character reference sentences. In addition, we included 20 sentences that were implausible on the basis of pragmatic or world knowledge (e.g., <italic>Paul lachte Sophie uit toen hij van zijn fiets viel</italic> &#8216;Paul laughed at Sophie when he fell off his bike&#8217; and <italic>Rick stuurde een onderzee&#235;r om de bosbrand te blussen</italic> &#8216;Rick sent a submarine to extinguish the forest fire&#8217;). The norming study was run online, using an online tool for web experiments (Frinex) developed by the technical group of the MPI.</p>
<p>A total of 40 native speakers of Dutch (<italic>n</italic> = 20 per list; mean age = 23.28; sd = 2.92) were asked to rate each sentence for plausibility of the event it described (<italic>Hoe waarschijnlijk is het dat dit gebeurt?</italic> &#8216;How likely is this happening?&#8217;) on a 7-point Likert scale, with 1 corresponding to &#8216;highly implausible&#8217; (<italic>zeer onwaarschijnlijk</italic> in Dutch) and 7 to &#8216;highly plausible&#8217; (<italic>zeer waarschijnlijk</italic>). Based on the results of the norming study, a set of 60 sentences were selected for which the difference between the mean ratings for the two versions of the same sentence was minimal. The mean difference between the first and second character reference sentences for these 60 sentences was 0.40 (sd = 0.36), with a range of 0 to 1.2. The mean plausibility of the sentences referring to the first-mentioned character was 5.45 (sd = 0.84), and the mean plausibility of the sentences referring to the second-mentioned character was 5.39 (sd = 0.83).</p>
<p>All sentences in the main experiment were followed by a two-choice comprehension question. As described above, these questions served to manipulated the processing depth. In the shallow processing condition, all questions probed the information content of the main clause. Hence, the embedded clause was irrelevant to the comprehension question and participants were not required to resolve the pronominal reference. Sentence (13) above, for instance, was followed by the question <italic>Wie leende het boek aan Gijs?</italic> &#8216;Who lent the book to Gijs?&#8217;, with answer options <italic>Abel</italic> (correct) or <italic>Zo&#235;</italic> (incorrect). To answer this question, participants were merely required to remember who the characters were in the sentence, since only one of the two answer options occurred in the sentence. A third of the sentences were followed by a question that did not mention any of the characters (e.g., <italic>Wie werd er uitgelachen?</italic> &#8216;Who was laughed at?&#8217; after the sentence <italic>Jordy lachte Marc uit toen hij de grap begreep</italic> &#8216;Jordy laughed at Marc when he understood the joke&#8217;), with answer options <italic>Jordy</italic> (incorrect) and <italic>Marc</italic> (correct). In these cases, participants needed to remember who did what as expressed in the main clause, but they, crucially, did not need to resolve the pronominal reference in the embedded clause. Half of the shallow processing questions targeted the first-mentioned referent, and half targeted the second-mentioned referent (i.e., <italic>Aan wie leende Abel het boek?</italic> &#8216;To whom did Abel lend the book?&#8217;).</p>
<p>In the deep processing condition, all sentences were followed by a question that probed the information content of the embedded clause, such that the questions could not be answered without resolving the pronominal reference. Sentence (13) above was paired with the question <italic>Wie ging op vakantie?</italic> &#8216;Who went on holiday?&#8217; with answer choices <italic>Abel</italic> and <italic>Gijs</italic>. In the ambiguous condition, both answers reflected potential interpretations of the pronoun, while in the disambiguated sentences only one of the two choices was correct (e.g., for (18): <italic>Abel</italic> (correct) or <italic>Zo&#235;</italic> (incorrect)).</p>
</sec>
<sec>
<title>Procedure</title>
<p>The experiment was run online, using the same online tool for web experiments as was used for the norming study (Frinex). In the experiment, participants read sentences in a word-by-word self-paced reading paradigm (<xref ref-type="bibr" rid="B20">Just et al., 1982</xref>). Participants used their own keyboards to respond. Upon pressing the space bar, a trial began and the first word of the sentence appeared. Participants continued to press the space bar to read each successive word, and as each word appeared, preceding words were masked again. Participants were instructed to read at a natural pace and to make sure they understood what they were reading so that they could answer the comprehension questions accurately. Reading times were measured for each word from the time it appeared on the screen until the space bar was pressed.</p>
<p>After participants read the complete sentence, a comprehension question appeared on the screen with two answer choices, to which participants responded with a key press. Participants were randomly assigned to either the shallow processing or the deep processing condition. In the shallow processing condition, the answers consisted of the correct character name and a character name that had not appeared in the preceding sentence. In a portion of the questions no character was mentioned in the question (e.g., <italic>Wie vertelde over het probleem?</italic> &#8216;Who mentioned the problem?&#8217;), in which case the two character names from the sentence were given as possible answers. In the deep processing condition, the two character names that had appeared in the preceding sentence were always depicted on the screen as possible answers. The positions of the two answers on the screen were randomized. After answering, participants proceeded to the next trial by pressing the space bar.</p>
<p>We generated twelve pseudo-random orders of item presentation to which participants were randomly assigned. The experimental session was preceded by a set of 8 practice trials. The practice trials were meant to familiarize the participant with the experiment and to mimic the different conditions of the between-participants manipulation of processing depth: in the shallow processing condition, practice trials were followed by questions targeting main clause information, while questions targeted the information content of the embedded clause were used in the deep processing condition.</p>
</sec>
<sec>
<title>Analysis</title>
<p>We recorded reading times (RTs) for each word, and accuracy and question-answering times for the comprehension questions after each sentence. Eight participants were removed for technical reasons (only part of their responses were properly recorded, likely due to connection issues or because they continuously pressed the space bar for at least a portion of the experiment). We then calculated the mean accuracy on the comprehension questions after filler sentences for each participant, and removed one participant with &lt;75% accuracy from further analyses. All remaining 147 participants had a high accuracy on the filler items (on average 99% accurate on the filler items under both deep and shallow processing conditions).</p>
<p>The reading and question-answering time data were analyzed with linear mixed-effects models, using the <monospace>lme4</monospace> package (<xref ref-type="bibr" rid="B5">Bates, Kliegl, Vasishth, &amp; Baayen, 2015</xref>) in the <monospace>R</monospace> environment (<xref ref-type="bibr" rid="B30">R Core Team, 2016</xref>) (version 3.6.0). Post-hoc contrasts were performed using the <monospace>emmeans</monospace> package (<xref ref-type="bibr" rid="B25">Lenth, Singmann, Love, Buerkner, &amp; Herve, 2018</xref>) and <italic>p</italic>-values were computed using the Satterthwaite approximation for degrees of freedom, as implemented in the <monospace>lmerTest</monospace> package (<xref ref-type="bibr" rid="B23">Kuznetsova, Brockhoff, &amp; Christensen, 2016</xref>). We used the optimiser &#8220;bobyqa&#8221; to help with convergence issues.</p>
</sec>
<sec>
<title>Analysis of reading times</title>
<p>Unreasonably short and long latencies (&lt;100 ms and &gt;2500 ms) were excluded prior to analysis, which resulted in a loss of less than 1% of the data. No additional outlier removal was performed. Reading times were log-transformed (natural log) to normalize residuals and reduce the effect of extreme data points. Log-transformed reading times were then regressed against two factors that are known to affect reading times in self-paced reading tasks: word length and the list position of the sentence in the stimulus list (i.e., longer words predict longer reading times and later list positions predict faster reading times; <xref ref-type="bibr" rid="B18">Hofmeister, 2011</xref>; <xref ref-type="bibr" rid="B19">Hofmeister &amp; Vasishth, 2014</xref>). The model estimating these effects included by-participant slopes of <sc>word length</sc> and <sc>list position</sc>. Both <sc>word length</sc> and <sc>list position</sc> were log-transformed with a natural logarithm and then centered. The data from all sentences (experimental and filler) were used to produce maximally general estimates (<xref ref-type="bibr" rid="B19">Hofmeister &amp; Vasishth, 2014</xref>).</p>
<p>We analyzed the residual log reading times in five separate regions of interest: the pronoun itself (<sc>pronoun</sc>), the two regions immediately following the pronoun (<sc>pronoun</sc>+1 and <sc>pronoun</sc>+2), and the two regions preceding it (<sc>pronoun</sc>-1 and <sc>pronoun</sc>-2). The words used in each of these regions were identical across conditions. We included the two words following the pronoun as self-paced reading experiments often show spill-over effects, i.e., effects on words following the region of interest in the sentence. To establish whether there were any differences between the conditions prior to the pronoun, we further included the two words preceding the pronoun: the <sc>pronoun</sc>-1 region (which was always the first word of the embedded clause) and the <sc>pronoun</sc>-2 region. Linear mixed-effects models for the different regions of interest were fitted with fixed effects of <sc>sentence type</sc> and <sc>depth of processing</sc> and their interactions. <sc>sentence type</sc> was coded with successive differences contrast coding (using the <monospace>contr.sdif()</monospace> function in the <sc>mass</sc> package; <xref ref-type="bibr" rid="B40">Venables &amp; Ripley, 2002</xref>), comparing the differences between the means of the second and first levels (ambiguous vs. first character reference), and the third and second levels (second character reference vs. ambiguous). <sc>depth of processing</sc> was coded with scaled sum contrasts (or effect coding; &#8211;0.5 and 0.5). We fitted the models with the maximal random effects structure justified by the design (<xref ref-type="bibr" rid="B4">Barr, Levy, Scheepers, &amp; Tily, 2013</xref>), which included by-participant and by-item intercepts, by-participant slopes for <sc>sentence type</sc>, and by-item slopes for <sc>sentence type, depth</sc>, and their interactions. Random slopes were removed when the model did not converge; random slopes correlated above 0.95 were also removed to avoid overfitting. This resulted in models that included by-participant and by-item intercepts and a by-item slope for <sc>depth</sc> in the <sc>pronoun, pronoun</sc>+1, and <sc>pronoun</sc>+2 regions, and models with by-participant and by-item intercepts and by-item slopes for <sc>sentence type, depth</sc>, and their interactions for the <sc>pronoun</sc>-1 and <sc>pronoun</sc>-2 regions.</p>
</sec>
<sec>
<title>Analysis of question responses</title>
<p>For the question-answering times, unreasonably high (&gt;10000 ms) and low (&lt;700 ms) response times were removed. This resulted in 0.85% of the response time data being excluded. Response times were log-transformed (natural log) and log response times more than 2.5 standard deviations from the condition mean (per participant) were excluded, resulting in the removal of a further 1.22% of the data. Log-transformed question-answering times were analyzed with linear mixed-effects models that included a fixed effect of <sc>sentence type</sc> and <sc>depth of processing</sc> and their interactions. As with the analysis of reading times, <sc>sentence type</sc> was coded with successive differences contrast coding, comparing the ambiguous condition to the first character reference and to the second character reference conditions. <sc>depth of processing</sc> was coded with scaled sum contrasts and random intercepts for subjects and items were included.</p>
</sec>
</sec>
<sec>
<title>Results</title>
<sec>
<title>Reading times</title>
<p>A summary of the non-transformed reading times across the experimental conditions is given in <xref ref-type="table" rid="T1">Table 1</xref>. Analyses were conducted on residual log reading times, as shown in <xref ref-type="fig" rid="F1">Figure 1</xref>. <xref ref-type="table" rid="T2">Table 2</xref> gives the output of the mixed-effect models.</p>
<table-wrap id="T1">
<label>Table 1</label>
<caption>
<p>Means (and SEs) for reading latencies in the three regions of interest under shallow and deep processing conditions.</p>
</caption>
<table>
<tr>
<th colspan="7"><hr/></th>
</tr>
<tr>
<th align="left" valign="top">Depth</th>
<th align="left" valign="top">Sentence type</th>
<th align="left" valign="top"><sc>pronoun</sc>-2</th>
<th align="left" valign="top"><sc>pronoun</sc>-1</th>
<th align="left" valign="top"><sc>pronoun</sc></th>
<th align="left" valign="top"><sc>pronoun</sc>+1</th>
<th align="left" valign="top"><sc>pronoun</sc>+2</th>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Shallow</td>
<td align="left" valign="top">Ambiguous</td>
<td align="left" valign="top">309 (4.82)</td>
<td align="left" valign="top">313 (4.95)</td>
<td align="left" valign="top">280 (3.52)</td>
<td align="left" valign="top">273 (3.21)</td>
<td align="left" valign="top">285 (3.50)</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">First</td>
<td align="left" valign="top">314 (5.10)</td>
<td align="left" valign="top">319 (4.81)</td>
<td align="left" valign="top">285 (3.29)</td>
<td align="left" valign="top">272 (3.05)</td>
<td align="left" valign="top">288 (3.31)</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">Second</td>
<td align="left" valign="top">309 (5.00)</td>
<td align="left" valign="top">310 (4.86)</td>
<td align="left" valign="top">283 (3.31)</td>
<td align="left" valign="top">276 (3.23)</td>
<td align="left" valign="top">285 (3.71)</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Deep</td>
<td align="left" valign="top">Ambiguous</td>
<td align="left" valign="top">364 (5.76)</td>
<td align="left" valign="top">377 (5.57)</td>
<td align="left" valign="top">350 (4.51)</td>
<td align="left" valign="top">335 (4.18)</td>
<td align="left" valign="top">355 (4.41)</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">First</td>
<td align="left" valign="top">368 (5.69)</td>
<td align="left" valign="top">366 (5.16)</td>
<td align="left" valign="top">345 (4.34)</td>
<td align="left" valign="top">326 (3.50)</td>
<td align="left" valign="top">338 (3.77)</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">Second</td>
<td align="left" valign="top">367 (5.83)</td>
<td align="left" valign="top">365 (5.00)</td>
<td align="left" valign="top">351 (5.05)</td>
<td align="left" valign="top">330 (3.69)</td>
<td align="left" valign="top">345 (4.23)</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
</table>
</table-wrap>
<fig id="F1">
<label>Figure 1</label>
<caption><p>Residual log reading times in the regions of interest under shallow and deep processing conditions (regressed against word length and the log list position). Error bars represent 95% confidence intervals.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-1-1-66-g1.png"/>
</fig>
<table-wrap id="T2">
<label>Table 2</label>
<caption>
<p>Fixed effect estimates, standard errors, and p-values for linear mixed effects regression for the residual log reading times. Results from the full model are given there was a marked change in fixed effects in the reduced model.</p>
</caption>
<table>
<tr>
<th colspan="11"><hr/></th>
</tr>
<tr>
<th align="left" valign="top"></th>
<th align="left" valign="top" colspan="2"><sc>pronoun</sc>-2</th>
<th align="left" valign="top" colspan="2"><sc>pronoun</sc>-1</th>
<th align="left" valign="top" colspan="2"><sc>pronoun</sc></th>
<th align="left" valign="top" colspan="2"><sc>pronoun</sc>+1</th>
<th align="left" valign="top" colspan="2"><sc>pronoun</sc>+2</th>
</tr>
<tr>
<th colspan="11"><hr/></th>
</tr>
<tr>
<th align="left" valign="top"></th>
<th align="left" valign="top"><italic>&#946;</italic> (SE)</th>
<th align="left" valign="top"><italic>p</italic>-value</th>
<th align="left" valign="top"><italic>&#946;</italic> (SE)</th>
<th align="left" valign="top"><italic>p</italic>-value</th>
<th align="left" valign="top"><italic>&#946;</italic> (SE)</th>
<th align="left" valign="top"><italic>p</italic>-value</th>
<th align="left" valign="top"><italic>&#946;</italic> (SE)</th>
<th align="left" valign="top"><italic>p</italic>-value</th>
<th align="left" valign="top"><italic>&#946;</italic> (SE)</th>
<th align="left" valign="top"><italic>p</italic>-value</th>
</tr>
<tr>
<td colspan="11"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Intercept</td>
<td align="left" valign="top">&#8211;0.081 (0.012)</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
<td align="left" valign="top">&#8211;0.060 (0.011)</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
<td align="left" valign="top">&#8211;0.060 (0.008)</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
<td align="left" valign="top">&#8211;0.077 (0.008)</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
<td align="left" valign="top">&#8211;0.158 (0.009)</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td colspan="11"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">First vs. Amb</td>
<td align="left" valign="top">&#8211;0.006 (0.010)</td>
<td align="left" valign="top">0.577</td>
<td align="left" valign="top">0.005 (0.009)</td>
<td align="left" valign="top">0.540</td>
<td align="left" valign="top">0.005 (0.007)</td>
<td align="left" valign="top">0.478</td>
<td align="left" valign="top">0.014 (0.007)</td>
<td align="left" valign="top"><bold>0.033</bold></td>
<td align="left" valign="top">0.019 (0.007)</td>
<td align="left" valign="top"><bold>0.008</bold></td>
</tr>
<tr>
<td colspan="11"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Amb vs. Sec</td>
<td align="left" valign="top">&#8211;0.005 (0.009)</td>
<td align="left" valign="top">0.596</td>
<td align="left" valign="top">&#8211;0.015 (0.010)</td>
<td align="left" valign="top">0.127</td>
<td align="left" valign="top">0.001 (0.007)</td>
<td align="left" valign="top">0.893</td>
<td align="left" valign="top">&#8211;0.002 (0.007)</td>
<td align="left" valign="top">0.763</td>
<td align="left" valign="top">&#8211;0.020 (0.007)</td>
<td align="left" valign="top"><bold>0.006</bold></td>
</tr>
<tr>
<td colspan="11"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Depth</td>
<td align="left" valign="top">&#8211;0.014 (0.013)</td>
<td align="left" valign="top">0.280</td>
<td align="left" valign="top">&#8211;0.026 (0.014)</td>
<td align="left" valign="top">0.069</td>
<td align="left" valign="top">&#8211;0.057 (0.012)</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
<td align="left" valign="top">&#8211;0.043 (0.011)</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
<td align="left" valign="top">&#8211;0.037 (0.014)</td>
<td align="left" valign="top"><bold>0.012</bold></td>
</tr>
<tr>
<td colspan="11"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">First vs. Amb &#215; Depth</td>
<td align="left" valign="top">&#8211;0.004 (0.017)</td>
<td align="left" valign="top">0.804</td>
<td align="left" valign="top">&#8211;0.038 (0.019)</td>
<td align="left" valign="top">0.053</td>
<td align="left" valign="top">&#8211;0.030 (0.014)</td>
<td align="left" valign="top"><bold>0.033</bold></td>
<td align="left" valign="top">&#8211;0.011 (0.013)</td>
<td align="left" valign="top">0.393</td>
<td align="left" valign="top">&#8211;0.050 (0.014)</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td colspan="11"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Amb vs. Sec &#215; Depth</td>
<td align="left" valign="top">0.007 (0.017)</td>
<td align="left" valign="top">0.667</td>
<td align="left" valign="top">0.029 (0.020)</td>
<td align="left" valign="top">0.159</td>
<td align="left" valign="top">0.031 (0.014)</td>
<td align="left" valign="top"><bold>0.026</bold></td>
<td align="left" valign="top">0.026 (0.013)</td>
<td align="left" valign="top">0.053</td>
<td align="left" valign="top">0.035 (0.014)</td>
<td align="left" valign="top"><bold>0.013</bold></td>
</tr>
<tr>
<td colspan="11"><hr/></td>
</tr>
</table>
</table-wrap>
<p>Throughout the <sc>pronoun</sc> region and the two regions following it, reading times were strongly affected by processing depth. The shallow processing condition elicited faster average reading times (280 ms on average) than the deep processing condition (342 ms on average) when comparing reading times in the <sc>pronoun</sc> region and the two spill-over regions. A main effect of <sc>depth</sc> was found for each of these three regions (<sc>pronoun</sc> region: <italic>&#946;</italic> = &#8211;0.057, <italic>p</italic> &lt; 0.001; <sc>pronoun</sc>+1 region: <italic>&#946;</italic> = &#8211;0.043, <italic>p</italic> &lt; 0.001; <sc>pronoun</sc>+2 region: <italic>&#946;</italic>= &#8211;0.037, <italic>p</italic> = 0.012). When considering the whole-sentence reading times, the deep processing condition also led to longer average RTs (4746 ms) compared to the shallow processing condition (3950 ms). However, the effect of <sc>depth</sc> was only marginally significant in the region immediately preceding the pronoun (<sc>pronoun</sc>-1 region: <italic>&#946;</italic> = &#8211;0.026, <italic>p</italic> = 0.069) and was not significant in the region before that (<sc>pronoun</sc>-2 region: <italic>p</italic> = 0.280). This indicates that participants who were asked &#8216;superficial&#8217; questions in our shallow processing condition read the embedded clause faster than participants who were asked questions that specifically targeted the information content of that embedded clause. While the manipulation of processing depth affected the processing of the embedded clause (which started at the word in the <sc>pronoun</sc>-1 region), this further shows that it did not necessarily affect the generic processing depth over the entire sentence.</p>
<p>Zooming in on the reading times per region of interest, the results in the <sc>pronoun</sc> region did not reveal a main effect of <sc>sentence type</sc> when comparing the ambiguous condition to the first character (<italic>p</italic> = 0.478) and second character reference (<italic>p</italic> = 0.893) conditions. Crucial for the aim of the current experiment are the interactions between <sc>sentence type</sc> and <sc>depth</sc>. For the <sc>pronoun</sc> region, these interactions were significant when comparing ambiguous and first character reference sentences (<italic>&#946;</italic> = &#8211;0.030, <italic>p</italic> = 0.033) and when comparing second character reference and ambiguous sentences (<italic>&#946;</italic> = 0.031, <italic>p</italic> = 0.026).<xref ref-type="fn" rid="n4">4</xref> This suggests that there is an effect of processing depth (or task demand) on the difference in reading times between ambiguous and unambiguous sentences in this region, with shorter reading times for ambiguous sentences under shallow processing but longer reading times under deep processing, which would be in line with earlier findings of an ambiguity advantage. Numerically, the ambiguous condition was read faster under shallow processing compared to the unambiguous conditions. However, post-hoc comparisons (with a correction for multiple comparisons using the Tukey method), revealed no significant differences in reading times between the different sentence types under shallow processing (first-second: <italic>p</italic> = 0.787; first-ambiguous: <italic>p</italic> = 0.575; ambiguous-second: <italic>p</italic> = 0.219) or under deep processing (first-second: <italic>p</italic> = 0.855; first-ambiguous: <italic>p</italic> = 0.111; ambiguous-second: <italic>p</italic> = 0.304).</p>
<p>The results in the <sc>pronoun</sc>+1 region, which consisted of highly frequent words such as prepositions, showed a difference between the reading times in the ambiguous and first character reference conditions across levels of <sc>depth</sc> (<italic>&#946;</italic> = 0.014, <italic>p</italic> = 0.033).<xref ref-type="fn" rid="n5">5</xref> This suggests that the reading times for the first character reference sentences were faster compared to the ambiguous sentences when looking at both processing depths. However, and as shown in <xref ref-type="table" rid="T2">Table 2</xref>, none of the interactions between <sc>sentence type</sc> and <sc>depth</sc> were significant.</p>
<p>The <sc>pronoun</sc>+2 region showed a main effect of <sc>sentence type</sc> when comparing reading times for ambiguous to the first character reference (<italic>&#946;</italic> = 0.019, <italic>p</italic> = 0.008) and the second character reference sentences (<italic>&#946;</italic> = &#8211;0.020, <italic>p</italic> = 0.006). This shows that, across levels of <sc>depth</sc>, ambiguous sentences were read more slowly than either of the unambiguous ones, although this effect is probably driven by the deep processing condition as discussed below. The interaction between <sc>sentence type</sc> and <sc>depth</sc> was significant when comparing the ambiguous and the first character reference sentences (<italic>&#946;</italic> = &#8211;0.050, <italic>p</italic> &lt; 0.001) and the ambiguous and second character reference sentences (<italic>&#946;</italic> = 0.035, <italic>p</italic> = 0.013).<xref ref-type="fn" rid="n6">6</xref> This again suggests that the difference in reading times between ambiguous and unambiguous sentences depends on the depth of processing that participants engaged in. Post-hoc comparisons indeed showed that the ambiguous sentences were read significantly more slowly than the unambiguous sentences under deep processing (first-ambiguous: <italic>&#946;</italic> = &#8211;0.044, <italic>p</italic> &lt; 0.001; ambiguous-second: <italic>&#946;</italic> = 0.037, <italic>p</italic> &lt; 0.001). No differences were found between the unambiguous conditions (<italic>p</italic> = 0.794). The longer reading times for ambiguous sentences reflect an ambiguity disadvantage or penalty under deep processing. Under shallow processing, no significant differences between any of the sentence types were found (first-ambiguous: <italic>p</italic> = 0.817; first-second: <italic>p</italic> = 0.704; ambiguous-second: <italic>p</italic> = 0.980).</p>
<p>Finally, we analyzed the two regions preceding the pronoun which, similar to the regions following the pronoun, were identical across conditions. In the <sc>pronoun</sc>-1 region, the model revealed a marginally significant interaction when comparing the ambiguous sentences to the first character reference sentences (<italic>&#946;</italic> = &#8211;0.038, <italic>p</italic> = 0.053), in the same direction as the interaction in the <sc>pronoun</sc> region. Although this interaction was only marginally significant (in both the full and reduced models) and should be interpreted with caution, this could indicate that participants may have learned over the course of the experiment that an ambiguous pronoun may follow after having seen two stereotypically male proper names in the main clause (as opposed to one male and one female name). No other effects were significant. The model for the <sc>pronoun</sc>-2 region revealed no significant effects (see <xref ref-type="table" rid="T2">Table 2</xref>).</p>
</sec>
<sec>
<title>Question responses</title>
<p>The analysis of the question responses showed high accuracy rates across conditions, as summarized in <xref ref-type="table" rid="T3">Table 3</xref>. When examining answers to questions after ambiguous sentences in the deep processing condition (for which participants had to resolve the pronominal reference), a strong preference to resolve the ambiguous pronominal reference to the first character reference was observed: out of 1460 answers, 1040 (71%) referred to the first character and 420 (29%) to the second character (<xref ref-type="fig" rid="F2">Figure 2A</xref>). This is in line with the consistent finding of a tendency for the referent of the subject or first-mentioned noun phrase to be more accessible than other entities (e.g. <xref ref-type="bibr" rid="B1">Arnold, 1998</xref>; <xref ref-type="bibr" rid="B2">Arnold et al., 2000</xref>; <xref ref-type="bibr" rid="B29">McDonald &amp; MacWhinney, 1995</xref>; <xref ref-type="bibr" rid="B39">Van Rij et al., 2013</xref>). It is likely that the fact that referents in subject position are more prominent or salient resulted in this subject bias when resolving pronoun reference.</p>
<table-wrap id="T3">
<label>Table 3</label>
<caption>
<p>Accuracy (and SEs) per sentence type and under shallow and deep processing conditions.</p>
</caption>
<table>
<tr>
<th colspan="4"><hr/></th>
</tr>
<tr>
<th align="left" valign="top">Depth</th>
<th align="left" valign="top">Sentence type</th>
<th align="left" valign="top" colspan="2">Accuracy</th>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Shallow</td>
<td align="left" valign="top">First</td>
<td align="left" valign="top">0.95</td>
<td align="left" valign="top">(0.006)</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">Second</td>
<td align="left" valign="top">0.94</td>
<td align="left" valign="top">(0.006)</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">Ambiguous</td>
<td align="left" valign="top">0.94</td>
<td align="left" valign="top">(0.006)</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">Filler</td>
<td align="left" valign="top">0.99</td>
<td align="left" valign="top">(0.002)</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Deep</td>
<td align="left" valign="top">First</td>
<td align="left" valign="top">0.97</td>
<td align="left" valign="top">(0.004)</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">Second</td>
<td align="left" valign="top">0.97</td>
<td align="left" valign="top">(0.004)</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top">Filler</td>
<td align="left" valign="top">0.99</td>
<td align="left" valign="top">(0.002)</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
</table>
</table-wrap>
<fig id="F2">
<label>Figure 2</label>
<caption><p>Question responses. <bold>A:</bold> Relative frequency of answers referring to either the first character or the second character given by participants after ambiguous sentences in the deep processing condition. <bold>B:</bold> Log-transformed (natural log) question answering times for the different sentence types under deep and shallow processing; the lines correspond to the first, second, and third quartile.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-1-1-66-g2.png"/>
</fig>
<p>The analysis of question answering times (see <xref ref-type="fig" rid="F2">Figure 2B</xref> and <xref ref-type="table" rid="T4">Table 4</xref> for the complete model output) showed a difference between the ambiguous and the first character reference conditions (<italic>&#946;</italic> = 0.184; <italic>p</italic> &lt; 0.001) and between the ambiguous and the second character reference conditions (<italic>&#946;</italic> = 0.172; <italic>p</italic> &lt; 0.001) across levels of <sc>depth. depth</sc> did not result in a significant main effect (<italic>p</italic> = 0.292). However, the interactions between <sc>sentence type</sc> and <sc>depth</sc> were significant and showed a larger difference between ambiguous and first character reference sentences (<italic>&#946;</italic> = 0.318; <italic>p</italic> &lt; 0.001) and between ambiguous and second character reference sentences (<italic>&#946;</italic> = 0.308; <italic>p</italic> &lt; 0.001) under deep processing compared to shallow processing. Post-hoc comparisons (with a correction for multiple comparisons using the Tukey method) indeed showed no significant differences between sentence types under shallow processing (first-second: <italic>p</italic> = 0.838; first-ambiguous: <italic>p</italic> = 0.124; second-ambiguous: <italic>p</italic> = 0.347). Under deep processing conditions, however, participants took significantly longer to answer comprehension questions after ambiguous sentences compared to both first character reference (<italic>&#946;</italic> = 0.344, <italic>p</italic> &lt; 0.001) and second character reference sentences (<italic>&#946;</italic> = 0.326, <italic>p</italic> &lt; 0.001). No difference between the unambiguous sentences was found (<italic>p</italic> = 0.341).</p>
<table-wrap id="T4">
<label>Table 4</label>
<caption>
<p>Fixed effect estimates for linear mixed effects regression for log-transformed (natural log) question answering times.</p>
</caption>
<table>
<tr>
<th colspan="5"><hr/></th>
</tr>
<tr>
<th align="left" valign="top"></th>
<th align="left" valign="top" colspan="4">Question answering times</th>
</tr>
<tr>
<th colspan="5"><hr/></th>
</tr>
<tr>
<th align="left" valign="top"></th>
<th align="left" valign="top">Estimate</th>
<th align="left" valign="top">SE</th>
<th align="left" valign="top"><italic>t</italic>-value</th>
<th align="left" valign="top"><italic>p</italic>-value</th>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Intercept</td>
<td align="left" valign="top">7.593</td>
<td align="left" valign="top">0.023</td>
<td align="left" valign="top">336.119</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">First vs. Amb</td>
<td align="left" valign="top">0.185</td>
<td align="left" valign="top">0.009</td>
<td align="left" valign="top">20.030</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Amb vs. Sec</td>
<td align="left" valign="top">&#8211;0.172</td>
<td align="left" valign="top">0.009</td>
<td align="left" valign="top">&#8211;18.653</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Depth</td>
<td align="left" valign="top">&#8211;0.037</td>
<td align="left" valign="top">0.035</td>
<td align="left" valign="top">&#8211;1.057</td>
<td align="left" valign="top">0.292</td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">First vs. Amb &#215; Depth</td>
<td align="left" valign="top">&#8211;0.319</td>
<td align="left" valign="top">0.018</td>
<td align="left" valign="top">&#8211;17.319</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Amb vs. Sec &#215; Depth</td>
<td align="left" valign="top">0.308</td>
<td align="left" valign="top">0.018</td>
<td align="left" valign="top">16.736</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td colspan="5"><hr/></td>
</tr>
</table>
<table-wrap-foot>
<fn><p><italic>Note</italic>. First = first character reference condition; Sec = second character reference condition; Amb = ambiguous condition.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec>
<title>General discussion</title>
<p>In previous studies, the processing of different types of ambiguities has been argued to result in either an <italic>ambiguity advantage</italic> or in an <italic>ambiguity penalty</italic>. The present study contributes to this line of research by examining why ambiguous sentences sometimes lead to ambiguity advantages and other times to ambiguity penalties. We tested the hypothesis that an ambiguity penalty occurs when task demands require a &#8216;deeper&#8217; interpretation of the embedded clause and the pronoun in it, while an ambiguity advantage occurs when task demands allow for more superficial interpretations of the pronominal reference. To do so, we provided a conceptual replication of the experiments reported in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>).</p>
<p>The results showed a large effect of depth of processing of the embedded clause, with a reading time difference between the shallow and deep processing conditions of about 60 ms on average in the pronoun region and the two regions following it. Participants who were asked comprehension questions that were only superficial, in that they did not target the content of the embedded clause and did not require resolving the pronominal reference (the shallow processing condition), showed faster reading times in the regions of interest than participants who were asked questions that targeted the information content of the embedded clause and that could not be answered without resolving the pronominal reference (deep processing). This suggests that the questions in our deep processing condition did indeed lead to &#8216;deeper&#8217; processing: participants read the relevant parts of the sentences more slowly, and hence likely more carefully, when they expected more difficult questions. The effect of depth was not significant in the <sc>pronoun</sc>-2 region (i.e., the region two words before the pronoun) and only marginally significant in the <sc>pronoun</sc>-1 region (i.e., the region immediately preceding the pronoun), which suggests that our manipulation of processing depth mainly affected the processing of the embedded clause, and not the processing depth of the entire sentence. This is most likely a consequence of the way in which we manipulated processing depth, as the comprehension questions manipulated the relevance of the embedded clause. In other words, participants read the embedded clauses, specifically on and following the critical pronouns, more slowly under deep processing, but not necessarily the entire sentences.</p>
<p>The most important results from the reading time analysis for the different regions were the significant interactions between sentence type and processing depth in the <sc>pronoun</sc> and <sc>pronoun</sc>+2 regions. In these two regions, we found that the difference in reading times for ambiguous versus unambiguous sentences depended on the depth of processing.<xref ref-type="fn" rid="n7">7</xref> Post-hoc comparisons revealed a significant <italic>ambiguity penalty</italic> (or <italic>disadvantage</italic>) in the <sc>pronoun</sc>+2 region under deep processing, but not under shallow processing. In the <sc>pronoun</sc> region, the results for shallow processing showed numerically faster reading times for the ambiguous condition relative to the unambiguous conditions, which is in line with earlier findings of an <italic>ambiguity advantage</italic> under shallow conditions. However, this effect was not significant.</p>
<sec>
<title>Comparison to earlier findings</title>
<p>Our reading time results replicate the results reported in Stewart et al. (<xref ref-type="bibr" rid="B33">2007, Experiment 1</xref>), using a different methodology (i.e., measuring non-cumulative word-by-word reading times rather than whole-sentence reading times), a larger sample size to provide adequate power, up-to-date statistical analyses, and a language that is different from but related to English.</p>
<p>Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) found that whole-sentence self-paced reading times in the ambiguous and unambiguous conditions did not differ significantly from each other under shallow processing, while ambiguous sentences were read more slowly (i.e., an ambiguity penalty) than unambiguous sentences under deep processing. These results for whole-sentence reading times showed a similar pattern to our results for word-by-word reading times: in our results, the conditions also did not differ from each under under shallow processing, while an ambiguity penalty was found under deep processing. In contrast, Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) observed a significant ambiguity advantage in the post-critical region when comparing sentences with an ambiguous pronominal reference to unambiguous sentences in an eye-tracking while reading experiment.</p>
<p>A relevant question is why we did not find a significant ambiguity advantage under shallow processing. One potential reason is that participants in our study, and in the experiments reported in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), did not read the embedded clause well enough to lead to differences between the sentence types. Since, in our study, none of the comprehension questions in the shallow processing version targeted the content of the embedded clause, this may have lead some participants to only skim this part of the sentence. However, considering the fact that this was a self-paced reading experiment in which participants were presented each word of the sentence in a cumulative way (different from an eye-tracking experiment in which it is easier to skip over parts of the sentence), this seems unlikely. Moreover, Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) did include some (4/24) comprehension questions that targeted the embedded clause and did not find a significant ambiguity advantage either.</p>
<p>A different explanation relating to the shallow comprehension questions, suggested to us by a reviewer, is that our &#8216;shallow&#8217; questions may not have been shallow enough to observe an ambiguity advantage. While none of the shallow questions targeted the pronominal reference, the questions did refer to the referents (asking who did something). This may have invited strategic attention to the referents, which may have wiped out a possible ambiguity advantage. In their study, Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) also included questions that probed the comprehension of the pronoun, but did so after only a proportion of the trials. It could, therefore, be the case that the occurrence of a question relating to a referent on all trials motivated a less superficial type of referential processing even in the shallow condition, resulting in the absence of a significant ambiguity advantage.</p>
<p>It could also be the case that the processing system handles the resolution of structural (i.e., attachment) ambiguities and pronominal reference ambiguities differently. Unlike structural/attachment ambiguities, which have typically been shown to facilitate processing (e.g. <xref ref-type="bibr" rid="B9">Clifton &amp; Staub, 2008</xref>; <xref ref-type="bibr" rid="B35">Traxler et al., 1998</xref>; <xref ref-type="bibr" rid="B36">Van Gompel et al., 2005</xref>, <xref ref-type="bibr" rid="B38">2001</xref>), referential ambiguities would then more often <italic>hinder</italic> processing (<xref ref-type="bibr" rid="B3">Badecker &amp; Straub, 2002</xref>; <xref ref-type="bibr" rid="B33">Stewart et al., 2007</xref>). While this position is not unreasonable considering that referential and structural ambiguities are distinct phenomena, the findings by Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) suggests that, when compared directly and under equivalent conditions, structural and referential ambiguity show similar patterns in eye movements during reading. This suggests that readers react to referential and structural ambiguity in a similar fashion.</p>
<p>Another possible explanation is that the ambiguity advantage for pronominal references may constitute such a small effect that even more statistical power and/or a more sensitive experimental paradigm is needed to observe it. For instance, it could be that the effect is more pronounced in an eye-tracking study, explaining why Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>) did find a significant ambiguity advantage.</p>
<p>Moving back to comparisons to earlier findings, we used a word-by-word self-paced reading task, while Stewart et al. (<xref ref-type="bibr" rid="B33">2007, Experiment 1</xref>) examined whole-sentence reading times. This allowed us to localize the effects and obtain a more detailed picture of the time course of processing. The results showed an ambiguity disadvantage in the <sc>pronoun</sc>+2 region, which was a post-critical region. While spill-over effects are expected in self-paced reading experiments, this could potentially indicate that referential ambiguity does not immediately result in an ambiguity penalty, but that this effect is delayed. This is similar to what Loga&#269;ev and Vasishth (<xref ref-type="bibr" rid="B26">2016a</xref>) reported for an ambiguity disadvantage in structurally ambiguous sentences, where a relatively late ambiguity disadvantage was observed at the last word of the relative clause. It is also in line with Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), who argued that readers may delay the resolution of ambiguous pronouns. A different line of literature on the processing of gender features also suggests that ambiguous pronouns do not disrupt comprehension immediately (e.g. <xref ref-type="bibr" rid="B17">Greene, McKoon, &amp; Ratcliff, 1992</xref>; <xref ref-type="bibr" rid="B31">Rigalleau, Caplan, &amp; Baudiffier, 2004</xref>), but may do so after a delay.</p>
<p>Finally, the question-answering results showed a larger difference between the ambiguous and both of the unambiguous conditions when questions required resolving the pronominal reference compared to when questions were only superficial. Similar results were reported in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) and Swets et al. (<xref ref-type="bibr" rid="B34">2008</xref>), who showed that question-answering times were significantly longer after ambiguous than after unambiguous sentences when &#8216;deep&#8217; processing questions were asked, but no such difference was found when questions were only superficial.</p>
</sec>
<sec>
<title>Theoretical implications</title>
<p>Our results provide additional evidence, along the lines of Swets et al. (<xref ref-type="bibr" rid="B34">2008</xref>) and Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>), that the difference in reading times between referentially ambiguous and unambiguous sentences depends on the readers&#8217; goals, in our case whether the manipulation of &#8216;depth of processing&#8217; required the reader to answer questions that targeted the antecedent of the pronoun in the embedded clause. This has important implications for models of sentence processing. In particular, it suggests that ambiguity resolution is a <italic>flexible</italic> process, such that the way in which ambiguous sentences are processed (relative to unambiguous ones) depends on the processing depth or task demands.</p>
<p>In previous studies of structural and referential ambiguities, ambiguity penalties have been taken as evidence for constraint-based parsing models or competition-based models, in which the parallel consideration of multiple structures gives rise to competition effects in processing (e.g., <xref ref-type="bibr" rid="B3">Badecker &amp; Straub, 2002</xref>; <xref ref-type="bibr" rid="B28">MacDonald, 1994</xref>). In contrast, ambiguity advantages have been taken as support for non-competitive models such as the Unrestricted Race Model (e.g. <xref ref-type="bibr" rid="B37">Van Gompel et al., 2000</xref>), according to which potential syntactic analyses engage in a race before a single analysis is adopted. In a race-based model, disambiguated sentences are predicted to take longer to process than ambiguous ones because processing delays occur when subsequent information is inconsistent with a previously adopted analysis and the initial analysis needs to be revised (<xref ref-type="bibr" rid="B37">Van Gompel et al., 2000</xref>). Alternatively, an ambiguity advantage is predicted because the adopted analysis is the one that is completed the fastest, leading to shorter times on average when there are several candidates, as in globally ambiguous cases compared to unambiguous cases (<xref ref-type="bibr" rid="B26">Loga&#269;ev &amp; Vasishth, 2016a</xref>). The current results warrant the formulation of a model of ambiguity resolution that takes into account the effect of processing depth on ambiguity resolution, and that should be able to capture <italic>both</italic> ambiguity advantages under shallow processing and ambiguity penalties under deep processing.</p>
<p>One proposal that has been put forward to account for moderating effects of processing depth or task demands is based on <italic>underspecification</italic> (cf. <xref ref-type="bibr" rid="B11">Ferreira et al., 2002</xref>; <xref ref-type="bibr" rid="B21">Karimi &amp; Ferreira, 2016</xref>; <xref ref-type="bibr" rid="B33">Stewart et al., 2007</xref>; <xref ref-type="bibr" rid="B34">Swets et al., 2008</xref>). This proposal is in line with the &#8216;good-enough&#8217; processing approach, which argues that readers or listeners may construct sentence representations that are only &#8216;good-enough&#8217;, i.e., not necessarily complete and fully specified, but that suffice to accomplish the communicative goal at hand (e.g., <xref ref-type="bibr" rid="B7">Christianson, 2016</xref>; <xref ref-type="bibr" rid="B8">Christianson, Hollingworth, Halliwell, &amp; Ferreira, 2001</xref>; <xref ref-type="bibr" rid="B13">Ferreira &amp; Lowder, 2016</xref>; <xref ref-type="bibr" rid="B14">Ferreira &amp; Patson, 2007</xref>). For structural ambiguities, Swets et al. (<xref ref-type="bibr" rid="B34">2008</xref>) argued that readers may strategically underspecify the representation of structurally ambiguous sentences to save time, unless disambiguation is required by task demands as under deep processing. Committing to one reading and building the corresponding syntactic structure is argued to take time. Therefore, not having to make a commitment because of underspecification under shallow processing is expected to result in a reading time advantage for ambiguous sentences. The longer question answering times for ambiguous sentences under deep processing, as also found in our study, have been taken as additional evidence in favor of the delay of a commitment to a particular reading (<xref ref-type="bibr" rid="B34">Swets et al., 2008</xref>).</p>
<p>For referential ambiguities, Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) argued that ambiguous pronouns may initially be underspecified under shallow processing, resulting from the absence of co-indexation between the pronoun and its antecedent. According to this approach, readers invest processing resources to interpret the ambiguous pronoun only when they are required to engage in deep processing; under shallow processing ambiguous pronouns may remain underspecified. A sentence such as <italic>Abel lent Gijs the book before he went on holiday</italic> will be understood by the comprehender to mean that someone went on holiday, but it will be left undecided whether that was Abel or Gijs. Because of this underspecification, under shallow processing reading times for ambiguous sentences are expected to be faster or equivalent compared to unambiguous sentences. By contrast, under deep processing, reading times for ambiguous sentences are expected to be slower.</p>
<p>The finding of processing depth affecting ambiguous sentence processing can also be explained without the notion of underspecification. Loga&#269;ev and Vasishth (<xref ref-type="bibr" rid="B26">2016a</xref>) proposed a Stochastic Multiple-Channel Model of structural ambiguity resolution as an extension of Van Gompel et al.&#8217;s (<xref ref-type="bibr" rid="B37">2000</xref>) Unrestricted Race Model. To account for the flexible nature of ambiguity resolution, Loga&#269;ev and Vasishth (<xref ref-type="bibr" rid="B26">2016a</xref>) stipulated a <italic>stopping rule</italic> that is determined by task demands. If the task does not require more than one permissible relative clause attachment to be constructed, the parser stops after one attachment has been computed (i.e., a <italic>first-terminating</italic> stopping rule). However, if the task requires access to all available attachment options, the parser may choose to wait for all permissible structures to be built (i.e., an <italic>exhaustive</italic> stopping rule). Under a first-terminating stopping rule, an ambiguity <italic>advantage</italic> is predicted, because the parser needs to wait for only one particular attachment in unambiguous conditions and the fastest will be adopted. However, when the exhaustive stopping rule is used, an ambiguity <italic>disadvantage</italic> is predicted because the parser needs to wait for both attachments to be computed in the ambiguous condition, which will lead to more instances of long completion times.</p>
<p>This approach can easily be applied to referential ambiguities as well. Under deep processing, as implemented in the experiment reported here, the task suggests to the participant that an exhaustive stopping rule should be used, because the task requires resolving the pronominal references or because the reader is aware that the pronouns sometimes have more than one referent. In that case, the parser waits for both referents that form potential antecedents for the ambiguous pronoun to be computed (and then possibly discards the least plausible of them). This predicts an ambiguity penalty under deep processing, because computing both referents will always take longer than computing one, even in a parallel processing system (<xref ref-type="bibr" rid="B26">Loga&#269;ev &amp; Vasishth, 2016a</xref>). In contrast, under shallow processing conditions, one referent will suffice and an ambiguity advantage is predicted as the adopted referent is the one that takes the least time, leading to shorter time on average when there are more candidates as in the ambiguous cases.</p>
<p>In sum, both underspecification and race-based models are, in modified form, able to account for the finding that processing depth may influence the difference in reading times between referentially ambiguous and unambiguous sentences: the former through strategic underspecification of the co-indexation between the pronoun and its antecedent; the latter through the use of different stopping rules determined by task demands. A difference between the approaches is that at least one referent is selected as an antecedent to the pronoun in the Stochastic Multiple-Channel Model, while the co-indexation is not necessarily specified under an underspecification model. Hence, important questions are what the resulting representations look like under shallow processing and, under an underspecification approach, whether the resulting representations are partially specified or non-specified (see <xref ref-type="bibr" rid="B27">Loga&#269;ev &amp; Vasishth 2016b, for discussion</xref>). As pointed out by Van Gompel et al. (<xref ref-type="bibr" rid="B38">2001</xref>), the relevant parts of ambiguous sentences need to be processed to some extent under an underspecification approach as well, in order for the comprehension system to be able to determine what can be left underspecified (ambiguous cases) and what cannot (unambiguous cases). In response, Swets et al. (<xref ref-type="bibr" rid="B34">2008</xref>) argued that the relevant part is processed just enough to identify disambiguation cues and inappropriate interpretations, but not deeply enough to fully disambiguate when there are two appropriate interpretations. On the basis of the present data we cannot tell which representations were generated while the sentences were read and exactly how specified these were under shallow processing. It is possible that both referents remained available as possible antecedents, as predicted under an underspecification account. It is also possible that a specific referent was selected, either at random, or, more likely, because stored knowledge was taken into account (such as the bias for referents in subject position to be more accessible). Further research is needed to distinguish between these options and evaluate the validity of both models.</p>
<p>Thinking about the broader implications of the current findings, an important question is how representative our materials and task are for other reading/listening situations. With respect to the task, people do not usually face comprehension questions. Instead, depth of processing or the processing strategy more generally may be driven by both general constraints (e.g., whether the conversation or text is judged to be important) and local factors (e.g., information structure). The consequences of shallow/deep processing may also vary across situations. In our experiment, participants read isolated sentences in which the ambiguity always concerned a choice between two referents. Considering an underspecifiation model, these materials may have biased participants towards partial specification (e.g., <italic>either Abel or Gijs went on holiday</italic>), while a context in which the potential referents are not so clearly specified may bias participants towards complete underspecification (e.g., <italic>someone went on holiday</italic>).</p>
</sec>
<sec>
<title>Concluding remarks</title>
<p>In conclusion, the present results show, once again, that the way in which ambiguous sentences are processed depends on the depth of processing that participants engage in. This finding may not only be true for ambiguity resolution, but likely generalizes to other areas of language processing as well. For instance, recent work by Laurinavichyute and von der Malsburg (<xref ref-type="bibr" rid="B24">2021</xref>) showed that the illusion of ungrammaticality (i.e., a slow-down in reading due to agreement attraction in sentences like <italic>The key to the cabinets <bold>is</bold> rusty</italic>) disappeared when participants engaged in deep processing. In addition, a study by Schotter, Bicknell, Howard, Levy, and Rayner (<xref ref-type="bibr" rid="B32">2014</xref>) showed that the type of proofreading task that participants engaged in affected properties of word processing (such as word frequency and predictability effects). An important methodological implication of these findings is that the type of comprehension questions or tasks included in an experiment may affect the manner of processing. Care must, therefore, be taken in the choice of these comprehension questions and tasks (see also <xref ref-type="bibr" rid="B15">Ferreira &amp; Yang, 2019</xref>). At a theoretical level, the present findings illustrate the need for models of ambiguity resolution that take into account the effect of processing depth on ambiguity resolution, and that can flexibly account for both ambiguity advantages and ambiguity penalties.</p>
</sec>
</sec>
<sec>
<title>Data Accessibility Statement</title>
<p>The experimental stimuli, the data, and our analysis code are available at <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://hdl.handle.net/1839/a3c0374b-8b05-4929-9458-a829f786b7ef">https://hdl.handle.net/1839/a3c0374b-8b05-4929-9458-a829f786b7ef</ext-link>.</p>
</sec>
</body>
<back>
<fn-group>
<fn id="n1"><p>In Grant et al. (<xref ref-type="bibr" rid="B16">2020</xref>), the first character reference condition is referred to as &#8216;high reference&#8217; and the second character reference condition is referred to as &#8216;low reference&#8217;; we adopt the terminology used in Stewart et al. (<xref ref-type="bibr" rid="B33">2007</xref>) for consistency.</p></fn>
<fn id="n2"><p>G*Power uses Cohen&#8217;s <italic>f</italic>, which is an effect size appropriate for a within-subjects repeated measures analysis of variance (ANOVA). We used this effect size in our power calculation despite using mixed-effect models for analysis as the two types of analyses are conceptually similar.</p></fn>
<fn id="n3"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.meertens.knaw.nl/nvb/">https://www.meertens.knaw.nl/nvb/</ext-link>.</p></fn>
<fn id="n4"><p>In the full model, the interactions between <sc>sentence type</sc> and <sc>depth</sc> in the <sc>pronoun</sc> region were only marginally significant (ambiguous vs. first character: <italic>&#946;</italic> = &#8211;0.029, <italic>p</italic> = 0.069; second character vs. ambiguous: <italic>&#946;</italic> = 0.031, <italic>p</italic> = 0.062).</p></fn>
<fn id="n5"><p>This difference was only marginally significant in the full model (<italic>p</italic> = 0.058).</p></fn>
<fn id="n6"><p>The interaction when comparing the ambiguous and second character reference sentences was only marginally significant in the full model (<italic>p</italic> = 0.083).</p></fn>
<fn id="n7"><p>We acknowledge that the marginal interaction in the pre-critical region points to the possibility that the ambiguity penalty for pronouns in the deep processing condition could, at least in part, be caused by participants learning that an ambiguous pronoun may follow two male proper names. As pointed out by a reviewer, if this were the case, the difference in processing behaviour between deep and shallow conditions may, at least in part, be an anticipatory effect. We leave this for further research.</p></fn>
</fn-group>
<sec>
<title>Ethics and Consent</title>
<p>The experiments were conducted in accordance with the Declaration of Helsinki. Informed consent was obtained from each participants prior to the start of the experiment. The study was approved by the ethics board of the Faculty of Social Sciences of Radboud University.</p>
</sec>
<ack>
<title>Acknowledgements</title>
<p>The authors would like to thank Thijs Rinsma for his assistance with programming the experiment, and Annelies van Wijngaarden, Carlijn van Herpt, and Dennis Joossen for their help with stimuli creation and participant recruitment. We also thank Laurel Brehm for useful discussions regarding the statistical analyses reported here, and three anonymous reviewers for comments on previous versions of this paper.</p>
</ack>
<sec>
<title>Competing Interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<sec>
<title>Author Contributions</title>
<p>AC and AM conceptualized the experiment, decided on the methodology, and contributed to the interpretation of the data. AC took the lead on data collection, data analysis, and writing; AM reviewed and edited.</p>
</sec>
<ref-list>
<ref id="B1"><label>1</label><mixed-citation publication-type="book"><string-name><surname>Arnold</surname>, <given-names>J. E.</given-names></string-name> (<year>1998</year>). <source>Reference form and discourse patterns</source>. <publisher-name>Stanford University</publisher-name>.</mixed-citation></ref>
<ref id="B2"><label>2</label><mixed-citation publication-type="journal"><string-name><surname>Arnold</surname>, <given-names>J. E.</given-names></string-name>, <string-name><surname>Eisenband</surname>, <given-names>J. G.</given-names></string-name>, <string-name><surname>Brown-Schmidt</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Trueswell</surname>, <given-names>J. C.</given-names></string-name> (<year>2000</year>). <article-title>The rapid use of gender information: Evidence of the time course of pronoun resolution from eyetracking</article-title>. <source>Cognition</source>, <volume>76</volume>(<issue>1</issue>), <fpage>B13</fpage>&#8211;<lpage>B26</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0010-0277(00)00073-1</pub-id></mixed-citation></ref>
<ref id="B3"><label>3</label><mixed-citation publication-type="journal"><string-name><surname>Badecker</surname>, <given-names>W.</given-names></string-name>, &amp; <string-name><surname>Straub</surname>, <given-names>K.</given-names></string-name> (<year>2002</year>). <article-title>The processing role of structural constraints on interpretation of pronouns and anaphors</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>28</volume>(<issue>4</issue>), <fpage>748</fpage>&#8211;<lpage>769</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0278-7393.28.4.748</pub-id></mixed-citation></ref>
<ref id="B4"><label>4</label><mixed-citation publication-type="journal"><string-name><surname>Barr</surname>, <given-names>D. J.</given-names></string-name>, <string-name><surname>Levy</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Scheepers</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Tily</surname>, <given-names>H. J.</given-names></string-name> (<year>2013</year>). <article-title>Random effects structure for confirmatory hypothesis testing: Keep it maximal</article-title>. <source>Journal of memory and language</source>, <volume>68</volume>(<issue>3</issue>), <fpage>255</fpage>&#8211;<lpage>278</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2012.11.001</pub-id></mixed-citation></ref>
<ref id="B5"><label>5</label><mixed-citation publication-type="journal"><string-name><surname>Bates</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Kliegl</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Baayen</surname>, <given-names>R. H.</given-names></string-name> (<year>2015</year>). <article-title>Parsimonious mixed models</article-title>. <source>arXiv preprint arXiv:1506.04967</source>.</mixed-citation></ref>
<ref id="B6"><label>6</label><mixed-citation publication-type="journal"><string-name><surname>Chow</surname>, <given-names>W.-Y.</given-names></string-name>, <string-name><surname>Lewis</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Phillips</surname>, <given-names>C.</given-names></string-name> (<year>2014</year>). <article-title>Immediate sensitivity to structural constraints in pronoun resolution</article-title>. <source>Frontiers in Psychology</source>, <volume>5</volume>, <fpage>630</fpage>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2014.00630</pub-id></mixed-citation></ref>
<ref id="B7"><label>7</label><mixed-citation publication-type="journal"><string-name><surname>Christianson</surname>, <given-names>K.</given-names></string-name> (<year>2016</year>). <article-title>When language comprehension goes wrong for the right reasons: Good-enough, underspecified, or shallow language processing</article-title>. <source>Quarterly Journal of Experimental Psychology</source>, <volume>69</volume>(<issue>5</issue>), <fpage>817</fpage>&#8211;<lpage>828</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/17470218.2015.1134603</pub-id></mixed-citation></ref>
<ref id="B8"><label>8</label><mixed-citation publication-type="journal"><string-name><surname>Christianson</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Hollingworth</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Halliwell</surname>, <given-names>J. F.</given-names></string-name>, &amp; <string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name> (<year>2001</year>). <article-title>Thematic roles assigned along the garden path linger</article-title>. <source>Cognitive Psychology</source>, <volume>42</volume>(<issue>4</issue>), <fpage>368</fpage>&#8211;<lpage>407</lpage>. DOI: <pub-id pub-id-type="doi">10.1006/cogp.2001.0752</pub-id></mixed-citation></ref>
<ref id="B9"><label>9</label><mixed-citation publication-type="journal"><string-name><surname>Clifton</surname>, <given-names>J. C.</given-names></string-name>, &amp; <string-name><surname>Staub</surname>, <given-names>A.</given-names></string-name> (<year>2008</year>). <article-title>Parallelism and competition in syntactic ambiguity resolution</article-title>. <source>Language and Linguistics Compass</source>, <volume>2</volume>(<issue>2</issue>), <fpage>234</fpage>&#8211;<lpage>250</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/j.1749-818X.2008.00055.x</pub-id></mixed-citation></ref>
<ref id="B10"><label>10</label><mixed-citation publication-type="journal"><string-name><surname>Faul</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Erdfelder</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Lang</surname>, <given-names>A.-G.</given-names></string-name>, &amp; <string-name><surname>Buchner</surname>, <given-names>A.</given-names></string-name> (<year>2007</year>). <article-title>G*power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences</article-title>. <source>Behavior Research Methods</source>, <volume>39</volume> (<issue>2</issue>), <fpage>175</fpage>&#8211;<lpage>191</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/BF03193146</pub-id></mixed-citation></ref>
<ref id="B11"><label>11</label><mixed-citation publication-type="journal"><string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Bailey</surname>, <given-names>K. G.</given-names></string-name>, &amp; <string-name><surname>Ferraro</surname>, <given-names>V.</given-names></string-name> (<year>2002</year>). <article-title>Good-enough representations in language comprehension</article-title>. <source>Current directions in psychological science</source>, <volume>11</volume>(<issue>1</issue>), <fpage>11</fpage>&#8211;<lpage>15</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/1467-8721.00158</pub-id></mixed-citation></ref>
<ref id="B12"><label>12</label><mixed-citation publication-type="journal"><string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name><surname>Henderson</surname>, <given-names>J. M.</given-names></string-name> (<year>1990</year>). <article-title>Use of verb information in syntactic parsing: Evidence from eye movements and word-by-word self-paced reading</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>16</volume>(<issue>4</issue>), <fpage>555</fpage>&#8211;<lpage>568</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0278-7393.16.4.555</pub-id></mixed-citation></ref>
<ref id="B13"><label>13</label><mixed-citation publication-type="book"><string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name><surname>Lowder</surname>, <given-names>M. W.</given-names></string-name> (<year>2016</year>). <chapter-title>Prediction, information structure, and good-enough language processing</chapter-title>. In <source>Psychology of learning and motivation</source>, <volume>65</volume>, <fpage>217</fpage>&#8211;<lpage>247</lpage>. <publisher-name>Elsevier</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1016/bs.plm.2016.04.002</pub-id></mixed-citation></ref>
<ref id="B14"><label>14</label><mixed-citation publication-type="journal"><string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name><surname>Patson</surname>, <given-names>N. D.</given-names></string-name> (<year>2007</year>). <article-title>The &#8216;good enough&#8217; approach to language comprehension</article-title>. <source>Language and Linguistics Compass</source>, <volume>1</volume>(<issue>1&#8211;2</issue>), <fpage>71</fpage>&#8211;<lpage>83</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/j.1749-818X.2007.00007.x</pub-id></mixed-citation></ref>
<ref id="B15"><label>15</label><mixed-citation publication-type="journal"><string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name><surname>Yang</surname>, <given-names>Z.</given-names></string-name> (<year>2019</year>). <article-title>The problem of comprehension in psycholinguistics</article-title>. <source>Discourse Processes</source>, <volume>56</volume>(<issue>7</issue>), <fpage>485</fpage>&#8211;<lpage>495</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/0163853X.2019.1591885</pub-id></mixed-citation></ref>
<ref id="B16"><label>16</label><mixed-citation publication-type="journal"><string-name><surname>Grant</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Sloggett</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Dillon</surname>, <given-names>B.</given-names></string-name> (<year>2020</year>). <article-title>Processing ambiguities in attachment and pronominal reference</article-title>. <source>Glossa: A Journal of General Linguistics</source>, <volume>5</volume>(<issue>1</issue>), <fpage>77</fpage>. DOI: <pub-id pub-id-type="doi">10.5334/gjgl.852</pub-id></mixed-citation></ref>
<ref id="B17"><label>17</label><mixed-citation publication-type="journal"><string-name><surname>Greene</surname>, <given-names>S. B.</given-names></string-name>, <string-name><surname>McKoon</surname>, <given-names>G.</given-names></string-name>, &amp; <string-name><surname>Ratcliff</surname>, <given-names>R.</given-names></string-name> (<year>1992</year>). <article-title>Pronoun resolution and discourse models</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>18</volume>(<issue>2</issue>), <fpage>266</fpage>. DOI: <pub-id pub-id-type="doi">10.1037/0278-7393.18.2.266</pub-id></mixed-citation></ref>
<ref id="B18"><label>18</label><mixed-citation publication-type="journal"><string-name><surname>Hofmeister</surname>, <given-names>P.</given-names></string-name> (<year>2011</year>). <article-title>Representational complexity and memory retrieval in language comprehension</article-title>. <source>Language and Cognitive Processes</source>, <volume>26</volume>(<issue>3</issue>), <fpage>376</fpage>&#8211;<lpage>405</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01690965.2010.492642</pub-id></mixed-citation></ref>
<ref id="B19"><label>19</label><mixed-citation publication-type="journal"><string-name><surname>Hofmeister</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2014</year>). <article-title>Distinctiveness and encoding effects in online sentence comprehension</article-title>. <source>Frontiers in Psychology</source>, <volume>5</volume>, <fpage>1237</fpage>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2014.01237</pub-id></mixed-citation></ref>
<ref id="B20"><label>20</label><mixed-citation publication-type="journal"><string-name><surname>Just</surname>, <given-names>M. A.</given-names></string-name>, <string-name><surname>Carpenter</surname>, <given-names>P. A.</given-names></string-name>, &amp; <string-name><surname>Woolley</surname>, <given-names>J. D.</given-names></string-name> (<year>1982</year>). <article-title>Paradigms and processes in reading comprehension</article-title>. <source>Journal of Experimental Psychology: General</source>, <volume>111</volume>(<issue>2</issue>), <fpage>228</fpage>&#8211;<lpage>238</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0096-3445.111.2.228</pub-id></mixed-citation></ref>
<ref id="B21"><label>21</label><mixed-citation publication-type="journal"><string-name><surname>Karimi</surname>, <given-names>H.</given-names></string-name>, &amp; <string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name> (<year>2016</year>). <article-title>Good-enough linguistic representations and online cognitive equilibrium in language processing</article-title>. <source>Quarterly Journal of Experimental Psychology</source>, <volume>69</volume>(<issue>5</issue>), <fpage>1013</fpage>&#8211;<lpage>1040</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/17470218.2015.1053951</pub-id></mixed-citation></ref>
<ref id="B22"><label>22</label><mixed-citation publication-type="journal"><string-name><surname>Keuleers</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Brysbaert</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>New</surname>, <given-names>B.</given-names></string-name> (<year>2010</year>). <article-title>SUBTLEX-NL: A new measure for Dutch word frequency based on film subtitles</article-title>. <source>Behavior Research Methods</source>, <volume>42</volume>(<issue>3</issue>), <fpage>643</fpage>&#8211;<lpage>650</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/BRM.42.3.643</pub-id></mixed-citation></ref>
<ref id="B23"><label>23</label><mixed-citation publication-type="webpage"><string-name><surname>Kuznetsova</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Brockhoff</surname>, <given-names>P. B.</given-names></string-name>, &amp; <string-name><surname>Christensen</surname>, <given-names>R. H. B.</given-names></string-name> (<year>2016</year>). <article-title>lmertest: Tests in linear mixed effects models [Computer software manual]</article-title>. Retrieved from <uri>https://CRAN.R-project.org/package=lmerTest</uri> (R package version 2.0-32)</mixed-citation></ref>
<ref id="B24"><label>24</label><mixed-citation publication-type="confproc"><string-name><surname>Laurinavichyute</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>von der Malsburg</surname>, <given-names>T.</given-names></string-name> (<year>2021</year>). <source>Agreement attraction in grammatical sentences arises only in the good-enough processing mode</source>. <conf-name>Paper presented at the 34th Annual CUNY Conference on Human Sentence Processing</conf-name>, <conf-sponsor>University of Pennsylvania</conf-sponsor>.</mixed-citation></ref>
<ref id="B25"><label>25</label><mixed-citation publication-type="journal"><string-name><surname>Lenth</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Singmann</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Love</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Buerkner</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Herve</surname>, <given-names>M.</given-names></string-name> (<year>2018</year>). <article-title>Emmeans: Estimated marginal means, aka least-squares means</article-title>. <source>R package version</source>, <volume>1</volume>(<issue>1</issue>), <fpage>3</fpage>.</mixed-citation></ref>
<ref id="B26"><label>26</label><mixed-citation publication-type="journal"><string-name><surname>Logacev</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2016a</year>). <article-title>A multiple-channel model of task-dependent ambiguity resolution in sentence comprehension</article-title>. <source>Cognitive Science</source>, <volume>40</volume>(<issue>2</issue>), <fpage>266</fpage>&#8211;<lpage>298</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/cogs.12228</pub-id></mixed-citation></ref>
<ref id="B27"><label>27</label><mixed-citation publication-type="journal"><string-name><surname>Loga&#269;ev</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2016b</year>). <article-title>Understanding underspecification: A comparison of two computational implementations</article-title>. <source>Quarterly Journal of Experimental Psychology</source>, <volume>69</volume>(<issue>5</issue>), <fpage>996</fpage>&#8211;<lpage>1012</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/17470218.2015.1134602</pub-id></mixed-citation></ref>
<ref id="B28"><label>28</label><mixed-citation publication-type="journal"><string-name><surname>MacDonald</surname>, <given-names>M. C.</given-names></string-name> (<year>1994</year>). <article-title>Probabilistic constraints and syntactic ambiguity resolution</article-title>. <source>Language and Cognitive Processes</source>, <volume>9</volume>(<issue>2</issue>), <fpage>157</fpage>&#8211;<lpage>201</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01690969408402115</pub-id></mixed-citation></ref>
<ref id="B29"><label>29</label><mixed-citation publication-type="journal"><string-name><surname>McDonald</surname>, <given-names>J. L.</given-names></string-name>, &amp; <string-name><surname>MacWhinney</surname>, <given-names>B.</given-names></string-name> (<year>1995</year>). <article-title>The time course of anaphor resolution: Effects of implicit verb causality and gender</article-title>. <source>Journal of Memory and Language</source>, <volume>34</volume>(<issue>4</issue>), <fpage>543</fpage>&#8211;<lpage>566</lpage>. DOI: <pub-id pub-id-type="doi">10.1006/jmla.1995.1025</pub-id></mixed-citation></ref>
<ref id="B30"><label>30</label><mixed-citation publication-type="webpage"><collab>R Core Team</collab>. (<year>2016</year>). <chapter-title>R: A language and environment for statistical computing [Computer software manual]</chapter-title>. <publisher-loc>Vienna, Austria</publisher-loc>. (<uri>https://www.R-project.org/</uri>)</mixed-citation></ref>
<ref id="B31"><label>31</label><mixed-citation publication-type="journal"><string-name><surname>Rigalleau</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Caplan</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Baudiffier</surname>, <given-names>V.</given-names></string-name> (<year>2004</year>). <article-title>New arguments in favour of an automatic gender pronominal process</article-title>. <source>The Quarterly Journal of Experimental Psychology Section A</source>, <volume>57</volume>(<issue>5</issue>), <fpage>893</fpage>&#8211;<lpage>933</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/02724980343000549</pub-id></mixed-citation></ref>
<ref id="B32"><label>32</label><mixed-citation publication-type="journal"><string-name><surname>Schotter</surname>, <given-names>E. R.</given-names></string-name>, <string-name><surname>Bicknell</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Howard</surname>, <given-names>I.</given-names></string-name>, <string-name><surname>Levy</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Rayner</surname>, <given-names>K.</given-names></string-name> (<year>2014</year>). <article-title>Task effects reveal cognitive flexibility responding to frequency and predictability: Evidence from eye movements in reading and proofreading</article-title>. <source>Cognition</source>, <volume>131</volume>(<issue>1</issue>), <fpage>1</fpage>&#8211;<lpage>27</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cognition.2013.11.018</pub-id></mixed-citation></ref>
<ref id="B33"><label>33</label><mixed-citation publication-type="journal"><string-name><surname>Stewart</surname>, <given-names>A. J.</given-names></string-name>, <string-name><surname>Holler</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Kidd</surname>, <given-names>E.</given-names></string-name> (<year>2007</year>). <article-title>Shallow processing of ambiguous pronouns: Evidence for delay</article-title>. <source>Quarterly Journal of Experimental Psychology</source>, <volume>60</volume>(<issue>12</issue>), <fpage>1680</fpage>&#8211;<lpage>1696</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/17470210601160807</pub-id></mixed-citation></ref>
<ref id="B34"><label>34</label><mixed-citation publication-type="journal"><string-name><surname>Swets</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Desmet</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Clifton</surname>, <given-names>J. C.</given-names></string-name>, &amp; <string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name> (<year>2008</year>). <article-title>Underspecification of syntactic ambiguities: Evidence from self-paced reading</article-title>. <source>Memory &amp; Cognition</source>, <volume>36</volume>(<issue>1</issue>), <fpage>201</fpage>&#8211;<lpage>216</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/MC.36.1.201</pub-id></mixed-citation></ref>
<ref id="B35"><label>35</label><mixed-citation publication-type="journal"><string-name><surname>Traxler</surname>, <given-names>M. J.</given-names></string-name>, <string-name><surname>Pickering</surname>, <given-names>M. J.</given-names></string-name>, &amp; <string-name><surname>Clifton</surname>, <given-names>J. C.</given-names></string-name> (<year>1998</year>). <article-title>Adjunct attachment is not a form of lexical ambiguity resolution</article-title>. <source>Journal of Memory and Language</source>, <volume>39</volume>(<issue>4</issue>), <fpage>558</fpage>&#8211;<lpage>592</lpage>. DOI: <pub-id pub-id-type="doi">10.1006/jmla.1998.2600</pub-id></mixed-citation></ref>
<ref id="B36"><label>36</label><mixed-citation publication-type="journal"><string-name><surname>Van Gompel</surname>, <given-names>R. P.</given-names></string-name>, <string-name><surname>Pickering</surname>, <given-names>M. J.</given-names></string-name>, <string-name><surname>Pearson</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Liversedge</surname>, <given-names>S. P.</given-names></string-name> (<year>2005</year>). <article-title>Evidence against competition during syntactic ambiguity resolution</article-title>. <source>Journal of Memory and Language</source>, <volume>52</volume>(<issue>2</issue>), <fpage>284</fpage>&#8211;<lpage>307</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2004.11.003</pub-id></mixed-citation></ref>
<ref id="B37"><label>37</label><mixed-citation publication-type="book"><string-name><surname>Van Gompel</surname>, <given-names>R. P.</given-names></string-name>, <string-name><surname>Pickering</surname>, <given-names>M. J.</given-names></string-name>, &amp; <string-name><surname>Traxler</surname>, <given-names>M. J.</given-names></string-name> (<year>2000</year>). <chapter-title>Unrestricted race: A new model of syntactic ambiguity resolution</chapter-title>. In <source>Reading as a perceptual process</source>, <fpage>621</fpage>&#8211;<lpage>648</lpage>. <publisher-name>Elsevier</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1016/B978-008043642-5/50029-2</pub-id></mixed-citation></ref>
<ref id="B38"><label>38</label><mixed-citation publication-type="journal"><string-name><surname>Van Gompel</surname>, <given-names>R. P.</given-names></string-name>, <string-name><surname>Pickering</surname>, <given-names>M. J.</given-names></string-name>, &amp; <string-name><surname>Traxler</surname>, <given-names>M. J.</given-names></string-name> (<year>2001</year>). <article-title>Reanalysis in sentence processing: Evidence against current constraint-based and two-stage models</article-title>. <source>Journal of Memory and Language</source>, <volume>45</volume>(<issue>2</issue>), <fpage>225</fpage>&#8211;<lpage>258</lpage>. DOI: <pub-id pub-id-type="doi">10.1006/jmla.2001.2773</pub-id></mixed-citation></ref>
<ref id="B39"><label>39</label><mixed-citation publication-type="journal"><string-name><surname>Van Rij</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Van Rijn</surname>, <given-names>H.</given-names></string-name>, &amp; <string-name><surname>Hendriks</surname>, <given-names>P.</given-names></string-name> (<year>2013</year>). <article-title>How WM load influences linguistic processing in adults: A computational model of pronoun interpretation in discourse</article-title>. <source>Topics in Cognitive Science</source>, <volume>5</volume>(<issue>3</issue>), <fpage>564</fpage>&#8211;<lpage>580</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/tops.12029</pub-id></mixed-citation></ref>
<ref id="B40"><label>40</label><mixed-citation publication-type="webpage"><string-name><surname>Venables</surname>, <given-names>W. N.</given-names></string-name>, &amp; <string-name><surname>Ripley</surname>, <given-names>B. D.</given-names></string-name> (<year>2002</year>). <source>Modern applied statistics with s</source> (<edition>Fourth</edition> ed.). <publisher-loc>New York</publisher-loc>: <publisher-name>Springer</publisher-name>. Retrieved from <uri>http://www.stats.ox.ac.uk/pub/MASS4</uri>(ISBN 0-387-95457-0). DOI: <pub-id pub-id-type="doi">10.1007/978-0-387-21706-2_14</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>