<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">2767-0279</journal-id>
<journal-title-group>
<journal-title>Glossa Psycholinguistics</journal-title>
</journal-title-group>
<issn pub-type="epub">2767-0279</issn>
<publisher>
<publisher-name>eScholarship Publishing</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5070/G60111681</article-id>
<article-categories>
<subj-group>
<subject>Regular article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Searching for semantic distance effects</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-0301-8471</contrib-id>
<name>
<surname>Winkowski</surname>
<given-names>Jan</given-names>
</name>
<email>j.l.winkowski@uu.nl</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0001-9571-4644</contrib-id>
<name>
<surname>Nouwen</surname>
<given-names>Rick</given-names>
</name>
<email>r.w.f.nouwen@uu.nl</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-5337-8432</contrib-id>
<name>
<surname>Dotla&#269;il</surname>
<given-names>Jakub</given-names>
</name>
<email>j.dotlacil@uu.nl</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>Institute for Language Sciences, Utrecht University, NL</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2024-07-30">
<day>30</day>
<month>07</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="collection">
<year>2024</year>
</pub-date>
<volume>3</volume>
<issue>1</issue>
<elocation-id>18</elocation-id>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2024 The Author(s)</copyright-statement>
<copyright-year>2024</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://glossapsycholinguistics.journalpub.escholarship.org/articles/10.5070/G60111681/"/>
<abstract>
<p>Language processing relies on memory. There exists a considerable body of literature on retrieval in <italic>sentence processing</italic> and, in particular, on cases involving recall of syntax-relevant information. There is no reason to doubt, however, that memory is involved in semantic aspects of language processing as well. In this work, we look at the case of additive presuppositions, such as those involved in interpreting the additive particle <italic>too</italic>. When one hears <italic>Mary went to the party, too</italic>, one should recall that someone other than Mary went to the party. We make the case that, as a starting hypothesis, it would be expected that the retrieval of this kind of information should share basic features of memory processes in language with the better-known cases of recall involved in syntactic parsing. In particular, we argue that, given certain assumptions and linking hypotheses, all prominent retrieval theories predict the existence of <italic>distance effects</italic> for the recall of previous information, independent of whether the recall is driven by syntactic or semantic, sentential or inter-sentential, considerations. As the distance increases, so does the difficulty of processing. We test this prediction in four experiments that investigate the role of retrieval in interpreting <italic>too</italic>. Using the Bayesian hierarchical modelling paradigm, we find evidence in two self-paced reading experiments that it takes more time to read sentences with <italic>too</italic> when the distance between the trigger and its antecedent is greater, compared to a baseline that lacks the presupposition trigger. This result shows that theories of the role of memory in language are just as relevant to the domain of discourse interpretation as they are to syntactic parsing. The fact that our evidence was relatively hard to find, however, suggests additionally that there are interesting differences to explore in the future.</p>
</abstract>
</article-meta>
</front>
<body>
<sec>
<title>1. Introduction</title>
<p>In linguistics, a lot of work has focused on studying the properties of syntactic dependencies (<xref ref-type="bibr" rid="B15">Chomsky, 1957</xref>; <xref ref-type="bibr" rid="B79">Tesni&#232;re, 1965</xref>). Research into syntactic dependencies has been accompanied by experimental research in psycholinguistics on the nature and properties of memory systems deployed in the resolution of syntactic dependencies (<xref ref-type="bibr" rid="B34">J&#228;ger et al., 2017</xref>; <xref ref-type="bibr" rid="B49">Lewis &amp; Vasishth, 2005</xref>; <xref ref-type="bibr" rid="B54">McElree et al., 2003</xref>; <xref ref-type="bibr" rid="B86">Van Dyke &amp; McElree, 2011</xref>; <xref ref-type="bibr" rid="B89">Wagers et al., 2009</xref>).</p>
<p>We believe that the reliance on memory does not stop at sentence boundaries. Language users have to remember elements and information spread out through whole discourses to arrive at the correct interpretation of complete texts. In linguistics and philosophy, the properties of discourse dependencies have been investigated in a formally explicit way by branches of formal semantics, such as discourse semantics and dynamic semantics (<xref ref-type="bibr" rid="B4">Asher &amp; Lascarides, 2003</xref>; <xref ref-type="bibr" rid="B32">Heim, 1982</xref>; <xref ref-type="bibr" rid="B37">Kamp, 1981</xref>; <xref ref-type="bibr" rid="B38">Kamp &amp; Reyle, 1993</xref>; <xref ref-type="bibr" rid="B62">Nouwen et al., 2016</xref>). While this research is well established, it has not been supplemented by detailed psycholinguistic investigations into the properties of memory systems used for the resolution of such discourse dependencies to the same extent that properties of memory processes for syntactic dependencies have been studied.</p>
<p>In this article, we investigate the memory processes involved in resolving one particular case of discourse dependencies. We focus on the particle <italic>too</italic>, which signals a dependency that can stretch back many sentences to be resolved (<xref ref-type="bibr" rid="B43">Kripke, 2009</xref>). Consider the following example:</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(1)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>A contestant started opening boxes one by one. The first box was empty. The second box contained a piece of gold. The contestant hesitated before opening the third box. This box was empty, too.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The particle <italic>too</italic> triggers the presupposition that there was another box apart from <italic>this box</italic> that was empty (for instance, <xref ref-type="bibr" rid="B83">van der Sandt &amp; Geurts, 2001</xref>). Resolving this presupposition is only feasible if readers recall what was mentioned at the beginning of the discourse. Such dependencies, spanning potentially large portions of a discourse, are subject to various structural and non-structural constraints, investigated in the field of semantics (<xref ref-type="bibr" rid="B43">Kripke, 2009</xref>; <xref ref-type="bibr" rid="B78">Szabolcsi, 2017</xref>; <xref ref-type="bibr" rid="B81">van der Sandt, 1992</xref>). Our goal in this article is to investigate how the resolution of this presupposition is affected by the distance between <italic>too</italic> and its antecedent, the information present in the sentence <italic>The first box was empty</italic> in (1). The investigation allows us to test a basic prediction made if we assume that the processes involved in recalling semantic information, potentially across sentence boundaries, share basic features with syntactic sentence processing.</p>
<p>The article is designed in the following way. In the next, section we discuss distance effects that have been found to exist for syntactic retrieval, and we show that, given certain assumptions, we can predict similar effects for the semantic processing of additives like <italic>too</italic>. Sections 3, 4, 5, and 6 present two acceptability judgment experiments and two self-paced reading experiments. Then, in Section 7, we use the Bayesian paradigm to inspect the pooled data from all experiments. The model fit to the pooled data will present evidence that confirms our prediction. Finally, we discuss some important questions raised by these findings.</p>
</sec>
<sec>
<title>2. Distance effects in syntactic and semantic processing</title>
<sec>
<title>2.1 Distance effects in syntactic dependencies</title>
<p>Distance effects (also called <italic>locality effects</italic>) occur when processing complexity is affected by the distance between words that are involved in some kind of dependency. An example of such a dependency is the relation between the head noun and the verb of a relative clause, as <italic>the administrator</italic> &#8211; <italic>supervised</italic> in (2) (based on <xref ref-type="bibr" rid="B31">Grodner &amp; Gibson, 2005</xref>).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(2)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The administrator who the nurse supervised scolded the medic.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The administrator who the nurse who was from the clinic supervised scolded the medic.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The additional material in (2b), compared to (2a), results in processing difficulties. The difficulty manifests itself as increased reading times in eye-tracking while reading studies and self-paced reading studies (<xref ref-type="bibr" rid="B7">Bartek et al., 2011</xref>; <xref ref-type="bibr" rid="B31">Grodner &amp; Gibson, 2005</xref>; <xref ref-type="bibr" rid="B57">Nicenboim et al., 2016</xref>, among others) or as a decrease in accuracy in studies using the speed-accuracy tradeoff paradigm (e.g. <xref ref-type="bibr" rid="B52">McElree, 2000</xref>).</p>
<p>Assuming a certain view of how memory is involved in language processing (<xref ref-type="bibr" rid="B1">Anderson, 1991</xref>; <xref ref-type="bibr" rid="B50">Lewis et al., 2006</xref>), there are two possible sources for such findings: decay and interference (see also <xref ref-type="bibr" rid="B7">Bartek et al., 2011</xref>). Both of these can play a role in processing. As the distance to the retrieval target grows, the activation of that target decays. At the same time, with greater distance comes an increase in the number of similar elements that compete with the antecedent for retrieval. This competition reduces accuracy, but can also have an impact on speed when faulty resolutions need to be rectified.</p>
<p>Distance effects are accounted for in different ways by various theoretical frameworks. For instance, in dependency locality theory (<xref ref-type="bibr" rid="B30">Gibson, 2000</xref>), the integration cost of connecting some current structure to a previous structure depends directly on the number of discourse entities (discourse referents, events, times) that have been introduced in the meantime.</p>
<p>Another dominant theory of how memory is involved in sentence processing posits that dependents in long-distance dependencies are retrieved via a so-called cue-based mechanism (<xref ref-type="bibr" rid="B34">J&#228;ger et al., 2017</xref>; <xref ref-type="bibr" rid="B49">Lewis &amp; Vasishth, 2005</xref>; <xref ref-type="bibr" rid="B53">McElree, 2001</xref>; <xref ref-type="bibr" rid="B54">McElree et al., 2003</xref>; <xref ref-type="bibr" rid="B61">Nicenboim et al., 2015</xref>; <xref ref-type="bibr" rid="B60">Nicenboim &amp; Vasishth, 2018, and others</xref>). This means that when an item previously seen is needed to parse a dependency, it is retrieved from memory using cues determined by the word being processed. Consider (3) (taken from <xref ref-type="bibr" rid="B85">Van Dyke, 2007</xref>).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(3)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The worker was surprised that the resident who was living near the dangerous neighbor <bold>was complaining</bold> about the investigation.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>When reading the phrase <italic>was complaining</italic>, the parser has to find the subject of the clause (<italic>the resident</italic>) to arrive at the correct interpretation. To retrieve the subject from memory, the parser uses cues. In this way, the search is restricted to the elements that satisfy the constraints introduced by the verb phrase. We can assume that the parser would consider the cues {<italic>noun phrase</italic>}, {<italic>local subject</italic>} and {<italic>animate</italic>}, among others.</p>
<p>In examples like (3), there are several elements that match the cue {<italic>animate</italic>} and {<italic>noun phrase</italic>} without matching the cue {<italic>local subject</italic>}: <italic>the worker</italic>, and <italic>the neighbour</italic> are such cases. Such partially-matching elements are commonly called <italic>distractors</italic>. The interaction of phrases carrying partially matching sets of features with the retrieval process is called <italic>cue interference</italic> and is usually taken as evidence in favour of the cue-based retrieval mechanism (<xref ref-type="bibr" rid="B34">J&#228;ger et al., 2017</xref>). Due to cue interference, distractors can in some environments cause misretrieval and slowdown in reading times at the moment of dependency resolution (e.g. <xref ref-type="bibr" rid="B34">J&#228;ger et al., 2017</xref>; <xref ref-type="bibr" rid="B59">Nicenboim &amp; Vasishth, 2018</xref>).</p>
<p>There is a plethora of research studying the cue-based retrieval mechanism in the syntactic domain (see <xref ref-type="bibr" rid="B34">J&#228;ger et al., 2017</xref>, for an overview and a metaanalysis). This body of work extends also to syntactic phenomena involving retrieval of semantic information, such as the licensing of negative polarity items (<xref ref-type="bibr" rid="B63">Parker &amp; Phillips, 2016</xref>) and to the role of semantic information in resolving syntactic dependencies (e.g. <xref ref-type="bibr" rid="B16">Cunnings &amp; Sturt, 2018</xref>; <xref ref-type="bibr" rid="B46">Laurinavichyute &amp; von der Malsburg, 2022</xref>).</p>
<p>In contrast to this research, the investigation of retrieval mechanisms for discourse dependencies is much more limited. Most studies in psycholinguistics that go beyond the domain of syntax consider pronoun resolution (<xref ref-type="bibr" rid="B45">Kush &amp; Eik, 2019</xref>; <xref ref-type="bibr" rid="B44">Kush &amp; Dillon, 2021</xref>; <xref ref-type="bibr" rid="B73">Schmitz et al., 2024</xref>). Yet, it is clear that there are other dependencies in discourse that could be investigated from this perspective.</p>
<p>In what follows, we will show in a theory-independent way that the processing of <italic>too</italic> relies on memory. This fact will allow us to derive experimental predictions using cue-based theories of memory retrieval. We note here that the predictions we are to discuss and test are likely shared with various other theories of memory use in language comprehension, like dependency locality theory. Our goal here is not to decide which of the theories is right. Rather, we want to show that under quite general conditions on memory, which we will make explicit using the cue-based retrieval model as our illustrative framework, we can establish whether there is a cost to processing <italic>too</italic> due to retrieval.</p>
</sec>
<sec>
<title>2.2 Processing too</title>
<p>In contrast to studies concerned with syntactic dependencies, the phenomenon we are interested in is largely independent of syntax. Yet, the additive presuppositions we focus on in what follows <italic>do</italic> clearly qualify as <italic>dependencies</italic>. So, in this sense, they <italic>are</italic> quite close to what has been studied in the syntactic domain (see also <xref ref-type="bibr" rid="B8">Brasoveanu &amp; Dotla&#269;il, 2020</xref>; <xref ref-type="bibr" rid="B14">Chen &amp; Husband, 2018</xref>, for a similar argument).</p>
<p>To see this, consider a sentence like (4). This triggers a presupposition that is connected to the so-called <italic>associate</italic> of the additive <italic>too</italic>, here assumed to be <italic>Sue</italic>: namely, that someone other than Sue lives in France.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(4)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Sue lives in France, too.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Given this presupposition, we can now demonstrate that additives like <italic>too</italic> are involved in dependencies and that processing them relies on retrieval. <italic>Too</italic> is subject to a so-called <italic>strong contextual felicity constraint</italic> (<xref ref-type="bibr" rid="B80">Tonhauser et al., 2013</xref>), which means that it is infelicitous whenever the presupposition is not satisfied in the preceding context.<xref ref-type="fn" rid="n1">1</xref> Consider the example in (5) (intended as an utterance out of the blue).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(5)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Anne lives in Brussels. Sue lives in France, #too.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The fact that we can recognise that (5) is infelicitous means that we use our knowledge of the preceding context when we process the additive. To be more specific, our capacity to judge the second sentence in (5) infelicitous after encountering <italic>too</italic> entails that upon encountering the additive, a retrieval process is triggered to match the presupposition. The result of this process is known to have an effect on sentence processing. Schwarz (<xref ref-type="bibr" rid="B74">2007</xref>) shows that reading times are longer in conditions where a trigger lacks an antecedent, compared to a condition where an antecedent is available. Presumably, such effects, as well as the infelicity judgment of (5), are caused by a failure of the match following the retrieval process.</p>
<p>The basis for this retrieval process is information-structural rather than syntactic. There is a general consensus that the associate of the additive is in focus (e.g. <xref ref-type="bibr" rid="B29">Geurts &amp; van der Sandt, 2004</xref>; <xref ref-type="bibr" rid="B69">Ruys, 2015</xref>) or is a contrastive topic (e.g. <xref ref-type="bibr" rid="B41">Krifka, 1998</xref>). The semantic import of bearing focus (and being a contrastive topic) is the activation of alternatives (<xref ref-type="bibr" rid="B42">Krifka, 2008</xref>; <xref ref-type="bibr" rid="B67">Rooth, 1992</xref>). Such alternatives usually trigger an exhaustification implicature, the inference that all alternatives except for the asserted one are false (<xref ref-type="bibr" rid="B23">Fox &amp; Katzir, 2011</xref>; <xref ref-type="bibr" rid="B39">Kiss, 1998</xref>; <xref ref-type="bibr" rid="B67">Rooth, 1992</xref>). For instance, <italic>Sue lives in France</italic> (with the subject bearing focus) triggers an implicature that none of the other contextually relevant individuals (i.e. alternatives to <italic>Sue</italic>) live in France. The additive in (4) overtly cancels that implicature, by presupposing that one of the alternatives <italic>is</italic> true (<xref ref-type="bibr" rid="B5">Bade, 2014</xref>; <xref ref-type="bibr" rid="B41">Krifka, 1998</xref>; <xref ref-type="bibr" rid="B70">S&#230;b&#248;, 2004</xref>).</p>
<p>The additive&#8217;s presupposition can be obtained by looking at the alternative propositions triggered by the associate bearing focus. For instance, focus on <italic>Sue</italic> in the second sentence of (5) creates alternatives like <italic>Peter lives in France, Bob lives in France, Anne lives in France</italic>. The presupposition of <italic>too</italic> says that at least one such alternative is true. In other words, if the presupposition is satisfied textually, then the preceding discourse should entail one of these alternatives. Since <italic>Anne lives in Paris</italic> entails <italic>Anne lives in France</italic>, the presupposition is met in an example like (6), but since <italic>Anne lives in Brussels</italic> does not have that entailment, (5) ends up being infelicitous.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(6)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Anne lives in Paris. Sue lives in France, too.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Given this picture, any proposition entailed in the preceding discourse is a potential antecedent to the additive. Importantly, this means that the number of potential antecedents increases as the preceding discourse gets larger. This is not dependent on the number of preceding clauses, but rather on the number of constituents. Consider, for instance, (7):</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(7)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Last week, Sue gave a 14 year old bottle of wine to John. Sue&#8217;s cat is 14 years old, too. But that&#8217;s a coincidence.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Examples like this show that the addition of any constituent has the potential to be responsible for an entailment that could satisfy some presupposition down the line. Focus on the subject of the second sentence in (7) activates alternatives of the form <italic>X is 14 years old</italic>. The additive triggers the presupposition that one of these alternatives is true, where <italic>X</italic> is not Sue&#8217;s cat. The first sentence entails that the bottle of wine given to John is 14 years old and, as a consequence, the presupposition is satisfied. However, obviously, without the addition of the AP <italic>14 year old</italic> modifying <italic>bottle of wine</italic>, the presupposition would not be met and the discourse in (7) would be infelicitous. This observation has consequences for how we may understand the retrieval process involved in additive presuppositional dependencies. As examples like (7) show, the number of potential dependents (and, so, the number of distractors) increases not just with the number of intervening clauses, but also with the number of intervening constituents.</p>
<p>There are obvious parallels between the retrieval process involved in syntactic dependencies and that involved in presuppositions. Just as the number of syntactic attachment points increases as the sentence proceeds, there is a comparable accumulation of entailments as the discourse progresses. Consequently, just as we expect processing complexity to increase with increased distance within a syntactic dependency, all else being equal, we expect a similar effect of complexity to occur with presuppositions whenever the antecedent entailment is triggered earlier in the discourse. Put simply, if we assume that distance effects are due to inherent constraints on the memory processes involved in language processing, and given that additive presuppositions involve retrieval, we come to expect that the processing of additive presuppositions will display distance effects.</p>
<p>It should be noted, though, that it is possible to satisfy an additive presupposition without identifying a single word or phrase responsible for this entailment. Take the following example:</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(8)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>By Thanksgiving, his spots disappeared, his coat turned a rich, dull, field-mouse brown. He gained weight, abandoned the goat&#8217;s milk for forage. Buck became a fixture on Church Road, bounding alongside Skye when she went jogging, greeting vehicles at the mailbox, startling the FedEx delivery man, and charming neighbors into leaving treats out for him &#8212; a few apples, some oatmeal, a pile of hickory nuts. Billy was changing, too. (taken from Corpus of Contemporary American English, <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.english-corpora.org/coca/">https://www.english-corpora.org/coca/</ext-link>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Clearly the presupposition is satisfied here; the set of sentences in the preceding discourse seems to entail the proposition <italic>Buck was changing</italic> for some relevant meaning of <italic>changing</italic>. However, the word itself does not appear in that discourse. So, in contexts like this, it would be hard to pinpoint a location in the preceding text where the entailment originates. This is not surprising; since additive presuppositions involve dependencies built on entailments, not all such dependencies will be reducible to a dependency between <italic>too</italic> and a word or a phrase.<xref ref-type="fn" rid="n2">2</xref> We purposefully avoid examples like this in our stimuli, which will allow us to match the antecedent of the presuppositional dependency (an entailment) with a position in the preceding text. This will allow us to test the expectation that presuppositional dependencies are subject to distance effects.</p>
<p>This expectation is dependent on presuppositions being triggered rapidly. If presuppositions are slow inferences, the comparison to syntactic processing becomes moot. In the literature, presupposition processing speed has been connected to the question whether presuppositions are more like semantic entailment (<xref ref-type="bibr" rid="B32">Heim, 1982</xref>) or more like pragmatic implicature (e.g. <xref ref-type="bibr" rid="B72">Schlenker, 2008</xref>). This is because, for a while, it was thought that pragmatic inferences are typically slow and costly (e.g. <xref ref-type="bibr" rid="B33">Huang &amp; Snedeker, 2011</xref>, views tend to be more nuanced now, e.g. <xref ref-type="bibr" rid="B17">Degen &amp; Tanenhaus, 2016</xref>). It has proven to be difficult to use acceptance/rejection reaction time studies to compare presupposition and implicature (<xref ref-type="bibr" rid="B13">Chemla &amp; Bott, 2013</xref>; <xref ref-type="bibr" rid="B66">Romoli &amp; Schwarz, 2015</xref>), but there is clear evidence from other paradigms that presuppositions are not delayed. Using the visual world paradigm, Romoli et al. (<xref ref-type="bibr" rid="B65">2015</xref>); Schwarz (<xref ref-type="bibr" rid="B75">2015</xref>) show that fixation shifts to pictures representing potential antecedents of additive presupposition triggers very soon after such a trigger is encountered. This suggests that the retrieval process is started rapidly. That is, antecedent retrieval is like the rapid processing of syntactic information. As a result, comparison with the resolution of syntactic dependencies makes sense from a processing point of view.</p>
<p>At the same time, there are also obvious differences between retrieval of syntactic material and retrieval involving semantics. It is much less evident what kind of cues could guide the retrieval of entailments. Entailments do not always coincide with linguistic events, such as the processing of a word or constituent. In this sense, it is harder to translate them into cues than, for instance, connecting a phrase to the feature {<italic>subject</italic>}. Still, such connections <italic>are</italic> proposed in representational theories of (discourse) semantics, most prominently in Discourse Representation Theory (DRT; <xref ref-type="bibr" rid="B37">Kamp, 1981</xref>; <xref ref-type="bibr" rid="B38">Kamp &amp; Reyle, 1993</xref>). In DRT, an incremental semantic representation of the discourse is formed as the discourse progresses. The ensuing representations impose constraints on what information is accessible where in the discourse. As such, it offers a representational basis for semantic notions like entailment and, therefore, it could potentially act as a framework for imagining what a cue-based mechanism for presupposition retrieval looks like. In fact, DRT has been an influential framework for theories of presupposition (e.g. <xref ref-type="bibr" rid="B28">Geurts, 1999</xref>; <xref ref-type="bibr" rid="B81">van der Sandt, 1992</xref>).</p>
</sec>
<sec>
<title>2.3 Predictions</title>
<p>The items used in the experiments below allow us to simplify, to some extent, the complex picture introduced above. Consider, for instance, (9), which is a stimulus from Experiment 1.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(9)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a dancer and the waiter, who is a great boxer from southern Amsterdam, dances too, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>When encountering the verb <italic>dances</italic> followed by the additive <italic>too</italic>, the parser needs to find the information that makes the additive licensed. In particular, some part of the discourse (representation) needs to be found that entails <italic>X dances</italic> for something other than <italic>X</italic> = <italic>the waiter</italic>. The fully matching part of the discourse is <italic>The cook is a dancer</italic>, since this clause entails that the cook dances, but between that and the additive there are many constituents that introduce predicate-argument propositions that fail to match the presupposition. For instance, <italic>a great boxer</italic> has the same subject as <italic>dances</italic> and is thus unsuitable. The same goes for <italic>from southern Amsterdam</italic>. Both these predicates also fail to subsume the property of <italic>dancing</italic>, which is another reason why they are unsuitable antecedents. Still, they do qualify as distractors, since they introduce entailments (that the waiter is a great boxer, that the waiter is from southern Amsterdam) that potentially could satisfy upcoming presuppositions. This means we could simplify things by matching distractors to <italic>predicates</italic> appearing somewhere else in the text. Such predicates can be realised as a VP, an appositive noun, a PP, etc.<xref ref-type="fn" rid="n3">3</xref> What matters is the semantic content present in the discourse. We assume that {<italic>predicate</italic>} can be used as a cue when searching for a relevant item to retrieve. Since predicates are ubiquitous in language, we believe that this makes it theoretically possible to investigate whether the cue-based retrieval mechanism is a plausible theory of the on-line interpretation of additives. Under this theory, we expect interference due to the number of predicates, which can be approximated by the number of words intervening between the antecedent and the additive. Based on this assumption, the cue-based retrieval mechanism is compatible with the observation of more errors and/or slower reading times as the number of words intervening between the antecedent and the additive is increased.</p>
<p>Let us spell out the prediction in more detail. Assuming that predicates act as distractors, they interfere with the retrieval process. The distractors could be retrieved erroneously. If readers need to correct the misretrieval by attempting retrieval again, or they notice that the information retrieved does not match what they wanted to retrieve and end up with an infelicitous discourse, an increase in their reading time is expected (<xref ref-type="bibr" rid="B59">Nicenboim &amp; Vasishth, 2018</xref>). Alternatively, according to some theories of cue-based retrieval, the presence of distractors should lower the activation of the antecedent, which is consequently harder to retrieve, so its retrieval should take more time (<xref ref-type="bibr" rid="B34">J&#228;ger et al., 2017</xref>; <xref ref-type="bibr" rid="B49">Lewis &amp; Vasishth, 2005</xref>). Either assumption would derive a slowdown when more words intervene between the antecedent and <italic>too</italic>. However, even outside of the cue-based retrieval mechanism, we predict an interfering role for intervening words. Under models of memory in which memory traces decay over time (<xref ref-type="bibr" rid="B9">Brown et al., 2007</xref>, base-level activation; <xref ref-type="bibr" rid="B3">Anderson &amp; Lebiere, 1998</xref>), a similar prediction is made. Since increasing the number of words between the antecedent and the additive should increase the time elapsed, it is expected that the antecedent should be harder to retrieve when more words intervene.</p>
<p>In summary, given the assumptions about the various ways retrieval processes work in language processing and given that presupposition resolution involves retrieval, just as syntactic dependency resolution does, we predict that the greater the distance between the trigger and the antecedent, the more difficult it will be to retrieve the antecedent. This difficulty should manifest itself in longer reading times observed on the relevant regions. Besides reading times, we also examine whether the predicted difficulty will manifest itself as diminished acceptability of sentences with greater distance between the trigger and the antecedent.</p>
</sec>
</sec>
<sec>
<title>3. Experiment 1: Acceptability judgment task</title>
<sec>
<title>3.1 Design</title>
<p>The first experiment was designed to test whether linguistic distance has an influence on the perceived acceptability of sentences with presupposition carried by an additive <italic>too</italic>.</p>
<p>It is usually assumed that processing difficulty has an influence on acceptability (<xref ref-type="bibr" rid="B55">Miller &amp; Chomsky, 1963</xref>, <xref ref-type="bibr" rid="B77">Staum Casasanto, Hofmeister, &amp; Sag, 2010</xref> and many others). The relation between the two is not so clear when it comes to distance (locality) effects. For example, Konieczny (<xref ref-type="bibr" rid="B40">2000</xref>) reports conflicting results from acceptability judgments and self-paced reading experiments on locality effects. This suggests that, in this particular domain, processing difficulty and acceptability are not connected in a straightforward way. We decided to collect acceptability judgment data as a sanity check and a first step in investigating the processing of <italic>too</italic>. In case distance effects were observed in the acceptability judgment task, we would have evidence that distance effects play a role even in off-line measures. If we do not observe such effects, the natural next step is to investigate on-line processing of <italic>too</italic>.</p>
<p>We compared sentences like (10a), in which an additive presupposition is triggered by <italic>too</italic>, with sentences like (10b), where there was no such trigger and no presupposition.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(10)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a <bold>dancer</bold> and the waiter dances <bold>too</bold>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a <bold>swimmer</bold> and the waiter dances <bold>often</bold>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Apart from varying the presence or absence of a presupposition trigger, we also manipulated the length of the sentences used. This varied the distance between the presuppositional antecedent and the trigger, in a way designed to help us observe the differences in acceptability (or reading times) between the trigger/no trigger conditions.</p>
<p>In the experiment, four levels of distance (labelled 0, 4, 8, or 12 for extensions of 0 words, 4&#8211;5 words, 8&#8211;10 words, or 12&#8211;15 words; 4 words change between the conditions at minimum) were crossed with two levels of trigger (<italic>too/nil</italic> &#8211; for trigger present or no trigger, respectively). This results in a within-participants design with 4 &#215; 2 = 8 conditions. The distance factor was manipulated by increasing the number of words between the trigger site and the antecedent. The trigger factor was manipulated by replacing the trigger with a non-triggering word, usually an adverb, and aligning the sentence meaning with this change. This usually involved changing the nominal predicate to a different noun. An example of an item is shown below.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p><italic>too</italic></p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>&#160;</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>0</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a dancer and the waiter dances too, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>4</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a dancer and the waiter, who is a boxer, dances too, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>8</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a dancer and the waiter, who is a great boxer from southern Amsterdam, dances too, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>12</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a dancer and the waiter, who is a great lightweight boxer from east parts of southern Amsterdam, dances too, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p><italic>nil</italic></p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>&#160;</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>0</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a swimmer and the waiter dances often, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>4</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a swimmer and the waiter, who is a boxer, dances often, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>8</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a swimmer and the waiter, who is a great boxer from southern Amsterdam, dances often, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>12</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a swimmer and the waiter, who is a great lightweight boxer from east parts of southern Amsterdam, dances often, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The target items were sentences consisting of two parallel clauses. In the first clause, some property was asserted of the subject using a noun (e.g. <italic>dancer</italic> in (<italic>too</italic>)). In the second clause, this property was recalled using a verb followed by <italic>too</italic> (e.g. <italic>dances too</italic> in (<italic>too</italic>)). In the <italic>nil</italic> condition, the noun in the first clause was unrelated to the verb in the second clause, and <italic>too</italic> was replaced by an adverb (e.g., <italic>swimmer</italic> and <italic>dances often</italic> in (<italic>nil</italic>)).</p>
<p>We constructed 32 items of this kind. The target items were combined with 22 filler items. The fillers were grammatical sentences of similar length to the target items. Additionally, we used 10 items as controls. Their purpose was to check on the engagement of the participants. These were sentences that clearly elicit judgments of unacceptability, because they violate some syntactic or semantic constraint. All the stimuli are available in the supplementary material.</p>
</sec>
<sec>
<title>3.2 Participants</title>
<p>32 participants (female = 18 (56%), male = 12 (38%), other/don&#8217;t want to say = 2 (6%)) were recruited using Prolific (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.prolific.co">https://www.prolific.co</ext-link>). Participants were compensated 1.97 USD for taking part, which was calculated assuming that the experiment should take around 10 minutes to complete, and given that the effective minimum wage in the United States was 11.80 USD at the time of the experiment <inline-formula>
<alternatives>
<mml:math id="eq001-mml">
<mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mstyle scriptlevel='+1'><mml:mfrac><mml:mn>1</mml:mn><mml:mn>6</mml:mn></mml:mfrac></mml:mstyle><mml:mo>&#x2009;</mml:mo><mml:mo>&#x2009;</mml:mo><mml:mo>&#x00D7;</mml:mo><mml:mo>&#x2009;</mml:mo><mml:mo>&#x2009;</mml:mo><mml:mn>11.80</mml:mn><mml:mo>&#x2009;</mml:mo><mml:mo>&#x2009;</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x2009;</mml:mo><mml:mo>&#x2009;</mml:mo><mml:mn>1.97</mml:mn><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:math>
<tex-math id="M1">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
({\textstyle{1 \over 6}}\,\, \times \,\,11.80\,\, = \,\,1.97)
\]
\end{document}
</tex-math>
<graphic xlink:href="glossapx-3-1-1681-e1.gif"/>
</alternatives>
</inline-formula>. The participants were self-reported English native speakers and self-reported US nationals.</p>
</sec>
<sec>
<title>3.3 Procedure</title>
<p>The experiment was conducted using Ibex.<xref ref-type="fn" rid="n4">4</xref> The 31 item sets were distributed across 8 lists in a Latin Square design, and participants were assigned at random to each list.<xref ref-type="fn" rid="n5">5</xref> Each of the target sentences was interspersed with either a filler item or a control item. The order of items across lists was randomized. The experiment was run from the participants&#8217; internet browser.</p>
<p>Before starting the experiment, there was an introduction containing a consent form, a question regarding native language (or languages), and a question about the participant&#8217;s gender with three possible answers: <italic>male, female, other/don&#8217;t want to say</italic>. After the introduction, there was an explanation of the task, followed by a training session with two training items.</p>
<p>During the actual task, participants saw a sentence in the middle of the screen. Below the sentence was a question asking: <italic>How does the sentence sound?</italic> The question was accompanied by a 5-point scale labelled <italic>unnatural</italic> near <italic>1</italic> on the left, and <italic>natural</italic> near <italic>5</italic> on the right. On the top of the screen, a progress bar was displayed. After choosing a score either with a mouse cursor or with a keyboard, the participant saw a screen with the message: <italic>Please wait for the next sentence</italic>, and after one second, they were presented with the next item.</p>
</sec>
<sec>
<title>3.4 Predictions</title>
<p>There are two specific predictions for this experiment. First, we predict that longer items will be judged as worse than the shorter ones. This prediction is in line with the findings of Dillon et al. (<xref ref-type="bibr" rid="B19">2014</xref>), who found that longer restrictive relative clauses are judged as worse than shorter ones.</p>
<p>The prediction which is of greater importance to us, concerns the difference in processing difficulty between the sentences with <italic>too</italic> and without <italic>too</italic>. We predict that increasing distance in the sentences with <italic>too</italic> will make them more difficult to process. However, if we test this prediction only on sentences with <italic>too</italic>, we cannot be sure if this difficulty is not just a general effect of the greater distance, independent of <italic>too</italic>. To overcome this difficulty, we are comparing sentences with <italic>too</italic> to sentences without <italic>too</italic>, and we will look for an interaction of the trigger with distance. The difference in difficulty between the long and short sentences should be larger for sentences with <italic>too</italic> than for those without it. We expect this difficulty to be visible as a difference in ratings across relevant conditions. That is, if distance effects in retrieval are visible in an off-line study, we predict a larger decrease in ratings across distance in the <italic>too</italic> conditions than in the <italic>nil</italic> conditions.</p>
</sec>
<sec>
<title>3.5 Data analysis</title>
<p>After visual inspection, two participants were removed from the analysis, since their median response to the control sentences was above 3, indicating that they treated purposefully unnatural sentences as quite natural.</p>
<p>A Bayesian hierarchical model was fit to the data obtained. In Bayesian data analysis, one specifies the likelihood, and the prior distributions over the parameters of interest (<xref ref-type="bibr" rid="B26">Gelman et al., 2003</xref>; <xref ref-type="bibr" rid="B27">Gelman &amp; Hill, 2006</xref>; <xref ref-type="bibr" rid="B51">McElreath, 2018</xref>; <xref ref-type="bibr" rid="B58">Nicenboim et al., 2021</xref>). The analysis results in posterior probability distributions of plausible values for a given model and data. We report means and 89% credible intervals (CRI), i.e. the range of values for which we can be 89% certain that the true effect lies therein. The mean and the range of credible intervals are supposed to convey the shape of the posterior distribution. We specifically avoid using 95% credible intervals to discourage the readers from conducting unconscious hypothesis tests (see also <xref ref-type="bibr" rid="B51">McElreath, 2018, p. 58</xref>).</p>
<p>We used ordinal regression models, which are particularly well suited to the analysis of discrete data (<xref ref-type="bibr" rid="B12">B&#252;rkner &amp; Vuorre, 2019</xref>; <xref ref-type="bibr" rid="B88">Ver&#237;ssimo, 2021</xref>). In particular, we used a cumulative model with a logit link function and so-called flexible thresholds (i.e. the estimated distance between the different scores can vary). Models used 4 sampling chains, with 4,000 samples drawn from each chain. Half of these samples were discarded for warm-up; hence each model had 8,000 samples available for the analysis. Trace plots and plots of posterior predictive distribution were visually inspected to identify convergence issues. Additionally, only the models with all &#710;<italic>R</italic> &#8804; 1.01, which suggests convergence, were used in the analyses.</p>
<p>All the models used were fit with a full variance-covariance matrix, i.e. they were so-called maximal models. The analysis was conducted in R software for statistical computing (<xref ref-type="bibr" rid="B64">R Core Team, 2021</xref>), and, in particular, with the use of the brms package (<xref ref-type="bibr" rid="B11">B&#252;rkner, 2017</xref>), which uses the Stan probabilistic language (<xref ref-type="bibr" rid="B76">Stan Development Team, 2021</xref>).</p>
<p>The distance, levels of which were labelled with numbers closely matching the number of words between the trigger and its antecedent, was treated as a numeric variable which was scaled and centered before fitting the model. The trigger was sum-contrast coded (<italic>nil</italic> = &#8211;1, <italic>too</italic> = 1). Fixed effects of interest were: distance, trigger and their interaction. Participants and items were used as random effects.</p>
<p>We used regularising prior distributions for slopes and intercepts. For the intercept, we used a normal distribution with <italic>&#956;</italic> = 0 and <italic>&#963;</italic> = 2. This means that any of the intercepts (i.e. the thresholds for different Likert scale outcomes) should fall with 89% probability between 0.04 and 0.96 on the probability scale. For the slopes, we used a normal distribution with <italic>&#956;</italic> = 0 and <italic>&#963;</italic> = 0.5.</p>
<p>This assumes that if we fix the intercept at 0 log-odds (i.e. 0.5 on the probability scale), and the input at 1, then the values of the outcome will be, with 89% probability, between 0.69 and 0.31 on the probability scale. Or, to put it differently, the priors used for the slopes mean that when the intercept is at 0.5 probability, we assume, with 89% probability, that the slope coefficient will change the outcome by at most 0.197 on the probability scale. The priors were additionally examined using prior predictive checks.</p>
<p>Note that the size of the effect depends on the intercept, because of the logit link function. We report the results on the log-odds scale.</p>
<p>The 89% credible intervals for the slopes of interest are summarised in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
<fig id="F1">
<caption>
<p><bold>Figure 1:</bold> The posterior probability distribution of the parameters of interest in Experiment 1. The thicker lines delimit 50% probability intervals, the thinner lines delimit 89% probability intervals. The point signifies the mean value.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-1681-g1.png"/>
</fig>
<p>The estimate of the distance parameter indicates that the longer items were judged as less natural (89% CrI = (&#8211;0.4, &#8211;0.2), mean = &#8211;0.3, sd = 0.08).</p>
<p>In the case of the trigger, the sign of the estimate is rather uncertain, with the 89% (&#8211;0.09, 0.18) interval lying on the both sides of 0. The mean (= 0.04, sd = 0.09) is positive (i.e. the sentences with <italic>too</italic> were judged as slightly more natural), but the spread of the posterior distribution indicates that the sign of estimate is not conclusive.</p>
<p>For our purposes, the most interesting parameter is the slope of the interaction. The posterior distribution of the slope of the interaction (mean = &#8211;0.07, sd = 0.08) has the 89% credible interval (&#8211;0.2, 0.05), that is, we can be 89% certain that the value of the interaction lies in this interval. The interval is more negative than positive, but the fact that it spans zero suggests that zero (no effect) is a possible value for the interaction term.</p>
<p>In sum, there is a clear effect of distance (longer items are less natural), but the model does not show that the presence of the trigger and the interaction have an effect on acceptability.</p>
</sec>
</sec>
<sec>
<title>4. Experiment 2: Self-paced reading</title>
<sec>
<title>4.1 Design</title>
<p>In the self-paced reading task, we used a subset of the conditions and items from the acceptability judgment task. These were chosen based on our intuition, i.e. one of the authors went through all the items and selected the ones they thought sounded good. Some of the remaining items which we thought could be improved were edited for the self-paced reading experiment. The exact changes between the items can be seen at the OSF repository.</p>
<p>With respect to the experimental manipulation, two levels of distance (<italic>short</italic> and <italic>long</italic>, originally 0 and 8) were crossed with two levels of trigger (<italic>too</italic> and <italic>nil</italic>), which resulted in a 2 &#215; 2 within-participant design. We used 24 target items in the self-paced reading task. These were accompanied by 25 filler items.</p>
</sec>
<sec>
<title>4.2 Participants</title>
<p>82 participants (female = 44 (54%), male = 37 (45%), other/don&#8217;t want to say = 1 (1%)) were recruited using Prolific (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.prolific.co">https://www.prolific.co</ext-link>). They were compensated 3.00 GBP for taking part in the experiment. The experiment took, on average, 12 minutes and 38 seconds to finish; therefore, the participants would get 14.25 GPB per hour, on average. The participants were self-reported English native speakers.</p>
</sec>
<sec>
<title>4.3 Procedure</title>
<p>The experiment was conducted in a noncumulative self-paced reading moving window paradigm (<xref ref-type="bibr" rid="B36">Just et al., 1982</xref>): during reading, only one word appeared and participants had to press the space bar to reveal the next word and hide the previous word. In the initial position, every word was masked by a line which was roughly equal to the word&#8217;s length. When the words disappeared, they went back to the initial position. A comprehension question followed each item.</p>
</sec>
<sec>
<title>4.4 Predictions</title>
<p>As was the case for the acceptability judgment experiment, there are also two specific predictions for this experiment. The first prediction follows from the observed relationship between reading speed and distance. It is commonly observed (<xref ref-type="bibr" rid="B18">Demberg &amp; Keller, 2008</xref>; <xref ref-type="bibr" rid="B22">Ferreira &amp; Henderson, 1993</xref>; and others) that readers speed up with the length of a sentence. Therefore, we predict that the relevant regions in the longer sentences will be read faster than in the shorter sentences.</p>
<p>The prediction which is of greater importance to us concerns the difference in processing difficulty between the sentences with <italic>too</italic> and without <italic>too</italic>. We predict that increasing distance in the sentences with <italic>too</italic> will make them more difficult to process. Following the reasoning from the previous experiment, we should note that if we test this prediction only on sentences with <italic>too</italic>, we cannot distinguish the difficulty due to retrieval from the general effect of longer texts on reading. To overcome this, we are comparing sentences with <italic>too</italic> to sentences without <italic>too</italic>, and we will look for an interaction. The difference in difficulty between the long and short sentences should be larger for sentences with <italic>too</italic> than for those without it. We expect this difficulty to be visible as a difference in reading times across relevant conditions. That is, we predict that compared to the effect of distance in the <italic>nil</italic> conditions, there should be a slowdown effect due to distance in the <italic>too</italic> conditions.</p>
</sec>
<sec>
<title>4.5 Data analysis</title>
<p>Before the analysis, we investigated the quality of the responses to the comprehension questions. Out of 82 participants, 1 with the worst ratio (&#8804; 0.7 correct) of correct responses to the comprehension questions was removed from the results. The median correct response ratio among the remaining participants was equal to 0.9.</p>
<p>Additionally, time measurements which were under 50 ms or above 3,000 ms were removed from the results. Similar data trimming was done, for example, by Futrell et al. (<xref ref-type="bibr" rid="B25">2021</xref>). This resulted in the removal of a further 0.2% of the data.</p>
<p>Next, we fitted a Bayesian mixed-effects model to the response time data. This model assumed log-normal likelihood, since reading time data are approximately log-normally distributed and, therefore, our model can resemble the data-generating process more closely (see also <xref ref-type="bibr" rid="B57">Nicenboim et al., 2016</xref>, <xref ref-type="bibr" rid="B60">2018</xref>; <xref ref-type="bibr" rid="B68">Rouder et al., 2008</xref>; for a more detailed discussion). Besides the choice of priors and likelihood, the details of model fitting were similar to those in the analysis of the acceptability judgment task. We analyzed the critical region (the word <italic>too</italic> and the corresponding word in the <italic>nil</italic> condition), the post-critical region (the word immediately following the critical region), and the post-post-critical region (two words after the critical region). The experimental factor <italic>trigger</italic> was coded using sum contrasts, with the <italic>nil</italic> condition as &#8211;1 and the <italic>too</italic> condition as 1. The experimental factor <italic>distance</italic> was also coded with sum contrasts. The <italic>long</italic> condition was coded as 1 and the <italic>short</italic> condition as &#8211;1. Both of these variables were entered into the model as fixed effects. Participants and items were entered as random effects.</p>
<p>Besides models quantifying the effects of interest on the entirety of the data, we also fit so-called nested models to quantify the effect of distance for each of the trigger conditions. These models were implemented by specifying two predictors: too_distance and nil_distance. For too_distance, we coded the conditions where the trigger was <italic>too</italic> and distance was <italic>long</italic> as 1, the conditions where the trigger was <italic>too</italic> and the distance was <italic>short</italic> as &#8211;1, and the <italic>nil</italic> conditions as 0. Analogously, for nil_distance, the conditions where the trigger was <italic>nil</italic> and the distance was <italic>long</italic> were coded as 1, the conditions where the trigger was <italic>nil</italic> and the distance was <italic>short</italic> were coded as &#8211;1, and the <italic>too</italic> conditions were coded as 0. Both of these predictors were then used to fit varying slopes and varying intercepts models. These models provide a comparison of the effect of distance separately for each trigger condition. They used 4 sampling chains, with 6,000 samples drawn from each chain. 2,000 samples from each chain were discarded for warm-up; hence each model had 16,000 samples available for the analysis.</p>
<p>For the intercept, we used a normal distribution with <italic>&#956;</italic> = 6 and <italic>&#963;</italic> = 1.5. This means that the intercept (i.e. the grand mean in our models) is assumed to be between 36.7 ms and 4435 ms, with 89% probability. For the slopes, we used a normal distribution with <italic>&#956;</italic> = 0 and <italic>&#963;</italic> = 0.1. This assumes that if we fix the intercept at 6 (which is <italic>e</italic><sup>6</sup> = 403.4 ms), and the input at 1, the output will be between 473.3 and 343.8 ms, with 89% probability. Note that the degree to which slope coefficients influence the outcome depends on the intercept, because of the log-normal likelihood.</p>
<p>If the prior distribution of the slopes seems too constraining, one should consider that it is in line with the estimates of the meta-analysis of interference effects studies (<xref ref-type="bibr" rid="B34">J&#228;ger et al., 2017</xref>). In that meta-analysis, the estimate of the largest relevant effect (p. 328, Table 4: target mismatch configurations, <italic>AND/OR</italic> prominence effect) was 39.3 ms (95% CrI (10.8, 67.9)). One can use the credible interval to guide the selection of the prior distribution for the standard deviation. Following Schad et al. (<xref ref-type="bibr" rid="B71">2021</xref>), we come up with an estimate for the <italic>&#963;</italic> parameter on the log scale equal to 0.017. Since we cannot be sure whether effects related to the processing of semantic/pragmatic dependencies are of a similar order of magnitude, we settle on a much more uncertain prior of 0.1. Furthermore, we evaluated the prior distributions with prior predictive checks. The visual summaries of the checks are available at the OSF repository. As a prior distribution for both the standard deviations of the random effects, and for the residual standard deviation, we used a truncated normal with <italic>&#956;</italic> = 0 and <italic>&#963;</italic> = 1. For the random effects correlation between the intercept and the slope, we used the LKJ distribution (<xref ref-type="bibr" rid="B48">Lewandowski et al., 2009</xref>; <xref ref-type="bibr" rid="B76">Stan Development Team, 2021</xref>) with <italic>&#951;</italic> = 2.</p>
<p>The results are reported on the log-ms scale.</p>
</sec>
<sec>
<title>4.6 Results</title>
<p>The descriptive summaries of the reading times across conditions from one word before to two words after the critical region (the additive <italic>too</italic> and the corresponding word in the <italic>nil</italic> condition) are shown on <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<fig id="F2">
<caption>
<p><bold>Figure 2:</bold> Descriptive summary of reading times in Experiment 2. The points connected by dashed lines denote mean reading times across conditions. These were computed by taking an average of all the data points for each condition, i.e. without grouping by participants or items. The bars represent bootstrapped 0.95 confidence intervals.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-1681-g2.png"/>
</fig>
<p>Let us turn to the findings of the Bayesian mixed-effects model. First, we discuss the results for the critical region (i.e. <italic>too</italic> or the parallel word), then the post-critical region (i.e. the word after the <italic>too</italic> or the parallel word), and finally the post-post-critical region (i.e. the two words after <italic>too</italic> or the parallel word).</p>
<p>For the critical region, the main effect of trigger was &#8211;0.02 log-ms (89% CrI: (&#8211;0.04, &#8211;0.007)), which suggests that there was a difference between the <italic>nil</italic> and <italic>too</italic> conditions, in particular, the <italic>nil</italic> condition took longer to read than the <italic>too</italic> condition. The main effect of distance was &#8211;0.02 log-ms (89% CrI: (&#8211;0.04, &#8211;0.008)), which means that the participants read the word faster when the distance increased. The interaction effect was 0.003 log-ms (89% CrI: (&#8211;0.01, 0.015)). The mean value of the interaction being very close to 0 and the spread of the CrI across both positive and negative values suggest that its direction is inconclusive.</p>
<p>Using the nested model, we further inspect the effect of distance in the <italic>too</italic> and the <italic>nil</italic> conditions. Within the <italic>too</italic> condition, the effect of distance is &#8211;0.02 log-ms (89% CrI: (&#8211;0.04, &#8211;0.001)), which means that the <italic>long</italic> condition was read faster than the <italic>short</italic> condition. The effect of distance within the <italic>nil</italic> condition was &#8211;0.03 log-ms (89% CrI: (&#8211;0.05, &#8211;0.006)), which also means that the <italic>long</italic> condition was read faster than the <italic>short</italic> condition.</p>
<p><xref ref-type="fig" rid="F3">Figure 3</xref> shows the posterior probability distribution of the relevant parameters in the post-critical region. The main effect of trigger was &#8211;0.02 log-ms (89% CrI: (&#8211;0.04, &#8211;0.004)), which again suggests that there was a difference between the <italic>nil</italic> and <italic>too</italic> conditions, and that the <italic>nil</italic> condition took a little longer to read than the <italic>too</italic> condition. The main effect of distance was &#8211;0.02 log-ms (89% CrI: (&#8211;0.03, &#8211;0.004)), which suggests that participants read the word faster when the distance increased. The interaction effect was 0.02 log-ms (89% CrI: (0.005, 0.031)). The positive interval shows that the negative difference in distance, when comparing the <italic>long</italic> and <italic>short</italic> conditions, was diminished and maybe even reversed in the <italic>too</italic> condition.</p>
<fig id="F3">
<caption>
<p><bold>Figure 3:</bold> The posterior probability distribution of the parameters of interest at the post-critical region, in Experiment 2. The thicker lines delimit 50% probability intervals, the thinner lines delimit 89% probability intervals. The point signifies the mean value. The plot on the left shows main effects and the interaction, the plot on the right shows effects of distance nested within a given trigger level.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-1681-g3.png"/>
</fig>
<p>In the nested model, the effect of distance within the <italic>too</italic> condition was 0.001 log-ms (89% CrI: (&#8211;0.02, 0.02)), that is, there was no difference in reading times between the conditions. In the same model, the effect of distance within the <italic>nil</italic> condition was &#8211;0.04 log-ms (89% CrI: (&#8211;0.06, &#8211;0.02)), which means than the <italic>long</italic> condition was read faster than the <italic>short</italic> condition.</p>
<p>Finally, for the post-post-critical, region the main effect of trigger was &#8211;0.01 log-ms (89% CrI: (&#8211;0.03, &#8211;0.002)) which, as in the previous regions, suggests that the <italic>nil</italic> condition took longer to read than the <italic>too</italic> condition. The main effect of distance was 0.005 log-ms (89% CrI: (&#8211;0.007, 0.02)), which suggests almost no distance effect. The interaction effect was positive, just as in the post-critical region, even though the credible interval included zero this time (mean 0.008 log-ms, 89% CrI: (&#8211;0.003, 0.02)).</p>
<p>In the nested model, we see that the effect of distance within the <italic>too</italic> condition was 0.01 log-ms (89% CrI: (&#8211;0.005, 0.03)), which means that the <italic>short</italic> condition was read faster than the <italic>long</italic> condition, even though the null effect is not excluded among possible values in the credible interval. The effect of distance within the <italic>nil</italic> condition was &#8211;0.003 log-ms (89% CrI: (&#8211;0.018, 0.012)), which means that there was no difference in reading times between the conditions.</p>
</sec>
<sec>
<title>4.7 Discussion</title>
<p>The posterior distribution of the trigger was (in the 89% credible interval) negative in the post-critical region. This suggests that the <italic>nil</italic> condition took longer to read than the <italic>too</italic> condition, possibly because the trigger was more predictable than the corresponding element in the <italic>nil</italic> condition (an adverb). The main effect of distance was also negative in the same region, which suggests that the post-critical region was read faster when the distance was greater. Crucially, the effect of distance interacted with the trigger: there was a positive interaction in the post-critical region. As we mentioned, one way to interpret the positive interaction is that it shows that the negative effect of distance is diminished and maybe even reversed in items with <italic>too</italic>. The pattern of the interaction is easier to understand when we turn to the nested model, shown in <xref ref-type="fig" rid="F3">Figure 3</xref>. We see that there was a speed-up in reading due to distance in the <italic>nil</italic> condition, but no such effect in the <italic>too</italic> condition. In other words, the interaction comes about due to the interplay of two effects: the speed-up effect due to distance in the <italic>nil</italic> condition (longer items are read faster in the post-critical region), and the absence of this speed-up in the <italic>too</italic> condition.</p>
<p>We interpret the finding as follows. First, the fact that the <italic>long</italic> condition is read faster than the <italic>short</italic> condition in the <italic>nil</italic> case is in line with the general observation that readers speed up during reading texts or experimental items, as seen in experimental research (<xref ref-type="bibr" rid="B22">Ferreira and Henderson, 1993</xref>; see also <xref ref-type="bibr" rid="B87">Vasishth &amp; Lewis, 2006, p. 780</xref>) and the analysis of psycholinguistic reading corpora (<xref ref-type="bibr" rid="B18">Demberg &amp; Keller, 2008</xref>; <xref ref-type="bibr" rid="B21">Dotla&#269;il, 2021</xref>). It is also in line with the fact that we observe a (general) speed-up on the critical region in this experiment. Second, in contrast to the <italic>nil</italic> condition, something nullifies the speed-up in the <italic>too</italic> condition. This, we submit, is the retrieval process due to <italic>too</italic>. Since readers have to resolve the presupposition, they have to recall the antecedent. Such a recall is harder and slower in the <italic>long</italic> condition compared to the <italic>short</italic> condition. Under this interpretation, our findings would represent a novel instance of semantic inhibitory interference (<xref ref-type="bibr" rid="B63">Parker &amp; Phillips, 2016</xref>; <xref ref-type="bibr" rid="B85">Van Dyke, 2007</xref>), one that is observed and can be studied in texts and long discourses, not just sentence-internally.</p>
<p>We can think of three possible explanations for the proposed slow-down in <italic>too</italic> compared to the baseline (the <italic>nil</italic> condition). First, it is possible that readers retrieve a distractor and subsequently correct their misretrieval, which takes extra time. Since misretrievals are more likely to happen with increased distance, the discussed reading times pattern is observed. Alternatively, readers might not correct their wrong retrieval but stick to it and consider the presupposition as unresolved. This would result in an infelicitous sentence, which would likely be read slower compared to a fully grammatical, felicitous sentence. Finally, it is possible that with greater distance, the antecedent decays in memory, resulting in longer retrieval time.</p>
<p>It should also be noted that the interaction effect persists to some extent at the post-post-critical region. If this is related to the observed effect, this might suggest that the retrieval difficulty affected the reading process for longer than just immediately after retrieval.</p>
<p>We should mention one possible confound in this study. As we explained in Section 1, additive presuppositions depend on information structure. In particular, the associate of the additive trigger has focus. Given that our stimuli are not spoken, there is no guarantee the information structure we intended for our stimuli matches how participants perceived them. Take, for instance, the following example:</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(11)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a dancer and the waiter, who is a boxer, dances too, &#8230;.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Here, we assume that when participants read this sentence, they interpret <italic>waiter</italic> as being in focus. The additive then associates with the subject. Potentially, however, participants read (11) with focus on the verb <italic>dances</italic>. In that case, the presupposition becomes that the waiter has properties besides being someone who dances. Given that such properties are provided in the appositive clause intervening between the subject and the verb, our manipulation of distance would stop working under this interpretation. This is even more so because this interpretation is fully unavailable in the short distance condition, where no alternative properties of the subject are given.</p>
<p>While we acknowledge that this alternative understanding of our stimuli is available, we doubt that it played a major role in the results of Experiments 1 and 2. This is because informal intuitions that we gathered about our stimuli suggest that our own intended understanding is much more prominent than the understanding with focus on the verb. However, to be sure, the follow-up experiments were designed to avoid this possible confound.</p>
</sec>
</sec>
<sec>
<title>5. Experiment 3: Acceptability judgment task</title>
<sec>
<title>5.1 Design</title>
<p>Experiments 3 and 4 were designed to further investigate the retrieval of the antecedents of additives. We wanted to see whether the effect of distance on retrieval can be observed by manipulating the number of sentences in a discourse, not by manipulating the number of words in an appositive, i.e. a non-restrictive relative clause, as was done in Experiments 1 and 2. One reason to consider this is that a non-restrictive relative clause might be processed differently from, and independently of, the rest of the discourse (<xref ref-type="bibr" rid="B19">Dillon et al., 2014</xref>), adding unnecessary complications and orthogonal issues to the first two experiments. Another reason is that this allows us to exclude the possible confound in Experiment 1 and 2, that is, the possibility that in the <italic>long</italic> condition, the presupposition was resolved with a different information structure than in the <italic>short</italic> condition.</p>
<p>In the same way as in Experiment 2, we used two factors in Experiment 3: distance and trigger. Two levels of distance were crossed with two levels of trigger, resulting in a 2 &#215; 2 within-participant design.</p>
<p>The distance factor was manipulated by changing the number of clauses between the items. In the <italic>short</italic> case, there was 1 additional clause introduced between the first sentence and the critical sentence. In the <italic>long</italic> case, there were 3 such clauses, see (<italic>short</italic>) and (<italic>long</italic>) in <italic>too</italic> and <italic>nil</italic> below. The sentences were always in the past tense.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p><italic>too</italic></p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>&#160;</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(<italic>short</italic>)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Anne came first in her race. She burst into tears. Caroline won too, setting the lap record.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(<italic>long</italic>)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Anne came first in her race. She raised her hands. She screamed out loud. She burst into tears. Caroline won too, setting the lap record.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p><italic>nil</italic></p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>&#160;</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(<italic>short</italic>)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Anne fell down in her race. She burst into tears. Caroline won gold, setting the lap record.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(<italic>long</italic>)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Anne fell down in her race. She raised her hands. She screamed out loud. She burst into tears. Caroline won gold, setting the lap record.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The way these discourse fragments were constructed permits us to treat VPs as predicates.<xref ref-type="fn" rid="n6">6</xref> Then, the contrast of distance can be interpreted as the number of these VP-predicates that appear in the <italic>short</italic> and <italic>long</italic> conditions. In the case of (<italic>short</italic>), only one extra clause, introducing one such extra predicate, appeared between the predicate with <italic>too</italic> (<italic>won too</italic>) and the antecedent (<italic>(Anne) came first in her race</italic>). In case of (<italic>long</italic>), three extra sentences, introducing three extra predicates, interfered between the predicate with <italic>too</italic> and the antecedent. We hypothesised that the number of predicates between the antecedent and the trigger should influence how quickly the participants will be able to retrieve the information required by the trigger.</p>
<p>The trigger factor was manipulated by replacing the trigger <italic>too</italic> with a non-triggering word, often a noun, as in the illustrative item given above. The corresponding word in the <italic>nil</italic> condition (<italic>gold</italic> above) had to be of similar length as <italic>too</italic> (between 3 and 5 characters). As in Experiments 1 and 2, the items in the <italic>nil</italic> condition also differed in the first sentence. Care was taken to ensure that sentences that were changed between the conditions had similar lengths and numbers of words.</p>
<p>Notice that the items above no longer allow for the possible confound we discussed in connection with Experiment 2. There, some stimuli had more than one candidate for the additive&#8217;s associate. In this experiment, however, the only felicitous associate for <italic>too</italic> is the subject. In particular, assuming the predicate <italic>won</italic> as the associate would make the presupposition unresolved in both <italic>short</italic> and <italic>long</italic> contexts. What&#8217;s more, the subject of the target sentence is easily recognized as a contrastive topic. In the first two sentences, <italic>Anne</italic> is the topic. The shift of subject in the target sentence means that <italic>Caroline</italic> becomes a contrastive topic (see <xref ref-type="bibr" rid="B10">B&#252;ring, 1999</xref>, section 1.3). As such, it is the natural associate of the additive (<xref ref-type="bibr" rid="B41">Krifka, 1998</xref>; <xref ref-type="bibr" rid="B70">S&#230;b&#248;, 2004</xref>).</p>
<p>We created 24 items of the type described above. The experiment furthermore included 24 filler items. These were simple and short grammatical discourses of a similar form as the target items. Additionally, 10 control items were added to the stimuli. These items were short discourse fragments which violated various semantic and syntactic constraints. They were similar to but not the same as, the ones used in Experiment 1.</p>
</sec>
<sec>
<title>5.2 Participants</title>
<p>Participant recruitment and compensation were carried out in the same way as in Experiment 1. 32 participants (female = 14 (44%), male = 18 (56%)) were recruited.</p>
</sec>
<sec>
<title>5.3 Procedure</title>
<p>The procedure was very similar to Experiment 1. Some modifications were needed to accommodate the different design of the experiment. The experiment consisted of 4 conditions so the 24 target items were distributed across 4 lists and participants were assigned at random to each of these lists. Each participant saw 6 items from every condition.</p>
</sec>
<sec>
<title>5.4 Predictions</title>
<p>The predictions are the same as in Experiment 1, with the caveat that the stimuli are discourse fragments and not sentences.</p>
</sec>
<sec>
<title>5.5 Data analysis and results</title>
<p>No participants were removed from the data. In the same way as in Experiment 1 we fit a cumulative ordinal Bayesian model to the data. All the model details were the same as in Experiment 1. The distance was sum-contrast coded with <italic>long</italic> conditions coded as 1 and <italic>short</italic> conditions coded as &#8211;1. The trigger was sum-contrast coded (<italic>nil</italic> = &#8211;1, <italic>too</italic> = 1). Fixed effects of interest were: distance, trigger and their interaction.</p>
<p>We used regularising prior distributions for slopes and intercepts. For the intercept, we used a normal distribution with <italic>&#956;</italic> = 0 and <italic>&#963;</italic> = 2. For the slopes, we used a normal distribution with <italic>&#956;</italic> = 0 and <italic>&#963;</italic> = 0.5. See also Experiment 1 for a more extensive discussion of the prior structures. We report the results on the log-odds scale.</p>
<p>The 89% credible intervals for the slopes of the interest are summarised on <xref ref-type="fig" rid="F4">Figure 4</xref>.</p>
<fig id="F4">
<caption>
<p><bold>Figure 4:</bold> The posterior probability distribution of the parameters of interest in Experiment 3. The thicker lines delimit 50% probability intervals, the thinner lines delimit 89% probability intervals. The point signifies the mean value.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-1681-g4.png"/>
</fig>
<p>As in the previous experiments, the crucial parameter here is the slope of the interaction. The posterior distribution of the slope of the interaction (89% CrI = (&#8211;0.2, 0.1), mean = &#8211;0.04, sd = 0.09) suggests that the effect of distance on acceptability does not clearly differ between the trigger conditions.</p>
<p>The estimate of the distance parameter indicates that the longer items were judged as less natural (89% CrI = (&#8211;0.3, &#8211;0.04), mean = &#8211;0.2, sd = 0.09).</p>
<p>In the case of the trigger, the fragments with <italic>too</italic> were more natural to the participants than those without it (89% CrI = (0.1, 0.5), mean = 0.3, sd = 0.1).</p>
</sec>
</sec>
<sec>
<title>6. Experiment 4: Self-paced reading</title>
<sec>
<title>6.1 Design</title>
<p>We conducted a self-paced reading study using the same design as in Experiment 3. Some nonessential changes were applied to the stimuli, and the control items were removed. Out of 48 items, 28 (17 target items and 11 filler items) were followed by a comprehension question.</p>
<p>The list of items is provided at the OSF repository.</p>
</sec>
<sec>
<title>6.2 Participants</title>
<p>79 participants (female = 45 (57%), male = 31 (39%), other/don&#8217;t want to say = 3 (4%)) were recruited for the self-paced reading study using Prolific (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.prolific.co">https://www.prolific.co</ext-link>).</p>
<p>The participants were compensated 3.00 GBP for taking part in the experiment. The experiment took, on average, 16 minutes and 46 seconds to finish; therefore the participants would get 10.7 GPB per hour, on average. The participants were self-reported English native speakers.</p>
</sec>
<sec>
<title>6.3 Procedure</title>
<p>The recruitment of the participants, randomization, and data collection mimicked that in Experiment 2. The experiment was conducted in a noncumulative self-paced reading moving window paradigm, as in Experiment 2. Since the stimuli were constructed of quite long text fragments, we inserted line breaks manually to ensure that the regions of interest would not end up before or across the line break.</p>
</sec>
<sec>
<title>6.4 Predictions</title>
<p>The predictions are the same as in Experiment 2, with the caveat that the stimuli are discourse fragments and not sentences.</p>
</sec>
<sec>
<title>6.5 Data analysis</title>
<p>Every participant answered more than 70% of comprehension questions correctly. For this reason, no participant was removed. Time measurements which were under 50 ms or above 3,000 ms were removed from the results. Similar data trimming was done, for example, by Futrell et al. (<xref ref-type="bibr" rid="B25">2021</xref>). This resulted in the removal of 0.46% of the data.</p>
<p>Data analysis of the final data from 79 participants was similar to the data analysis conducted for Experiment 2.</p>
<p>We fitted a Bayesian mixed-effects model to the response time data, measured at the critical, post-critical, and post-post-critical regions. The experimental factor <italic>trigger</italic> was coded using sum contrasts, with the <italic>nil</italic> condition as &#8211;1 and the <italic>too</italic> condition as 1. The factor <italic>distance</italic> was also coded using sum contrasts, with the <italic>short</italic> condition coded as &#8211;1 and <italic>long</italic> condition as 1. Both of these variables were used as fixed effects in the model. Participants and items were entered as random effects.</p>
<p>We also fit two additional models to the data. The first of these was the same nested model as in Experiment 2. With the second of those, we wanted to check for the effect of distance in regions other than the critical and post-critical. Here, we first pooled the data from the regions starting at the critical region &#8211;5, i.e. the first region at which the distance manipulation could be detected, up to the pre-critical region and the post-post-critical region (excluding the critical and post-critical region). The post-post-critical was the last region at which all the items had measurements. To these data, we fit a similar model as before, but without the trigger predictor. After this, we also examined the effect of distance at each of those particular regions. This was done to check for secondary support for our initial explanation, i.e. that the retrieval of the antecedent is erasing the general speed-up. The reasoning was that, if there is a general speed-up, we should also be able to observe it in other regions besides the critical and the post-critical regions. These models used 4 sampling chains with 6,000 samples drawn from each chain. 2,000 samples from each chain were discarded for warm-up; hence, each model had 16,000 samples available for the analysis.</p>
<p>The priors used were the same as the ones used Experiment 2. For the intercept, we used a normal distribution with <italic>&#956;</italic> = 6 and <italic>&#963;</italic> = 1.5. For the slopes, we used a normal distribution with <italic>&#956;</italic> = 0 and <italic>&#963;</italic> = 0.1. The prior distribution used for the standard deviations of the random effects and for the residual standard deviation was a truncated normal distribution with <italic>&#956;</italic> = 0 and <italic>&#963;</italic> = 1. For the random effects correlation between the intercept and the slope we used the LKJ distribution (<xref ref-type="bibr" rid="B48">Lewandowski et al., 2009</xref>; <xref ref-type="bibr" rid="B76">Stan Development Team, 2021</xref>) with <italic>&#951;</italic> = 2. See also Section 4.5 for a more extensive discussion of the prior structures.</p>
<p>Model estimates are reported on the log-ms scale.</p>
</sec>
<sec>
<title>6.6 Results</title>
<p>The descriptive summaries of the reading times across conditions for one word before and two words after the critical region (the additive <italic>too</italic> and the corresponding word in the <italic>nil</italic> condition) are shown on <xref ref-type="fig" rid="F5">Figure 5</xref>.</p>
<fig id="F5">
<caption>
<p><bold>Figure 5:</bold> Descriptive summary of reading times in Experiment 4. The points connected by dashed lines denote mean reading times across conditions. These were computed by taking an average of all the data points for each condition, i.e. without grouping by participants or items. The bars represent bootstrapped 0.95 confidence intervals.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-1681-g5.png"/>
</fig>
<p>At the critical region, the main effect of trigger was &#8211;0.02 log-ms (89% CrI: (&#8211;0.04, &#8211;0.008)). This suggests a negative difference between the conditions, i.e. the <italic>too</italic> condition was read faster than the <italic>nil</italic> condition. The main effect of distance was &#8211;0.009 log-ms (89% CrI: (&#8211;0.02, 0.003)). Since the probability density of the 89% credible interval is on the both sides of zero, one cannot conclusively infer the direction of the effect from the estimate. However, the region has much more density below zero which means that it is more likely that the participants were reading the word faster in the <italic>long</italic> condition than in the <italic>short</italic> one. The interaction effect was 0.005 log-ms (89% CrI: (&#8211;0.01, 0.02)). This suggests that the difference between the <italic>long</italic> and <italic>short</italic> conditions was smaller in the <italic>too</italic> conditions than in the <italic>nil</italic> conditions, although a caveat similar to the distance results applies here also. All the results observed at the critical region had the same direction as in Experiment 2. The magnitude of the results was also similar excluding the effect of distance, which, in Experiment 2, was more negative.</p>
<p>Let us turn to the nested model for the critical region. The model shows that the effect of distance within the <italic>too</italic> condition was &#8211;0.006 log-ms (89% CrI: (&#8211;0.02, 0.01)) and that the effect of distance within the <italic>nil</italic> condition was &#8211;0.01 log-ms (89% CrI: (&#8211;0.03, 0.007)). The results suggest that <italic>long</italic> was read faster than <italic>short</italic> in both conditions, even though the credible intervals include zero and, thus, the speed-up should not be taken as conclusive. Both of these effects were much closer to 0 than in the Experiment 2, but the fact that more probability density is below 0 aligns with the results observed in Experiment 2.</p>
<p>At the post-critical region, the main effect of trigger was &#8211;0.02 log-ms (89% CrI: (&#8211;0.03, &#8211;0.005)), i.e. the <italic>too</italic> condition was read faster than the <italic>nil</italic> condition. The main effect of distance was &#8211;0.01 log-ms (89% CrI: (&#8211;0.02, 0.001)). As in the case of the critical region, the probability density of the 89% credible interval is on both sides of zero, but most of the density is below zero. Therefore, it is more likely that the participants were reading the word faster in the <italic>long</italic> condition than in the <italic>short</italic> one. The interaction effect was 0.01 log-ms (89% CrI: (0, 0.02)). This suggests that the difference between the <italic>long</italic> and <italic>short</italic> conditions, which was negative as the main effect, was diminished or even (as we indeed will see) reversed in the <italic>too</italic> condition. The effects of trigger and interaction were virtually the same as in Experiment 2, the effect of distance was diminished compared to Experiment 2. The posterior probability distribution of the parameters is summarized in <xref ref-type="fig" rid="F6">Figure 6</xref>.</p>
<fig id="F6">
<caption>
<p><bold>Figure 6:</bold> The posterior probability distribution of the parameters of interest, from the model fit to the post-critical region in Experiment 4. The thicker lines delimit 50% probability intervals, the thinner lines delimit 89% probability intervals. The point signifies the mean value.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-1681-g6.png"/>
</fig>
<p>The interaction can be more easily understood in the nested model, which shows that the effect of distance within the <italic>too</italic> condition was 0.01 log-ms (89% CrI: (&#8211;0.007, 0.03)). That is, it is more likely that the <italic>too</italic> condition took more time to read in the <italic>long</italic> than in the <italic>short</italic> condition. However, since the credible interval crosses zero this interpretation should not be taken as conclusive. The effect of distance within the <italic>nil</italic> condition was &#8211;0.02 log-ms (89% CrI: (&#8211;0.04, &#8211;0.006)), which means that the <italic>long</italic> condition was read faster than the <italic>short</italic> condition. Both of these effects were more positive than in Experiment 2.</p>
<p>For the post-post-critical region, the results are very similar. The main effect of trigger was &#8211;0.02 log-ms (89% CrI: (&#8211;0.04, &#8211;0.008)), that is, the <italic>too</italic> condition was read faster than the <italic>nil</italic> condition. The main effect of distance was &#8211;0.004 log-ms (89% CrI: (&#8211;0.01, 0.006)). The interaction effect was 0.007 log-ms (89% CrI: (&#8211;0.003, 0.02)). The effects of trigger and interaction were virtually the same as in Experiment 2, the effect of distance was more negative compared to Experiment 2.</p>
<p>In the nested model of the post-post-critical region, the effect of distance within the <italic>too</italic> condition was 0.001 log-ms (89% CrI: (&#8211;0.01, 0.02)), that is, there was no clear difference in reading times between the distance conditions. The effect of distance within the <italic>nil</italic> condition was &#8211;0.01 log-ms (89% CrI: (&#8211;0.03, 0.003)), which means that it is more likely that the participants were reading the <italic>long</italic> condition faster than the <italic>short</italic> condition, but since the credible interval crosses zero, the interpretation is inconclusive. These effects were the only ones that were reversed compared to the results from Experiment 2. The effect of distance within <italic>too</italic> condition was much smaller in this experiment, and the effect of distance within <italic>nil</italic> condition was much larger in this experiment.</p>
</sec>
<sec>
<title>6.7 Discussion</title>
<p>The 89% credible interval of the estimate of the interaction parameter is predominantly situated on the positive side in the critical and the post-post-critical regions, and it is fully situated on the positive side in the post-critical region. The finding in the post-critical region replicates what we observed in Experiment 2. In the nested comparison of the post-critical region, we see that the interaction is driven by the speed-up in long items in the <italic>nil</italic> condition, which is crossed with a slowdown in long items in the <italic>too</italic> condition. The slowdown due to distance in the <italic>too</italic> condition, however, was not fully conclusive, since the credible interval crossed zero. In Experiment 2, we suggested that this can be explained as an interplay of two effects: readers speed up during reading, hence the speed-up due to distance in the <italic>nil</italic> condition, whereas the retrieval process is more effortful when the antecedent being retrieved is further away, hence the lack of speed-up, or even slowdown, in the <italic>too</italic> condition. We assume the same explanation accounts for the interaction in this experiment.</p>
<p>We can explore this explanation further since the <italic>short</italic> and <italic>long</italic> conditions were matched in several words preceding the critical region. Somewhat abstractly, we can represent the situation in this experiment as in (12). Sentence1, Sentence2 and Sentence3 are matched across the <italic>short</italic> and <italic>long</italic> conditions. Sentence3 includes the crucial manipulation, the additive particle <italic>too</italic>, which triggers retrieval. We can go up to 5 words back to the beginning of Sentence2 to explore the effect of distance on matched words before <italic>too</italic> was read.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(12)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>(<italic>short</italic>) Sentence1. Sentence2. Sentence3.</p></list-item>
<list-item><p>(<italic>long</italic>) Sentence1. &#8230;<bold>extra material</bold> &#8230;Sentence2. Sentence3.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>If our explanation is right, we expect to find a speed-up in these regions. In fact, the findings are mixed. When we pool regions in Sentence2 and Sentence3 before the critical region, but including the post-post-critical region, we see a negative effect, corresponding to the expected speed-up: &#8211;0.002 log-ms (89% CrI: (&#8211;0.01, 0.008)). However, as we can see, the credible interval does not exclude zero. When we look at individual words, we see a clear negative effect (i.e. faster reading in longer distance) at region &#8211;4, four words back from the critical region: &#8211;0.02 log-ms (89% CrI: (&#8211;0.04, &#8211;0.006)). It is probably relevant that in this region a clause (Sentence2) ended in 18 out of 24 items, and there was a line break in the majority of test items after this word. In this respect, it might be worth pointing out that the critical region, which is the next region to show a quite clear speed-up effect of distance, is also the end of a clause, which is signalled by a comma, and followed by a gerund clause. However, it is not clear why we would see a spillover of the speed-up effect to the post-critical and the post-post-critical regions, but no comparable spillover from the region &#8211;4. Thus, while we have some evidence that the speed-up effect of distance can be found outside of the critical and the post-critical regions, the evidence is not entirely conclusive, since only one region shows this effect clearly.</p>
<p>The results of the post-critical region replicate the results from Experiment 2 even though Experiment 4 looks at retrieval across a series of sentences, while Experiment 2 investigated the effect of distance within a clause. This suggests that the memory processes involved in the retrieval of the antecedent of <italic>too</italic> might be similar both at the sentence level and at the discourse level.</p>
<p>As can be seen in <xref ref-type="fig" rid="F5">Figure 5</xref>, the data are noisier than in Experiment 2. This can be related to the stimuli being discourse fragments instead of simple sentences.</p>
</sec>
</sec>
<sec>
<title>7. A pooled model</title>
<p>Since the design of all the experiments was similar, we decided to take advantage of the Bayesian paradigm and fit a model to the data pooled from all experiments and grouped by task type. We wanted to check what trend the data point to when taken together. Furthermore, we calculated Bayes factors on the pooled data to compare two hypotheses of interest. There are some important caveats to be made here. The Bayes factor is somewhat controversial (see <xref ref-type="bibr" rid="B26">Gelman et al., 2003, pp. 182&#8211;184</xref>; <xref ref-type="bibr" rid="B56">Navarro, 2019</xref>). One reason for this status is that the Bayes factor is &#8220;highly sensitive to, and crucially depends on, prior assumptions about model parameters&#8221; (<xref ref-type="bibr" rid="B71">Schad et al., 2021, p. 6</xref>). We tried our best to be cautious, but the reader should keep these possible shortcomings in mind. We think that the Bayes factor estimation is useful, but it should be mostly treated as an addition to the main content of the analysis. For a good overview of the Bayes factor methodology, see Schad et al. (<xref ref-type="bibr" rid="B71">2021</xref>).</p>
<p>In the model estimation used for the calculation of the Bayes factor for the self-paced reading experiments, we used 4 chains, with 30,000 iterations per chain and 2,000 warmup iterations. The stability of the estimates obtained was checked by fitting the models (the null model and the alternative model) 4 times, estimating the marginal likelihood for each of these fits via bridge sampling, and, finally, computing the Bayes factor from these estimates. A similar approach was used in the case of the acceptability judgment experiments, but these models used 4 chains with 15,000 iterations per chain and 2,000 warmup iterations. The stability of these estimates was checked by completing the estimation procedure 5 times.<xref ref-type="fn" rid="n7">7</xref></p>
<p>In the first hypothesis, which we label H1, it is assumed that next to the effect of the trigger (<italic>too</italic> vs. <italic>nil</italic>) and the effect of distance, the interaction term <italic>trigger</italic> &#215; <italic>distance</italic> as a fixed effect influences reading times. In the second hypothesis, which we label H0, it is assumed that the interaction term does not influence reading times. Using the common notation of the R model formula, we write these two hypotheses as the following two formulae:</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(13)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>[H1]</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>log(rt) &#126; 1 + distance * trigger +</p></list-item>
<list-item><p>(1+ distance * trigger &#124; participant) +</p></list-item>
<list-item><p>(1+ distance * trigger &#124; item)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>[H0]</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>log(rt) &#126; 1 + distance + trigger +</p></list-item>
<list-item><p>(1+ distance * trigger &#124; participant) +</p></list-item>
<list-item><p>(1+ distance * trigger &#124; item)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<sec>
<title>7.1 Data preparation</title>
<p>For each experiment type (i.e. acceptability judgment and self-paced reading), we took all the collected data and treated them as if they were gathered in one experiment in which subsets of participants saw subsets of items.</p>
<p>For the self-paced reading tasks, we only considered data from the post-critical regions. To such datasets we fit a model with random intercepts and slopes for participants and items. The fixed effects were <italic>distance, trigger</italic> and their interaction. The predictor <italic>trigger</italic> was sum-coded with <italic>nil</italic> = &#8211;1, and <italic>too</italic> = 1. The predictor <italic>distance</italic> was the number of words between the antecedent and the trigger z-normalized, i.e. rescaled to a mean of 0 and standard deviation of 1.</p>
<p>For the acceptability judgment experiments, we treated the data in a similar way, but since the distance factors differed between the experiments (4 levels in the first experiment and 2 levels in the second), we took a subset of items from the first experiment (levels labelled 0 as <italic>short</italic> and 8 as <italic>long</italic>) and fit models to this subset. The distance was sum-coded with the <italic>long</italic> distance coded as 1 and the <italic>short</italic> distance as &#8211;1.<xref ref-type="fn" rid="n8">8</xref></p>
</sec>
<sec>
<title>7.2 Results</title>
<p>First, we consider the self-paced reading data. We report the estimates of the models with the interaction. The main effect of the trigger was &#8211;0.02 log-ms (89% CrI: (&#8211;0.03, &#8211;0.008)). The negative credible interval shows that the post-critical regions are more likely read faster in the <italic>too</italic> condition than in the <italic>nil</italic> condition. The main effect of distance was &#8211;0.016 log-ms (89% CrI: (&#8211;0.03, &#8211;0.006)). This means that the participants read the post-critical region faster when the distance was greater. Finally, the interaction effect was 0.017 log-ms (89% CrI: (0.008, 0.03)). This shows that the negative effect of distance was diminished, or even reversed, in the <italic>too</italic> condition. The interaction estimate replicates the findings from individual experiments and supports our hypothesis that increased distance is problematic for presupposition resolution, because the retrieval of the antecedent is more effortful.</p>
<p>For the acceptability judgment data, the main effect of the trigger was 0.2 log-odds (89% CrI: (0.08, 0.35)), suggesting that the <italic>too</italic> condition was judged as more natural than the <italic>nil</italic> condition. The main effect of <italic>distance</italic> was &#8211;0.3 log-odds (89% CrI: (&#8211;0.5, &#8211;0.2)). This means that the participants judged longer sentences as worse on average than short sentences. The interaction effect was &#8211;0.08 log-odds, showing that the effect of distance was greater (more negative) in the <italic>too</italic> condition, but the credible interval included zero (89% CrI: (&#8211;0.19, 0.019)).</p>
<p>We now turn to the comparison of the two hypotheses H0 and H1 using the Bayes factor. Since, as mentioned above, the Bayes factor is sensitive to assumptions about prior distributions (<xref ref-type="bibr" rid="B26">Gelman et al., 2003</xref>; <xref ref-type="bibr" rid="B56">Navarro, 2019</xref>; <xref ref-type="bibr" rid="B71">Schad et al., 2021</xref>), we provide a sensitivity analysis. This means that we check several priors for the part of the model in which H0 and H1 differ, i.e. the interaction term distance &#215; trigger.</p>
<p>The summary of the sensitivity analysis for Bayes factor <italic>BF</italic><sub>10</sub> regarding the self-paced reading studies is provided in <xref ref-type="table" rid="T1">Table 1</xref>. In this sensitivity analysis, the prior distribution for the interaction term is normal, with <italic>&#956;</italic> centered at zero and <italic>&#963;</italic> one value from the list 0.01, 0.05, 0.10, 0.20, 0.50. We chose these values for <italic>&#963;</italic> since small values (0.01 and 0.05) restrict the interaction term within the most plausible range and should thus lead to a <italic>BF</italic><sub>10</sub> that shows evidence in favor of the model with the interaction term, H1 (provided the data support H1). The remaining values (0.1, 0.2, 0.5) are unreasonably large and overestimate the effect size of the interaction term, and, thus, it is likely that somewhere among these values, we will observe that <italic>BF</italic><sub>10</sub> should become smaller than 1 and thus reveal evidence in favor of H0, the model without the interaction term.</p>
<table-wrap id="T1">
<caption>
<p><bold>Table 1:</bold> Bayes factors (<italic>BF</italic><sub>10</sub>) for the self-paced reading studies for the models in which the prior distribution of the interaction term was normal distribution, with the mean centered at zero and the standard deviation a value from the list 0.01, 0.05, 0.10, 0.20, 0.50.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top"><bold><italic>&#963;</italic></bold></td>
<td align="left" valign="top"><bold><italic>BF</italic><sub>10</sub> (mean)</bold></td>
<td align="left" valign="top"><bold><italic>BF</italic><sub>10</sub> (individual runs)</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">0.01</td>
<td align="left" valign="top">13.3</td>
<td align="left" valign="top">10.9, 12.1, 14.8, 15.4</td>
</tr>
<tr>
<td align="left" valign="top">0.05</td>
<td align="left" valign="top">7.8</td>
<td align="left" valign="top">5.8, 8.5, 8.5, 8.7</td>
</tr>
<tr>
<td align="left" valign="top">0.10</td>
<td align="left" valign="top">4.7</td>
<td align="left" valign="top">4.1, 4.6, 4.6, 5.5</td>
</tr>
<tr>
<td align="left" valign="top">0.20</td>
<td align="left" valign="top">2.1</td>
<td align="left" valign="top">1.87, 1.85, 2.26, 2.32</td>
</tr>
<tr>
<td align="left" valign="top">0.50</td>
<td align="left" valign="top">0.82</td>
<td align="left" valign="top">0.76, 0.8, 0.8, 0.91</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>A similar reasoning could be applied to the sensitivity analysis of the Bayes factor regarding the acceptability judgment task. Again, we decided to use a normal prior distribution, with <italic>&#956;</italic> set to 0 and <italic>&#963;</italic> taking values from the list 0.005, 0.009, 0.05, 0.1, 0.3, 0.5. We assumed that the smaller values of <italic>&#963;</italic> should provide more evidence for the model with the interaction term and the larger values, for the model without the interaction term, but note that here we do not really have a good reference point for what effect size should be expected.</p>
<p>In <xref ref-type="table" rid="T1">Table 1</xref>, we see that when the standard deviation of the prior distribution is &#8804; 0.2, Bayes factor estimates show evidence in favour of the model with the interaction, and only when the standard deviation is quite large, <italic>&#963;</italic> = 0.5, is <italic>BF</italic><sub>10</sub> smaller than 1, showing evidence in favour of the null model (the model without the interaction term). This decrease of evidence in favor of H1 is not surprising, given that other studies of interference effects in memory retrieval found clear evidence in favor of H1 only when the effect was small (<xref ref-type="bibr" rid="B71">Schad et al., 2021</xref>).</p>
<p>Among the values considered, there is a maximum, (<italic>BF</italic><sub>10</sub> = 13.3), when the standard deviation is 0.01, i.e. the observed data in the pooled model are 13.3 times more likely to occur under H1 than H0 in this case. This size of <italic>BF</italic><sub>10</sub> is sometimes labeled as moderate-to-strong evidence in favor of the full model (<xref ref-type="bibr" rid="B84">van Doorn, et al., 2021</xref>).</p>
<p>Regarding the acceptability study, we see in <xref ref-type="table" rid="T2">Table 2</xref> that no values of <italic>&#963;</italic> provide evidence for the model with the interaction. In fact, the estimates for most values of <italic>&#963;</italic> are close to 1, which means that there is no change in evidence for either of the models, as far as acceptability judgment studies are concerned.</p>
<table-wrap id="T2">
<caption>
<p><bold>Table 2:</bold> Bayes factors (<italic>BF</italic><sub>10</sub>) for the acceptability judgment studies. Listed are the models in which the prior distribution for the interaction term was normal distribution, with the mean centered at zero and the standard deviation a value from the list 0.005, 0.009, 0.05, 0.1, 0.3, 0.5.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top"><bold><italic>&#963;</italic></bold></td>
<td align="left" valign="top"><bold><italic>BF</italic></bold><sub>10</sub></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">0.005</td>
<td align="left" valign="top">1.129</td>
</tr>
<tr>
<td align="left" valign="top">0.009</td>
<td align="left" valign="top">1.045</td>
</tr>
<tr>
<td align="left" valign="top">0.050</td>
<td align="left" valign="top">1.041</td>
</tr>
<tr>
<td align="left" valign="top">0.100</td>
<td align="left" valign="top">0.985</td>
</tr>
<tr>
<td align="left" valign="top">0.300</td>
<td align="left" valign="top">0.468</td>
</tr>
<tr>
<td align="left" valign="top">0.500</td>
<td align="left" valign="top">0.347</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>7.3 Discussion</title>
<p>In the pooled reading studies, we see that the 89% credible interval for the interaction parameter lies on the positive side of 0, indicating that for the pooled model, it is likely that the effect is positive. Moreover, the Bayes factor estimates indicate that there is clear evidence (moderate-to-strong in the range of most reasonable priors) in favor of the H1 model. There was no evidence in favor of the interaction model in case of the pooled acceptability studies, that is, the detectable slowdown in reading does not translate to decreased acceptability in our experiments.</p>
<p>The sensitivity analysis of the Bayes factor shows the relationship between the standard deviation of the prior distribution on the interaction parameter, and the strength of the evidence for the full model (the model with the interaction term).</p>
<p>The interaction term expresses the difference between the effect of distance in the <italic>too</italic> condition and the effect of distance in the <italic>nil</italic> condition. The model fitted to the pooled data predicted the mean estimate of the interaction term as 0.0173 log-ms. It is easier to interpret these estimates on the ms scale. At the intercept estimate of the model, this corresponds to a difference of 14 ms. The difference is calculated as follows:</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(14)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>exp(I<sc>ntercept</sc> + <italic>&#946;<sub>int</sub></italic>) &#8211; exp(I<sc>ntercept</sc> &#8211; <italic>&#946;<sub>int</sub></italic>) = exp(6 + 0.0173) &#8211; exp(6 &#8211; 0.0173)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>This means that at the intercept (here, the grand mean reading time), we expect 14 ms of difference per one unit of distance. Recall that the distance is measured in the scaled number of words between the beginning of the text and the trigger. On this scale, one unit of distance corresponds to 6.2 words.</p>
</sec>
</sec>
<sec>
<title>8. General discussion</title>
<sec>
<title>8.1 The size and source of the effect</title>
<p>The results of these experiments provide evidence that reading is slowed down when the antecedent is further away from the trigger, the additive particle <italic>too</italic>. However, given that retrieval of the antecedent is needed, and given that it seems clear that this retrieval should be more taxing with increased distance, it should not be difficult to find evidence for this distance effect. Yet, the effects in both experiments are rather small. The strength of this evidence, as quantified by the Bayes factor, is conditional on the prior distribution for the slope of the interaction used in the model. We saw moderate to strong evidence for models with prior distributions that assume rather small effects sizes.</p>
<p>Notice, however, that memory retrieval effects seem to be small in general (<xref ref-type="bibr" rid="B34">J&#228;ger et al., 2017</xref>). Their small effect size is also what makes them difficult to establish. As an example, consider facilitatory interference effects in antecedent-reflexive constructions. A study by Dillon et al. (<xref ref-type="bibr" rid="B20">2013</xref>) found no facilitation, but a later experiment which collected data from many more participants (<xref ref-type="bibr" rid="B35">J&#228;ger et al., 2020</xref>) did find some such effect.</p>
<p>The size of the effect also provides clues as to how we should reason about its source. It is usually assumed that retrieval can happen either via a serial search mechanism or via a direct-access, cue-based mechanism (<xref ref-type="bibr" rid="B54">McElree et al., 2003</xref>). The former is a costly mechanism; the parser has to attend to every possible item in search for a match. To believe that a similar mechanism is responsible for retrieval in the experiments presented, we would expect a drastic slowdown with every additional candidate. We did not observe such a slowdown. Rather, the approximate size of the effect is comparable to the findings on retrieval in syntax. Assuming that only these two possibilities are on the table, we are compelled to conclude that the mechanism responsible is direct access.</p>
<p>The slowdown observed in the experiments could be interpreted as an effect of decay: the greater distance resulted in a slower retrieval of the relevant information, because its representation in memory weakened with time. Interpreting the findings in terms of decay is compatible with cue-based theories of retrieval, which can be cast in the framework in which temporal decay is one of the factors affecting retrieval time (e.g. ACT-R <xref ref-type="bibr" rid="B3">Anderson &amp; Lebiere, 1998</xref>; <xref ref-type="bibr" rid="B2">Anderson, 2007</xref>).</p>
<p>Another possibility is that the effect is driven by a different kind of cue from the ones we assumed so far. The experiments assumed that all the cue-value pairs other than the ones directly manipulated remained constant across the conditions. However, there are cue-value pairs the status of which might change with distance. For example, the salience of the licensing element might decrease with distance.<xref ref-type="fn" rid="n9">9</xref> If the retrieval of the antecedent is guided by a cue {<italic>salient</italic>}, and salience diminishes with distance, we would expect the parser to have trouble retrieving the target.</p>
</sec>
<sec>
<title>8.2 Additive presupposition between retrieval and expectation</title>
<p>We started this work arguing that additive presuppositions trigger a retrieval process, on the basis of the fact that language users have the capacity to judge the felicity of additive <italic>too</italic>, given the presence or absence of appropriate material earlier in the discourse. That is, the contrast between (15) and (16) shows that the processing <italic>too</italic> relies on recall of earlier parts of the discourse.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(15)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Ann lives in Paris. Sue lives in France, #(too).</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(16)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Ann lives in Brussels. Sue lives in France, (#too).</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The way we have presented the reliance on memory so far is in line with the treatment of presupposition as an anaphora-like phenomenon, as per quite a few proposals from semantic theory (<xref ref-type="bibr" rid="B43">Kripke, 2009</xref>; <xref ref-type="bibr" rid="B82">van der Sandt &amp; Geurts, 1991</xref>; <xref ref-type="bibr" rid="B81">van der Sandt, 1992</xref>;). However, one aspect of the contrast between (15) and (16) may be relevant for processing, yet does not follow from the anaphoric character of additive presuppositions. Not only is the additive felicitous in (15), it is also obligatory.<xref ref-type="fn" rid="n10">10</xref> That is, upon having processed <italic>Sue lives in France</italic> in (15), anything other than an additive will be deemed infelicitous. This obligatory presence of <italic>too</italic> is usually explained along the following lines. If <italic>Sue</italic> is a contrastive topic in the second sentence and the additive is left out, then the presence of alternatives to Sue would trigger the implicature that (out of the contextually salient set of individuals) only Sue lives in France. But this implicature is contradicted by what we just learned about where Anne lives. By presupposing that some alternative is true in addition to the asserted one, exhaustification is cancelled. This is because only one operation can associate with the focus/contrastive topic at a time. So, the presence of <italic>too</italic> precludes exhaustification from taking place and this is why the additive is obligatory in (15) (e.g. <xref ref-type="bibr" rid="B6">Bade, 2016</xref>; <xref ref-type="bibr" rid="B41">Krifka, 1998</xref>; <xref ref-type="bibr" rid="B70">S&#230;b&#248;, 2004</xref>).</p>
<p>We do not believe this aspect of additive presuppositions drastically changes the retrieval picture, though it may suggest a slightly different narrative about what is behind our results.<xref ref-type="fn" rid="n11">11</xref> As explained above, the obligatory nature of <italic>too</italic> makes it very predictable. This predictability is probably supported by various factors, one of which is the parallelism in the items of our first two experiments, as, for example, in (17).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(17)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The cook is a dancer and the waiter, who is a great boxer from southern Amsterdam, dances too, I have been told recently.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>When processing such a sentence, after arriving at the site of the verb in the second clause (<italic>dances</italic>), participants can retrieve the predicate (<italic>is a dancer</italic>) from the first clause and, as a result, predict <italic>too</italic>. This prediction is likely to result in an increase in reading speed, but such a speed-up is preceded by a retrieval effort, which is less costly for shorter sentences (since retrieval of the previous predicate is less inhibited by the distracting predicates). Because the items in Experiments 3 and 4 do not include the kind of parallelism found in (17), it could be that the prediction of <italic>too</italic> is less profound in those experiments, which could account for the smaller effect.</p>
<p>On this view, one may expect to see the effect of distance earlier, since the retrieval happens at the verb, but it is likely such differences cannot be captured in the self-paced reading paradigm. In short, then, we think that although the predictability of <italic>too</italic> allows for a narrative that is slightly different from the anaphoric processing framing we used elsewhere, the overall predictions are not so different, since the kind of retrieval involved is the same.</p>
</sec>
<sec>
<title>8.3 Slowing down or just not speeding-up?</title>
<p>Our starting point was to combine insights from semantic approaches to additive presuppositions with psycholinguistic findings on memory retrieval. The combination lead us to the hypothesis that an increase in distance between the additive <italic>too</italic> and the antecedent should result in more costly retrieval, which should translate into a slowdown in reading. In two self-paced reading experiments, this slowdown in <italic>too</italic> was observed, but, crucially, the slowdown was observed <italic>with respect to the baseline</italic>, that is, the sentences that lacked the additive. Furthermore, as we saw, an increase in distance caused a speed-up in the baseline. A slowdown in sentences with <italic>too</italic> relative to this baseline means, in the absolute measures studied in the nested comparisons, no speed-up/slowdown (Experiment 2) or a slowdown which, however, is not fully conclusive (Experiment 4).</p>
<p>We suggested that the speed-up in the baseline is expected independently, based on previous findings that show that readers speed up as they proceed in reading text (<xref ref-type="bibr" rid="B18">Demberg and Keller, 2008</xref>; <xref ref-type="bibr" rid="B21">Dotla&#269;il, 2021</xref>; <xref ref-type="bibr" rid="B22">Ferreira and Henderson, 1993</xref>; see also <xref ref-type="bibr" rid="B87">Vasishth &amp; Lewis, 2006, p. 780</xref>). Furthermore, we claimed that the fact that we observe a slowdown compared to the baseline but no absolute slowdown, in sentences with <italic>too</italic> should be understood to arise through the combination of two factors. On one hand there is a slowdown caused by more costly retrieval when the distance from the antecedent increases. On the other hand, readers read longer texts and, therefore, they should speed up. The net result is potentially no observable absolute slowdown in sentences with <italic>too</italic>.</p>
<p>Such a trade-off between speed-up due to increased sentence length and slowdown due to harder retrieval was observed in other studies on retrieval and was discussed, among others, by Levy &amp; Keller (<xref ref-type="bibr" rid="B47">2013</xref>); Futrell et al. (<xref ref-type="bibr" rid="B24">2020</xref>). In those studies, however, the focus was on how specific readers&#8217; expectations, modeled in Surprisal Theory, interact with the distance effects observed in retrieval.</p>
<p>One way to strengthen the results of the current study would be to design an experiment in which the distance in <italic>too</italic> does not cause just a relative speed-up but in which a slowdown can be seen irrespective of any baseline. Nicenboim et al. (<xref ref-type="bibr" rid="B57">2016</xref>) present an extended discussion of problems that arise when researching distance effects which could potentially be useful in constructing novel experiments that would further strengthen the results in this way.</p>
</sec>
</sec>
<sec>
<title>9. Conclusion</title>
<p>We presented four experiments that investigated the role of the retrieval needed to correctly interpret utterances with the presupposition trigger <italic>too</italic>. The trigger used in the experiments is an example of a linguistic element that forms a semantic dependency, which is not mediated by particular syntactic constructions.</p>
<p>Across the experiments, we found that it takes more time to read sentences with the presupposition trigger <italic>too</italic> when the distance between the trigger and its antecedent is greater, compared to the baseline, which lacked the presupposition trigger. This is compatible with the assumption that <italic>too</italic> triggers retrieval and that retrieval is cue-based, or that the activation of memory traces is subject to temporal decay.</p>
<p>The fact that retrieval is needed to resolve both syntactic and semantic dependencies, and that the retrieval mechanisms seem to show similar properties, suggests that a general-purpose mechanism for memory structuring and memory retrieval could exist in language processing.</p>
</sec>
</body>
<back>
<sec>
<title>Data accessibility statement</title>
<p>The data, materials, and code are available at an Open Science Foundation repository, at: <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://osf.io/2xcgu/">https://osf.io/2xcgu/</ext-link>, DOI: <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.17605/OSF.IO/2XCGU">10.17605/OSF.IO/2XCGU</ext-link></p>
</sec>
<sec>
<title>Ethics and consent</title>
<p>The studies in this article have been conducted after approval by the Faculty Ethics Assessment Committee Humanities (FEtC-H) of Utrecht University under reference number 4058356-01-2019.</p>
</sec>
<ack>
<title>Acknowledgements</title>
<p>We thank Sol Lago and two anonymous reviewers for their valuable feedback on this article. The research in this article was supported by an NWO Vrije competitie grant under grant number VC.GW17.112.</p>
<p>The third author was funded by the European Union (the European Research Council grant 101088098, MEMLANG). The views and opinions expressed are, however, those of the authors only and do not necessarily reflect those of the European Union or the European Research Council Executive Agency. Neither the European Union nor the granting authority can be held responsible for them.</p>
</ack>
<sec>
<title>Competing interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<sec>
<title>Authors&#8217; contributions</title>
<p>Conceptualization: Rick Nouwen, Jakub Dotla&#269;il, Jan Winkowski; Data curation: Jan Winkowski; Formal analysis: Jan Winkowski, Jakub Dotla&#269;il; Funding acquisition: Rick Nouwen; Investigation: Jan Winkowski; Methodology: Jakub Dotla&#269;il, Jan Winkowski; Project administration: Jan Winkowski; Supervision: Rick Nouwen, Jakub Dotla&#269;il; Visualization: Jan Winkowski; Writing &#8212; original draft: Jan Winkowski, Jakub Dotla&#269;il, Rick Nouwen; Writing &#8212; review &amp; editing: Jan Winkowski, Jakub Dotla&#269;il, Rick Nouwen.</p>
</sec>
<sec>
<title>ORCiD IDs</title>
<p>Jan Winkowski <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://orcid.org/0000-0002-0301-8471">0000-0002-0301-8471</ext-link></p>
<p>Rick Nouwen <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://orcid.org/0000-0001-9571-4644">0000-0001-9571-4644</ext-link></p>
<p>Jakub Dotla&#269;il <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://orcid.org/0000-0002-5337-8432">0000-0002-5337-8432</ext-link></p>
</sec>
<fn-group>
<fn id="n1"><p>There are some well-known cases where additives <italic>are</italic> felicitous out of the blue. For instance: <italic>The rich should suffer, too!</italic> Such examples, in which the presupposition of <italic>too</italic> is accommodated, play an important role in the debate on the nature of additive presupposition. In the remainder of the article, however, we only look at examples where the presupposition is not accommodated. In other words, in all the cases we look at, it is evident that some memory process is involved.</p></fn>
<fn id="n2"><p>This does not mean, however, that the notion of distance is not applicable to the general case. Even in cases such as (8), one can identify a point at which enough information has been accumulated for the relevant entailment to be satisfied. Precise identification might be difficult, because the presupposition triggered by <italic>too</italic> does not need an exact match to be resolved, but it is possible to identify the last contributing expression. E.g. in (8), adding material after <italic>hickory nuts</italic> will lengthen the distance between the antecedent and the trigger.</p></fn>
<fn id="n3"><p>The connection between phrases and predicates is not always trivial. For our items, it is enough to know at which point the predicate is established in discourse, so that it can serve as an antecedent. This also means that the same point in the discourse can establish multiple predicates, for instance, in cases where a transitive verb phrase contains a noun denoting a predicate.</p></fn>
<fn id="n4"><p>We used Alex Drummond&#8217;s Ibex farm (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://ibex.spellout.net">https://ibex.spellout.net</ext-link>), which is no longer operational. PCIbex farm (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://farm.pcibex.net/">https://farm.pcibex.net/</ext-link>) could be used to re-implement the experiments in their original form.</p></fn>
<fn id="n5"><p>Item 32 was not used in the actual experiment, because of a coding mistake. We became aware of this after the data was collected.</p></fn>
<fn id="n6"><p>Obviously, predicates cannot be equated with VPs in the general case. See the discussion in 2.3.</p></fn>
<fn id="n7"><p>There were also other differences. In particular, for the SPR experiments stability of the estimates was checked for each value of <italic>&#963;</italic> separately, while for the AJ experiments this was done only once. See the OSF repository for details.</p></fn>
<fn id="n8"><p>A more complete model for the pooled data would also account for the source of the data, i.e. which experiment the data comes from. Such a model would assume an additional group <italic>experiment</italic> and would estimate the parameters for that group also. However, when we tried this, we encountered issues with the convergence of the model, and hence we decided to use the model described in the text instead.</p></fn>
<fn id="n9"><p>We are grateful to an anonymous reviewer for suggesting this explanation.</p></fn>
<fn id="n10"><p>Note, that there are some exceptions to the obligatoriness of <italic>too</italic>. For instance, <italic>too</italic> is not obligatory in lists: <italic>Sue lives in Paris, Anne lives in Paris, Bo lives in Paris, all my friends live in Paris! Too</italic> can also be skipped when the fragment continues, e.g. <italic>&#8230; Sue lives in France but commutes to Rome</italic>. is fine. None of the examples that we looked at in this work are like this, however.</p></fn>
<fn id="n11"><p>We would like to thank an anonymous reviewer for discussion.</p></fn>
</fn-group>
<ref-list>
<ref id="B1"><label>1</label><mixed-citation publication-type="journal"><string-name><surname>Anderson</surname>, <given-names>J. R.</given-names></string-name> (<year>1991</year>). <article-title>Is human cognition adaptive?</article-title> <source>Behavioral and Brain Sciences</source>, <volume>14</volume>(<issue>3</issue>), <fpage>471</fpage>&#8211;<lpage>485</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S0140525X00070801</pub-id></mixed-citation></ref>
<ref id="B2"><label>2</label><mixed-citation publication-type="journal"><string-name><surname>Anderson</surname>, <given-names>J. R.</given-names></string-name> (<year>2007</year>). <article-title>Cognitive architecture</article-title>. <source>How Can the Human Mind Occur in the Physical Universe?</source> <fpage>344</fpage>. DOI: <pub-id pub-id-type="doi">10.1093/acprof:oso/9780195324259.003.0001</pub-id></mixed-citation></ref>
<ref id="B3"><label>3</label><mixed-citation publication-type="book"><string-name><surname>Anderson</surname>, <given-names>J. R.</given-names></string-name>, &amp; <string-name><surname>Lebiere</surname>, <given-names>C.</given-names></string-name> (<year>1998</year>). <source>The atomic components of thought</source>. <publisher-name>Lawrence Erlbaum Associates</publisher-name>.</mixed-citation></ref>
<ref id="B4"><label>4</label><mixed-citation publication-type="book"><string-name><surname>Asher</surname>, <given-names>N.</given-names></string-name>, &amp; <string-name><surname>Lascarides</surname>, <given-names>A.</given-names></string-name> (<year>2003</year>). <source>Logics of conversation</source>. <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation></ref>
<ref id="B5"><label>5</label><mixed-citation publication-type="journal"><string-name><surname>Bade</surname>, <given-names>N.</given-names></string-name> (<year>2014</year>). <article-title>Obligatory implicatures and the presupposition of &#8220;too&#8221;</article-title>. <source>Proceedings of Sinn und Bedeutung</source>, <volume>18</volume>, <fpage>42</fpage>&#8211;<lpage>59</lpage>.</mixed-citation></ref>
<ref id="B6"><label>6</label><mixed-citation publication-type="thesis"><string-name><surname>Bade</surname>, <given-names>N.</given-names></string-name> (<year>2016</year>). <source>Obligatory presupposition triggers in discourse: Empirical investigations of the theories maximize presupposition and obligatory implicatures</source> [Doctoral Thesis]. <publisher-name>University of Tuebingen</publisher-name>.</mixed-citation></ref>
<ref id="B7"><label>7</label><mixed-citation publication-type="journal"><string-name><surname>Bartek</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Lewis</surname>, <given-names>R. L.</given-names></string-name>, <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Smith</surname>, <given-names>M. R.</given-names></string-name> (<year>2011</year>). <article-title>In search of on-line locality effects in sentence comprehension</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>37</volume>(<issue>5</issue>), <elocation-id>1178</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1037/a0024194</pub-id></mixed-citation></ref>
<ref id="B8"><label>8</label><mixed-citation publication-type="book"><string-name><surname>Brasoveanu</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Dotla&#269;il</surname>, <given-names>J.</given-names></string-name> (<year>2020</year>). <source>Computational cognitive modeling and linguistic theory</source>. <publisher-name>Springer (Open Access)</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1007/978-3-030-31846-8</pub-id></mixed-citation></ref>
<ref id="B9"><label>9</label><mixed-citation publication-type="journal"><string-name><surname>Brown</surname>, <given-names>G. D.</given-names></string-name>, <string-name><surname>Neath</surname>, <given-names>I.</given-names></string-name>, &amp; <string-name><surname>Chater</surname>, <given-names>N.</given-names></string-name> (<year>2007</year>). <article-title>A temporal ratio model of memory</article-title>. <source>Psychological Review</source>, <volume>114</volume>(<issue>3</issue>), <fpage>539</fpage>. DOI: <pub-id pub-id-type="doi">10.1037/0033-295X.114.3.539</pub-id></mixed-citation></ref>
<ref id="B10"><label>10</label><mixed-citation publication-type="book"><string-name><surname>B&#252;ring</surname>, <given-names>D.</given-names></string-name> (<year>1999</year>). <chapter-title>Topic: Linguistic, cognitive, and computational perspectives</chapter-title>. In <string-name><given-names>P.</given-names> <surname>Bosch</surname></string-name> &amp; <string-name><given-names>R.</given-names> <surname>van der Sandt</surname></string-name> (Eds.), <source>Focus</source> (pp. <fpage>142</fpage>&#8211;<lpage>165</lpage>). <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation></ref>
<ref id="B11"><label>11</label><mixed-citation publication-type="journal"><string-name><surname>B&#252;rkner</surname>, <given-names>P.-C.</given-names></string-name> (<year>2017</year>). <article-title>brms: An R package for Bayesian multilevel models using Stan</article-title>. <source>Journal of Statistical Software</source>, <volume>80</volume>(<issue>1</issue>), <fpage>1</fpage>&#8211;<lpage>28</lpage>. DOI: <pub-id pub-id-type="doi">10.18637/jss.v080.i01</pub-id></mixed-citation></ref>
<ref id="B12"><label>12</label><mixed-citation publication-type="journal"><string-name><surname>B&#252;rkner</surname>, <given-names>P.-C.</given-names></string-name>, &amp; <string-name><surname>Vuorre</surname>, <given-names>M.</given-names></string-name> (<year>2019</year>). <article-title>Ordinal regression models in psychology: A tutorial</article-title>. <source>Advances in Methods and Practices in Psychological Science</source>, <volume>2</volume>(<issue>1</issue>), <fpage>77</fpage>&#8211;<lpage>101</lpage>. DOI: <pub-id pub-id-type="doi">10.1177/2515245918823199</pub-id></mixed-citation></ref>
<ref id="B13"><label>13</label><mixed-citation publication-type="journal"><string-name><surname>Chemla</surname>, <given-names>E.</given-names></string-name>, &amp; <string-name><surname>Bott</surname>, <given-names>L.</given-names></string-name> (<year>2013</year>). <article-title>Processing presuppositions: Dynamic semantics vs pragmatic enrichment</article-title>. <source>Language and Cognitive Processes</source>, <volume>28</volume>(<issue>3</issue>), <fpage>241</fpage>&#8211;<lpage>260</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01690965.2011.615221</pub-id></mixed-citation></ref>
<ref id="B14"><label>14</label><mixed-citation publication-type="journal"><string-name><surname>Chen</surname>, <given-names>S. Y.</given-names></string-name>, &amp; <string-name><surname>Husband</surname>, <given-names>E. M.</given-names></string-name> (<year>2018</year>). <article-title>Comprehending anaphoric presuppositions involves memory retrieval too</article-title>. <source>Proceedings of the Linguistic Society of America</source>, <volume>3</volume>(<issue>1</issue>), <fpage>1</fpage>&#8211;<lpage>44</lpage>. DOI: <pub-id pub-id-type="doi">10.3765/plsa.v3i1.4288</pub-id></mixed-citation></ref>
<ref id="B15"><label>15</label><mixed-citation publication-type="book"><string-name><surname>Chomsky</surname>, <given-names>N.</given-names></string-name> (<year>1957</year>). <source>Syntactic structures</source>. <publisher-name>De Gruyter Mouton</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1515/9783112316009</pub-id></mixed-citation></ref>
<ref id="B16"><label>16</label><mixed-citation publication-type="journal"><string-name><surname>Cunnings</surname>, <given-names>I.</given-names></string-name>, &amp; <string-name><surname>Sturt</surname>, <given-names>P.</given-names></string-name> (<year>2018</year>). <article-title>Retrieval interference and semantic interpretation</article-title>. <source>Journal of Memory and Language</source>, <volume>102</volume>, <fpage>16</fpage>&#8211;<lpage>27</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2018.05.001</pub-id></mixed-citation></ref>
<ref id="B17"><label>17</label><mixed-citation publication-type="journal"><string-name><surname>Degen</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Tanenhaus</surname>, <given-names>M. K.</given-names></string-name> (<year>2016</year>). <article-title>Availability of alternatives and the processing of scalar implicatures: A visual world eye-tracking study</article-title>. <source>Cognitive Science</source>, <volume>40</volume>(<issue>1</issue>), <fpage>172</fpage>&#8211;<lpage>201</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/cogs.12227</pub-id></mixed-citation></ref>
<ref id="B18"><label>18</label><mixed-citation publication-type="journal"><string-name><surname>Demberg</surname>, <given-names>V.</given-names></string-name>, &amp; <string-name><surname>Keller</surname>, <given-names>F.</given-names></string-name> (<year>2008</year>). <article-title>Data from eye-tracking corpora as evidence for theories of syntactic processing complexity</article-title>. <source>Cognition</source>, <volume>109</volume>(<issue>2</issue>), <fpage>193</fpage>&#8211;<lpage>210</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cognition.2008.07.008</pub-id></mixed-citation></ref>
<ref id="B19"><label>19</label><mixed-citation publication-type="journal"><string-name><surname>Dillon</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Clifton</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Frazier</surname>, <given-names>L.</given-names></string-name> (<year>2014</year>). <article-title>Pushed aside: Parentheticals, memory and processing</article-title>. <source>Language, Cognition and Neuroscience</source>, <volume>29</volume>(<issue>4</issue>), <fpage>483</fpage>&#8211;<lpage>498</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01690965.2013.866684</pub-id></mixed-citation></ref>
<ref id="B20"><label>20</label><mixed-citation publication-type="journal"><string-name><surname>Dillon</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Mishler</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Sloggett</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Phillips</surname>, <given-names>C.</given-names></string-name> (<year>2013</year>). <article-title>Contrasting intrusion profiles for agreement and anaphora: Experimental and modeling evidence</article-title>. <source>Journal of Memory and Language</source>, <volume>69</volume>(<issue>2</issue>), <fpage>85</fpage>&#8211;<lpage>103</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2013.04.003</pub-id></mixed-citation></ref>
<ref id="B21"><label>21</label><mixed-citation publication-type="journal"><string-name><surname>Dotla&#269;il</surname>, <given-names>J.</given-names></string-name> (<year>2021</year>). <article-title>Parsing as a cue-based retrieval model</article-title>. <source>Cognitive Science</source>, <volume>45</volume>(<issue>8</issue>). DOI: <pub-id pub-id-type="doi">10.1111/cogs.13020</pub-id></mixed-citation></ref>
<ref id="B22"><label>22</label><mixed-citation publication-type="journal"><string-name><surname>Ferreira</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name><surname>Henderson</surname>, <given-names>J.</given-names></string-name> (<year>1993</year>). <article-title>Reading processes during syntactic analysis and reanalysis</article-title>. <source>Canadian Journal of Experimental Psychology</source>, <volume>47</volume>(<issue>2</issue>), <fpage>247</fpage>&#8211;<lpage>275</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/h0078819</pub-id></mixed-citation></ref>
<ref id="B23"><label>23</label><mixed-citation publication-type="journal"><string-name><surname>Fox</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Katzir</surname>, <given-names>R.</given-names></string-name> (<year>2011</year>). <article-title>On the characterization of alternatives</article-title>. <source>Natural Language Semantics</source>, <volume>19</volume>(<issue>1</issue>), <fpage>87</fpage>&#8211;<lpage>107</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s11050-010-9065-3</pub-id></mixed-citation></ref>
<ref id="B24"><label>24</label><mixed-citation publication-type="journal"><string-name><surname>Futrell</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name>, &amp; <string-name><surname>Levy</surname>, <given-names>R. P.</given-names></string-name> (<year>2020</year>). <article-title>Lossy-context surprisal: An information-theoretic model of memory effects in sentence processing</article-title>. <source>Cognitive Science</source>, <volume>44</volume>(<issue>3</issue>), <elocation-id>e12814</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1111/cogs.12814</pub-id></mixed-citation></ref>
<ref id="B25"><label>25</label><mixed-citation publication-type="journal"><string-name><surname>Futrell</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Tily</surname>, <given-names>H. J.</given-names></string-name>, <string-name><surname>Blank</surname>, <given-names>I.</given-names></string-name>, <string-name><surname>Vishnevetsky</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Piantadosi</surname>, <given-names>S. T.</given-names></string-name>, &amp; <string-name><surname>Fedorenko</surname>, <given-names>E.</given-names></string-name> (<year>2021</year>). <article-title>The natural stories corpus: A reading-time corpus of English texts containing rare syntactic constructions</article-title>. <source>Language Resources and Evaluation</source>, <volume>55</volume>, <fpage>63</fpage>&#8211;<lpage>77</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s10579-020-09503-7</pub-id></mixed-citation></ref>
<ref id="B26"><label>26</label><mixed-citation publication-type="book"><string-name><surname>Gelman</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Carlin</surname>, <given-names>J. B.</given-names></string-name>, <string-name><surname>Stern</surname>, <given-names>H. S.</given-names></string-name>, <string-name><surname>Vehtari</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Dunson</surname>, <given-names>D. B.</given-names></string-name>, &amp; <string-name><surname>Rubin</surname>, <given-names>D. B.</given-names></string-name> (<year>2003</year>). <source>Bayesian data analysis</source>. <publisher-name>Chapman; Hall/CRC</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1201/9780429258480</pub-id></mixed-citation></ref>
<ref id="B27"><label>27</label><mixed-citation publication-type="book"><string-name><surname>Gelman</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Hill</surname>, <given-names>J.</given-names></string-name> (<year>2006</year>). <source>Data analysis using regression and multilevel/hierarchical models</source>. <publisher-name>Cambridge University Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1017/CBO9780511790942</pub-id></mixed-citation></ref>
<ref id="B28"><label>28</label><mixed-citation publication-type="book"><string-name><surname>Geurts</surname>, <given-names>B.</given-names></string-name> (<year>1999</year>). <source>Presuppositions and pronouns</source>. <publisher-name>Elsevier</publisher-name>.</mixed-citation></ref>
<ref id="B29"><label>29</label><mixed-citation publication-type="journal"><string-name><surname>Geurts</surname>, <given-names>B.</given-names></string-name>, &amp; <string-name><surname>van der Sandt</surname>, <given-names>R.</given-names></string-name> (<year>2004</year>). <article-title>Interpreting focus</article-title>. <source>Theoretical Linguistics</source>. DOI: <pub-id pub-id-type="doi">10.1515/thli.2004.005</pub-id></mixed-citation></ref>
<ref id="B30"><label>30</label><mixed-citation publication-type="journal"><string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name> (<year>2000</year>). <article-title>The dependency locality theory: A distance-based theory of linguistic complexity</article-title>. <source>Image, Language, Brain</source>, <volume>2000</volume>, <fpage>95</fpage>&#8211;<lpage>126</lpage>. DOI: <pub-id pub-id-type="doi">10.7551/mitpress/3654.003.0008</pub-id></mixed-citation></ref>
<ref id="B31"><label>31</label><mixed-citation publication-type="journal"><string-name><surname>Grodner</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name> (<year>2005</year>). <article-title>Consequences of the serial nature of linguistic input for sentenial complexity</article-title>. <source>Cognitive Science</source>, <volume>29</volume>(<issue>2</issue>), <fpage>261</fpage>&#8211;<lpage>290</lpage>. DOI: <pub-id pub-id-type="doi">10.1207/s15516709cog0000_7</pub-id></mixed-citation></ref>
<ref id="B32"><label>32</label><mixed-citation publication-type="thesis"><string-name><surname>Heim</surname>, <given-names>I.</given-names></string-name> (<year>1982</year>). <source>The semantics of definite and indefinite noun phrases</source> [Doctoral dissertation, <publisher-name>UMass Amherst</publisher-name>].</mixed-citation></ref>
<ref id="B33"><label>33</label><mixed-citation publication-type="journal"><string-name><surname>Huang</surname>, <given-names>Y. T.</given-names></string-name>, &amp; <string-name><surname>Snedeker</surname>, <given-names>J.</given-names></string-name> (<year>2011</year>). <article-title>Logic and conversation revisited: Evidence for a division between semantic and pragmatic content in real-time language comprehension</article-title>. <source>Language and Cognitive Processes</source>, <volume>26</volume>(<issue>8</issue>), <fpage>1161</fpage>&#8211;<lpage>1172</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01690965.2010.508641</pub-id></mixed-citation></ref>
<ref id="B34"><label>34</label><mixed-citation publication-type="journal"><string-name><surname>J&#228;ger</surname>, <given-names>L. A.</given-names></string-name>, <string-name><surname>Engelmann</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2017</year>). <article-title>Similarity-based interference in sentence comprehension: Literature review and Bayesian meta-analysis</article-title>. <source>Journal of Memory and Language</source>, <volume>94</volume>, <fpage>316</fpage>&#8211;<lpage>339</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2017.01.004</pub-id></mixed-citation></ref>
<ref id="B35"><label>35</label><mixed-citation publication-type="journal"><string-name><surname>J&#228;ger</surname>, <given-names>L. A.</given-names></string-name>, <string-name><surname>Mertzen</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Van Dyke</surname>, <given-names>J. A.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2020</year>). <article-title>Interference patterns in subject-verb agreement and reflexives revisited: A large-sample study</article-title>. <source>Journal of Memory and Language</source>, <volume>111</volume>, <elocation-id>104063</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2019.104063</pub-id></mixed-citation></ref>
<ref id="B36"><label>36</label><mixed-citation publication-type="journal"><string-name><surname>Just</surname>, <given-names>M. A.</given-names></string-name>, <string-name><surname>Carpenter</surname>, <given-names>P. A.</given-names></string-name>, &amp; <string-name><surname>Woolley</surname>, <given-names>J. D.</given-names></string-name> (<year>1982</year>). <article-title>Paradigms and processes in reading comprehension</article-title>. <source>Journal of Experimental Psychology: General</source>, <volume>111</volume>(<issue>2</issue>), <fpage>228</fpage>&#8211;<lpage>238</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0096-3445.111.2.228</pub-id></mixed-citation></ref>
<ref id="B37"><label>37</label><mixed-citation publication-type="book"><string-name><surname>Kamp</surname>, <given-names>H.</given-names></string-name> (<year>1981</year>). <chapter-title>A theory of truth and semantic representation</chapter-title>. In <string-name><given-names>J.</given-names> <surname>Groenendijk</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Janssen</surname></string-name>, &amp; <string-name><given-names>M.</given-names> <surname>Stokhof</surname></string-name> (Eds.), <source>Formal methods in the study of language</source> (pp. <fpage>277</fpage>&#8211;<lpage>322</lpage>). <publisher-name>Mathematical Centre Tracts</publisher-name>.</mixed-citation></ref>
<ref id="B38"><label>38</label><mixed-citation publication-type="book"><string-name><surname>Kamp</surname>, <given-names>H.</given-names></string-name>, &amp; <string-name><surname>Reyle</surname>, <given-names>U.</given-names></string-name> (<year>1993</year>). <source>From discourse to logic: Introduction to modeltheoretic semantics of natural language, formal logic and discourse representation theory</source>. <publisher-name>Kluwer Academic Publishers</publisher-name>.</mixed-citation></ref>
<ref id="B39"><label>39</label><mixed-citation publication-type="journal"><string-name><surname>Kiss</surname>, <given-names>K. &#201;.</given-names></string-name> (<year>1998</year>). <article-title>Identificational focus versus information focus</article-title>. <source>Language</source>, <volume>74</volume>(<issue>2</issue>), <fpage>245</fpage>&#8211;<lpage>273</lpage>. DOI: <pub-id pub-id-type="doi">10.1353/lan.1998.0211</pub-id></mixed-citation></ref>
<ref id="B40"><label>40</label><mixed-citation publication-type="journal"><string-name><surname>Konieczny</surname>, <given-names>L.</given-names></string-name> (<year>2000</year>). <article-title>Locality and parsing complexity</article-title>. <source>Journal of Psycholinguistic Research</source>, <volume>29</volume>(<issue>6</issue>), <fpage>627</fpage>&#8211;<lpage>645</lpage>. DOI: <pub-id pub-id-type="doi">10.1023/A:1026528912821</pub-id></mixed-citation></ref>
<ref id="B41"><label>41</label><mixed-citation publication-type="journal"><string-name><surname>Krifka</surname>, <given-names>M.</given-names></string-name> (<year>1998</year>). <article-title>Additive particles under stress</article-title>. <source>Proceedings of Semantics and Linguistic Theory</source>, <volume>8</volume>(<issue>8</issue>), <fpage>111</fpage>&#8211;<lpage>128</lpage>. DOI: <pub-id pub-id-type="doi">10.3765/salt.v8i0.2799</pub-id></mixed-citation></ref>
<ref id="B42"><label>42</label><mixed-citation publication-type="journal"><string-name><surname>Krifka</surname>, <given-names>M.</given-names></string-name> (<year>2008</year>). <article-title>Basic notions of information structure</article-title>. <source>Acta Linguistica Hungarica</source>, <volume>55</volume>(<issue>3&#8211;4</issue>), <fpage>243</fpage>&#8211;<lpage>276</lpage>. DOI: <pub-id pub-id-type="doi">10.1556/ALing.55.2008.3-4.2</pub-id></mixed-citation></ref>
<ref id="B43"><label>43</label><mixed-citation publication-type="journal"><string-name><surname>Kripke</surname>, <given-names>S. A.</given-names></string-name> (<year>2009</year>). <article-title>Presupposition and anaphora: Remarks on the formulation of the projection problem</article-title>. <source>Linguistic Inquiry</source>, <volume>40</volume>(<issue>3</issue>), <fpage>367</fpage>&#8211;<lpage>386</lpage>. DOI: <pub-id pub-id-type="doi">10.1162/ling.2009.40.3.367</pub-id></mixed-citation></ref>
<ref id="B44"><label>44</label><mixed-citation publication-type="journal"><string-name><surname>Kush</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Dillon</surname>, <given-names>B.</given-names></string-name> (<year>2021</year>). <article-title>Principle B constrains the processing of cataphora: Evidence for syntactic and discourse predictions</article-title>. <source>Journal of Memory and Language</source>, <volume>120</volume>, <elocation-id>104254</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2021.104254</pub-id></mixed-citation></ref>
<ref id="B45"><label>45</label><mixed-citation publication-type="journal"><string-name><surname>Kush</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Eik</surname>, <given-names>R.</given-names></string-name> (<year>2019</year>). <article-title>Antecedent accessibility and exceptional covariation: Evidence from Norwegian donkey pronouns</article-title>. <source>Glossa: a journal of general linguistics</source>, <volume>4</volume>(<issue>1</issue>). DOI: <pub-id pub-id-type="doi">10.5334/gjgl.930</pub-id></mixed-citation></ref>
<ref id="B46"><label>46</label><mixed-citation publication-type="journal"><string-name><surname>Laurinavichyute</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>von der Malsburg</surname>, <given-names>T.</given-names></string-name> (<year>2022</year>). <article-title>Semantic attraction in sentence comprehension</article-title>. <source>Cognitive Science</source>, <volume>46</volume>(<issue>2</issue>). DOI: <pub-id pub-id-type="doi">10.1111/cogs.13086</pub-id></mixed-citation></ref>
<ref id="B47"><label>47</label><mixed-citation publication-type="journal"><string-name><surname>Levy</surname>, <given-names>R. P.</given-names></string-name>, &amp; <string-name><surname>Keller</surname>, <given-names>F.</given-names></string-name> (<year>2013</year>). <article-title>Expectation and locality effects in German verb-final structures</article-title>. <source>Journal of Memory and Language</source>, <volume>68</volume>(<issue>2</issue>), <fpage>199</fpage>&#8211;<lpage>222</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2012.02.005</pub-id></mixed-citation></ref>
<ref id="B48"><label>48</label><mixed-citation publication-type="journal"><string-name><surname>Lewandowski</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Kurowicka</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Joe</surname>, <given-names>H.</given-names></string-name> (<year>2009</year>). <article-title>Generating random correlation matrices based on vines and extended onion method</article-title>. <source>Journal of Multivariate Analysis</source>, <volume>100</volume>(<issue>9</issue>), <fpage>1989</fpage>&#8211;<lpage>2001</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jmva.2009.04.008</pub-id></mixed-citation></ref>
<ref id="B49"><label>49</label><mixed-citation publication-type="journal"><string-name><surname>Lewis</surname>, <given-names>R. L.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2005</year>). <article-title>An activation-based model of sentence processing as skilled memory retrieval</article-title>. <source>Cognitive Science</source>, <volume>29</volume>(<issue>3</issue>), <fpage>375</fpage>&#8211;<lpage>419</lpage>. DOI: <pub-id pub-id-type="doi">10.1207/s15516709cog0000_25</pub-id></mixed-citation></ref>
<ref id="B50"><label>50</label><mixed-citation publication-type="journal"><string-name><surname>Lewis</surname>, <given-names>R. L.</given-names></string-name>, <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Van Dyke</surname>, <given-names>J. A.</given-names></string-name> (<year>2006</year>). <article-title>Computational principles of working memory in sentence comprehension</article-title>. <source>Trends in Cognitive Sciences</source>, <volume>10</volume>(<issue>10</issue>), <fpage>447</fpage>&#8211;<lpage>454</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.tics.2006.08.007</pub-id></mixed-citation></ref>
<ref id="B51"><label>51</label><mixed-citation publication-type="book"><string-name><surname>McElreath</surname>, <given-names>R.</given-names></string-name> (<year>2018</year>). <source>Statistical rethinking: A Bayesian course with examples in R and Stan</source>. <publisher-name>Chapman; Hall/CRC</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1201/9781315372495</pub-id></mixed-citation></ref>
<ref id="B52"><label>52</label><mixed-citation publication-type="journal"><string-name><surname>McElree</surname>, <given-names>B.</given-names></string-name> (<year>2000</year>). <article-title>Sentence comprehension is mediated by content-addressable memory structures</article-title>. <source>Journal of Psycholinguistic Research</source>, <volume>29</volume>(<issue>2</issue>), <fpage>111</fpage>&#8211;<lpage>123</lpage>. DOI: <pub-id pub-id-type="doi">10.1023/A:1005184709695</pub-id></mixed-citation></ref>
<ref id="B53"><label>53</label><mixed-citation publication-type="journal"><string-name><surname>McElree</surname>, <given-names>B.</given-names></string-name> (<year>2001</year>). <article-title>Working memory and focal attention</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>27</volume>(<issue>3</issue>), <fpage>817</fpage>&#8211;<lpage>835</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0278-7393.27.3.817</pub-id></mixed-citation></ref>
<ref id="B54"><label>54</label><mixed-citation publication-type="journal"><string-name><surname>McElree</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Foraker</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Dyer</surname>, <given-names>L.</given-names></string-name> (<year>2003</year>). <article-title>Memory structures that subserve sentence comprehension</article-title>. <source>Journal of Memory and Language</source>, <volume>48</volume>(<issue>1</issue>), <fpage>67</fpage>&#8211;<lpage>91</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0749-596X(02)00515-6</pub-id></mixed-citation></ref>
<ref id="B55"><label>55</label><mixed-citation publication-type="book"><string-name><surname>Miller</surname>, <given-names>G. A.</given-names></string-name>, &amp; <string-name><surname>Chomsky</surname>, <given-names>N.</given-names></string-name> (<year>1963</year>). <chapter-title>Finitary models of language users</chapter-title>. In <string-name><given-names>D.</given-names> <surname>Luce</surname></string-name> (Ed.), <source>Handbook of mathematical psychology</source>. <publisher-name>John Wiley &amp; Sons</publisher-name>.</mixed-citation></ref>
<ref id="B56"><label>56</label><mixed-citation publication-type="journal"><string-name><surname>Navarro</surname>, <given-names>D. J.</given-names></string-name> (<year>2019</year>). <article-title>Between the devil and the deep blue sea: Tensions between scientific judgement and statistical model selection</article-title>. <source>Computational Brain &amp; Behavior</source>, <volume>2</volume>(<issue>1</issue>), <fpage>28</fpage>&#8211;<lpage>34</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s42113-018-0019-z</pub-id></mixed-citation></ref>
<ref id="B57"><label>57</label><mixed-citation publication-type="journal"><string-name><surname>Nicenboim</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Loga&#269;ev</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Gattei</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2016</year>). <article-title>When high-capacity readers slow down and low-capacity readers speed up: Working memory and locality effects</article-title>. <source>Frontiers in Psychology</source>, <volume>7</volume>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2016.00280</pub-id></mixed-citation></ref>
<ref id="B58"><label>58</label><mixed-citation publication-type="webpage"><string-name><surname>Nicenboim</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Schad</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2021</year>). <source>An introduction to Bayesian data analysis for cognitive science</source>. <uri>https://vasishth.github.io/Bayes_CogSci/</uri></mixed-citation></ref>
<ref id="B59"><label>59</label><mixed-citation publication-type="journal"><string-name><surname>Nicenboim</surname>, <given-names>B.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2018</year>). <article-title>Models of retrieval in sentence comprehension: A computational evaluation using Bayesian hierarchical modeling</article-title>. <source>Journal of Memory and Language</source>, <volume>99</volume>, <fpage>1</fpage>&#8211;<lpage>34</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2017.08.004</pub-id></mixed-citation></ref>
<ref id="B60"><label>60</label><mixed-citation publication-type="journal"><string-name><surname>Nicenboim</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Engelmann</surname>, <given-names>F.</given-names></string-name>, &amp; <string-name><surname>Suckow</surname>, <given-names>K.</given-names></string-name> (<year>2018</year>). <article-title>Exploratory and confirmatory analyses in sentence processing: A case study of number interference in German</article-title>. <source>Cognitive Science</source>, <volume>42</volume>(<issue>S4</issue>), <fpage>1075</fpage>&#8211;<lpage>1100</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/cogs.12589</pub-id></mixed-citation></ref>
<ref id="B61"><label>61</label><mixed-citation publication-type="journal"><string-name><surname>Nicenboim</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Gattei</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Sigman</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Kliegl</surname>, <given-names>R.</given-names></string-name> (<year>2015</year>). <article-title>Working memory differences in long-distance dependency resolution</article-title>. <source>Frontiers in Psychology</source>, <volume>6</volume>, <fpage>312</fpage>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2015.00312</pub-id></mixed-citation></ref>
<ref id="B62"><label>62</label><mixed-citation publication-type="book"><string-name><surname>Nouwen</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Brasoveanu</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>van Eijck</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Visser</surname>, <given-names>A.</given-names></string-name> (<year>2016</year>). <chapter-title>Dynamic semantics</chapter-title>. In <string-name><given-names>E. N.</given-names> <surname>Zalta</surname></string-name> (Ed.), <source>The Stanford encyclopedia of philosophy</source> (Winter 2016). <publisher-name>Metaphysics Research Lab, Stanford University</publisher-name>.</mixed-citation></ref>
<ref id="B63"><label>63</label><mixed-citation publication-type="journal"><string-name><surname>Parker</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Phillips</surname>, <given-names>C.</given-names></string-name> (<year>2016</year>). <article-title>Negative polarity illusions and the format of hierarchical encodings in memory</article-title>. <source>Cognition</source>, <volume>157</volume>, <fpage>321</fpage>&#8211;<lpage>339</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cognition.2016.08.016</pub-id></mixed-citation></ref>
<ref id="B64"><label>64</label><mixed-citation publication-type="webpage"><collab>R Core Team</collab>. (<year>2021</year>). <source>R: A language and environment for statistical computing</source>. <publisher-name>R Foundation for Statistical Computing</publisher-name>. <uri>https://www.R-project.org/</uri></mixed-citation></ref>
<ref id="B65"><label>65</label><mixed-citation publication-type="book"><string-name><surname>Romoli</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Khan</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Sudo</surname>, <given-names>Y.</given-names></string-name>, &amp; <string-name><surname>Snedeker</surname>, <given-names>J.</given-names></string-name> (<year>2015</year>). <chapter-title>Resolving temporary referential ambiguity using presupposed content</chapter-title>. In <source>Experimental perspectives on presuppositions</source> (pp. <fpage>67</fpage>&#8211;<lpage>87</lpage>). <publisher-name>Springer</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1007/978-3-319-07980-6_3</pub-id></mixed-citation></ref>
<ref id="B66"><label>66</label><mixed-citation publication-type="book"><string-name><surname>Romoli</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Schwarz</surname>, <given-names>F.</given-names></string-name> (<year>2015</year>). <chapter-title>An experimental comparison between presuppositions and indirect scalar implicatures</chapter-title>. In <source>Experimental perspectives on presuppositions</source> (pp. <fpage>215</fpage>&#8211;<lpage>240</lpage>). <publisher-name>Springer</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1007/978-3-319-07980-6_10</pub-id></mixed-citation></ref>
<ref id="B67"><label>67</label><mixed-citation publication-type="journal"><string-name><surname>Rooth</surname>, <given-names>M.</given-names></string-name> (<year>1992</year>). <article-title>A theory of focus interpretation</article-title>. <source>Natural Language Semantics</source>, <volume>1</volume>(<issue>1</issue>), <fpage>75</fpage>&#8211;<lpage>116</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/BF02342617</pub-id></mixed-citation></ref>
<ref id="B68"><label>68</label><mixed-citation publication-type="journal"><string-name><surname>Rouder</surname>, <given-names>J. N.</given-names></string-name>, <string-name><surname>Tuerlinckx</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Speckman</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Lu</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Gomez</surname>, <given-names>P.</given-names></string-name> (<year>2008</year>). <article-title>A hierarchical approach for fitting curves to response time measurements</article-title>. <source>Psychonomic Bulletin &amp; Review</source>, <volume>15</volume>. DOI: <pub-id pub-id-type="doi">10.3758/PBR.15.6.1201</pub-id></mixed-citation></ref>
<ref id="B69"><label>69</label><mixed-citation publication-type="journal"><string-name><surname>Ruys</surname>, <given-names>E. G.</given-names></string-name> (<year>2015</year>). <article-title>On the anaphoricity of too</article-title>. <source>Linguistic Inquiry</source>, <volume>46</volume>(<issue>2</issue>), <fpage>343</fpage>&#8211;<lpage>361</lpage>. DOI: <pub-id pub-id-type="doi">10.1162/LING_a_00184</pub-id></mixed-citation></ref>
<ref id="B70"><label>70</label><mixed-citation publication-type="journal"><string-name><surname>S&#230;b&#248;</surname>, <given-names>K. J.</given-names></string-name> (<year>2004</year>). <article-title>Conversational contrast and conventional parallel: Topic implicatures and additive presuppositions</article-title>. <source>Journal of Semantics</source>, <volume>21</volume>(<issue>2</issue>), <fpage>199</fpage>&#8211;<lpage>217</lpage>. DOI: <pub-id pub-id-type="doi">10.1093/jos/21.2.199</pub-id></mixed-citation></ref>
<ref id="B71"><label>71</label><mixed-citation publication-type="journal"><string-name><surname>Schad</surname>, <given-names>D. J.</given-names></string-name>, <string-name><surname>Nicenboim</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>B&#252;rkner</surname>, <given-names>P.-C.</given-names></string-name>, <string-name><surname>Betancourt</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2021</year>). <article-title>Workflow techniques for the robust use of Bayes factors</article-title>. DOI: <pub-id pub-id-type="doi">10.48550/arXiv.2103.08744</pub-id></mixed-citation></ref>
<ref id="B72"><label>72</label><mixed-citation publication-type="journal"><string-name><surname>Schlenker</surname>, <given-names>P.</given-names></string-name> (<year>2008</year>). <article-title>Be articulate: A pragmatic theory of presupposition projection</article-title>. <source>Theoretical Linguistics</source>, <volume>34</volume>. DOI: <pub-id pub-id-type="doi">10.1515/THLI.2008.013</pub-id></mixed-citation></ref>
<ref id="B73"><label>73</label><mixed-citation publication-type="journal"><string-name><surname>Schmitz</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Winkowski</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Hoeks</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Nouwen</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Dotla&#269;il</surname>, <given-names>J.</given-names></string-name> (<year>2024</year>). <article-title>Semantic accessibility and interference in pronoun resolution</article-title>. <source>Glossa Psycholinguistics</source>, <volume>3</volume>(<issue>1</issue>). DOI: <pub-id pub-id-type="doi">10.5070/G60111424</pub-id></mixed-citation></ref>
<ref id="B74"><label>74</label><mixed-citation publication-type="journal"><string-name><surname>Schwarz</surname>, <given-names>F.</given-names></string-name> (<year>2007</year>). <article-title>Processing presupposed content</article-title>. <source>Journal of Semantics</source>, <volume>24</volume>(<issue>4</issue>), <fpage>373</fpage>&#8211;<lpage>416</lpage>. DOI: <pub-id pub-id-type="doi">10.1093/jos/ffm011</pub-id></mixed-citation></ref>
<ref id="B75"><label>75</label><mixed-citation publication-type="book"><string-name><surname>Schwarz</surname>, <given-names>F.</given-names></string-name> (<year>2015</year>). <chapter-title>Presuppositions vs. asserted content in online processing</chapter-title>. In <source>Experimental perspectives on presuppositions</source> (pp. <fpage>89</fpage>&#8211;<lpage>108</lpage>). <publisher-name>Springer</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1007/978-3-319-07980-6_4</pub-id></mixed-citation></ref>
<ref id="B76"><label>76</label><mixed-citation publication-type="webpage"><collab>Stan Development Team</collab>. (<year>2021</year>). <source>Stan modeling language users guide and reference manual</source>. Version 2.27. <uri>https://mc-stan.org</uri></mixed-citation></ref>
<ref id="B77"><label>77</label><mixed-citation publication-type="webpage"><string-name><surname>Staum Casasanto</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Hofmeister</surname>, <given-names>P.</given-names></string-name>, &amp; <string-name><surname>Sag</surname>, <given-names>I. A.</given-names></string-name> (<year>2010</year>). <article-title>Understanding acceptability judgments: Additivity and working memory effects</article-title>. <source>Proceedings of the Annual Meeting of the Cognitive Science Society</source>, <volume>32</volume>. <uri>https://escholarship.org/uc/item/08400553</uri></mixed-citation></ref>
<ref id="B78"><label>78</label><mixed-citation publication-type="webpage"><string-name><surname>Szabolcsi</surname>, <given-names>A.</given-names></string-name> (<year>2017</year>). <article-title>Additive presuppositions are derived through activating focus alternatives</article-title>. In <string-name><given-names>A.</given-names> <surname>Cremers</surname></string-name>, <string-name><given-names>T.</given-names> <surname>van Gessel</surname></string-name>, &amp; <string-name><given-names>F.</given-names> <surname>Roelofsen</surname></string-name> (Eds.), <source>Proceedings of the 21st Amsterdam colloquium</source> (pp. <fpage>455</fpage>&#8211;<lpage>464</lpage>). <uri>https://semanticsarchive.net/Archive/jZiM2FhZ/AC2017-Proceedings.pdf</uri></mixed-citation></ref>
<ref id="B79"><label>79</label><mixed-citation publication-type="book"><string-name><surname>Tesni&#232;re</surname>, <given-names>L.</given-names></string-name> (<year>1965</year>). <source>&#201;l&#233;ments de syntaxe structurale</source> (<elocation-id>2nd</elocation-id> ed.). <publisher-name>Klincksieck</publisher-name>.</mixed-citation></ref>
<ref id="B80"><label>80</label><mixed-citation publication-type="journal"><string-name><surname>Tonhauser</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Beaver</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Roberts</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Simons</surname>, <given-names>M.</given-names></string-name> (<year>2013</year>). <article-title>Toward a taxonomy of projective content</article-title>. <source>Language</source>, <fpage>66</fpage>&#8211;<lpage>109</lpage>. DOI: <pub-id pub-id-type="doi">10.1353/lan.2013.0001</pub-id></mixed-citation></ref>
<ref id="B81"><label>81</label><mixed-citation publication-type="journal"><string-name><surname>van der Sandt</surname>, <given-names>R.</given-names></string-name> (<year>1992</year>). <article-title>Presupposition projection as anaphora resolution</article-title>. <source>Journal of Semantics</source>, <volume>9</volume>(<issue>4</issue>), <fpage>333</fpage>&#8211;<lpage>377</lpage>. DOI: <pub-id pub-id-type="doi">10.1093/jos/9.4.333</pub-id></mixed-citation></ref>
<ref id="B82"><label>82</label><mixed-citation publication-type="book"><string-name><surname>van der Sandt</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Geurts</surname>, <given-names>B.</given-names></string-name> (<year>1991</year>). <chapter-title>Presupposition, anaphora, and lexical content</chapter-title>. In <string-name><given-names>O.</given-names> <surname>Herzog</surname></string-name> &amp; <string-name><given-names>C.-R.</given-names> <surname>Rollinger</surname></string-name> (Eds.), <source>Text understanding in LILOG: Integrating computational linguistics and artificial intelligence. Final report on the IBM Germany LILOG-project</source> (pp. <fpage>259</fpage>&#8211;<lpage>296</lpage>). <publisher-name>Springer</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1007/3-540-54594-8_65</pub-id></mixed-citation></ref>
<ref id="B83"><label>83</label><mixed-citation publication-type="book"><string-name><surname>van der Sandt</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Geurts</surname>, <given-names>B.</given-names></string-name> (<year>2001</year>). <chapter-title>Too</chapter-title>. In <string-name><given-names>R.</given-names> <surname>van Rooy</surname></string-name> &amp; <string-name><given-names>M.</given-names> <surname>Stokhof</surname></string-name> (Eds.), <source>Proceedings of the 13th Amsterdam colloquium</source> (pp. <fpage>180</fpage>&#8211;<lpage>185</lpage>). <publisher-name>University of Amsterdam</publisher-name>.</mixed-citation></ref>
<ref id="B84"><label>84</label><mixed-citation publication-type="journal"><string-name><surname>van Doorn</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>van den Bergh</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>B&#246;hm</surname>, <given-names>U.</given-names></string-name>, <string-name><surname>Dablander</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Derks</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Draws</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Etz</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Evans</surname>, <given-names>N. J.</given-names></string-name>, <string-name><surname>Gronau</surname>, <given-names>Q. F.</given-names></string-name>, <string-name><surname>Haaf</surname>, <given-names>J. M.</given-names></string-name>, et al. (<year>2021</year>). <article-title>The JASP guidelines for conducting and reporting a Bayesian analysis</article-title>. <source>Psychonomic Bulletin &amp; Review</source>, <volume>28</volume>(<issue>3</issue>), <fpage>813</fpage>&#8211;<lpage>826</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/s13423-020-01798-5</pub-id></mixed-citation></ref>
<ref id="B85"><label>85</label><mixed-citation publication-type="journal"><string-name><surname>Van Dyke</surname>, <given-names>J. A.</given-names></string-name> (<year>2007</year>). <article-title>Interference effects from grammatically unavailable constituents during sentence processing</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>33</volume>(<issue>2</issue>), <fpage>407</fpage>. DOI: <pub-id pub-id-type="doi">10.1037/0278-7393.33.2.407</pub-id></mixed-citation></ref>
<ref id="B86"><label>86</label><mixed-citation publication-type="journal"><string-name><surname>Van Dyke</surname>, <given-names>J. A.</given-names></string-name>, &amp; <string-name><surname>McElree</surname>, <given-names>B.</given-names></string-name> (<year>2011</year>). <article-title>Cue-dependent interference in comprehension</article-title>. <source>Journal of Memory and Language</source>, <volume>65</volume>(<issue>3</issue>), <fpage>247</fpage>&#8211;<lpage>263</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2011.05.002</pub-id></mixed-citation></ref>
<ref id="B87"><label>87</label><mixed-citation publication-type="journal"><string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Lewis</surname>, <given-names>R. L.</given-names></string-name> (<year>2006</year>). <article-title>Argument-head distance and processing complexity: Explaining both locality and antilocality effects</article-title>. <source>Language</source>, <volume>82</volume>(<issue>4</issue>), <fpage>767</fpage>&#8211;<lpage>794</lpage>. DOI: <pub-id pub-id-type="doi">10.1353/lan.2006.0236</pub-id></mixed-citation></ref>
<ref id="B88"><label>88</label><mixed-citation publication-type="journal"><string-name><surname>Ver&#237;ssimo</surname>, <given-names>J.</given-names></string-name> (<year>2021</year>). <article-title>Analysis of rating scales: A pervasive problem in bilingualism research and a solution with Bayesian ordinal models</article-title>. <source>Bilingualism: Language and Cognition</source>, <volume>24</volume>(<issue>5</issue>), <fpage>842</fpage>&#8211;<lpage>848</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S1366728921000316</pub-id></mixed-citation></ref>
<ref id="B89"><label>89</label><mixed-citation publication-type="journal"><string-name><surname>Wagers</surname>, <given-names>M. W.</given-names></string-name>, <string-name><surname>Lau</surname>, <given-names>E. F.</given-names></string-name>, &amp; <string-name><surname>Phillips</surname>, <given-names>C.</given-names></string-name> (<year>2009</year>). <article-title>Agreement attraction in comprehension: Representations and processes</article-title>. <source>Journal of Memory and Language</source>, <volume>61</volume>(<issue>2</issue>), <fpage>206</fpage>&#8211;<lpage>237</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2009.04.002</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>