<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">2767-0279</journal-id>
<journal-title-group>
<journal-title>Glossa Psycholinguistics</journal-title>
</journal-title-group>
<issn pub-type="epub">2767-0279</issn>
<publisher>
<publisher-name>eScholarship Publishing</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5070/G601120916</article-id>
<article-categories>
<subj-group>
<subject>Regular article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Revisiting processing complexity of nested and cross-serial dependencies</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Yadav</surname>
<given-names>Himanshu</given-names>
</name>
<email>himanshu@iitk.ac.in</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Frank</surname>
<given-names>Stefan L.</given-names>
</name>
<email>stefan.frank@ru.nl</email>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Futrell</surname>
<given-names>Richard</given-names>
</name>
<email>rfutrell@uci.edu</email>
<xref ref-type="aff" rid="aff-3">3</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Husain</surname>
<given-names>Samar</given-names>
</name>
<email>samar@hss.iitd.ac.in</email>
<xref ref-type="aff" rid="aff-4">4</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>Indian Institute of Technology Kanpur, Kanpur, India</aff>
<aff id="aff-2"><label>2</label>Radboud University, Nijmegen, the Netherlands</aff>
<aff id="aff-3"><label>3</label>University of California Irvine, Irvine (CA), USA</aff>
<aff id="aff-4"><label>4</label>Indian Institute of Technology Delhi, Delhi, India</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2025-01-30">
<day>30</day>
<month>01</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<volume>4</volume>
<issue>1</issue>
<elocation-id>8</elocation-id>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2025 The Author(s)</copyright-statement>
<copyright-year>2025</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://glossapsycholinguistics.journalpub.escholarship.org/articles/10.5070/G601120916/"/>
<abstract>
<p>In two web-based experiments, we compare comprehension difficulty between Dutch and German sentences with clusters of two or three verbs. In Dutch, such sentences involve crossing dependencies, whereas these dependencies are nested in German. Replicating the seminal finding of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>), we find that the crossing (Dutch) structure is easier to comprehend than the nested (German) structure, although we find a different pattern of results in terms of where this difficulty manifests. The results are in line with predictions from dependency locality theory.</p>
</abstract>
</article-meta>
</front>
<body>
<sec>
<title>1. Introduction</title>
<p>Languages differ in the kind of formal structures they use to encode grammatical relationships: for example, to encode embedded subject&#8211;verb relations, German uses nested structures while equivalent sentences in Dutch can use crossing structures, which correspond to discontinuous phrase structures: see (1a). Crossing structures such as these are at the heart of debates about the formal characterization of natural language grammar and parsing (<xref ref-type="bibr" rid="B12">Bresnan et al., 1982</xref>; <xref ref-type="bibr" rid="B41">Joshi et al., 1991</xref>; <xref ref-type="bibr" rid="B44">Kuhlmann, 2013</xref>; <xref ref-type="bibr" rid="B61">Shieber, 1985</xref>), both in terms of computational systems and human language processing (<xref ref-type="bibr" rid="B48">Levy et al., 2012</xref>).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(1)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><italic>Dutch</italic></p></list-item>
<list-item><p><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-4-1-20916-g7.png"/></p></list-item>
<list-item><p>&#8216;Hans has seen Peter help the children swim.&#8217;</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><italic>German</italic></p></list-item>
<list-item><p><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-4-1-20916-g8.png"/></p></list-item>
<list-item><p>&#8216;Hans has seen Peter help the children swim.&#8217;</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Early research involving these structures established that natural language grammars are not context-free, both in terms of their weak generative capacity as well as their strong generative capacity (<xref ref-type="bibr" rid="B12">Bresnan et al., 1982</xref>; <xref ref-type="bibr" rid="B61">Shieber, 1985</xref>); also see Kobele (<xref ref-type="bibr" rid="B42">2006</xref>). Crossing dependency structures are precisely where we can identify deviations from context-free grammar. Any formal grammar that encompasses these crossing dependencies must be more complex than context-free: this additional complexity often takes the form of special mechanisms in the grammar which specifically handle discontinuous constituents (for example, the Move / Internal Merge operation in Minimalist grammars, or adjunction in Tree-Adjoining Grammars (TAG), or the multiple components of Multiple Context-Free Grammars; <xref ref-type="bibr" rid="B60">Seki et al., 1991</xref>). Grammars that minimally capture natural-language-like crossing dependencies are called <bold>mildly context-sensitive</bold> (<xref ref-type="bibr" rid="B39">Joshi, 1985</xref>; <xref ref-type="bibr" rid="B41">Joshi et al., 1991</xref>; <xref ref-type="bibr" rid="B67">Weir, 1988</xref>).</p>
<p>How do these findings from the formal language theory literature relate to processing of such structures? In a seminal study, Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) investigated the processing difficulty associated with crossing dependencies by comparing the comprehension difficulty of sentences containing crossing structures in Dutch with sentences containing nested structures in German. If there is inherent processing difficulty for crossing dependencies, Dutch sentences containing crossings should be more difficult than their German non-crossing counterparts. Surprisingly, the authors found that German sentences are more difficult to comprehend compared to Dutch sentences. This study inspired work in formal syntax, parsing, and psycholinguistics, because it suggested that crossing dependencies may not be a major determinant of processing difficulty (Graf et al. (<xref ref-type="bibr" rid="B31">2017</xref>); Joshi (<xref ref-type="bibr" rid="B40">1990</xref>); Rambow and Joshi (<xref ref-type="bibr" rid="B53">1994</xref>); Rambow and Satta (<xref ref-type="bibr" rid="B54">1994</xref>); among others).</p>
<p>However, this influential result has been found using only one methodology and set of materials, and the data is not available for more in-depth analysis using modern techniques and theories. Here we report two web-based replications and extensions of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>), investigating if comprehension difficulty of German nested structures and equivalent Dutch crossing structures differs, and whether this depends on the depth of embedding. Consistent with the original results, we find that processing is subjectively more difficult with multiple embeddings in the German word order than the Dutch word order. We interpret these results in terms of dependency locality: when compared to the Dutch sentences, the equivalent German sentences have longer dependencies, which have independently been found to be associated with processing difficulty due to working memory constraints regardless of crossings (<xref ref-type="bibr" rid="B7">Bartek et al., 2011</xref>; <xref ref-type="bibr" rid="B16">Fedorenko et al., 2013</xref>; <xref ref-type="bibr" rid="B29">Gibson, 1998</xref>, <xref ref-type="bibr" rid="B30">2000</xref>; <xref ref-type="bibr" rid="B32">Grodner and Gibson, 2005</xref>).</p>
</sec>
<sec>
<title>2. Crossing dependencies in psycholinguistics</title>
<sec>
<title>2.1 Why crossing dependencies might be difficult</title>
<p>Since crossing dependencies can only be captured through more complex grammars, it would be reasonable to hypothesize that these dependencies also come with additional processing cost. The move from context-free to mildly-context-sensitive grammars comes with a definite cost in terms of the worst-case time complexity of exact parsing, from cubic time <italic>O</italic>(<italic>n</italic><sup>3</sup>) for context-free grammars to <italic>O</italic>(<italic>n</italic><sup>5</sup>), <italic>O</italic>(<italic>n</italic><sup>7</sup>), etc. for various mildly context-sensitive grammars (<xref ref-type="bibr" rid="B60">Seki et al., 1991</xref>), up to <italic>O</italic>(<italic>n</italic><sup>28</sup>) for a wide-coverage Minimalist grammar (<xref ref-type="bibr" rid="B65">Torr et al., 2019</xref>). Furthermore, under an assumption of strong competence (<xref ref-type="bibr" rid="B11">Bresnan, 1982</xref>; <xref ref-type="bibr" rid="B13">Chomsky, 1965</xref>), the added complexity in formal grammars such as TAG, MG, etc., which accounts for non-context-freeness in natural language, could reflect special operations for crossing dependencies during human language processing (<xref ref-type="bibr" rid="B48">Levy et al., 2012</xref>).</p>
<p>In theories where the production system faithfully encodes syntactic dependencies in an utterance (<xref ref-type="bibr" rid="B10">Bock et al., 2002</xref>), it has been assumed that generating a context-free structure should be less costly than generating a non-context-free structure (involving a crossing dependency). In particular, theories of sentence production assume some degree of incrementality (<xref ref-type="bibr" rid="B45">Levelt, 1989</xref>), but in some views, non-context-free dependencies such as filler&#8211;gap dependencies require advance planning (<xref ref-type="bibr" rid="B51">Momma, 2021</xref>). There is also evidence that sentences that are difficult to produce tend to also be difficult to comprehend (<xref ref-type="bibr" rid="B50">MacDonald, 2013</xref>; <xref ref-type="bibr" rid="B59">Scontras et al., 2015</xref>). On such accounts, the comprehension system should find the non-context-free structures involving crossing dependencies difficult to parse because it is rarely exposed to such configurations.</p>
<p>One implication is that the processing system would rather form a simpler context-free structure than a complex non-context-free structure. Evidence for this comes from comprehension of structures involving filler&#8211;gap dependencies in English where it has been found that native speakers tend to avoid forming filler&#8211;gap dependencies if possible (<xref ref-type="bibr" rid="B64">Staub et al., 2018</xref>), and that they tend to resolve the gap as early as possible (<xref ref-type="bibr" rid="B14">Clifton and Frazier, 1989</xref>; <xref ref-type="bibr" rid="B24">Frazier, 1985</xref>), limiting the range of the non-context-free dependency. Relatedly, recent results have shown that it is difficult to prime structures involving crossing dependencies (<xref ref-type="bibr" rid="B38">Husain and Yadav, 2020</xref>). Again, it is assumed that comprehension of such structures involves a (parsing) process that establishes all the required dependencies faithfully. Either way, it is quite reasonable to assume that the comprehension system should find parsing non-context-free structures to be difficult (cf. <xref ref-type="bibr" rid="B25">Frazier, 1987</xref>).</p>
</sec>
<sec>
<title>2.2 Why crossing dependencies might not be difficult</title>
<p>Despite the above, the most common theoretical and empirical stance in psycholinguistics has been that crossing dependencies do not pose special processing challenges, and the Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) result that we revisit here was a key piece of evidence in establishing this idea.</p>
<p>While mildly context-sensitive grammars have worse time complexity than context-free grammars, the time complexity analysis is based on the worst case. For individual sentences, the time taken to parse may be higher or lower under context-free or mildly context-sensitive grammars. Indeed, although Torr et al. (<xref ref-type="bibr" rid="B65">2019</xref>) calculate a worst-case time complexity of <italic>O</italic>(<italic>n</italic><sup>28</sup>) for their parser, they find that its <italic>average</italic> parsing time seems to reflect cubic time complexity, similarly to context-free grammars. Most relevantly, inspired by the Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) results, Joshi (<xref ref-type="bibr" rid="B40">1990</xref>) presents a parsing model for TAG based on an extended pushdown automaton in which the German-style nested dependencies require items to be stored on a (structured) stack for a longer time than the Dutch-style cross-serial dependencies; further automaton models along these lines include Rambow and Joshi (<xref ref-type="bibr" rid="B54">1994</xref>) and Kobele et al. (<xref ref-type="bibr" rid="B43">2013</xref>). If the number of items stored on the stack is a predictor of human processing difficulty (a common assumption: see for example <xref ref-type="bibr" rid="B1">Abney and Johnson, 1991</xref>; <xref ref-type="bibr" rid="B15">De Santo, 2020</xref>; <xref ref-type="bibr" rid="B31">Graf et al., 2017</xref>; <xref ref-type="bibr" rid="B55">Resnik, 1992</xref>; <xref ref-type="bibr" rid="B63">Stabler, 1994</xref>; <xref ref-type="bibr" rid="B69">Yngve, 1960</xref>), then the particular sentences with crossing dependencies would be easier than those with deep nested dependencies, even though the need to accommodate such dependencies results in worse worst-case behavior.</p>
<p>The preponderance of empirical psycholinguistic evidence also seems to suggest that it is long dependencies, not crossing dependencies, that cause processing difficulty under appropriate circumstances, because long dependencies tax working memory resources (<xref ref-type="bibr" rid="B7">Bartek et al., 2011</xref>; <xref ref-type="bibr" rid="B18">Ferrer-i-Cancho, 2006</xref>; <xref ref-type="bibr" rid="B29">Gibson, 1998</xref>, <xref ref-type="bibr" rid="B30">2000</xref>; <xref ref-type="bibr" rid="B32">Grodner and Gibson, 2005</xref>; <xref ref-type="bibr" rid="B47">Levy, 2013</xref>).<xref ref-type="fn" rid="n1">1</xref> For example, right-extraposition (which leads to a crossing dependency) is preferred over its embedded counterpart (which leads to a non-crossing dependency) when the length of the right-extraposed relative clause is longer (<xref ref-type="bibr" rid="B20">Francis, 2010</xref>; <xref ref-type="bibr" rid="B35">Hawkins, 1994</xref>). In other words, if the total dependency distance in the sentence with an extraposed relative clause is less than the sentence with an embedded relative clause, then the former configuration is preferred. Crossing constructions can also be preferred due to information structure (<xref ref-type="bibr" rid="B37">Huck and Na, 1990</xref>; <xref ref-type="bibr" rid="B56">Rochemont and Culicover, 1990</xref>). Second, there is evidence that parsing crossing dependencies is not difficult as long as the dependency is highly expected (e.g., <xref ref-type="bibr" rid="B48">Levy et al., 2012</xref>). Finally, although crossing dependencies are relatively rare in naturalistic corpora (<xref ref-type="bibr" rid="B19">Ferrer-i-Cancho et al., 2018</xref>; <xref ref-type="bibr" rid="B34">Havelka, 2007</xref>; <xref ref-type="bibr" rid="B44">Kuhlmann, 2013</xref>), this could be genre dependent: the dependency corpus data showing rarity of crossing dependencies mainly comes from news corpora in various languages. It could very well be the case that crossing dependencies are much more common in naturalistic dialogue settings where factors such as information structure and accessibility more strongly dictate sentence formulation rather than the presumed complexity of crossing dependencies.</p>
<p>The first and key empirical result suggesting that crossing dependencies are not especially difficult was Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>), which showed that crossing dependencies in Dutch are easier to comprehend than meaning-equivalent multiply nested structures in German. Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) conducted a rating study using recorded spoken items similar to those in (1a) and (1b). The levels of nesting and crossing in German and Dutch respectively ranged from 1 to 4 (where Level 1 corresponded to no nesting/crossing). Each of these nested/crossing items had a corresponding paraphrase in order to control for semantic/propositional complexity. Unlike the nested/crossing items, the paraphrases used right-branching structures that were identical between the two languages. Participants were asked to rate the items on a 9-point scale from &#8216;easy&#8217; to &#8216;difficult&#8217;. In addition, participants were asked comprehension questions that targeted specific NPs in the sentence (only at Levels 2 and 3, and corresponding Paraphrase items). The results showed that, relative to the paraphrases, crossing structures in Dutch were subjectively easier to understand and achieved higher comprehension accuracy than their German nested counterparts. Moreover, this difference between languages was larger at deeper levels of embedding.</p>
<p>The dependency locality perspective predicts that crossing dependencies in Dutch should be easier than meaning-equivalent nested dependencies in German, because the multiple nesting of subject-verb dependencies in German creates a long dependency between the auxiliary and the verb (see <xref ref-type="fig" rid="F1">Figure 1</xref>). The locality theory assumes that increased distance between a head and its dependent causes difficulty in maintaining the co-dependent in memory, predicting a higher cost when the long dependency must be integrated. For example, the AUX &#8594; V3 dependency in German has six intervening words (see <xref ref-type="fig" rid="F1">Figure 1</xref>). In contrast to German, Dutch has relatively shorter dependencies (distance &#8804; 4 for all dependencies) due to crossing structure. Dependency locality theory thus naturally predicts lower comprehension difficulty for Dutch crossing structures than German nested structures.</p>
<fig id="F1">
<caption>
<p><bold>Figure 1:</bold> The schematic representation of dependency structure in German vs. Dutch sentences. The dependencies going from verbs (V1, V2, V3) to nouns (N1, N2, N3) are stacked over each other in German, but crossing each other in Dutch. The numbered labels on each dependency arc represent the length of the dependency.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-4-1-20916-g1.png"/>
</fig>
<p>Given the status of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) as a starting point for the formal and psycholinguistic work described above, we propose to replicate it. While the original study found robust results, they were obtained in only one modality (audio presentation) and the by-trial and by-participant data are no longer available for analysis. In this article, we present two replications of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) using new materials in a written format and analyze the results from the perspective of dependency locality.</p>
</sec>
</sec>
<sec>
<title>3. Experiment 1</title>
<sec>
<title>3.1 Methods</title>
<p>We replicate Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) in an online experiment in the written modality, presenting Dutch and German native speakers with sentences like those in (1a) and (1b), and collecting difficulty ratings and answers to comprehension questions.</p>
<sec>
<title>3.1.1 Materials</title>
<sec>
<title>3.1.1.1 Experimental stimuli</title>
<p>Exactly six verbs<xref ref-type="fn" rid="n2">2</xref> can be used in these constructions in both German and Dutch: <italic>sehen/zien</italic> &#8216;to see&#8217;, <italic>helfen/helpen</italic> &#8216;to help&#8217;, <italic>h&#246;ren/horen</italic> &#8216;to hear&#8217;, <italic>lehren/leren</italic> &#8216;to teach&#8217;, <italic>f&#252;hlen/voelen</italic> &#8216;to feel&#8217;, and <italic>lassen/laten</italic> &#8216;to let&#8217;. For each of these verbs, we constructed four German and four Dutch translation-equivalent items. Each item comes in a two- and a three-levels-of-embedding version, that is, with two or three consecutive verbs. Furthermore, following Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>), each embedded item was paired with a non-embedded paraphrase to control for any non-syntactic differences between languages or between embedding levels. Only the German versions of these paraphrases contain commas, which are obligatory before complement clauses in German but not commonly included in Dutch.</p>
<p><xref ref-type="table" rid="T1">Table 1</xref> presents one item in all 2 (Language) &#215; 2 (Level) &#215; 2 (Paraphrase) = 8 conditions. The 96 experimental sentences were divided over four lists such that, in each list, each condition occurred four times for each of the six verbs, and each item occurred in only one of the conditions. The item order was randomized per list.</p>
<table-wrap id="T1">
<caption>
<p><bold>Table 1:</bold> Example item in all eight experimental conditions. English translations of the sentences are &#8216;Timo saw the athlete run the marathon&#8217; (Level 2) and &#8216;The binoculars helped Timo see the athlete run the marathon.&#8217; (Level 3)</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top"><bold>Language</bold></td>
<td align="left" valign="top"><bold>Level</bold></td>
<td align="left" valign="top"><bold>Par.</bold></td>
<td align="left" valign="top"><bold>Example</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="4">German</td>
<td align="left" valign="top" rowspan="2">2</td>
<td align="left" valign="top">no</td>
<td align="left" valign="top">Timo hat den Athleten den Marathon laufen sehen.</td>
</tr>
<tr>
<td align="left" valign="top">yes</td>
<td align="left" valign="top">Timo hat gesehen, dass der Athlet den Marathon lief.</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">3</td>
<td align="left" valign="top">no</td>
<td align="left" valign="top">Das Fernglas hat Timo den Athleten den Marathon laufen sehen geholfen.</td>
</tr>
<tr>
<td align="left" valign="top">yes</td>
<td align="left" valign="top">Das Fernglas hat Timo geholfen, um zu sehen, dass der Athlet den Marathon lief.</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">Dutch</td>
<td align="left" valign="top" rowspan="2">2</td>
<td align="left" valign="top">no</td>
<td align="left" valign="top">Timo heeft de atleet de marathon zien lopen.</td>
</tr>
<tr>
<td align="left" valign="top">yes</td>
<td align="left" valign="top">Timo heeft gezien dat de atleet de marathon liep.</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">3</td>
<td align="left" valign="top">no</td>
<td align="left" valign="top">De verrekijker heeft Timo de atleet de marathon helpen zien lopen.</td>
</tr>
<tr>
<td align="left" valign="top">yes</td>
<td align="left" valign="top">De verrekijker heeft Timo geholpen om te zien dat de atleet de marathon liep.</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>3.1.1.2 Verb forms</title>
<p>In the no-paraphrase condition, all Dutch verbs (except for the auxiliary <italic>heeft</italic> &#8216;has&#8217;) were in the infinitive form. German speakers, however, disagree on whether the final verb of the cluster needs to be an infinitive or past participle. Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) solved this by testing two groups of German participants, one on infinitives and one on participles. A potential issue with that approach is that it may reduce average comprehensibility ratings. Assuming that encountering the dispreferred form reduces comprehensibility, variance in preference between participants (or between verbs, for that matter) will lead to lower average ratings in German than in Dutch.</p>
<p>To tackle this issue, we ran a German verb-form preference pretest<xref ref-type="fn" rid="n3">3</xref> in which 45 native German speakers indicated their preference for one of two sentences that differed only in the final verb form, for example:</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(2)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Timo hat den Athleten den Marathon laufen sehen. (infinitive)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Timo hat den Athleten den Marathon laufen gesehen. (participle)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Participants could also select an &#8216;equally good&#8217;-option. One sentence pair was presented for each of the six verbs. We then selected for our experimental stimuli the most often preferred form of each verb. These were the infinitives <italic>sehen</italic> and <italic>lassen</italic>, and the past participles <italic>geholfen</italic> &#8216;helped&#8217;, <italic>geh&#246;rt</italic> &#8216;heard&#8217;, <italic>gelehrt</italic> &#8216;learned&#8217;, and <italic>gef&#252;hlt</italic> &#8216;felt&#8217;.</p>
</sec>
<sec>
<title>3.1.1.3 Fillers</title>
<p>Each of the four experimental lists included the same 52 filler sentences. Sixteen of the fillers, taken from Frank et al. (<xref ref-type="bibr" rid="B23">2016</xref>), had doubly nested center-embedded relative clauses, making them very difficult to understand. The other 36 fillers varied in the number of main verbs (one, two, or three; twelve sentences each) but did not have a purposefully difficult structure. German and Dutch fillers were translation-equivalents.</p>
</sec>
<sec>
<title>3.1.1.4 Comprehension tests</title>
<p>In order to measure how well the experimental and filler sentences were understood, each sentence was paired with four statements, only one of which corresponded to the content of the sentence. The nouns and verbs of the three distractor statements also occurred in the corresponding sentence. For example, for the Level-2 sentences of <xref ref-type="table" rid="T1">Table 1</xref> these statements (translated to English) were:</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(3)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The athlete ran the marathon. (true)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The athlete saw the marathon. (false)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>c.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The athlete saw Timo. (false)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>d.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Timo ran the marathon. (false)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The four statements were identical for the paraphrase and non-paraphrase conditions, and translation-equivalent between German and Dutch. The statements were visible simultaneously, in random order and without the stimulus sentence present.</p>
<p>All experimental stimuli with their list assignment, filler sentences, and comprehension test items are available as supplementary materials.</p>
</sec>
</sec>
<sec>
<title>3.1.2 Participants and procedure</title>
<p>Using the Prolific platform, we recruited 40 adult native German speakers living in Germany, and 40 adult native Dutch speakers living in the Netherlands. Participants of the verb form pretest were excluded from Experiment 1. The experiment was fully web-based and in the participant&#8217;s native language; we use English translations in the description below.</p>
<p>After giving informed consent, participants read the instruction to rate sentences on how easy or difficult they are to understand using a slider response, and then choose one of four statements that is correct given the previously seen sentence. The experiment began with a single practice item, where feedback was provided on the comprehension test, followed by a reminder of the instructions. No feedback was provided on the other comprehension tests.</p>
<p>Each sentence was presented with the horizontal slider below. The slider was labeled (from left to right) &#8216;very easy&#8217;, &#8216;easy&#8217;, &#8216;hard&#8217;, and &#8216;very hard&#8217;. The slider&#8217;s midpoint (which was also its initial position) was indicated by a dot. Participants could move the slider using the mouse. Their difficulty ratings were then recorded on a continuous scale from 0 (very easy) to 100 (very hard). After at least clicking on the slider, participants could click the &#8216;continue&#8217; button to replace the sentence with the comprehension question that asked which of the four statements shown on the screen is correct given the content of the sentence. They would then click what they considered to be the correct statement, and the next sentence item appeared directly.</p>
<p>After the last comprehension test, participants in the German condition received instructions about the following acceptability test in which they would select which of two sentences sounds more natural, or click a button marked &#8216;Don&#8217;t know / equally good&#8217;. They were then shown pairs of new Level-2 No-paraphrase items (one pair for each of the six verbs) where one sentence used the infinitive verb form and the other the past participle. There was also one control trial with one clearly ungrammatical option.</p>
<p>Finally, both versions of the experiment asked for demographic information: Age, region in which the participant grew up (free text), region of current residence (free text), and highest level of education (multiple choice). German participants then indicated whether they were fluent in Dutch, while Dutch participants were asked about fluency in German.</p>
<p>Median completion time was 35 minutes for Dutch and 36 minutes for German. Participants were paid US$ 7.</p>
</sec>
<sec>
<title>3.1.3 Differences in methodology from Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>)</title>
<p>A careful reader will have noted that while we attempt a replication of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>), there are some differences in our experimental setup compared to the original study; we list the key differences below.</p>
<list list-type="roman-lower">
<list-item><p>The items used in the current study are not identical to the ones used in the original study. Also, we didn&#8217;t include Level 1 and Level 4 items. The original items are unavailable.</p></list-item>
<list-item><p>The German items in the current study were selected through a norming study. Through this norming, it was ensured that the final verb cluster had the most preferred form (either infinitive or past participle). The original Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) treated the verb form as a between-subjects factor, thereby testing two groups, one for infinitives and another for past participles. As stated above, a potential issue with that approach is that it may reduce average comprehensibility ratings if people&#8217;s actual preferences are graded and variable across verbs, presenting a potential confound for the results of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>).</p></list-item>
<list-item><p>The non-availability of the original items also meant that the filler items in the current study were different from those in Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>). However, similar to the original study, some filler items matched the critical items in terms of the number of nouns and verb. Unlike the original study, the number of filler items in the current study was higher (52 vs. 36). The current study also had some difficult fillers to obscure the difficulty associated exclusively with the critical items.</p></list-item>
<list-item><p>The comprehension questions in the original study were open-ended: &#8220;Was tat NP?&#8221;/&#8220;Wat deed NP?&#8221;. Unlike those, the comprehension test in the current study comprised of a forced-choice task where participants had to choose a correct statement out of four options. These options targeted specific dependencies between nouns and verbs.</p></list-item>
<list-item><p>The items in the original study were presented auditorily. The current replication used visual (written) presentation.</p></list-item>
<list-item><p>The rating in the original was done on a discrete scale of 1&#8211;9 (1 was labeled as &#8216;easy&#8217;, 9 was labeled as &#8216;difficult&#8217;). In the current study, ratings were given using a slider that was labeled as &#8216;very easy&#8217;, &#8216;easy&#8217;, &#8216;hard&#8217;, &#8216;very hard&#8217;; the ratings were recorded on a continuous scale of 0 (very easy) to 100 (very hard).</p></list-item>
<list-item><p>Apart from the rating and the comprehension tasks (as in <xref ref-type="bibr" rid="B3">Bach et al., 1986</xref>), the current study had an additional task that was presented at the end of the German experiment. In this task, the participants chose their preference of the infinitive vs. past participle ending for the 6 critical verbs used in the study.</p></list-item>
<list-item><p>Finally the original Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) study was conducted in a lab setting. The data for the current study was collected online.</p></list-item>
</list>
</sec>
<sec>
<title>3.1.4 Data analysis</title>
<p>Participants who reported fluency in the other language or who had less than 40% accuracy (chance accuracy is 25%) on comprehension questions were rejected from the analysis. For German, individual trials were rejected for participants who indicated a preference for the non-presented verb form, so that only items with the subject&#8217;s preferred verb form were included in the analysis. The final data used for analysis consisted of 5,616 observations (36 participants) from Dutch and 6,314 observations (40 participants) from German.</p>
<p>We want to infer whether the difference in comprehension difficulty between embedded and paraphrase sentences is significantly higher for German sentences compared to Dutch, and whether this difference in relative difficulty in German vs. Dutch is higher in double-embedded (Level 3) sentences compared to single-embedded (Level 2) sentences.</p>
<p>To draw these inferences from the data, we fit a linear mixed-effects model (<xref ref-type="bibr" rid="B2">Baayen et al., 2008</xref>; <xref ref-type="bibr" rid="B28">Gelman and Hill, 2007</xref>) with varying intercepts and slope adjustments for subjects and items using the lme4 package (<xref ref-type="bibr" rid="B9">Bates et al., 2014</xref>) in R. We tested the effect of language and embedding level on the mean difficulty ratings using the following formula:</p>
<disp-quote>
<p>difficulty &#8764; language &#8727; level &#8727; paraphrase+(level+paraphrase &#124; subject)+(1 &#124; item),</p>
</disp-quote>
<p>which is the maximal converging model structure (<xref ref-type="bibr" rid="B6">Barr et al., 2013</xref>; <xref ref-type="bibr" rid="B8">Bates et al., 2015</xref>).</p>
<p>To test the effect of language and embedding levels on the accuracy of comprehension question responses, we used a logistic regression model with the same fixed- and random-effects structure as the previous model, predicting whether a participant&#8217;s response is correct (1) or not (0).</p>
<p>However, the mixed-effects linear regression for the difficulty rating data assumes that the residuals of the mixed-effects model are normally distributed. We need to verify whether this assumption holds. <xref ref-type="fig" rid="F2">Figure 2</xref> shows the distribution of residuals obtained from the mixed effect model fitted to the ratings data. We observe that the residuals are approximately normally distributed. However, the values of the rating scores are slightly inflated at the boundaries. An alternative way of analyzing these data is to use a Beta regression which would allow us to model the truncated data with inflation at the boundaries. The details and the results of the Beta regression are shown in supplementary materials Section 3. We find that our main conclusion does not change with the use of a Beta regression. We stick to linear regression analysis in our main text.</p>
<fig id="F2">
<caption>
<p><bold>Figure 2:</bold> The distributions of residuals obtained from the linear mixed models fitted to difficulty ratings data from Experiment 1 (left panel) and Experiments 2a&#8211;b (right panel). Plot (a) shows the histograms of the residuals and Plot (b) shows the boxplots.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-4-1-20916-g2.png"/>
</fig>
</sec>
<sec>
<title>3.1.5 Predictions</title>
<p>The effects of interest for our analysis are the interaction effect of sentence type (paraphrase or embedded) and language, and the three-way interaction of language, sentence type, and embedding level. If these interaction effects have (significantly) positive estimates, Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>)&#8217;s results are successfully replicated.</p>
<p>The locality theory predicts that German embedded sentences should be more difficult to process than Dutch ones, i.e., the interaction of language and sentence type should be positive. Moreover, the increase in difficulty with respect to embedding level should be larger in German, that is, the three-way interaction of language, level, and paraphrase should be positive. Thus, both Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>)&#8217;s claim&#8212;that crossing dependencies are easier to comprehend than meaning-equivalent multiple-nested structures&#8212;and the locality account predict a positive estimate for the effect of Paraphrase = No &#215; Language = German and for the effect of Level 3 &#215; Paraphrase = No &#215; Language = German.</p>
</sec>
</sec>
<sec>
<title>3.2 Results and discussion</title>
<sec>
<title>3.2.1 Comprehension difficulty ratings</title>
<p><xref ref-type="fig" rid="F3">Figure 3</xref> shows the average difficulty rating in each of the eight conditions. Unsurprisingly, Level-3 sentences are rated as more difficult to understand than Level-2 sentences, and paraphrases are rated as less difficult than non-paraphrases. Critically, the difference in difficulty between paraphrase and non-paraphrase sentences appears to be larger in German than in Dutch, which is confirmed by a significant positive interaction of Language and Paraphrase in the regression analysis results presented in <xref ref-type="table" rid="T2">Table 2</xref>.</p>
<fig id="F3">
<caption>
<p><bold>Figure 3:</bold> Average comprehension difficulty in each condition of Experiment 1. The difference between the orange (No-paraphrase) and gray (Paraphrase) bars indicates the comprehension difficulty caused by the use of an embedded structure, which is the effect of interest. The error bar shows the standard error of the mean difficulty rating.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-4-1-20916-g3.png"/>
</fig>
<table-wrap id="T2">
<caption>
<p><bold>Table 2:</bold> The estimated effects of embedding level, language, and paraphrase condition on comprehension difficulty ratings in Experiment 1. The estimates were obtained from a linear mixed-effects regression model fitted to the difficulty rating data. The two theoretically important interaction effects are highlighted with a gray shade; significant effects are marked with an asterisk *.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top"><bold>Effect</bold></td>
<td align="left" valign="top"><bold>Estimate</bold></td>
<td align="left" valign="top"><bold>Std.Err.</bold></td>
<td align="left" valign="top" colspan="2"><bold><italic>t</italic>-value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Intercept</td>
<td align="right" valign="top">13.70</td>
<td align="right" valign="top">2.56</td>
<td align="right" valign="top">5.36</td>
<td align="right" valign="top">*</td>
</tr>
<tr>
<td align="left" valign="top">Level 3</td>
<td align="right" valign="top">26.47</td>
<td align="right" valign="top">2.19</td>
<td align="right" valign="top">12.10</td>
<td align="center" valign="top">*</td>
</tr>
<tr>
<td align="left" valign="top">Paraphrase = No</td>
<td align="right" valign="top">2.82</td>
<td align="right" valign="top">1.94</td>
<td align="right" valign="top">1.45</td>
<td align="right" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top">Language = German</td>
<td align="right" valign="top">&#8211;0.43</td>
<td align="right" valign="top">3.10</td>
<td align="right" valign="top">&#8211;0.14</td>
<td align="right" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top">Level 3 &#215; Paraphrase = No</td>
<td align="right" valign="top">20.44</td>
<td align="right" valign="top">2.39</td>
<td align="right" valign="top">8.55</td>
<td align="center" valign="top">*</td>
</tr>
<tr>
<td align="left" valign="top">Level 3 &#215; Language = German</td>
<td align="right" valign="top">2.87</td>
<td align="right" valign="top">3.02</td>
<td align="right" valign="top">0.95</td>
<td align="right" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top" style="background-color:#d9d9d9;">Paraphrase = No &#215; Language = German</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">12.45</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">2.74</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">4.55</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">*</td>
</tr>
<tr>
<td align="left" valign="top" style="background-color:#d9d9d9;">Level 3 &#215; Paraphrase = No &#215; Language = German</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">&#8211;8.87</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">3.46</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">&#8211;2.56</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">*</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>This result implies that nested structures in German are more difficult than crossing structures in Dutch. The result supports the locality theory and confirms a successful replication of Bach et al.,&#8217;s (<xref ref-type="bibr" rid="B3">1986</xref>) general finding.</p>
<p>However, unexpectedly, the three-way interaction of Language, Level, and Paraphrase is negative. This negative interaction means that the relative difficulty of German decreases, rather than increases, with the extra level of embedding. This negative interaction is unexpected both from the perspective of the results of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) and from a locality theory of processing difficulty. The negative interaction is unlikely to be a ceiling effect&#8212;notice that the maximal difficulty rating is 100, but the average difficulty rating for level-3 non-paraphrase sentences in German is around 70. A possible explanation for this effect is that while the processing difficulty in German is higher than Dutch for level-2 sentences, the subjective difficulty saturates at level-3 embedding across both languages.</p>
</sec>
<sec>
<title>3.2.2 Comprehension question accuracy</title>
<p>Comprehension accuracy results are shown in <xref ref-type="fig" rid="F4">Figure 4</xref>. The logistic regression in <xref ref-type="table" rid="T3">Table 3</xref> shows no significant effects of interest on question response accuracy. There is a significant general effect whereby level-3 stimuli, paraphrase or not, have lower comprehension accuracy.</p>
<fig id="F4">
<caption>
<p><bold>Figure 4:</bold> Average question response accuracy in each condition of Experiment 1. The difference between the orange (No-paraphrase) and gray (Paraphrase) bars indicates the decrease in question response accuracy caused by the use of an embedded structure, which is the effect of interest. The error bar shows the standard error of the mean response accuracy.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-4-1-20916-g4.png"/>
</fig>
<table-wrap id="T3">
<caption>
<p><bold>Table 3:</bold> The estimated effects of embedding level, language, and paraphrase condition on question response accuracy in Experiment 1. The estimates were obtained from a mixed-effects logistic regression fitted to the question response data. The two theoretically important interaction effects are highlighted with a gray shade.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top"><bold>Effect</bold></td>
<td align="left" valign="top"><bold>Estimate</bold></td>
<td align="left" valign="top"><bold>Std.Err.</bold></td>
<td align="left" valign="top"><bold><italic>z</italic>-value</bold></td>
<td align="left" valign="top"><bold><italic>p</italic>-value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Intercept</td>
<td align="right" valign="top">2.49</td>
<td align="right" valign="top">0.31</td>
<td align="right" valign="top">8.09</td>
<td align="right" valign="top">&lt;0.01</td>
</tr>
<tr>
<td align="left" valign="top">Level 3</td>
<td align="right" valign="top">&#8211;1.43</td>
<td align="right" valign="top">0.28</td>
<td align="right" valign="top">&#8211;5.10</td>
<td align="right" valign="top">&lt;0.01</td>
</tr>
<tr>
<td align="left" valign="top">Paraphrase = No</td>
<td align="right" valign="top">0.20</td>
<td align="right" valign="top">0.33</td>
<td align="right" valign="top">0.59</td>
<td align="right" valign="top">0.55</td>
</tr>
<tr>
<td align="left" valign="top">Language = German</td>
<td align="right" valign="top">0.26</td>
<td align="right" valign="top">0.37</td>
<td align="right" valign="top">0.71</td>
<td align="right" valign="top">0.48</td>
</tr>
<tr>
<td align="left" valign="top">Level 3 &#215; Paraphrase = No</td>
<td align="right" valign="top">0.54</td>
<td align="right" valign="top">0.41</td>
<td align="right" valign="top">1.32</td>
<td align="right" valign="top">0.19</td>
</tr>
<tr>
<td align="left" valign="top">Level 3 &#215; Language = German</td>
<td align="right" valign="top">&#8211;0.35</td>
<td align="right" valign="top">0.43</td>
<td align="right" valign="top">&#8211;0.81</td>
<td align="right" valign="top">0.42</td>
</tr>
<tr>
<td align="left" valign="top" style="background-color:#d9d9d9;">Paraphrase = No &#215; Language = German</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">&#8211;0.30</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.48</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">&#8211;0.64</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.53</td>
</tr>
<tr>
<td align="left" valign="top" style="background-color:#d9d9d9;">Level 3 &#215; Paraphrase = No &#215; Language = German</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.91</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.61</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">1.50</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.13</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
</sec>
<sec>
<title>4. Experiments 2a and 2b</title>
<p>Although the results from Experiment 1 are partially consistent with those from Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>), there are three major concerns. First, the study might be underpowered. Second, because participants can take as long as they wanted reading each sentence (unlike in the auditory presentation from Bach et al.,&#8217;s original study) higher difficulty in one language could have been compensated by spending more time on the sentence&#8212;this may explain the lack of a crosslinguistic difference in comprehension accuracy, as German speakers may be taking more time to achieve the same level of accuracy as Dutch speakers. Third, the very high comprehension accuracy rates in the level 2 condition can be indicative of a ceiling effect.</p>
<p>All three concerns are dealt with in Experiment 2, where we control reading time by using a Rapid Serial Visual Presentation (RSVP) format to present the stimuli. Experiment 2a was an exploratory study, the results of which were used in a power analysis to determine the number of participants for a preregistered confirmatory study (Experiment 2b).<xref ref-type="fn" rid="n4">4</xref></p>
<sec>
<title>4.1 Methods</title>
<sec>
<title>4.1.1 Materials</title>
<p>Stimuli and comprehension tests were identical to those of Experiment 1 with three minor changes. First, a small number of the verb forms in the German no-paraphrase items of Experiment 1 turned out not to match the outcome of the verb-form preference pretest, which was corrected. Second, semantically incongruous answer options in the comprehension test were replaced by semantically meaningful alternatives in order to reduce the probability of guessing correctly. Third, minor changes were applied to the fillers to increase parallelism between languages, for example by using German-Dutch cognate words whenever possible.</p>
<p>All experimental stimuli with their list assignment, filler sentences, and comprehension test items are available as supplementary materials.</p>
</sec>
<sec>
<title>4.1.2 Participants</title>
<p>All participants were adult native German speakers living in Germany and adult native Dutch speakers living in the Netherlands, recruited on the Prolific platform. Participants of Experiment 1 and the verb form pre-test were excluded from taking part in Experiments 2a and 2b.</p>
<p>Forty-four Dutch participants and forty-one German participants were recruited for Experiment 2a. The power analysis based on effect estimates from Experiment 2a revealed that 96 participants per language would provide 80% power after taking into account likely data loss (see Supplementary Materials Section 3 for the power analysis details). After testing these additional participants, twelve additional Dutch speakers and one additional German speaker were recruited because a higher-than-expected number of Dutch speakers indicated fluency in German. Participants from Experiment 2a were excluded from taking part in Experiment 2b. Participants were paid &#163;7.20.</p>
</sec>
<sec>
<title>4.1.3 Procedure</title>
<p>The procedure was identical to that of Experiment 1 with the exception of the sentence presentation method. Each sentence was preceded by a centrally presented fixation cross above a button labeled &#8216;Start&#8217;. Upon clicking the button, the fixation cross was replaced by the sentence&#8217;s first word which was then automatically replaced by each following word until completion of the sentence. To ensure that the number of presented tokens per sentence was identical between languages, it was occasionally necessary to display two words at a time for one of the languages, as indicated in the supplementary materials.</p>
<p>Following the EEG study by Frank et al. (<xref ref-type="bibr" rid="B22">2015</xref>), tokens were visible for a length-dependent duration of 190 + 20max{<italic>n</italic><sub>German</sub>,<italic>n</italic><sub>Dutch</sub>} ms, where <italic>n</italic> is the number of characters in the token, including any punctuation or space. This was followed by a 390 ms interval before the next token appeared. Taking the maximum length of the German and Dutch token ensures that the total time for each sentence is identical between languages.</p>
<p>After the offset of the sentence-final word, the comprehensibility rating slider would appear, followed by the comprehension test, as in Experiment 1. The German rating experiment was followed by the verb-form preference test. The same demographic information was collected as in Experiment 1, except that the German study also asked about fluency in Swiss German and the Dutch study also asked about fluency in Frisian. This is because Swiss German, like Dutch, has crossing dependencies, and Frisian, like standard German, has nested dependencies in these verb clusters.</p>
<p>Median completion times was 34 and 31 minutes for German and Dutch, respectively, in Experiment 2a; and 36 and 35 minutes in Experiment 2b.</p>
</sec>
<sec>
<title>4.1.4 Data analysis and predictions</title>
<p>Participants who reported fluency in the other language, in Swiss German, or in Frisian were rejected from the analysis. For the German experiment, individual trials were rejected for participants who indicated a preference for the non-presented verb form so that only items with the preferred verb form were included in the analysis. We analyze the data from Experiments 2a and 2b together. The data consisted of 10,184 observations (134 participants) from Dutch and 9,572 observations (132 participants) from German.</p>
<p>We want to infer whether the comprehension difficulty and accuracy are significantly higher for German sentences compared to Dutch and whether this difference in difficulty in German vs. Dutch is higher in double-embedded (level-3) sentences compared to single-embedded (level-2) sentences. The regression models and the predictions were the same as described in Sections 3.1.4 and 3.1.5, respectively. Bach and colleagues&#8217; claim and the locality hypothesis would predict a (significantly) positive estimate for the interaction effect of language (= German) and sentence type (= no-paraphrase) and also a positive three-way interaction of language, sentence type, and level of embedding. The positive estimate for these two interaction effects would imply a successful replication of Bach et al.&#8217;s main findings.</p>
</sec>
</sec>
<sec>
<title>4.2 Results and Discussion</title>
<sec>
<title>4.2.1 Comprehension difficulty ratings</title>
<p><xref ref-type="fig" rid="F5">Figure 5</xref> shows the average difficulty rating in each of the eight conditions. We find that German sentences have higher comprehension difficulty compared to Dutch sentences, as confirmed by the significant interaction of language and paraphrase in <xref ref-type="table" rid="T4">Table 4</xref>. We also find a significant three-way interaction of language, paraphrase, and level, this time positive, implying that the difference in comprehension difficulty of German vs Dutch is higher for level-3 sentences compared to level-2 sentences.<xref ref-type="fn" rid="n5">5</xref> The results collectively indicate that German sentences (containing nested structures) are more difficult to comprehend than Dutch sentences (with crossed structures), and this comprehension difficulty in nested structures gets significantly increased in double-embedding compared to single-embedding structures.</p>
<fig id="F5">
<caption>
<p><bold>Figure 5:</bold> Average comprehension difficulty in each condition of Experiment 2. The difference between the orange (No-paraphrase) and gray (Paraphrase) bars indicates the comprehension difficulty caused by the use of an embedded structure, which is the effect of interest. The error bar shows the standard error of the mean difficulty rating.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-4-1-20916-g5.png"/>
</fig>
<table-wrap id="T4">
<caption>
<p><bold>Table 4:</bold> The estimated effects of embedding level, language, and paraphrase condition on comprehension difficulty ratings in Experiment 2. The estimates were obtained from a linear mixed-effects regression model fitted to the difficulty rating data. The two theoretically important interaction effects are highlighted with a gray shade; significant effects are marked with an asterisk *.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top"><bold>Effect</bold></td>
<td align="left" valign="top"><bold>Estimate</bold></td>
<td align="left" valign="top"><bold>Std.Err.</bold></td>
<td align="left" valign="top" colspan="2"><bold><italic>t</italic>-value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Intercept</td>
<td align="right" valign="top">15.22</td>
<td align="right" valign="top">1.66</td>
<td align="right" valign="top">9.15</td>
<td align="center" valign="top">*</td>
</tr>
<tr>
<td align="left" valign="top">Level 3</td>
<td align="right" valign="top">25.82</td>
<td align="right" valign="top">1.24</td>
<td align="right" valign="top">20.90</td>
<td align="center" valign="top">*</td>
</tr>
<tr>
<td align="left" valign="top">Paraphrase = No</td>
<td align="right" valign="top">1.44</td>
<td align="right" valign="top">1.05</td>
<td align="right" valign="top">1.36</td>
<td align="right" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top">Language = German</td>
<td align="right" valign="top">2.71</td>
<td align="right" valign="top">1.70</td>
<td align="right" valign="top">1.59</td>
<td align="right" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top">Level 3 &#215; Paraphrase = No</td>
<td align="right" valign="top">12.12</td>
<td align="right" valign="top">1.31</td>
<td align="right" valign="top">9.24</td>
<td align="center" valign="top">*</td>
</tr>
<tr>
<td align="left" valign="top">Level 3 &#215; Language = German</td>
<td align="right" valign="top">0.59</td>
<td align="right" valign="top">1.76</td>
<td align="right" valign="top">0.34</td>
<td align="right" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top" style="background-color:#d9d9d9;">Paraphrase = No &#215; Language = German</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">3.11</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">1.53</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">2.03</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">*</td>
</tr>
<tr>
<td align="left" valign="top" style="background-color:#d9d9d9;">Level 3 &#215; Paraphrase = No &#215; Language = German</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">4.86</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">1.97</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">2.47</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">*</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In alternative Beta regressions shown in Supplementary Materials Section 3, we find a significant negative language by paraphrase interaction, but the three-way interaction is not supported. We note that there may be a power issue here; our power analysis was designed around detecting the two-way interaction of language and paraphrase.</p>
</sec>
<sec>
<title>4.2.2 Comprehension question accuracy</title>
<p>Despite the RSVP method, we do not find significant crosslinguistic differences in comprehension accuracy (<xref ref-type="fig" rid="F6">Figure 6</xref>), although accuracy is now overall lower than in Experiment 1. Logistic regression results predicting accuracy are shown in <xref ref-type="table" rid="T5">Table 5</xref>: the only significant effect interacting with the Paraphrase factor is that Level-3 embeddings are more difficult. A <italic>p</italic> = .03 negative interaction between Level 3 and Language = German is likely not meaningful, as it applies to the Paraphrase = Yes condition.</p>
<fig id="F6">
<caption>
<p><bold>Figure 6:</bold> Average question response accuracy in each condition of Experiment 2. The difference between the orange (No-paraphrase) and gray (Paraphrase) bars indicates the decrease in response accuracy caused by the use of an embedded structure, which is the effect of interest. The error bar shows the standard error of the mean response accuracy.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-4-1-20916-g6.png"/>
</fig>
<table-wrap id="T5">
<caption>
<p><bold>Table 5:</bold> The estimated effects of embedding level, language, and paraphrase condition on question response accuracy. The estimates were obtained from a mixed-effects logistic regression fitted to the question response data. The two theoretically important interaction effects are highlighted with a gray shade.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top"><bold>Effect</bold></td>
<td align="left" valign="top"><bold>Estimate</bold></td>
<td align="left" valign="top"><bold>Std.Err.</bold></td>
<td align="left" valign="top"><bold><italic>z</italic>-value</bold></td>
<td align="left" valign="top"><bold><italic>p</italic>-value</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Intercept</td>
<td align="right" valign="top">2.16</td>
<td align="right" valign="top">0.25</td>
<td align="right" valign="top">8.64</td>
<td align="right" valign="top">&lt;0.01</td>
</tr>
<tr>
<td align="left" valign="top">Level 3</td>
<td align="right" valign="top">&#8211;0.61</td>
<td align="right" valign="top">0.15</td>
<td align="right" valign="top">&#8211;4.10</td>
<td align="right" valign="top">&lt;0.01</td>
</tr>
<tr>
<td align="left" valign="top">Paraphrase = No</td>
<td align="right" valign="top">0.13</td>
<td align="right" valign="top">0.16</td>
<td align="right" valign="top">0.83</td>
<td align="right" valign="top">0.41</td>
</tr>
<tr>
<td align="left" valign="top">Language = German</td>
<td align="right" valign="top">0.35</td>
<td align="right" valign="top">0.19</td>
<td align="right" valign="top">1.83</td>
<td align="right" valign="top">0.07</td>
</tr>
<tr>
<td align="left" valign="top">Level 3 &#215; Paraphrase = No</td>
<td align="right" valign="top">&#8211;1.08</td>
<td align="right" valign="top">0.20</td>
<td align="right" valign="top">&#8211;5.29</td>
<td align="right" valign="top">&lt;0.01</td>
</tr>
<tr>
<td align="left" valign="top">Level 3 &#215; Language = German</td>
<td align="right" valign="top">&#8211;0.47</td>
<td align="right" valign="top">0.21</td>
<td align="right" valign="top">&#8211;2.21</td>
<td align="right" valign="top">0.03</td>
</tr>
<tr>
<td align="left" valign="top" style="background-color:#d9d9d9;">Paraphrase = No &#215; Language = German</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">&#8211;0.10</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.24</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">&#8211;0.41</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.68</td>
</tr>
<tr>
<td align="left" valign="top" style="background-color:#d9d9d9;">Level 3 &#215; Paraphrase = No &#215; Language = German</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.33</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.31</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">1.08</td>
<td align="right" valign="top" style="background-color:#d9d9d9;">0.28</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
</sec>
<sec>
<title>5. General discussion</title>
<sec>
<title>5.1 Replication status of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>)</title>
<p>In a broad sense, the above results replicate the findings of Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) in a different modality: we find greater difficulty in German than in Dutch. However, the particulars of the results diverge from Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>). The cross-linguistic difference in difficulty emerges at embedding level 2 in our results, whereas it emerged at level 3 in the original results. We find only mixed evidence that the cross-linguistic difference becomes stronger going from level 2 to level 3 embeddings.</p>
<p>The most noticeable difference is that we do not find any interesting effects in the comprehension question accuracy rates. Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) found a difference in comprehension accuracy between languages for 2 and 3 levels of embedding, although only for the infinitive form of the German materials. It is possible that the difference found by Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>) is due to the use of infinitive verb forms, which may be dispreferred by speakers; in contrast, we only analyze data from the preferred verb forms by participant. The difference may also arise due to the use of open-ended questions in the original work, whereas we used multiple-choice questions probing different dependencies.</p>
</sec>
<sec>
<title>5.2 Frequency-based explanation?</title>
<p>One natural question is whether the difference in processing between German and Dutch arises due to the differences in frequency of these verb sequences in the two languages. Indeed, a large number of processing phenomena across languages seem to be reducible to effects of surprisal (<xref ref-type="bibr" rid="B21">Frank and Bod, 2011</xref>; <xref ref-type="bibr" rid="B26">Futrell et al., 2020a</xref>; <xref ref-type="bibr" rid="B33">Hale, 2001</xref>; <xref ref-type="bibr" rid="B46">Levy, 2008</xref>), which reflects language-internal statistics (but see <xref ref-type="bibr" rid="B36">Huang et al., 2024</xref>; <xref ref-type="bibr" rid="B66">van Schijndel and Linzen, 2021</xref>). In the case of crossing dependencies, Levy et al. (<xref ref-type="bibr" rid="B48">2012</xref>) find that the difficulty of right-extraposed relative clauses is a function of their surprisal given the main verb.</p>
<p>To investigate this possibility for the Dutch and German structures under study, we collected corpus counts from web-based corpora (<xref ref-type="bibr" rid="B57">Sch&#228;fer, 2015</xref>; <xref ref-type="bibr" rid="B58">Sch&#228;fer and Bildhauer, 2012</xref>) for the sequences of verbs found in our critical experimental items. We used the actual lexical items found in our experiments in order to test the simplest frequency-based account: that the acceptability difference is due to frequency of exposure alone. In Dutch and German web text (252.8 million sentences of Dutch and 607.7 million sentences of German), we find only 3 instances of the critical verb trigrams in Dutch, and 0 in German, suggesting extreme rarity for the level-3 embedded structures. For verb bigrams, reflecting the frequency of level-2 embedded structures, we find 13,077 in Dutch and 47,326 in German, suggesting that the level-2 structures are significantly <italic>more</italic> frequent in German than Dutch (in a <italic>&#967;</italic>-squared test on the proportion of verb bigrams per sentence in the Dutch vs. German corpora, we find <italic>&#967;</italic><sup>2</sup> = 1738.9, <italic>p</italic> &lt; .001). The lower frequency of verb trigram sequences in German may explain the higher difficulty ratings for level-3 embeddings, but the higher frequency of verb bigrams in German is not reflected in lower difficulty for level-2 embeddings.</p>
<p>In any case, a frequency-based explanation of the processing difference between Dutch and German cannot provide a complete account, because it leaves open the question of <italic>why</italic> some structures are more frequent than others to begin with. The low frequency of multiple embedded verb phrases may be a result of processing difficulty, rather than a cause of it, if speakers avoid using constructions that engender difficulty.</p>
</sec>
<sec>
<title>5.3 Consequences for psycholinguistic theories</title>
<p>Our results compound the evidence that there is no particular processing difficulty in comprehension associated with crossing dependencies. Although this result has been found before, it is still somewhat remarkable given the formal complexity associated with such dependencies.</p>
<p>The results are, however, compatible with accounts of processing difficulty based on dependency locality (<xref ref-type="bibr" rid="B29">Gibson, 1998</xref>, <xref ref-type="bibr" rid="B30">2000</xref>), where difficulty occurs when a word must be integrated with a head or dependent far away from it in the linear order of words, and with automaton-based theories where processing difficulty is associated with the amount of time an item must be stored on a suitably structured stack. As shown in <xref ref-type="fig" rid="F1">Figure 1</xref>, the nested German structures involve longer (maximum) dependency lengths than the cross-serial Dutch structures. Note that it is not always the case that crossing dependencies create shorter dependency lengths, as they do here: for example, topicalization (a form of <italic>wh</italic>-movement) often increases dependency length.</p>
<p>The generality of our results needs some qualification, however, because we studied only one construction, and we were comparing across two languages. Within Dutch, there is no alternative word order available to express exactly the dependency structure in example (1a). It is thus possible that the processing system for Dutch is &#8216;fine-tuned&#8217; toward such word orders because they occur inevitably within the language, and this fine-tuning masks any underlying difficulty associated with the crossing dependencies. We note however, that the German order is equally inevitable given the dependency structure, and the frequency of the relevant verb trigrams is vanishingly low in both languages.<xref ref-type="fn" rid="n6">6</xref></p>
</sec>
<sec>
<title>5.4 Consequences for typology: Why are crossing dependencies rare?</title>
<p>While crossing dependencies are widely attested across languages, they are rare within languages. The underlying explanation for this rarity remains unknown. A common view is that there are hard formal restrictions on syntactic patterns that humans can learn: that is, a universal constraint on mental representations of grammars limits the occurrence of crossing structures in a sentence (<xref ref-type="bibr" rid="B13">Chomsky, 1965</xref>; <xref ref-type="bibr" rid="B41">Joshi et al., 1991</xref>; <xref ref-type="bibr" rid="B62">Silva et al., 2022</xref>). However, artificial language learning experiments do not support a preference for context-free structures (<xref ref-type="bibr" rid="B52">&#214;ttl et al., 2015</xref>). Our results argue against a view where crossing dependencies are avoided because they are associated with online comprehension difficulty. While avoidance of long dependencies (<xref ref-type="bibr" rid="B17">Ferrer-i-Cancho, 2004</xref>; <xref ref-type="bibr" rid="B27">Futrell et al., 2020b</xref>; <xref ref-type="bibr" rid="B35">Hawkins, 1994</xref>; <xref ref-type="bibr" rid="B49">Liu et al., 2017</xref>) does reduce the rate of crossing dependencies on average (<xref ref-type="bibr" rid="B18">Ferrer-i-Cancho, 2006</xref>), it does not fully explain their observed rarity (<xref ref-type="bibr" rid="B68">Yadav et al., 2021</xref>). Nevertheless, processing-based explanations are not yet fully ruled out: there may still be processing difficulty associated with crossing dependencies during production.</p>
</sec>
</sec>
<sec>
<title>6. Conclusion</title>
<p>We have revisited the classic psycholinguistic results on cross-serial versus nested dependencies from Bach et al. (<xref ref-type="bibr" rid="B3">1986</xref>). Our findings broadly support the conclusion that long nested dependencies in German engender more processing difficulty than the equivalent cross-serial dependencies in Dutch. The results support a dependency locality account for the processing difficulty of crossing dependencies.</p>
</sec>
</body>
<back>
<sec>
<title>Data accessibility statement</title>
<p>All code and data are publicly available at <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://osf.io/qbc5f/">https://osf.io/qbc5f/</ext-link>. Supplementary files with the German verb-form pretest, regression modeling, and power analysis details are available at <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://osf.io/93vq6">https://osf.io/93vq6</ext-link>.</p>
</sec>
<sec>
<title>Ethics and consent</title>
<p>Experiments were performed following UC Irvine IRB protocols. Participants gave informed consent for participation.</p>
</sec>
<sec>
<title>Acknowledgments</title>
<p>We thank Walter Haesereyn for his input on Dutch syntax and Volker Struckmeier for his input on German syntax during stimuli preparation. We also thank In&#233;s Sch&#246;nmann for checking the German sentence stimuli. We are thankful to three anonymous reviewers as well whose comments considerably improved the quality of this manuscript.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<sec>
<title>Authors&#8217; contributions</title>
<p>HY, SF, RF, and SH conceived the study and its design. SF and RF wrote items with input from all authors. HY conducted data analysis with input from all authors. All authors wrote and edited the manuscript.</p>
</sec>
<fn-group>
<fn id="n1"><p>We use dependency locality theory (<xref ref-type="bibr" rid="B30">Gibson, 2000</xref>) to analyze the current results, but this theory is not in conflict with the perspective based on how long items must be stored on the stack of a parser: dependency length corresponds exactly to the minimal time that an item must be stored on the stack of a left-corner parser.</p></fn>
<fn id="n2"><p>Dutch has a seventh verb that may be used in this way, <italic>doen</italic> &#8216;to make someone do&#8217;.</p></fn>
<fn id="n3"><p>Details of the pretest can be found in the supplementary materials.</p></fn>
<fn id="n4"><p>The preregistration can be found at <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://aspredicted.org/blind.php?x=3XC_VMS">https://aspredicted.org/blind.php?x=3XC_VMS</ext-link>.</p></fn>
<fn id="n5"><p>The RSVP presentation seems to eliminate the unexpected negative three-way interaction found in Experiment 1, although it was not intended to do so.</p></fn>
<fn id="n6"><p>Interestingly, in less standardized varieties of German and in other verb clusters in Dutch, there is variation in word order for these kinds of structures (<xref ref-type="bibr" rid="B61">Shieber, 1985</xref>; <xref ref-type="bibr" rid="B4">Barbiers, 2005</xref>; <xref ref-type="bibr" rid="B5">Barbiers et al., 2018</xref>).</p></fn>
</fn-group>
<ref-list>
<ref id="B1"><mixed-citation publication-type="journal"><string-name><surname>Abney</surname>, <given-names>S. P.</given-names></string-name>, &amp; <string-name><surname>Johnson</surname>, <given-names>M.</given-names></string-name> (<year>1991</year>). <article-title>Memory requirements and local ambiguities of parsing strategies</article-title>. <source>Journal of Psycholinguistic Research</source>, <volume>20</volume>(<issue>3</issue>), <fpage>233</fpage>&#8211;<lpage>250</lpage>. <pub-id pub-id-type="doi">10.1007/BF01067217</pub-id></mixed-citation></ref>
<ref id="B2"><mixed-citation publication-type="journal"><string-name><surname>Baayen</surname>, <given-names>R. H.</given-names></string-name>, <string-name><surname>Davidson</surname>, <given-names>D. J.</given-names></string-name>, &amp; <string-name><surname>Bates</surname>, <given-names>D. M.</given-names></string-name> (<year>2008</year>). <article-title>Mixed-effects modeling with crossed random effects for subjects and items</article-title>. <source>Journal of Memory and Language</source>, <volume>59</volume>(<issue>4</issue>), <fpage>390</fpage>&#8211;<lpage>412</lpage>. <pub-id pub-id-type="doi">10.1016/j.jml.2007.12.005</pub-id></mixed-citation></ref>
<ref id="B3"><mixed-citation publication-type="journal"><string-name><surname>Bach</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Brown</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Marslen-Wilson</surname>, <given-names>W. D.</given-names></string-name> (<year>1986</year>). <article-title>Cross and nested dependencies in German and Dutch: A psycholinguistic study</article-title>. <source>Language and Cognitive Processes</source>, <volume>1</volume>(<issue>4</issue>), <fpage>249</fpage>&#8211;<lpage>262</lpage>. <pub-id pub-id-type="doi">10.1080/01690968608404677</pub-id></mixed-citation></ref>
<ref id="B4"><mixed-citation publication-type="book"><string-name><surname>Barbiers</surname>, <given-names>S.</given-names></string-name> (<year>2005</year>). <chapter-title>Word order variation in three-verb clusters and the division of labour between generative linguistics and sociolinguistics</chapter-title>. In <string-name><given-names>L.</given-names> <surname>Cornips</surname></string-name> &amp; <string-name><given-names>K.</given-names> <surname>Corrigan</surname></string-name> (Eds.), <source>Syntax and variation. Reconciling the biological and the social</source> (pp. <fpage>233</fpage>&#8211;<lpage>264</lpage>). <publisher-name>Benjamins</publisher-name>. <pub-id pub-id-type="doi">10.1075/cilt.265.14bar</pub-id></mixed-citation></ref>
<ref id="B5"><mixed-citation publication-type="journal"><string-name><surname>Barbiers</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Bennis</surname>, <given-names>H.</given-names></string-name>, &amp; <string-name><surname>Dros-Hendriks</surname>, <given-names>L.</given-names></string-name> (<year>2018</year>). <article-title>Merging verb cluster variation</article-title>. <source>Linguistic Variation</source>, <volume>18</volume>(<issue>1</issue>), <fpage>144</fpage>&#8211;<lpage>196</lpage>. <pub-id pub-id-type="doi">10.1075/lv.00008.bar</pub-id></mixed-citation></ref>
<ref id="B6"><mixed-citation publication-type="journal"><string-name><surname>Barr</surname>, <given-names>D. J.</given-names></string-name>, <string-name><surname>Levy</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Scheepers</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Tily</surname>, <given-names>H. J.</given-names></string-name> (<year>2013</year>). <article-title>Random effects structure for confirmatory hypothesis testing: Keep it maximal</article-title>. <source>Journal of Memory and Language</source>, <volume>68</volume>(<issue>3</issue>), <fpage>255</fpage>&#8211;<lpage>278</lpage>. <pub-id pub-id-type="doi">10.1016/j.jml.2012.11.001</pub-id></mixed-citation></ref>
<ref id="B7"><mixed-citation publication-type="journal"><string-name><surname>Bartek</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Lewis</surname>, <given-names>R. L.</given-names></string-name>, <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Smith</surname>, <given-names>M. R.</given-names></string-name> (<year>2011</year>). <article-title>In search of on-line locality effects in sentence comprehension</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>37</volume>(<issue>5</issue>), <fpage>1178</fpage>&#8211;<lpage>1198</lpage>. <pub-id pub-id-type="doi">10.1037/a0024194</pub-id></mixed-citation></ref>
<ref id="B8"><mixed-citation publication-type="journal"><string-name><surname>Bates</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Kliegl</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Baayen</surname>, <given-names>H.</given-names></string-name> (<year>2015</year>). <article-title>Parsimonious mixed models</article-title>. <source>arXiv preprint arXiv:1506.04967</source>.</mixed-citation></ref>
<ref id="B9"><mixed-citation publication-type="journal"><string-name><surname>Bates</surname>, <given-names>D. M.</given-names></string-name>, <string-name><surname>Maechler</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Bolker</surname>, <given-names>B.</given-names></string-name>, &amp; <string-name><surname>Walker</surname>, <given-names>S.</given-names></string-name> (<year>2014</year>). <source>lme4: Linear mixed-effects models using Eigen and S4</source>. R package version 1.1-7.</mixed-citation></ref>
<ref id="B10"><mixed-citation publication-type="book"><string-name><surname>Bock</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Levelt</surname>, <given-names>W.</given-names></string-name>, &amp; <string-name><surname>Gernsbacher</surname>, <given-names>M. A.</given-names></string-name> (<year>2002</year>). <chapter-title>Language production: Grammatical encoding</chapter-title>. In <source>Psycholinguistics: Critical concepts in psychology</source> (pp. <fpage>405</fpage>&#8211;<lpage>452</lpage>). <publisher-name>Routledge</publisher-name>.</mixed-citation></ref>
<ref id="B11"><mixed-citation publication-type="book"><string-name><surname>Bresnan</surname>, <given-names>J. W.</given-names></string-name> (<year>1982</year>). <source>The mental representation of grammatical relations</source>. <publisher-name>MIT Press</publisher-name>.</mixed-citation></ref>
<ref id="B12"><mixed-citation publication-type="journal"><string-name><surname>Bresnan</surname>, <given-names>J. W.</given-names></string-name>, <string-name><surname>Kaplan</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Peters</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Zaenen</surname>, <given-names>A.</given-names></string-name> (<year>1982</year>). <article-title>Cross-serial dependencies in Dutch</article-title>. <source>Linguistic Inquiry</source>, <volume>13</volume>, <fpage>613</fpage>&#8211;<lpage>635</lpage>. <pub-id pub-id-type="doi">10.1007/978-94-009-3401-6_11</pub-id></mixed-citation></ref>
<ref id="B13"><mixed-citation publication-type="book"><string-name><surname>Chomsky</surname>, <given-names>N.</given-names></string-name> (<year>1965</year>). <source>Aspects of the theory of syntax</source>. <publisher-name>MIT Press</publisher-name>. <pub-id pub-id-type="doi">10.21236/AD0616323</pub-id></mixed-citation></ref>
<ref id="B14"><mixed-citation publication-type="book"><string-name><surname>Clifton</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Frazier</surname>, <given-names>L.</given-names></string-name> (<year>1989</year>). <chapter-title>Comprehending sentences with long-distance dependencies</chapter-title>. In <string-name><given-names>G. N.</given-names> <surname>Carlson</surname></string-name> &amp; <string-name><given-names>M. K.</given-names> <surname>Tanenhaus</surname></string-name> (Eds.), <source>Linguistic structure in language processing</source> (pp. <fpage>273</fpage>&#8211;<lpage>317</lpage>). <publisher-name>Springer</publisher-name>. <pub-id pub-id-type="doi">10.1007/978-94-009-2729-2_8</pub-id></mixed-citation></ref>
<ref id="B15"><mixed-citation publication-type="journal"><string-name><surname>De Santo</surname>, <given-names>A.</given-names></string-name> (<year>2020</year>). <article-title>MG parsing as a model of gradient acceptability in syntactic islands</article-title>. In <source>Proceedings of the Society for Computation in Linguistics</source> (pp. <fpage>59</fpage>&#8211;<lpage>69</lpage>).</mixed-citation></ref>
<ref id="B16"><mixed-citation publication-type="journal"><string-name><surname>Fedorenko</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Woodbury</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name> (<year>2013</year>). <article-title>Direct evidence of memory retrieval as a source of difficulty in non-local dependencies in language</article-title>. <source>Cognitive Science</source>, <volume>37</volume>, <fpage>378</fpage>&#8211;<lpage>394</lpage>. <pub-id pub-id-type="doi">10.1111/cogs.12021</pub-id></mixed-citation></ref>
<ref id="B17"><mixed-citation publication-type="journal"><string-name><surname>Ferrer-i-Cancho</surname>, <given-names>R.</given-names></string-name> (<year>2004</year>). <article-title>Euclidean distance between syntactically linked words</article-title>. <source>Physical Review E</source>, <volume>70</volume>, <elocation-id>056135</elocation-id>. <pub-id pub-id-type="doi">10.1103/PhysRevE.70.056135</pub-id></mixed-citation></ref>
<ref id="B18"><mixed-citation publication-type="journal"><string-name><surname>Ferrer-i-Cancho</surname>, <given-names>R.</given-names></string-name> (<year>2006</year>). <article-title>Why do syntactic links not cross?</article-title> <source>Europhysics Letters</source>, <volume>76</volume>(<issue>6</issue>), <elocation-id>1228</elocation-id>. <pub-id pub-id-type="doi">10.1209/epl/i2006-10406-0</pub-id></mixed-citation></ref>
<ref id="B19"><mixed-citation publication-type="journal"><string-name><surname>Ferrer-i-Cancho</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>G&#243;mez-Rodr&#237;guez</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Esteban</surname>, <given-names>J. L.</given-names></string-name> (<year>2018</year>). <article-title>Are crossing dependencies really scarce?</article-title> <source>Physica A: Statistical Mechanics and its Applications</source>, <volume>493</volume>, <fpage>311</fpage>&#8211;<lpage>329</lpage>. <pub-id pub-id-type="doi">10.1016/j.physa.2017.10.048</pub-id></mixed-citation></ref>
<ref id="B20"><mixed-citation publication-type="journal"><string-name><surname>Francis</surname>, <given-names>E. J.</given-names></string-name> (<year>2010</year>). <article-title>Grammatical weight and relative clause extraposition in English</article-title>. <source>Cognitive Linguistics</source>, <volume>21</volume>(<issue>1</issue>), <fpage>35</fpage>&#8211;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1515/cogl.2010.002</pub-id></mixed-citation></ref>
<ref id="B21"><mixed-citation publication-type="journal"><string-name><surname>Frank</surname>, <given-names>S. L.</given-names></string-name>, &amp; <string-name><surname>Bod</surname>, <given-names>R.</given-names></string-name> (<year>2011</year>). <article-title>Insensitivity of the human sentence-processing system to hierarchical structure</article-title>. <source>Psychological Science</source>, <volume>22</volume>(<issue>6</issue>), <fpage>829</fpage>&#8211;<lpage>834</lpage>. <pub-id pub-id-type="doi">10.1177/0956797611409589</pub-id></mixed-citation></ref>
<ref id="B22"><mixed-citation publication-type="journal"><string-name><surname>Frank</surname>, <given-names>S. L.</given-names></string-name>, <string-name><surname>Otten</surname>, <given-names>L. J.</given-names></string-name>, <string-name><surname>Galli</surname>, <given-names>G.</given-names></string-name>, &amp; <string-name><surname>Vigliocco</surname>, <given-names>G.</given-names></string-name> (<year>2015</year>). <article-title>The ERP response to the amount of information conveyed by words in sentences</article-title>. <source>Brain and Language</source>, <volume>140</volume>, <fpage>1</fpage>&#8211;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1016/j.bandl.2014.10.006</pub-id></mixed-citation></ref>
<ref id="B23"><mixed-citation publication-type="journal"><string-name><surname>Frank</surname>, <given-names>S. L.</given-names></string-name>, <string-name><surname>Trompenaars</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Lewis</surname>, <given-names>R. L.</given-names></string-name>, &amp; <string-name><surname>Vasishth</surname>, <given-names>S.</given-names></string-name> (<year>2016</year>). <article-title>Cross-linguistic differences in processing double-embedded relative clauses: Working-memory constraints or language statistics?</article-title> <source>Cognitive Science</source>, <volume>40</volume>, <fpage>554</fpage>&#8211;<lpage>578</lpage>. <pub-id pub-id-type="doi">10.1111/cogs.12247</pub-id></mixed-citation></ref>
<ref id="B24"><mixed-citation publication-type="book"><string-name><surname>Frazier</surname>, <given-names>L.</given-names></string-name> (<year>1985</year>). <chapter-title>Syntactic complexity</chapter-title>. In <string-name><given-names>D. R.</given-names> <surname>Dowty</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Karttunen</surname></string-name>, &amp; <string-name><given-names>A. M.</given-names> <surname>Zwicky</surname></string-name> (Eds.), <source>Natural language parsing: Psychological, computational, and theoretical perspectives</source> (pp. <fpage>129</fpage>&#8211;<lpage>189</lpage>). <publisher-name>Cambridge University Press</publisher-name>. <pub-id pub-id-type="doi">10.1017/CBO9780511597855.005</pub-id></mixed-citation></ref>
<ref id="B25"><mixed-citation publication-type="book"><string-name><surname>Frazier</surname>, <given-names>L.</given-names></string-name> (<year>1987</year>). <chapter-title>Sentence processing: A tutorial review</chapter-title>. In <string-name><surname>Coltheart</surname>, <given-names>M.</given-names></string-name>, editor, <source>Attention and performance 12: The psychology of reading</source> (pp. <fpage>559</fpage>&#8211;<lpage>586</lpage>). <publisher-name>Lawrence Erlbaum Associates, Inc</publisher-name>.</mixed-citation></ref>
<ref id="B26"><mixed-citation publication-type="journal"><string-name><surname>Futrell</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name>, &amp; <string-name><surname>Levy</surname>, <given-names>R. P.</given-names></string-name> (<year>2020a</year>). <article-title>Lossy-context surprisal: An information-theoretic model of memory effects in sentence processing</article-title>. <source>Cognitive Science</source>, <volume>44</volume>, <elocation-id>e12814</elocation-id>. <pub-id pub-id-type="doi">10.1111/cogs.12814</pub-id></mixed-citation></ref>
<ref id="B27"><mixed-citation publication-type="journal"><string-name><surname>Futrell</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Levy</surname>, <given-names>R. P.</given-names></string-name>, &amp; <string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name> (<year>2020b</year>). <article-title>Dependency locality as an explanatory principle for word order</article-title>. <source>Language</source>, <volume>96</volume>(<issue>2</issue>), <fpage>371</fpage>&#8211;<lpage>413</lpage>. <pub-id pub-id-type="doi">10.1353/lan.2020.0024</pub-id></mixed-citation></ref>
<ref id="B28"><mixed-citation publication-type="book"><string-name><surname>Gelman</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Hill</surname>, <given-names>J.</given-names></string-name> (<year>2007</year>). <source>Data analysis using regression and multilevel/hierarchical models</source>. <publisher-name>Cambridge University Press</publisher-name>. <pub-id pub-id-type="doi">10.32614/CRAN.package.arm</pub-id></mixed-citation></ref>
<ref id="B29"><mixed-citation publication-type="journal"><string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name> (<year>1998</year>). <article-title>Linguistic complexity: Locality of syntactic dependencies</article-title>. <source>Cognition</source>, <volume>68</volume>(<issue>1</issue>), <fpage>1</fpage>&#8211;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1016/S0010-0277(98)00034-1</pub-id></mixed-citation></ref>
<ref id="B30"><mixed-citation publication-type="book"><string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name> (<year>2000</year>). <chapter-title>The dependency locality theory: A distance-based theory of linguistic complexity</chapter-title>. In <string-name><given-names>A.</given-names> <surname>Marantz</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Miyashita</surname></string-name>, &amp; <string-name><given-names>W.</given-names> <surname>O&#8217;Neil</surname></string-name> (Eds.), <source>Image, Language, Brain: Papers from the First Mind Articulation Project Symposium</source> (pp. <fpage>95</fpage>&#8211;<lpage>126</lpage>). <publisher-name>MIT Press</publisher-name>. <pub-id pub-id-type="doi">10.7551/mitpress/3654.003.0008</pub-id></mixed-citation></ref>
<ref id="B31"><mixed-citation publication-type="journal"><string-name><surname>Graf</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Monette</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Zhang</surname>, <given-names>C.</given-names></string-name> (<year>2017</year>). <article-title>Relative clauses as a benchmark for minimalist parsing</article-title>. <source>Journal of Language Modelling</source>, <volume>5</volume>(<issue>1</issue>), <fpage>57</fpage>&#8211;<lpage>106</lpage>. <pub-id pub-id-type="doi">10.15398/jlm.v5i1.157</pub-id></mixed-citation></ref>
<ref id="B32"><mixed-citation publication-type="journal"><string-name><surname>Grodner</surname>, <given-names>D.</given-names></string-name>, &amp; <string-name><surname>Gibson</surname>, <given-names>E.</given-names></string-name> (<year>2005</year>). <article-title>Consequences of the serial nature of linguistic input for sentential complexity</article-title>. <source>Cognitive Science</source>, <volume>29</volume>(<issue>2</issue>), <fpage>261</fpage>&#8211;<lpage>290</lpage>. <pub-id pub-id-type="doi">10.1207/s15516709cog0000_7</pub-id></mixed-citation></ref>
<ref id="B33"><mixed-citation publication-type="journal"><string-name><surname>Hale</surname>, <given-names>J. T.</given-names></string-name> (<year>2001</year>). <article-title>A probabilistic Earley parser as a psycholinguistic model</article-title>. In <source>Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics and Language Technologies</source> (pp. <fpage>1</fpage>&#8211;<lpage>8</lpage>). <pub-id pub-id-type="doi">10.3115/1073336.1073357</pub-id></mixed-citation></ref>
<ref id="B34"><mixed-citation publication-type="book"><string-name><surname>Havelka</surname>, <given-names>J.</given-names></string-name> (<year>2007</year>). <chapter-title>Beyond projectivity: Multilingual evaluation of constraints and measures on non-projective structures</chapter-title>. In <source>Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics</source> (pp. <fpage>608</fpage>&#8211;<lpage>615</lpage>). <publisher-name>Association for Computational Linguistics</publisher-name>.</mixed-citation></ref>
<ref id="B35"><mixed-citation publication-type="book"><string-name><surname>Hawkins</surname>, <given-names>J. A.</given-names></string-name> (<year>1994</year>). <source>A performance theory of order and constituency</source>. <publisher-name>Cambridge University Press</publisher-name>. <pub-id pub-id-type="doi">10.1017/CBO9780511554285</pub-id></mixed-citation></ref>
<ref id="B36"><mixed-citation publication-type="journal"><string-name><surname>Huang</surname>, <given-names>K.-J.</given-names></string-name>, <string-name><surname>Arehalli</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Kugemoto</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Muxica</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Prasad</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Dillon</surname>, <given-names>B.</given-names></string-name>, &amp; <string-name><surname>Linzen</surname>, <given-names>T.</given-names></string-name> (<year>2024</year>). <article-title>Large-scale benchmark yields no evidence that language model surprisal explains syntactic disambiguation difficulty</article-title>. <source>Journal of Memory and Language</source>, <volume>137</volume>, <elocation-id>104510</elocation-id>. <pub-id pub-id-type="doi">10.1016/j.jml.2024.104510</pub-id></mixed-citation></ref>
<ref id="B37"><mixed-citation publication-type="journal"><string-name><surname>Huck</surname>, <given-names>G. J.</given-names></string-name>, &amp; <string-name><surname>Na</surname>, <given-names>Y.</given-names></string-name> (<year>1990</year>). <article-title>Extraposition and focus</article-title>. <source>Language</source>, <volume>66</volume>, <fpage>51</fpage>&#8211;<lpage>77</lpage>. <pub-id pub-id-type="doi">10.1353/lan.1990.0023</pub-id></mixed-citation></ref>
<ref id="B38"><mixed-citation publication-type="journal"><string-name><surname>Husain</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Yadav</surname>, <given-names>H.</given-names></string-name> (<year>2020</year>). <article-title>Target complexity modulates syntactic priming during comprehension</article-title>. <source>Frontiers in Psychology</source>, <volume>11</volume>, <elocation-id>454</elocation-id>. <pub-id pub-id-type="doi">10.3389/fpsyg.2020.00454</pub-id></mixed-citation></ref>
<ref id="B39"><mixed-citation publication-type="book"><string-name><surname>Joshi</surname>, <given-names>A. K.</given-names></string-name> (<year>1985</year>). <chapter-title>Tree adjoining grammars: How much context-sensitivity is required to provide reasonable structural descriptions?</chapter-title> In <string-name><given-names>D. R.</given-names> <surname>Dowty</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Karttunen</surname></string-name>, &amp; <string-name><given-names>A. M.</given-names> <surname>Zwicky</surname></string-name> (Eds.), <source>Natural language parsing: Psychological, computational, and theoretical Perspectives</source> (pp. <fpage>190</fpage>&#8211;<lpage>205</lpage>). <publisher-name>Cambridge University Press</publisher-name>. <pub-id pub-id-type="doi">10.1017/CBO9780511597855.007</pub-id></mixed-citation></ref>
<ref id="B40"><mixed-citation publication-type="journal"><string-name><surname>Joshi</surname>, <given-names>A. K.</given-names></string-name> (<year>1990</year>). <article-title>Processing crossed and nested dependencies: An automaton perspective on the psycholinguistic results</article-title>. <source>Language and Cognitive Processes</source>, <volume>5</volume>, <fpage>1</fpage>&#8211;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1080/01690969008402095</pub-id></mixed-citation></ref>
<ref id="B41"><mixed-citation publication-type="book"><string-name><surname>Joshi</surname>, <given-names>A. K.</given-names></string-name>, <string-name><surname>Vijay-Shanker</surname>, <given-names>K.</given-names></string-name>, &amp; <string-name><surname>Weir</surname>, <given-names>D. J.</given-names></string-name> (<year>1991</year>). <chapter-title>The convergence of mildly context-sensitive grammar formalisms</chapter-title>. In <string-name><given-names>P.</given-names> <surname>Sells</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Shieber</surname></string-name>, &amp; <string-name><given-names>T.</given-names> <surname>Wasow</surname></string-name> (Eds.), <source>Foundational issues in natural language processing</source> (pp. <fpage>31</fpage>&#8211;<lpage>81</lpage>). <publisher-name>MIT Press</publisher-name>.</mixed-citation></ref>
<ref id="B42"><mixed-citation publication-type="thesis"><string-name><surname>Kobele</surname>, <given-names>G. M.</given-names></string-name> (<year>2006</year>). <source>Generating copies: An investigation into structural identity in language and grammar</source> [Doctoral dissertation]. <publisher-name>University of California Los Angeles</publisher-name>.</mixed-citation></ref>
<ref id="B43"><mixed-citation publication-type="book"><string-name><surname>Kobele</surname>, <given-names>G. M.</given-names></string-name>, <string-name><surname>Gerth</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Hale</surname>, <given-names>J.</given-names></string-name> (<year>2013</year>). <chapter-title>Memory resource allocation in top-down minimalist parsing</chapter-title>. In <string-name><given-names>G.</given-names> <surname>Morrill</surname></string-name> &amp; <string-name><given-names>M.-J.</given-names> <surname>Nederhof</surname></string-name> (Eds.), <source>Formal grammar</source> (pp. <fpage>32</fpage>&#8211;<lpage>51</lpage>). <publisher-name>Springer</publisher-name>. <pub-id pub-id-type="doi">10.1007/978-3-642-39998-5_3</pub-id></mixed-citation></ref>
<ref id="B44"><mixed-citation publication-type="journal"><string-name><surname>Kuhlmann</surname>, <given-names>M.</given-names></string-name> (<year>2013</year>). <article-title>Mildly non-projective dependency grammar</article-title>. <source>Computational Linguistics</source>, <volume>39</volume>(<issue>2</issue>), <fpage>355</fpage>&#8211;<lpage>387</lpage>. <pub-id pub-id-type="doi">10.1162/COLI_a_00125</pub-id></mixed-citation></ref>
<ref id="B45"><mixed-citation publication-type="book"><string-name><surname>Levelt</surname>, <given-names>W. J. M.</given-names></string-name> (<year>1989</year>). <source>Speaking: From intention to articulation</source>. <publisher-name>MIT Press</publisher-name>. <pub-id pub-id-type="doi">10.7551/mitpress/6393.001.0001</pub-id></mixed-citation></ref>
<ref id="B46"><mixed-citation publication-type="journal"><string-name><surname>Levy</surname>, <given-names>R.</given-names></string-name> (<year>2008</year>). <article-title>Expectation-based syntactic comprehension</article-title>. <source>Cognition</source>, <volume>106</volume>(<issue>3</issue>), <fpage>1126</fpage>&#8211;<lpage>1177</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2007.05.006</pub-id></mixed-citation></ref>
<ref id="B47"><mixed-citation publication-type="book"><string-name><surname>Levy</surname>, <given-names>R.</given-names></string-name> (<year>2013</year>). <chapter-title>Memory and surprisal in human sentence comprehension</chapter-title>. In <string-name><given-names>R. P. G. van</given-names> <surname>Gompel</surname></string-name> (Ed.), <source>Sentence Processing</source> (pp. <fpage>78</fpage>&#8211;<lpage>114</lpage>). <publisher-loc>Hove</publisher-loc>: <publisher-name>Psychology Press</publisher-name>.</mixed-citation></ref>
<ref id="B48"><mixed-citation publication-type="journal"><string-name><surname>Levy</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Fedorenko</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Breen</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Gibson</surname>, <given-names>T.</given-names></string-name> (<year>2012</year>). <article-title>The processing of extraposed structures in English</article-title>. <source>Cognition</source>, <volume>122</volume>(<issue>1</issue>), <fpage>12</fpage>&#8211;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2011.07.012</pub-id></mixed-citation></ref>
<ref id="B49"><mixed-citation publication-type="journal"><string-name><surname>Liu</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Xu</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Liang</surname>, <given-names>J.</given-names></string-name> (<year>2017</year>). <article-title>Dependency distance: A new perspective on syntactic patterns in natural languages</article-title>. <source>Physics of Life Reviews</source>, <volume>21</volume>, <fpage>171</fpage>&#8211;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.1016/j.plrev.2017.03.002</pub-id></mixed-citation></ref>
<ref id="B50"><mixed-citation publication-type="journal"><string-name><surname>MacDonald</surname>, <given-names>M. C.</given-names></string-name> (<year>2013</year>). <article-title>How language production shapes language form and comprehension</article-title>. <source>Frontiers in Psychology</source>, <volume>4</volume>, <elocation-id>226</elocation-id>. <pub-id pub-id-type="doi">10.3389/fpsyg.2013.00226</pub-id></mixed-citation></ref>
<ref id="B51"><mixed-citation publication-type="journal"><string-name><surname>Momma</surname>, <given-names>S.</given-names></string-name> (<year>2021</year>). <article-title>Filling the gap in gap-filling: Long-distance dependency formation in sentence production</article-title>. <source>Cognitive Psychology</source>, <volume>129</volume>, <elocation-id>101411</elocation-id>. <pub-id pub-id-type="doi">10.1016/j.cogpsych.2021.101411</pub-id></mixed-citation></ref>
<ref id="B52"><mixed-citation publication-type="journal"><string-name><surname>&#214;ttl</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>J&#228;ger</surname>, <given-names>G.</given-names></string-name>, &amp; <string-name><surname>Kaup</surname>, <given-names>B.</given-names></string-name> (<year>2015</year>). <article-title>Does formal complexity reflect cognitive complexity? Investigating aspects of the Chomsky hierarchy in an artificial language learning study</article-title>. <source>PloS ONE</source>, <volume>10</volume>(<issue>4</issue>), <elocation-id>e0123059</elocation-id>. <pub-id pub-id-type="doi">10.1371/journal.pone.0123059</pub-id></mixed-citation></ref>
<ref id="B53"><mixed-citation publication-type="book"><string-name><surname>Rambow</surname>, <given-names>O.</given-names></string-name>, &amp; <string-name><surname>Joshi</surname>, <given-names>A. K.</given-names></string-name> (<year>1994</year>). <chapter-title>A processing model for free word-order languages</chapter-title>. In <string-name><given-names>J. Charles</given-names> <surname>Clifton</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Frazier</surname></string-name>, &amp; <string-name><given-names>K.</given-names> <surname>Rayner</surname></string-name> (Eds.), <source>Perspectives on sentence processing</source>. <publisher-name>Psychology Press</publisher-name>.</mixed-citation></ref>
<ref id="B54"><mixed-citation publication-type="book"><string-name><surname>Rambow</surname>, <given-names>O.</given-names></string-name>, &amp; <string-name><surname>Satta</surname>, <given-names>G.</given-names></string-name> (<year>1994</year>). <chapter-title>A rewriting system for natural language syntax that is non-local and mildly context sensitive</chapter-title>. In <source>North-Holland Linguistic Series: Linguistic Variations</source> (volume <volume>56</volume>, pp. <fpage>121</fpage>&#8211;<lpage>130</lpage>). <publisher-name>Elsevier</publisher-name>.</mixed-citation></ref>
<ref id="B55"><mixed-citation publication-type="journal"><string-name><surname>Resnik</surname>, <given-names>P.</given-names></string-name> (<year>1992</year>). <article-title>Left-corner parsing and psychological plausibility</article-title>. In <source>Proceedings of the 14th International Conference on Computational Linguistics</source> (pp. <fpage>191</fpage>&#8211;<lpage>197</lpage>). <pub-id pub-id-type="doi">10.3115/992066.992098</pub-id></mixed-citation></ref>
<ref id="B56"><mixed-citation publication-type="book"><string-name><surname>Rochemont</surname>, <given-names>M. S.</given-names></string-name>, &amp; <string-name><surname>Culicover</surname>, <given-names>P. W.</given-names></string-name> (<year>1990</year>). <source>English focus constructions and the theory of grammar</source>. <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation></ref>
<ref id="B57"><mixed-citation publication-type="book"><string-name><surname>Sch&#228;fer</surname>, <given-names>R.</given-names></string-name> (<year>2015</year>). <chapter-title>Processing and querying large web corpora with the COW14 architecture</chapter-title>. In <source>Proceedings of the 3rd Workshop on Challenges in the Management of Large Corpora (CMLC-3)</source> (pp. <fpage>28</fpage>&#8211;<lpage>34</lpage>). <publisher-name>Institut f&#252;r Deutsche Sprache</publisher-name>.</mixed-citation></ref>
<ref id="B58"><mixed-citation publication-type="journal"><string-name><surname>Sch&#228;fer</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Bildhauer</surname>, <given-names>F.</given-names></string-name> (<year>2012</year>). <article-title>Building large corpora from the web using a new efficient tool chain</article-title>. In <source>Proceedings of the Eighth International Conference on Language Resources and Evaluation</source> (pp. <fpage>486</fpage>&#8211;<lpage>493</lpage>).</mixed-citation></ref>
<ref id="B59"><mixed-citation publication-type="journal"><string-name><surname>Scontras</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Badecker</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Shank</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Lim</surname>, <given-names>E.</given-names></string-name>, &amp; <string-name><surname>Fedorenko</surname>, <given-names>E.</given-names></string-name> (<year>2015</year>). <article-title>Syntactic complexity effects in sentence production</article-title>. <source>Cognitive Science</source>, <volume>39</volume>(<issue>3</issue>), <fpage>559</fpage>&#8211;<lpage>583</lpage>. <pub-id pub-id-type="doi">10.1111/cogs.12168</pub-id></mixed-citation></ref>
<ref id="B60"><mixed-citation publication-type="journal"><string-name><surname>Seki</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Matsumara</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Fujii</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Kasami</surname>, <given-names>T.</given-names></string-name> (<year>1991</year>). <article-title>On multiple context-free grammars</article-title>. <source>Theoretical Computer Science</source>, <volume>88</volume>(<issue>2</issue>), <fpage>191</fpage>&#8211;<lpage>229</lpage>. <pub-id pub-id-type="doi">10.1016/0304-3975(91)90374-B</pub-id></mixed-citation></ref>
<ref id="B61"><mixed-citation publication-type="book"><string-name><surname>Shieber</surname>, <given-names>S. M.</given-names></string-name> (<year>1985</year>). <chapter-title>Evidence against the context-freeness of natural language</chapter-title>. In <source>The formal complexity of natural language</source> (pp. <fpage>320</fpage>&#8211;<lpage>334</lpage>). <publisher-name>Springer</publisher-name>. <pub-id pub-id-type="doi">10.1007/978-94-009-3401-6_12</pub-id></mixed-citation></ref>
<ref id="B62"><mixed-citation publication-type="journal"><string-name><surname>Silva</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>In&#225;cio</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Rocha e Sousa</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Gaspar</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Folia</surname>, <given-names>V.</given-names></string-name>, &amp; <string-name><surname>Petersson</surname>, <given-names>K. M.</given-names></string-name> (<year>2022</year>). <article-title>Formal language hierarchy reflects different levels of cognitive complexity</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>49</volume>, <fpage>642</fpage>&#8211;<lpage>660</lpage>. <pub-id pub-id-type="doi">10.1037/xlm0001182</pub-id></mixed-citation></ref>
<ref id="B63"><mixed-citation publication-type="journal"><string-name><surname>Stabler</surname>, <given-names>E. P.</given-names></string-name> (<year>1994</year>). <article-title>The finite connectivity of linguistic structure</article-title>. <source>Perspectives on sentence processing</source> (pp. <fpage>303</fpage>&#8211;<lpage>336</lpage>).</mixed-citation></ref>
<ref id="B64"><mixed-citation publication-type="journal"><string-name><surname>Staub</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Foppolo</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Donati</surname>, <given-names>C.</given-names></string-name>, &amp; <string-name><surname>Cecchetto</surname>, <given-names>C.</given-names></string-name> (<year>2018</year>). <article-title>Relative clause avoidance: Evidence for a structural parsing principle</article-title>. <source>Journal of Memory and Language</source>, <volume>98</volume>, <fpage>26</fpage>&#8211;<lpage>44</lpage>. <pub-id pub-id-type="doi">10.1016/j.jml.2017.09.003</pub-id></mixed-citation></ref>
<ref id="B65"><mixed-citation publication-type="book"><string-name><surname>Torr</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Stanojevi&#263;</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Steedman</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Cohen</surname>, <given-names>S. B.</given-names></string-name> (<year>2019</year>). <chapter-title>Wide-coverage neural A* parsing for Minimalist Grammars</chapter-title>. In <string-name><given-names>A.</given-names> <surname>Korhonen</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Traum</surname></string-name>, &amp; <string-name><given-names>L.</given-names> <surname>M&#224;rquez</surname></string-name> (Eds.), <source>Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</source> (pp. <fpage>2486</fpage>&#8211;<lpage>2505</lpage>). <publisher-name>Association for Computational Linguistics</publisher-name>. <pub-id pub-id-type="doi">10.18653/v1/P19-1238</pub-id></mixed-citation></ref>
<ref id="B66"><mixed-citation publication-type="journal"><string-name><surname>van Schijndel</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Linzen</surname>, <given-names>T.</given-names></string-name> (<year>2021</year>). <article-title>Single-stage prediction models do not explain the magnitude of syntactic disambiguation difficulty</article-title>. <source>Cognitive Science</source>, <volume>45</volume>(<issue>6</issue>), <elocation-id>e12988</elocation-id>. <pub-id pub-id-type="doi">10.1111/cogs.12988</pub-id></mixed-citation></ref>
<ref id="B67"><mixed-citation publication-type="thesis"><string-name><surname>Weir</surname>, <given-names>D. J.</given-names></string-name> (<year>1988</year>). <source>Characterizing mildly context-sensitive grammar formalisms</source> [Doctoral dissertation]. <publisher-name>University of Pennsylvania</publisher-name>.</mixed-citation></ref>
<ref id="B68"><mixed-citation publication-type="journal"><string-name><surname>Yadav</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Husain</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Futrell</surname>, <given-names>R.</given-names></string-name> (<year>2021</year>). <article-title>Do dependency lengths explain constraints on crossing dependencies?</article-title> <source>Linguistics Vanguard</source>, <volume>7</volume>(<issue>s3</issue>). <pub-id pub-id-type="doi">10.1515/lingvan-2019-0070</pub-id></mixed-citation></ref>
<ref id="B69"><mixed-citation publication-type="journal"><string-name><surname>Yngve</surname>, <given-names>V. H.</given-names></string-name> (<year>1960</year>). <article-title>A model and an hypothesis for language structure</article-title>. <source>Proceedings of the American Philosophical Society</source>, <volume>104</volume>(<issue>5</issue>), <fpage>444</fpage>&#8211;<lpage>466</lpage>.</mixed-citation></ref>
</ref-list>
</back>
</article>