<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en"
    xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
    <front>
        <journal-meta>
            <journal-id journal-id-type="issn">2767-0279</journal-id>
            <journal-title-group>
                <journal-title>Glossa Psycholinguistics</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2767-0279</issn>
            <publisher>
                <publisher-name>eScholarship Publishing</publisher-name>
            </publisher>
        </journal-meta>
        <article-meta>

            <article-id pub-id-type="doi">10.5334/glossapx.40</article-id>
            <article-categories>
                <subj-group>
                    <subject>Regular article</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>The That-Trace Effect and Island Boundary-Gap Effect are the same:
                    Demonstrating equivalence with Null Hypothesis Significance Testing and
                    psychometrics</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Morgan</surname>
                        <given-names>Adam M.</given-names>
                    </name>
                    <email>adam.morgan@nyulangone.org</email>
                    <xref ref-type="aff" rid="aff-1">1</xref>
                </contrib>
            </contrib-group>
            <aff id="aff-1"><label>1</label>NYU School of Medicine, New York NY, US</aff>
            <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2022-01-18">
                <day>18</day>
                <month>01</month>
                <year>2022</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2021</year>
            </pub-date>
            <volume>1</volume>
            <issue>1</issue>
            <elocation-id>1</elocation-id>
            <permissions>
                <copyright-statement>Copyright: &#x00A9; 2021 The Author(s)</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license license-type="open-access"
                    xlink:href="http://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open-access article distributed under the terms of the
                        Creative Commons Attribution 4.0 International License (CC-BY 4.0), which
                        permits unrestricted use, distribution, and reproduction in any medium,
                        provided the original author and source are credited. See <uri
                            xlink:href="http://creativecommons.org/licenses/by/4.0/"
                            >http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
                </license>
            </permissions>
            <self-uri
                xlink:href="https://glossapsycholinguistics.journalpub.escholarship.org/articles/10.5334/glossapx.40/"/>
            <abstract>
                <p>This paper demonstrates a novel approach in experimental syntax, leveraging
                        <italic>psychometric</italic> methods to resolve a decades-old puzzle.
                    Specifically, gaps in subject position are more acceptable than gaps in object
                    position in non-islands, while the reverse is true in islands (the
                        <italic>Island Boundary-Gap Effect</italic>). Attempts at explaining this
                    asymmetry generally attribute it to a violation of the same constraint that
                    renders gaps unacceptable after the overt complementizer
                        &#8216;<italic>that</italic>&#8217; (the <italic>That-Trace
                    Effect</italic>). However, the two effects involve distinct syntactic
                    structures, and there is no a priori reason to believe they are the same beyond
                    the elegance of a unified account. One limitation has been the difficulty of
                    testing for equivalence in the Null Hypothesis Significance Testing framework:
                    when two constructs behave similarly, it generally constitutes an
                    uninterpretable null result. Experiments 1 and 2 use standard experimental
                    methods to circumvent this problem, but ultimately provide evidence that is at
                    best just consistent with equivalence. Experiment 3 demonstrates a novel
                    approach which shows that individual differences in the That-Trace Effect
                    correlate with individual differences in the Island Boundary-Gap Effect, after
                    removing correlated variance from carefully-chosen controls. This psychometric
                    approach provides positive evidence that the two effects do indeed derive from
                    the same underlying phenomenon.</p>
            </abstract>
            <funding-group specific-use="crossref">
                <award-group>
                    <funding-source id="gs1" country="USA">
                        <institution-wrap>
                            <institution>National Science Foundation</institution>
                            <institution-id institution-id-type="doi" vocab="open-funder-registry"
                                vocab-identifier="10.13039/open_funder_registry"
                                >10.13039/100000001</institution-id>
                        </institution-wrap>
                    </funding-source>
                    <award-id>DGE-1144086</award-id>
                </award-group>
            </funding-group>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>Version Note</meta-name>
                    <meta-value>This article has been updated to include a citation that was omitted
                        due to an editing error.</meta-value>
                </custom-meta>
            </custom-meta-group>
        </article-meta>
    </front>
    <body>
        <sec>
            <title>1. Introduction</title>
            <p>Linguistic models generally aim to reduce the number of unexplained phenomena that
                must be independently accounted for. One example of this involves two syntactic
                constraints: the <italic>That-Trace Effect</italic> and what will be referred to
                here as the <italic>Island Boundary-Gap Effect</italic>. On the surface, these
                effects appear remarkably similar. A central assumption of nearly all models of
                syntax for the last half-century has therefore been that they result from the same
                underlying constraint (e.g. <xref ref-type="bibr" rid="B11">Chomsky 1981</xref>;
                    <xref ref-type="bibr" rid="B48">McDaniel et al. 2015</xref>; <xref
                    ref-type="bibr" rid="B12">Chomsky 2014</xref>).</p>
            <p>However, the That-Trace Effect and Island Boundary-Gap Effect in fact involve
                distinct structures and lexical categories, making it far from clear that a unified
                analysis is warranted. This paper provides the first evidence (to my knowledge),
                aside from surface-level similarity, that the two effects derive from the same
                underlying constraint.</p>
            <p>The That-Trace Effect and the Island Boundary-Gap Effect are both properties of
                    <italic>long-distance syntactic dependencies</italic>. In such dependencies, as
                in the relative clauses in (1), a noun appears far away from its canonical
                position:</p>
            <table-wrap>
                <table content-type="example">
                    <tbody>
                        <tr>
                            <td>(1)</td>
                            <td>a.</td>
                            <td><italic>Subject gap</italic></td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>&#160;</td>
                            <td>&#8230;the patient [that I think [__ took the pill]]</td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>b.</td>
                            <td><italic>Object gap</italic></td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>&#160;</td>
                            <td>&#8230;the pill [that I think [the patient took __]]</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>That is, whereas in ordinary sentences like <italic>The patient took the
                    pill</italic>, subjects closely precede the verb and objects closely follow it,
                in (1a), &#8216;<italic>patient</italic>&#8217; is the subject of
                    &#8216;<italic>took</italic>,&#8217; but the two words are separated by two
                clause boundaries (brackets). Similarly, in (1b),
                &#8216;<italic>pill</italic>&#8217; is the object of
                &#8216;<italic>took</italic>,&#8217; but these are also separated by two clause
                boundaries. In both cases, the canonical position for the noun is left empty. These
                empty positions are referred to as <italic>gaps</italic> (or
                <italic>traces</italic>; represented with underscores throughout). Example (1a)
                demonstrates a <italic>subject gap</italic>, or a gap in subject position, and
                example (1b) demonstrates an <italic>object gap</italic>.</p>
            <p>The That-Trace Effect is exemplified by the unacceptability of (1a) when the
                complementizer &#8216;<italic>that</italic>&#8217; appears before the gap, as in (2)
                    (<xref ref-type="bibr" rid="B53">Perlmutter 1968</xref>):</p>
            <table-wrap>
                <table content-type="example">
                    <tbody>
                        <tr>
                            <td>(2)</td>
                            <td>&#160;&#160;<italic>That-Trace Effect</italic></td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>*&#8230;the patient [that I think [<bold>that</bold> __ took the
                                pill]]</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>While &#8216;<italic>that</italic>&#8217; is typically optional in English (e.g.,
                    <italic>I think (that) the patient took the pill</italic>), in (2) its presence
                renders the structure ungrammatical.</p>
            <p>There are reasons to think the That-Trace Effect may reflect something deep about
                human language. First, despite initially seeming to be an idiosyncratic quirk of
                English grammar, it has in fact turned out to be cross-linguistically common,
                documented in a number of languages across several families including French (<xref
                    ref-type="bibr" rid="B53">Perlmutter 1968</xref>), Russian (<xref
                    ref-type="bibr" rid="B54">Pesetsky 1982</xref>), Levantine Arabic (<xref
                    ref-type="bibr" rid="B39">Kenstowicz 1989</xref>), German (<xref ref-type="bibr"
                    rid="B20">Featherston 2005</xref>), Nupe (<xref ref-type="bibr" rid="B35"
                    >Kandybowicz 2006</xref>), and Wolof (<xref ref-type="bibr" rid="B46"
                    >Martinovi&#263; 2013</xref>).</p>
            <p>Second, the distribution of the That-Trace Effect in speech poses a problem for
                language acquisition. Speakers almost never produce
                &#8216;<italic>that</italic>&#8217; in sentences like (1a) (i.e., subject relative
                clauses), where it would be ungrammatical. But, somewhat surprisingly, speakers also
                almost never produce &#8216;<italic>that</italic>&#8217; in sentences like (1b)
                (object relative clauses), even though in these cases it is in fact grammatical
                    (<xref ref-type="bibr" rid="B55">Phillips 2013</xref>). Despite similar rates of
                    &#8216;<italic>that</italic>&#8217; production in their input, children must
                learn that &#8216;<italic>that</italic>&#8217; is optional in the latter, but
                unacceptable in the former. This hints at the possibility that more information than
                is contained in the input may be brought to bear on the acquisition of the
                That-Trace Effect. (See <xref ref-type="bibr" rid="B73">Pesetsky 2017</xref> for a fuller discussion of learnability and
                the relevant cross-linguistic facts about complementizer-trace effects.)</p>
            <p>To understand the Island Boundary-Gap Effect, a bit of background on gap processing
                is relevant. Extensive literature indicates that, relative to subject gaps, object
                gaps are harder to process (<xref ref-type="bibr" rid="B21">Ford 1983</xref>; <xref
                    ref-type="bibr" rid="B29">Hakes et al. 1976</xref>; <xref ref-type="bibr"
                    rid="B34">Holmes &amp; O&#8217;Regan 1981</xref>), less frequent (<xref
                    ref-type="bibr" rid="B47">McDaniel et al. 1998</xref>), cross-linguistically
                less common (<xref ref-type="bibr" rid="B38">Keenan &amp; Comrie 1977</xref>), and
                are sometimes reported to have lower acceptability (<xref ref-type="bibr" rid="B30"
                    >Han et al. 2012</xref>; <xref ref-type="bibr" rid="B51">Morgan &amp; Wagers
                    2018</xref>). However, the pattern of acceptability reverses in structures known
                as <italic>islands</italic>.</p>
            <p>Islands, exemplified in (3), are defined by the fact that they are unacceptable when
                gaps appear inside them (<xref ref-type="bibr" rid="B60">Ross 1967</xref>). Unlike
                in non-islands like (1), in islands subject gaps are less acceptable than object
                gaps (<xref ref-type="bibr" rid="B11">Chomsky 1981</xref>; <xref ref-type="bibr"
                    rid="B54">Pesetsky 1982</xref>). This decreased acceptability of gaps when they
                are in subject position in an island &#8211; that is, when they immediately follow
                an island boundary &#8211; is the Island Boundary-Gap Effect.</p>
            <table-wrap>
                <table content-type="example">
                    <tbody>
                        <tr>
                            <td>(3)</td>
                            <td>a.</td>
                            <td>&#160;&#160;&#160;&#160;<italic>Subject gap in island</italic></td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>&#160;</td>
                            <td>**&#8230;the patient [that I wonder [when ___ took the pill]]</td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>b.</td>
                            <td>&#160;&#160;&#160;&#160;<italic>Object gap in island</italic></td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>&#160;</td>
                            <td>&#160;&#160;*&#8230;the pill [that I wonder [why the patient took
                                ___]]</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <sec>
                <title>1.1 A unified account</title>
                <p>One way to account for the puzzling asymmetry between subject and object gaps in
                    islands and non-islands might be to stipulate a specific constraint against
                    subject gaps in islands. However, researchers have long noted that the
                    That-Trace Effect and the Island Boundary-Gap Effect are suspiciously similar
                        (<xref ref-type="bibr" rid="B10">Chomsky 1980</xref>), at least on the
                    surface. They both involve an unacceptable subject gap immediately following an
                    overt clausal function word, like &#8216;<italic>that</italic>&#8217; (2) or
                        &#8216;<italic>why</italic>&#8217; (3a). Drawing on this surface similarity,
                    many attempts have been made to attribute the That-Trace Effect and the Island
                    Boundary-Gap Effect to the same underlying constraint.</p>
                <p>For instance, McDaniel et al. (<xref ref-type="bibr" rid="B48">2015</xref>) offer
                    an account based on the idea that processing pressures during production
                    eventually become grammaticized over the course of language evolution (<xref
                        ref-type="bibr" rid="B32">Hawkins 2014</xref>). They stipulate that
                    clause-initial function words like <italic>that</italic> and
                        <italic>whether</italic> demarcate separate planning units for the language
                    production system. They then argue that connecting clauses with a long-distance
                    dependency is more difficult immediately after the beginning of a new planning
                    unit. Over the course of language evolution, this difficulty results in the
                    grammaticization of a constraint against gaps immediately after a clause-initial
                    function word.</p>
                <p>Perhaps the most famous attempt at unification is a linguistic constraint
                    proposed by Chomsky (<xref ref-type="bibr" rid="B11">1981</xref>) known as the
                    Empty Category Principle (ECP). According to the ECP, which is understood to be
                    a general principle that applies in all languages, gaps are prohibited from
                    appearing immediately after clause-initial function words like <italic>that,
                        when, because</italic>, etc. The specifics of this proposal require
                    extensive knowledge of Government and Binding (<xref ref-type="bibr" rid="B11"
                        >Chomsky 1981</xref>), a prominent theoretical framework from the 1980s, but
                    are not critical for the present purposes (see <xref ref-type="bibr" rid="B11"
                        >Chomsky 1981</xref>; <xref ref-type="bibr" rid="B43">Lasnik &amp; Saito
                        1984</xref>; <xref ref-type="bibr" rid="B37">Kayne 1981 for further
                        detail</xref>). Like McDaniel et al.&#8217;s account, the ECP also explains
                    the ungrammaticality of the That-Trace Effect and the Island Boundary-Gap Effect
                    as the result of violating a single underlying constraint.</p>
                <p>In spite of the elegance of a unified explanation of the That-Trace Effect and
                    the Island Boundary-Gap Effect, there is no a priori reason to believe that the
                    two should be the same. Indeed, there are several key differences between the
                    two. While the That-Trace Effect is conditioned on the presence of a specific
                    complementizer, &#8216;<italic>that</italic>,&#8217; the Island Boundary-Gap
                    Effect is conditioned on any type of island boundary, which may or may not
                    involve a complementizer. Island boundaries <italic>can</italic> be marked with
                    a complementizer (or complementizer-like function word; the particular analysis
                    does not matter here) like <italic>if</italic> or <italic>because</italic> (4a).
                    But they can also be marked with a relative pronoun or
                    <italic>wh</italic>-operator like <italic>who, why</italic> or
                        <italic>when</italic> (4b), or even a <italic>wh</italic>-operator involving
                    an entire referential phrase like <italic>which of these staplers</italic> (4c).
                    Not only are these latter examples not complementizers, they also occupy a
                    different syntactic position within the sentence,<xref ref-type="fn" rid="n1"
                        >1</xref> as shown in Table <xref ref-type="table" rid="T1">1</xref>.</p>
                <table-wrap id="T1">
                    <label>Table 1</label>
                    <caption>
                        <p>Illustrations of the distinct positions that complementizers and relative
                            pronouns occupy.</p>
                    </caption>
                    <table>
                        <tr>
                            <th colspan="2"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top">Complementizer
                                (<italic>that</italic>)</th>
                            <th align="left" valign="top"><italic>Wh</italic>- operator
                                    (<italic>why</italic>)</th>
                        </tr>
                        <tr>
                            <td colspan="2"><hr/></td>
                        </tr>
                        <tr>
                            <td align="center" valign="top"><inline-graphic
                                    xmlns:xlink="http://www.w3.org/1999/xlink"
                                    xlink:href="glossapx-1-1-40-g7.png"/></td>
                            <td align="center" valign="top"><inline-graphic
                                    xmlns:xlink="http://www.w3.org/1999/xlink"
                                    xlink:href="glossapx-1-1-40-g8.png"/></td>
                        </tr>
                        <tr>
                            <td colspan="2"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">&#8220;<italic>&#8230;thought that the
                                    patient took the pill</italic>&#8221;</td>
                            <td align="left" valign="top">&#8220;<italic>&#8230;wondered why the
                                    patient took the pill</italic>&#8221;</td>
                        </tr>
                        <tr>
                            <td colspan="2"><hr/></td>
                        </tr>
                    </table>
                </table-wrap>
                <table-wrap>
                    <table content-type="example">
                        <tbody>
                            <tr>
                                <td>(4)</td>
                                <td colspan="2">Distinct types of island boundaries</td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>a.</td>
                                <td><italic>Complementizer</italic></td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>&#160;</td>
                                <td>I wonder [<italic>if</italic> the patient took the pill].</td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>b.</td>
                                <td><italic>Wh- operator</italic></td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>&#160;</td>
                                <td>I wonder [<italic>why</italic> the patient took the pill].</td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>c.</td>
                                <td><italic>Wh- operator with internal structure</italic></td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>&#160;</td>
                                <td>I wonder [<italic>which of these staplers</italic> the patient
                                    threw at the nurse].</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>Thus, the similarity between the That-Trace Effect and the Island Boundary-Gap
                    Effect is superficial, at best. This relatively weak evidence has nonetheless
                    been used to motivate the assumption, which is pervasive throughout the
                    literature, that the two effects are the same. This assumption is clearly
                    appealing: it reduces the number of stipulative constraints in the grammar. But
                    it only does so by one. Without stronger evidence in favor of unification, it is
                    not clear that such a small reduction is sufficient to justify such a
                    consequential assumption.</p>
                <p>The present aim is to experimentally assess whether the two effects are the same,
                    and in so doing, demonstrate a novel approach to questions about equivalence. As
                    will be seen in Experiments 1 and 2, the type of experimental paradigms and
                    statistics standardly used in experimental syntax impose a familiar constraint:
                    it is easy to design a study intended to detect <italic>differences</italic>
                    between constructs, but showing that two constructs are the same is trickier. To
                    do so, Experiment 3 leverages psychometric methods to flip the usual logic of
                    the experimental design, so that a significant result can provide evidence of
                    equivalence rather than difference.</p>
            </sec>
            <sec sec-type="methods">
                <title>1.2 The joint logic of experimental methods and Null Hypothesis Significance
                    Testing</title>
                <p>The standard experimental approach in the cognitive sciences is designed to
                    reveal differences between conditions. <italic>Equivalence</italic> between
                    conditions, even though it is often useful to know about, is harder to
                    demonstrate. This difficulty derives from the fact that <italic>Null Hypothesis
                        Significance Testing</italic>, the dominant statistical framework, is
                    tailored to look for differences between measures, not similarities.<xref
                        ref-type="fn" rid="n2">2</xref></p>
                <p>But it is not <italic>necessarily</italic> the case that Null Hypothesis
                    Significance Testing cannot be used to test for equivalence. To foreshadow,
                    Experiment 3 leverages the insight from economics and neuroscience that any
                    systematic <italic>microvariation</italic> in an underlying phenomenon should be
                    detectable in the surface phenomena it causes (<xref ref-type="bibr" rid="B72"
                        >Wiener 1956</xref>; <xref ref-type="bibr" rid="B28">Granger 1969</xref>).
                    Specifically, if the That-Trace Effect and the Island Boundary-Gap Effect do in
                    fact derive from the same underlying constraint, and if there are individual
                    differences (i.e., microvariation) in that underlying constraint, then those
                    individual differences should be present in both effects. Experiment 3 compares
                    individual differences in the acceptability of the two effects to determine
                    whether they correlate beyond what can be explained by extraneous sources of
                    individual differences in acceptability judgment data.</p>
                <p>This logic relies on a distinction between statistical hypotheses, which are
                    about variables &#8211; the measured values &#8211; and theoretical hypotheses,
                    which are about psychological constructs &#8211; the black-box entities that
                    variables are often meant to stand in for. The statistical null hypothesis in
                    Null Hypothesis Significance Testing is always that two variables do not differ,
                    and the corresponding theoretical hypothesis is <italic>usually</italic> that
                    two constructs do not differ. But the hypotheses do not have to align in this
                    way.</p>
                <p>This is easy to overlook because variables and constructs are so often
                    confounded. In the standard experimental approach, dependent variables tend to
                    be measures of cognitive constructs like grammaticality, working memory
                    capacity, or attention span. When interpreting the data, these variables are
                    treated as proxies for the corresponding construct, and the line between the two
                    is often neither clear nor consequential.</p>
                <p>By analyzing individual differences in constructs rather than the constructs
                    themselves, Experiment 3 formulates a statistical question such that a
                    difference between variables results when two psychological constructs behave in
                    the same way. In such cases, a positive statistical result can constitute
                    evidence for equivalence.</p>
                <p>The usual difficulty in assessing equivalence is therefore not an inherent
                    limitation of Null Hypothesis Significance Testing, but a limitation of the
                    joint logic of Null Hypothesis Significance Testing and the standard approach to
                    experimental design. Experiment 3 circumvents this issue by employing a
                        <italic>psychometric</italic> experimental design.</p>
                <p>Psychometrics is the subfield of psychology that deals with how to measure
                    psychological constructs. Rather than being concerned with the nature of these
                    constructs (e.g., whether language processing uses domain-general or
                    domain-specific mechanisms), its aim is to measure a construct as accurately as
                    possible (<xref ref-type="bibr" rid="B15">Cronbach &amp; Meehl 1955</xref>).
                    Psychometric experiments rarely contain experimental manipulations, and do not
                    often attempt to measure within-subjects changes. Instead, the goal is to
                    measure psychological states as they already are (<xref ref-type="bibr"
                        rid="B69">Swets et al. 2007</xref>; <xref ref-type="bibr" rid="B61">Rust
                        &amp; Golombok 2014</xref>; <xref ref-type="bibr" rid="B22">Furr
                    2017</xref>). As a field, its primary occupation has been with the development
                    of testing, as in college entrance exams, IQ tests, professional licensing,
                    workplace aptitude, etc. (<xref ref-type="bibr" rid="B14">Crocker &amp; Algina
                        1986</xref>).</p>
                <p>Psychometric methods often differ considerably from standard methods, doing away
                    with practices like counterbalancing and randomization to avoid introducing
                    unwanted sources of variability. Outside of work on individual differences,
                    psychometrics has had dwindling contact with cognitive science in recent years,
                    although it has on occasion been adopted to divide experimental subjects into
                    groups based on different cognitive abilities (<xref ref-type="bibr" rid="B69"
                        >Swets et al. 2007</xref>; <xref ref-type="bibr" rid="B44">Longo et al.
                        2008</xref>). The use of psychometric methods in the present study to
                    demonstrate equivalence is, to my knowledge, a novel contribution.</p>
            </sec>
            <sec>
                <title>1.3 The present experiments</title>
                <p>Here, three experiments with large sample sizes aim to provide further evidence
                    for unifying the two effects. Experiment 1 (<italic>N</italic> = 161) examines
                    acceptability while deconfounding the subject position in
                    <italic>wh</italic>-islands from the position immediately after clause-initial
                    function words. Experiment 2 (<italic>N</italic> = 189) aims to validate that
                    the types of stimuli from Experiment 1, which have not been previously used as
                    stimuli in experimental research, show prototypical properties of islands. While
                    the findings of Experiments 1 and 2 are <italic>consistent</italic> with a
                    unified theory of the That-Trace Effect and the Island Boundary-Gap Effect, the
                    experiments are unable to definitively demonstrate that the effects are the same
                    due to inherent limitations on the joint logic of the standard experimental
                    design and Null Hypothesis Significance Testing.</p>
                <p>Experiment 3 (<italic>N</italic> = 104) circumvents these limitations using a
                    novel approach in experimental syntax. Leveraging psychometric methods, it
                    directly tests for equivalence by analyzing individual differences in the
                    That-Trace Effect and the Island Boundary-Gap Effect to determine whether the
                    two correlate beyond what would be expected due to extraneous shared
                    properties.</p>
            </sec>
        </sec>
        <sec>
            <title>2. Experiment 1</title>
            <p>One obstacle to determining whether the That-Trace Effect and the Island Boundary-Gap
                Effect reflect the same underlying constraint is that it is difficult to dissociate
                potential causes of the Island Boundary-Gap Effect. That is, it might be that there
                is a specific constraint against subject gaps in islands. But it might also be that
                there is a general constraint against subject gaps after clause-initial function
                words like <italic>that</italic> and <italic>when</italic>, as proposed by Chomsky
                    (<xref ref-type="bibr" rid="B11">1981</xref>) and McDaniel et al. (<xref
                    ref-type="bibr" rid="B48">2015</xref>). The difficulty in distinguishing between
                these possibilities is due to the fact that clause-initial function words at island
                boundaries are obligatory (e.g., the &#8216;<italic>why</italic>&#8217; in
                &#8220;&#8230;who I wondered [why he attacked ___]&#8221;). Thus, the subject
                position in islands is confounded with the position immediately following a
                clause-initial function word.</p>
            <p>Experiment 1 circumvented this problem by adding a finite declarative clause, in
                which complementizers are optional (e.g., &#8220;you thought [(that)&#8230;&#8221;),
                inside an island (resulting in sentences like: &#8220;&#8230;who I wondered [why you
                thought [(that) he attacked ___]]&#8221;). The experiment was an online
                acceptability judgment task with a high sample size and a standard experimental
                design (i.e., including familiar features like randomization and
                counterbalancing).</p>
            <p>If subject gaps are inherently worse than object gaps in islands, then subject gaps
                should be rated worse than object gaps regardless of whether they follow a null
                (&#8709;) or overt (<italic>that</italic>) complementizer. But if the That-Trace
                Effect and the Island Boundary-Gap Effect reflect the same constraint, then subject
                gaps should be worse than object gaps following the overt complementizer, but not
                when following the null complementizer.</p>
            <sec sec-type="methods">
                <title>2.1 Method</title>
                <sec>
                    <title>2.1.1 Participants</title>
                    <p>As the acceptability of the critical stimuli was expected to be quite low on
                        the whole, a high target <italic>N</italic> of 150 was set to mitigate
                        concerns about potential floor effects. To achieve this goal, 200 slots were
                        made available online to UC San Diego undergraduates; a total of 161
                        participated for course credit.</p>
                    <p>All participants were native monolingual speakers of American English
                        (defined as not having learned any other language before the age of 7) and
                        were at least 18 years old. Participants were excluded if they answered
                        fewer than 60% of comprehension questions correctly (50 participants), or if
                        they did not rate &#8220;ceiling&#8221; fillers &#8211; designed to
                        establish an acceptability ceiling (see below) &#8211; more than two
                        Likert-scale points higher on average than &#8220;floor&#8221; fillers
                        &#8211; designed to establish an acceptability floor (30 participants,
                        including 25 who were also among the 50 reported with below 60% accuracy). A
                        total of 106 participants were included in the analysis.<xref ref-type="fn"
                            rid="n3">3</xref></p>
                </sec>
                <sec>
                    <title>2.1.2 Procedure</title>
                    <p>The procedure was the same for all experiments, which were conducted online
                        using the Ibex Farm server (<xref ref-type="bibr" rid="B18">Drummond
                            2013</xref>). Participants began by providing informed consent.
                        Subsequent instructions explained that they would be asked to rate the
                        acceptability of sentences based not on the types of formal grammatical
                        principles they may have learned in high school, but on subjective
                        intuitions. They were told to consider whether, given an appropriate
                        context, a native speaker of English might ever say such a sentence.
                        Sentences appeared in full, with no time limit, in the center of the screen
                        above a 9-point Likert scale with 1 labeled &#8220;bad&#8221; and 9
                        &#8220;good.&#8221; After two practice trials, participants proceeded to the
                        task. After every critical trial and most filler trials, participants
                        responded to a multiple-choice comprehension question. Both Experiments 1
                        and 2 took roughly 20 minutes to complete.</p>
                </sec>
                <sec>
                    <title>2.1.3 Materials</title>
                    <p>Two factors were manipulated in a fully crossed 2 &#215; 2 design. The first,
                            <sc>COMPLEMENTIZER</sc>, had levels <italic>null</italic> and
                            <italic>that</italic>. The second, <sc>GAP POSITION</sc>, had levels
                            <italic>subject</italic> and <italic>object</italic>.</p>
                    <p>Critical trials consisted of 24 item sets, an example of which is given in
                        Table <xref ref-type="table" rid="T2">2</xref>. Stimuli contained relative
                        clauses with two clausal embeddings. The first embedding (&#8220;&#8230;I
                        wondered [why&#8230;&#8221;) introduced an island. Inside the island, the
                        second embedding introduced a finite declarative clause (&#8220;&#8230;you
                        thought [{&#8709;,that}&#8230;&#8221;). This clause contained a gap in
                        either subject or object position. Because it was finite and declarative,
                        the complementizer was optional. This allowed for the manipulation of the
                        presence of a clause-initial function word inside an island.</p>
                    <table-wrap id="T2">
                        <label>Table 2</label>
                        <caption>
                            <p>Experiment 1 stimuli. Sentences appeared in a 2 &#215; 2 design. The
                                    <sc>GAP POSITION</sc> manipulation is shown in rows;
                                    <sc>COMPLEMENTIZER</sc> is manipulated in-line. Clause
                                boundaries (square brackets) and gaps (underscores) were not visible
                                to participants.</p>
                        </caption>
                        <table>
                            <tr>
                                <th colspan="2"><hr/></th>
                            </tr>
                            <tr>
                                <th align="left" valign="top">G<sc>AP</sc> P<sc>OSITION</sc></th>
                                <th align="left" valign="top">S<sc>TIMULUS</sc></th>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                            <tr>
                                <td align="left" valign="top">Subject</td>
                                <td align="left" valign="top">It was the doctor</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"/>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[who
                                    I wondered</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"/>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[why
                                    you thought</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"/>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[{&#8709;,
                                    that} __ hit the lawyer with a bat]]].</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top">Object</td>
                                <td align="left" valign="top">It was the doctor</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"/>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[who
                                    I wondered</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"/>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[why
                                    you thought</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"/>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[{&#8709;,
                                    that} the lawyer hit __ with a bat]]].</td>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                            <tr>
                                <td align="left" valign="top">C<sc>OMPREHENSION</sc>
                                        Q<sc>UESTION</sc></td>
                                <td align="left" valign="top">What happened? <list
                                        list-type="alpha-lower">
                                        <list-item>
                                            <p>I hit the lawyer with a bat.</p>
                                        </list-item>
                                        <list-item>
                                            <p>The lawyer hit the doctor with a bat.</p>
                                        </list-item>
                                        <list-item>
                                            <p>The doctor hit the lawyer with a bat.</p>
                                        </list-item>
                                        <list-item>
                                            <p>You hit the doctor with a bat.</p>
                                        </list-item>
                                    </list></td>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                        </table>
                    </table-wrap>
                    <p>An additional 24 fillers were randomly mixed with the critical items. These
                        included 12 <italic>ceiling</italic> items, such as &#8220;Somebody
                        mentioned that you liked wine so I brought a bottle of Chardonnay.&#8221;
                            <italic>Ceiling</italic> fillers were intended to be unambiguously
                        highly acceptable so as to establish a ceiling for acceptability ratings.
                        The remaining 12 <italic>floor</italic> fillers were intended to establish a
                        lower bound for acceptability ratings in hopes of pushing the ratings of the
                        critical items up to avoid a floor effect. These consisted of
                        doubly-center-embedded relative clauses, such as &#8220;This park that the
                        landscaper that the architect hired revitalized attracted
                        tourists.&#8221;</p>
                    <p>Stimulus order was pseudo-randomized such that every consecutive group of
                        four critical trials contained one instance of each critical condition. Four
                        lists were automatically generated by the Ibex Farm software according to a
                        Latin square design, and participants were assigned to lists cyclically in
                        the order in which they began the experiment. The software was designed so
                        that the particular list the next participant was assigned to was
                        incremented only once the previous participant completed the consent form.
                        This meant that participants who began the experiment before the previous
                        participant completed the consent form were assigned to the same list as the
                        previous participant, resulting in a slightly uneven distribution across
                        lists. The lowest number of participants in a given list was 35 and the
                        highest was 43.</p>
                </sec>
                <sec>
                    <title>2.1.4 Trial exclusions and data preparation</title>
                    <p>For Experiments 1 and 2, trials completed in less time than would be required
                        to read the sentence at a rate of one word per 150 ms were excluded (25
                        critical trials, or 0.98%). Critical trials were also excluded if the
                        comprehension question was answered incorrectly (566 trials, or
                            22.25%).<xref ref-type="fn" rid="n4">4</xref> The remaining 1967
                        critical trials were included in the analysis; the number of observations
                        per cell ranged from 440 to 533.</p>
                    <p>Raw ratings were <italic>z</italic>-scored by participant after trial
                        exclusions (e.g., <xref ref-type="bibr" rid="B62">Sch&#252;tze &amp; Sprouse
                            2014</xref>). This served to reduce potential individual differences in
                        the use of the Likert scale, as well as to increase the degree to which
                        model residuals approximate a normal distribution.</p>
                </sec>
                <sec>
                    <title>2.1.5 Analysis</title>
                    <p>A linear mixed effects regression was used to model <italic>z</italic>-scores
                        as a function of <sc>COMPLEMENTIZER</sc>, and <sc>GAP POSITION</sc>, and
                        their interaction (<xref ref-type="bibr" rid="B2">Bates et al. 2015</xref>;
                            <xref ref-type="bibr" rid="B56">R Core Team 2013</xref>). Both fixed
                        effects were treatment coded. With treatment coding, the model terms reflect
                        differences between particular conditions and a &#8220;baseline&#8221;
                        condition &#8211; the model intercept. Thus, treatment coding is useful in
                        cases where the researcher wants to compare conditions. Here, the intercept
                        was the <italic>That,Subject</italic> condition and treatment coding was
                        used to make three comparisons. The simple main effect of
                            <sc>COMPLEMENTIZER</sc> reflects the difference between the
                            <italic>Null,Subject</italic> condition and the baseline condition. The
                        simple main effect of <sc>GAP POSITION</sc> reflects the difference between
                        the <italic>That,Object</italic> condition and the baseline condition.
                        Finally, the interaction term reflects the difference between the observed
                        value for the <italic>Null,Object</italic> condition and what would be
                        expected based on an additive combination of the two simple main
                        effects.</p>
                    <p>Random intercepts for items and participants were included, and all fixed
                        effects were allowed to vary within levels of the random effects. The model
                        converged with the full random effects structure. The
                            <italic>summary()</italic> function from the <italic>lmerTest</italic>
                        package was used to obtain <italic>p</italic>-values (<xref ref-type="bibr"
                            rid="B42">Kuznetsova et al. 2017</xref>). To confirm that critical
                        effects significantly contributed to model fit,
                            <italic>&#967;</italic><sup>2</sup> tests were performed to compare
                        models with and without the effect. All data and analyses are available
                        online at <ext-link ext-link-type="uri"
                            xmlns:xlink="http://www.w3.org/1999/xlink"
                            xlink:href="https://osf.io/fn5at/">https://osf.io/fn5at/</ext-link>.</p>
                </sec>
            </sec>
            <sec>
                <title>2.2 Results</title>
                <p>Results (summarized in Table <xref ref-type="table" rid="T3">3</xref> and
                    depicted in Figure <xref ref-type="fig" rid="F1">1</xref>) showed a significant
                    simple main effect of <sc>COMPLEMENTIZER</sc>. Because the factors were
                    treatment coded, this effect reflects the fact that the
                        <italic>Subject,Null</italic> condition was better than the
                        <italic>Subject,That</italic> condition (<italic>t</italic>(76.51) = 4.55,
                        <italic>p</italic> &lt; .001); model comparison confirmed the effect
                    significantly contributed to model fit (<italic>&#967;</italic><sup>2</sup>(1) =
                    18.185, <italic>p</italic> &lt; .001). There was also a significant simple main
                    effect of <sc>GAP POSITION</sc>, reflecting the fact that the
                        <italic>Object,That</italic> condition was better than the
                        <italic>Subject,That</italic> condition (<italic>t</italic>(399.78) = 2.2,
                        <italic>p</italic> = .03; model comparison:
                        <italic>&#967;</italic><sup>2</sup>(1) = 4.839, <italic>p</italic> = .028).
                    Critically, the interaction was significant, reflecting the fact that in
                    islands, subject gaps are only worse than object gaps in the presence of an
                    overt clause-initial function word (<italic>t</italic>(135.628) = &#8211;3.15,
                        <italic>p</italic> &lt; .01; model comparison:
                        <italic>&#967;</italic><sup>2</sup>(1) = 9.803, <italic>p</italic> =
                        .002).<xref ref-type="fn" rid="n5">5</xref></p>
                <table-wrap id="T3">
                    <label>Table 3</label>
                    <caption>
                        <p>Experiment 1 results.</p>
                    </caption>
                    <table>
                        <tr>
                            <th colspan="6"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top"/>
                            <th align="right" valign="top"><italic>&#946;</italic></th>
                            <th align="right" valign="top"><sc>D.F.</sc></th>
                            <th align="right" valign="top"><italic>t</italic></th>
                            <th align="right" valign="top"><italic>p</italic></th>
                            <th align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="6"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept
                                (<italic>That,Subject</italic>)</td>
                            <td align="right" valign="top">&#8211;0.426</td>
                            <td align="right" valign="top">315.598</td>
                            <td align="right" valign="top">&#8211;17.285</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>COMPLEMENTIZER</sc>
                                    (<italic>Null</italic>)</td>
                            <td align="right" valign="top">0.161</td>
                            <td align="right" valign="top">76.508</td>
                            <td align="right" valign="top">4.552</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>GAP POSITION</sc>
                                    (<italic>Object</italic>)</td>
                            <td align="right" valign="top">0.071</td>
                            <td align="right" valign="top">399.785</td>
                            <td align="right" valign="top">2.202</td>
                            <td align="right" valign="top">.028</td>
                            <td align="left" valign="top">*</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>COMPLEMENTIZER</sc> &#215; <sc>GAP
                                    POSITION</sc></td>
                            <td align="right" valign="top">&#8211;0.134</td>
                            <td align="right" valign="top">135.628</td>
                            <td align="right" valign="top">&#8211;3.150</td>
                            <td align="right" valign="top">.002</td>
                            <td align="left" valign="top">**</td>
                        </tr>
                        <tr>
                            <td colspan="6"><hr/></td>
                        </tr>
                    </table>
                </table-wrap>
                <fig id="F1">
                    <label>Figure 1</label>
                    <caption>
                        <p>Experiment 1 results: Mean <italic>z</italic>-scores per condtion. Bars
                            denote standard error.</p>
                    </caption>
                    <graphic xmlns:xlink="http://www.w3.org/1999/xlink"
                        xlink:href="glossapx-1-1-40-g1.png"/>
                </fig>
            </sec>
            <sec>
                <title>2.3 Discussion</title>
                <p>Experiment 1 tested the hypothesis that the Island Boundary-Gap Effect derives
                    from a specific constraint on subject gaps in islands, in which case it would
                    reflect a different constraint from whatever gives rise to the That-Trace
                    Effect. This hypothesis would predict that subject gaps are always less
                    acceptable than the corresponding object gaps in islands. However, the results
                    contradict this prediction. Instead, subject gaps are only worse when
                    immediately preceded by a clause-initial function word. This is consistent with
                    the idea that the Island Boundary-Gap Effect may be the same as the That-Trace
                    Effect.</p>
                <p>One potential loose end is that the Experiment 1 stimuli were atypical of stimuli
                    commonly tested in island literature in that they had an extra level of
                    embedding. In order to confirm that the findings of Experiment 1 can be brought
                    to bear on the question at hand, it should be verified that these stimuli do in
                    fact behave like islands. This was the aim of Experiment 2.</p>
            </sec>
        </sec>
        <sec>
            <title>3. Experiment 2</title>
            <p>In the experimental literature, islands have been studied by systematically isolating
                component parts of the stimuli so as to parcel out the specific effect of an island
                violation on acceptability (<xref ref-type="bibr" rid="B64">Sprouse 2007</xref>;
                    <xref ref-type="bibr" rid="B66">Sprouse et al. 2016</xref>). For instance, one
                can isolate the effect of an embedded gap on acceptability by comparing ratings of
                sentences like (5), which has a <italic>matrix</italic> (or non-embedded) gap, with
                sentences like (6), which has an embedded gap. Similarly, by comparing (5), which is
                a non-island structure, to (7), which involves an island structure, one can isolate
                the effect of islandhood.</p>
            <table-wrap>
                <table content-type="example">
                    <tbody>
                        <tr>
                            <td>(5)</td>
                            <td><italic>Non-embedded gap</italic>:</td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>That&#8217;s the judge who __ knew [that the defendant blackmailed
                                the juror].</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <table-wrap>
                <table content-type="example">
                    <tbody>
                        <tr>
                            <td>(6)</td>
                            <td><italic>Embedded gap</italic>:</td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>That&#8217;s the juror who the judge knew [that the defendant
                                blackmailed __].</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <table-wrap>
                <table content-type="example">
                    <tbody>
                        <tr>
                            <td>(7)</td>
                            <td><italic>Non-embedded gap + island</italic> (<italic>no
                                    violation</italic>):</td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>That&#8217;s the judge who __ knew [whether the defendant
                                blackmailed the juror].</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>In this literature, the defining feature of an island is that its acceptability
                combines superadditively with the acceptability of an embedded gap. Thus, when an
                embedded gap appears inside an island structure, as in (8), the resulting
                acceptability is not simply the acceptability of a non-island structure (5) minus
                penalties for an embedded gap (i.e., (6) minus (5)) and an island structure (i.e.,
                (7) minus (5)). Instead, it is even lower.</p>
            <table-wrap>
                <table content-type="example">
                    <tbody>
                        <tr>
                            <td>(8)</td>
                            <td>&#160;&#160;<italic>Embedded gap + island</italic>
                                    (<italic>violation</italic>):</td>
                        </tr>
                        <tr>
                            <td>&#160;</td>
                            <td>*That&#8217;s the juror who the judge knew [whether the defendant
                                blackmailed __].</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <p>This superadditivity is typically modeled as the statistical interaction between (a)
                whether a clause is an island or a non-island, and (b) whether a gap appears inside
                the clause or outside the clause (e.g., <xref ref-type="bibr" rid="B67">Sprouse et
                    al. 2012</xref>; <xref ref-type="bibr" rid="B66">2016</xref>). If the stimuli
                tested in Experiment 1 behave like typical islands, then they should show this
                characteristic interaction. Experiment 2 tested this prediction with another
                acceptability judgment study.</p>
            <sec sec-type="methods">
                <title>3.1 Method</title>
                <sec>
                    <title>3.1.1 Participants</title>
                    <p>An a priori target of 150 participants was set; 200 online experiment slots
                        were made available to UC San Diego undergraduates and a total of 189
                        participated for course credit. Exclusion criteria were the same as in
                        Experiment 1: participants had to answer more than 60% of comprehension
                        questions correctly (50 exclusions) and/or rate ceiling fillers more than
                        two points higher on average than floor fillers (14 partially-overlapping
                        exclusions). A total of 131 participants were included in the analysis. None
                        had participated in Experiment 1.</p>
                </sec>
                <sec>
                    <title>3.1.2 Materials</title>
                    <p>Two factors were manipulated, resulting in a fully crossed 2 &#215; 2 design.
                        The first factor, <sc>ISLANDHOOD</sc>, had levels
                            <italic>non-island</italic> and <italic>island</italic>. The second,
                            <sc>GAP EMBEDDING</sc>, had levels <italic>matrix</italic> and
                            <italic>embedded</italic>.</p>
                    <p>Materials, demonstrated in Table <xref ref-type="table" rid="T4">4</xref>,
                        were adapted from those in Experiment 1.<xref ref-type="fn" rid="n6"
                            >6</xref> They had the same global structure and same lexicalizations
                        (e.g., &#8220;<italic>the doctor said</italic>,&#8221; &#8220;<italic>you
                            thought</italic>,&#8221; etc.), but differed in the particular factors
                        manipulated. All conditions contained subject gaps, but the subject gap was
                        either in a matrix or an embedded clause. The null complementizer was always
                        used immediately before the embedded gap to avoid the That-Trace Effect. The
                        same 24 fillers from Experiment 1 were used.</p>
                    <table-wrap id="T4">
                        <label>Table 4</label>
                        <caption>
                            <p>Experiment 2 stimuli. Sentences appeared in a 2 &#215; 2 design. The
                                    <sc>GAP EMBEDDING</sc> and <sc>ISLANDHOOD</sc> manipulations are
                                shown in rows. Clause boundaries (square brackets) and gaps
                                (underscores) were not visible to participants.</p>
                        </caption>
                        <table>
                            <tr>
                                <th colspan="3"><hr/></th>
                            </tr>
                            <tr>
                                <th align="left" valign="top">G<sc>AP</sc></th>
                                <th align="left" valign="top">I<sc>SLANDHOOD</sc></th>
                                <th align="left" valign="top">S<sc>TIMULUS</sc></th>
                            </tr>
                            <tr>
                                <td colspan="3"><hr/></td>
                            </tr>
                            <tr>
                                <td align="left" valign="top" rowspan="8"
                                    ><italic>Matrix</italic></td>
                                <td align="left" valign="top" rowspan="4"
                                        ><italic>Non-island</italic></td>
                                <td align="left" valign="top">It was the doctor</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[who
                                    __ said</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[that
                                    you thought</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[the
                                    lawyer hit me with a bat]]].</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top" rowspan="4"
                                    ><italic>Island</italic></td>
                                <td align="left" valign="top">It was the doctor</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[who
                                    __ wondered</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[why
                                    you thought</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[the
                                    lawyer hit me with a bat]]].</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top" rowspan="8"
                                    ><italic>Embedded</italic></td>
                                <td align="left" valign="top" rowspan="4"
                                        ><italic>Non-island</italic></td>
                                <td align="left" valign="top">It was the lawyer</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[who
                                    the doctor said</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[that
                                    you thought</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[__
                                    hit me with a bat]]].</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top" rowspan="4"
                                    ><italic>Island</italic></td>
                                <td align="left" valign="top">It was the lawyer</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[who
                                    the doctor wondered</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[why
                                    you thought</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    >&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;[__
                                    hit me with a bat]]].</td>
                            </tr>
                            <tr>
                                <td colspan="3"><hr/></td>
                            </tr>
                        </table>
                    </table-wrap>
                    <p>Stimulus order randomization and list assignment followed the same procedure
                        outlined for Experiment 1. The lowest number of participants in a given list
                        was 44 and the highest was 51.</p>
                </sec>
                <sec>
                    <title>3.1.3 Trial exclusions and data preparation</title>
                    <p>A total of 820 critical trials (26.08%) were excluded: 814 for having the
                        comprehension question answered incorrectly and a partially overlapping set
                        of 15 for being read too quickly. The remaining 2324 critical trials were
                        included in the analysis; the number of observations per cell ranged from
                        547 to 628. All other details were the same as in Experiment 1.</p>
                </sec>
                <sec>
                    <title>3.1.4 Analysis</title>
                    <p>All details of the analysis were identical to those in Experiment 1.</p>
                </sec>
            </sec>
            <sec>
                <title>3.2 Results</title>
                <p>Results, summarized in Table <xref ref-type="table" rid="T5">5</xref> and
                    depicted in Figure <xref ref-type="fig" rid="F2">2</xref>, showed a significant
                    simple main effect of <sc>ISLANDHOOD</sc>, reflecting the fact that sentences
                    with islands were better than sentences without islands when the gap appeared in
                    the matrix position (i.e., outside the island; <italic>t</italic>(57.58) = 2.27,
                        <italic>p</italic> = .03); model comparison confirmed that the effect
                    significantly contributed to model fit (<italic>&#967;</italic><sup>2</sup>(1) =
                    5.064, <italic>p</italic> = .024). There was also a significant simple main
                    effect of <sc>GAP EMBEDDING</sc>, reflecting the fact that in non-islands,
                    embedded gaps were less acceptable than matrix gaps (<italic>t</italic>(66.25) =
                    &#8211;11.57, <italic>p</italic> &lt; .001; model comparison:
                        <italic>&#967;</italic><sup>2</sup>(1) = 55.868, <italic>p</italic> &lt;
                    .001). Crucially, the characteristic island interaction was also significant,
                    indicating that the difference in acceptability between a gap in an embedded
                    versus a matrix clause is larger for island sentences than for non-island
                    sentences (<italic>t</italic>(604.4) = &#8211;3.73, <italic>p</italic> &lt;
                    .001; model comparison: <italic>&#967;</italic><sup>2</sup>(1) = 13.068,
                        <italic>p</italic> &lt; .001).</p>
                <table-wrap id="T5">
                    <label>Table 5</label>
                    <caption>
                        <p>Experiment 2 results.</p>
                    </caption>
                    <table>
                        <tr>
                            <th colspan="6"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top"/>
                            <th align="right" valign="top"><italic>&#946;</italic></th>
                            <th align="right" valign="top"><sc>D.F.</sc></th>
                            <th align="right" valign="top"><italic>t</italic></th>
                            <th align="right" valign="top"><italic>p</italic></th>
                            <th align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="6"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept
                                    (<italic>Non-island,Matrix</italic>)</td>
                            <td align="right" valign="top">0.054</td>
                            <td align="right" valign="top">41.076</td>
                            <td align="right" valign="top">1.333</td>
                            <td align="right" valign="top">.190</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>ISLANDHOOD</sc>
                                    (<italic>Island</italic>)</td>
                            <td align="right" valign="top">0.096</td>
                            <td align="right" valign="top">57.584</td>
                            <td align="right" valign="top">2.267</td>
                            <td align="right" valign="top">.027</td>
                            <td align="left" valign="top">*</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>GAP POSITION</sc>
                                    (<italic>Embedded</italic>)</td>
                            <td align="right" valign="top">&#8211;0.453</td>
                            <td align="right" valign="top">66.249</td>
                            <td align="right" valign="top">&#8211;11.569</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>ISLANDHOOD</sc> &#215; <sc>GAP
                                    POSITION</sc></td>
                            <td align="right" valign="top">&#8211;0.194</td>
                            <td align="right" valign="top">604.399</td>
                            <td align="right" valign="top">&#8211;3.733</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td colspan="6"><hr/></td>
                        </tr>
                    </table>
                </table-wrap>
                <fig id="F2">
                    <label>Figure 2</label>
                    <caption>
                        <p>Experiment 2 results: Mean <italic>z</italic>-scores per condition. Bars
                            denote standard error.</p>
                    </caption>
                    <graphic xmlns:xlink="http://www.w3.org/1999/xlink"
                        xlink:href="glossapx-1-1-40-g2.png"/>
                </fig>
            </sec>
            <sec>
                <title>3.3 Discussion</title>
                <p>Experiment 2 demonstrates that the sentences tested in Experiment 1 do in fact
                    behave like islands, showing the defining statistical interaction between
                        <sc>ISLANDHOOD</sc> and <sc>GAP EMBEDDING</sc>. However, two aspects of the
                    current findings are atypical with respect to the commonly found pattern in the
                    literature. First, when gaps were in matrix position, the
                        <italic>Island</italic> condition was rated higher than the
                        <italic>Non-island</italic> condition. Second, while significant, the
                    difference between the <italic>Non-island,Embedded</italic> condition and the
                        <italic>Island,Embedded</italic> condition was not as big as is commonly
                    observed (see, for instance, the many results in <xref ref-type="bibr" rid="B66"
                        >Sprouse et al. 2016</xref>).</p>
                <p>The first atypicality &#8211; that islands were more acceptable than non-islands
                    in <italic>Matrix</italic> conditions &#8211; may reflect the increased
                    syntactic diversity of the clause types in the sentence. It has previously been
                    demonstrated that increasing the heterogeneity of syntactic properties of
                    complex sentences can increase their acceptability. For instance, (9a) contains
                    three full Determiner Phrases (<italic>the mouse, the cat</italic>, and
                        <italic>the dog</italic>), and is highly unacceptable. However, (9b), which
                    has the same global structure but contains one quantified nominal
                        (<italic>someone</italic>), one local pronoun (<italic>I</italic>), and one
                    full Determiner Phrase (<italic>the mouse</italic>), is much more acceptable.
                    This is thought to be the result of the fact that mutual distinctiveness among
                    aspects of a stimulus can facilitate working memory processes (<xref
                        ref-type="bibr" rid="B3">Bever 1974</xref>).</p>
                <table-wrap>
                    <table content-type="example">
                        <tbody>
                            <tr>
                                <td>(9)</td>
                                <td>a.</td>
                                <td>&#160;&#160;<italic>Similar nominals</italic></td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>&#160;</td>
                                <td>*The mouse [that the cat [the dog chased] stalked]
                                    squeaked.</td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>b.</td>
                                <td>&#160;&#160;<italic>Distinctive nominals</italic></td>
                            </tr>
                            <tr>
                                <td>&#160;</td>
                                <td>&#160;</td>
                                <td>&#160;&#160;The mouse [that someone [I know] adopted]
                                    squeaked.</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>In the Experiment 2 stimuli, the types of nominal elements did not vary
                    systematically across conditions as in (9), but the types of clause structures
                    did. In <italic>Island</italic> conditions, stimuli consisted of two declarative
                    clauses and one embedded interrogative clause, whereas
                        <italic>Non-island</italic> stimuli consisted of three declarative clauses.
                    It stands to reason that in these highly demanding sentences, structural
                    diversity may have facilitated working memory in the same way that nominal
                    distinctiveness aids working memory in (9), resulting in the higher
                    acceptability of the <italic>Island,Matrix</italic> condition relative to the
                        <italic>Non-island,Matrix</italic> condition.</p>
                <p>An anonymous reviewer pointed out the second atypical feature of the Experiment 2
                    results: that the difference between the two embedded conditions was smaller
                    than is often reported in the literature. This is potentially concerning
                    because, if the size of this difference is indeed smaller than for the types of
                    island structures that have been previously studied, it may mean that these
                    stimuli are somehow different from &#8220;ordinary&#8221; island violations. If
                    that were true, then the conclusion of Experiment 1 &#8211; that in islands,
                    subject gaps are not inherently worse than object gaps &#8211; may not extend to
                    islands in general.</p>
                <p>However, there are at least three reasons not to be overly concerned about this
                    discrepancy. First, if the structural diversity explanation given above is
                    correct, then the <italic>Island,Embedded</italic> condition is worse than the
                        <italic>Non-island,Embedded</italic> condition <italic>in spite of</italic>
                    an acceptability boost the former receives from containing mutually distinctive
                    clause types. Without this boost, the difference would be bigger.</p>
                <p>Second, the absolute size of a difference in <italic>z</italic>-scores should not
                    generally be compared across experiments. This is because a unit in
                        <italic>z</italic>-scores represents one standard deviation in the
                    distribution of ratings of all stimuli, and therefore depends on the spread of
                    acceptabilities in the particular stimulus set. Thus, in a hypothetical
                    experiment with the same critical items but different fillers, the size of this
                    difference could be larger or smaller.</p>
                <p>Third, and most importantly, the ratings in the <italic>Island,Embedded</italic>
                    condition exhibit signs of a floor effect. That is, differences that we might
                    have expected to be bigger are difficult or impossible to detect because there
                    simply wasn&#8217;t enough room at the bottom of the Likert scale to distinguish
                    between, e.g., <italic>very unacceptable</italic> and <italic>very, very
                        unacceptable</italic>. For instance, the <italic>floor</italic> fillers were
                    included to establish a floor in the study. Thus, if the <italic>floor</italic>
                    fillers were rated lower than the <italic>Island,Embedded</italic> fillers, then
                    it suggests there <italic>was</italic> in fact room to distinguish even lower
                    degrees of acceptability. But this was not the case. The mean
                    <italic>z</italic>-score for <italic>floor</italic> stimuli was
                    &#8211;0.521(S.E.: 0.055), and the mean <italic>z</italic>-score of the
                        <italic>Island,Embedded</italic> condition was &#8211;0.482(S.E.: 0.049). An
                    uncorrected (i.e., liberal) <italic>t</italic>-test shows that the difference is
                    not significant (<italic>t</italic>(1056.1) = &#8211;1.331, <italic>p</italic> =
                    .184). Thus, if the <italic>floor</italic> items did in fact establish a floor
                    in this experiment, then there is no evidence that the
                        <italic>Island,Embedded</italic> condition was not also effectively at
                    floor.</p>
                <p>If the <italic>Island,Embedded</italic> condition was in fact confounded by a
                    floor effect, this does not change the interpretation of the Experiment 2
                    results. In fact, any floor effect in this condition would only have made the
                    interaction smaller, meaning that the critical prediction was borne out
                        <italic>in spite of</italic> another potential effect that worked against
                    it, making the interaction even more credible.</p>
                <p>Thus, despite these atypical features, the results of Experiment 2 verify that
                    these stimuli behave like normal islands. This finding suggests that the
                    conclusion of Experiment 1 &#8211; that subject gaps are not inherently worse
                    than object gaps in islands &#8211; likely generalizes to the types of island
                    structures more commonly tested in the experimental literature.</p>
                <p>While the conclusion of Experiment 1 is consistent with the Island Boundary-Gap
                    Effect and the That-Trace Effect resulting from the same underlying violation,
                    it does not mean that this <italic>must</italic> be the case. Indeed, with a
                    standard experimental approach such as those in Experiments 1 and 2, it is not
                    generally possible to show that two constructs are the same in the Null
                    Hypothesis Significance Testing framework.</p>
                <p>As outlined in the introduction, one tactic for circumventing this limitation is
                    to look for a relationship between individual differences in the That-Trace
                    Effect and the Island Boundary-Gap Effect. In such an analysis, a significant
                    result can constitute evidence for equivalence.</p>
            </sec>
        </sec>
        <sec>
            <title>4. Experiment 3</title>
            <p>One peculiarity of the That-Trace Effect is that it has long been reported that the
                effect varies considerably across speakers (<xref ref-type="bibr" rid="B13">Chomsky
                    &amp; Lasnik 1977</xref>; <xref ref-type="bibr" rid="B54">Pesetsky 1982</xref>;
                    <xref ref-type="bibr" rid="B63">Sobin 1987</xref>; <xref ref-type="bibr"
                    rid="B20">Featherston 2005</xref>). Experiment 3 takes a psychometric approach
                in order to measure this variability and compare it to variability in the Island
                Boundary-Gap Effect. If the That-Trace Effect and the Island Boundary-Gap Effect
                reflect the same constraint, and if there are meaningful individual differences in
                the strength of that constraint, then an individual who perceives the That-Trace
                Effect to be more unacceptable than the average English speaker should also perceive
                the Island Boundary-Gap Effect to be more unacceptable than the average English
                speaker. The effects should therefore correlate across individuals.</p>
            <p>However, a difficulty with this type of design is identifying the source(s) of
                individual differences. Here, even if the two effects are not underlyingly the same,
                individual differences in their ratings are still likely to correlate. This is
                because a host of other processes that vary across individuals underlie behaviors
                like acceptability judgment. For instance, individual differences in pickiness may
                result in overall lower ratings on all sentences for some particularly highfalutin
                individuals. This could lead to a correlation that reflects individual differences
                in pickiness, but not in some single underlying linguistic constraint.</p>
            <p>Thus, while one may surmise that if the two effects are underlyingly the same there
                should be a correlation in individual differences, the reverse inference &#8211;
                that a correlation is evidence of equivalence &#8211; is not necessarily true. In
                order to interpret a correlation as equivalence between two constructs, strong
                control measures must be in place so as to remove all other sources of covariance
                    (<xref ref-type="bibr" rid="B22">Furr 2017</xref>; <xref ref-type="bibr"
                    rid="B14">Crocker &amp; Algina 1986</xref>). If such controls are adequately
                employed, as demonstrated below, then a correlation between individual differences
                in two variables may be regarded as evidence that the two share some processing
                machinery.</p>
            <p>In the present study, one likely source of extraneous correlated individual
                differences is the shared syntactic makeup of the That-Trace Effect and Island
                Boundary-Gap Effect. For instance, both effects involve long-distance dependencies.
                If there are individual differences in the acceptability of long-distance
                dependencies, then these could drive the predicted relationship between individual
                differences in the That-Trace Effect and the Island Boundary-Gap Effect. To mitigate
                this and similar concerns, two control conditions were included in the stimuli (see
                4.1.2).</p>
            <p>There are also nonlinguistic individual differences which could drive such a
                correlation. To minimize attentional differences, comprehension questions were
                included after every trial. To control for different uses of the Likert scale,
                control conditions were included in the analysis as covariates (see 4.2). Finally,
                to avoid inadvertently inducing individual differences by exposing participants to
                different sentences in different orders, stimuli were not counterbalanced or
                randomized, meaning that each participant saw the exact same sentences in the exact
                same order, following common practice in psychometric studies (<xref ref-type="bibr"
                    rid="B69">Swets et al. 2007</xref>).</p>
            <sec sec-type="methods">
                <title>4.1 Method</title>
                <sec>
                    <title>4.1.1 Participants</title>
                    <p>An a priori stopping point for running participants was set at either 150
                        participants or the end of the academic year at UC San Diego, whichever came
                        first. Running was stopped at the end of the year with a total of 104 UC San
                        Diego undergraduates having participated for course credit. None had
                        participated in Experiments 1 or 2.</p>
                    <p>In Experiments 1 and 2, bilinguals had been excluded to avoid transfer
                        effects, where aspects of one grammar bleed over into another (<xref
                            ref-type="bibr" rid="B23">Gas 1979</xref>). If, for instance, the
                        bilingual&#8217;s other language does not have a That-Trace Effect, this
                        might have reduced the size of the effect in Experiment 1. However, in
                        Experiment 3 this was not a concern because, even if a bilingual&#8217;s
                        non-English grammar affects their English grammar, if the That-Trace Effect
                        and the Island Boundary-Gap Effect are indeed the same, then whatever
                        influence the other grammar exerts should be the same for both effects.
                        (Indeed, to foreshadow, bilinguals&#8217; Experiment 3 data patterned with
                        those of monolinguals.)</p>
                    <p>Speaking another language was therefore deemed orthogonal to the question of
                        interest, so bilinguals were allowed to participate so long as they
                        responded to questions in a demographics questionnaire indicating that they
                        (a) self-identified as native English speakers, (b) learned English before
                        the age of 7, and (c) were not aware of having any perceptible non-native
                        accent. Of the 23 bilingual participants, 5 were excluded for not meeting
                        these criteria. As in Experiments 1 and 2, participants were also excluded
                        for answering fewer than 60% of comprehension questions correctly (4
                        exclusions) and/or for not rating ceiling fillers more than two points
                        higher on average than floor fillers (5 exclusions). One additional
                        participant was excluded due to data loss from trial exclusions (see below).
                        A total of 93 participants were included in the analysis.</p>
                </sec>
                <sec>
                    <title>4.1.2 Materials</title>
                    <p>Materials, summarized in Table <xref ref-type="table" rid="T6">6</xref>,
                        consisted of two types of critical items (10 each), two types of control
                        items (10 each), and three types of fillers (12 ceiling, 5 floor, and 10
                        &#8220;middle&#8221; fillers described below). Unlike in previous
                        experiments, conditions were not counterbalanced across items. That is,
                        items consisted of only one sentence rather than a set of four, so the
                        particular lexicalizations in the stimuli were confounded with
                            conditions.<xref ref-type="fn" rid="n7">7</xref> Furthermore, the
                        stimulus order was randomized just once, and then fixed so that all
                        participants saw all stimuli in the same order. This removed the possibility
                        that any observed individual differences might derive from differences in
                        the particular sentences or the particular order that participants saw.</p>
                    <table-wrap id="T6">
                        <label>Table 6</label>
                        <caption>
                            <p>Experiment 3: Sample stimuli.</p>
                        </caption>
                        <table>
                            <tr>
                                <th colspan="2"><hr/></th>
                            </tr>
                            <tr>
                                <th align="left" valign="top" colspan="2">C<sc>RITICAL</sc>
                                        C<sc>ONDITIONS</sc></th>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"><italic>TTE</italic></td>
                                <td align="left" valign="top">It was the one-legged pirate [who
                                    everyone was saying [that __ kidnapped the princess]].</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"><italic>IBGE</italic></td>
                                <td align="left" valign="top">That&#8217;s the royal [who everyone
                                    is wondering [whether __ will succeed the gravely ill
                                    king]].</td>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                            <tr>
                                <td align="left" valign="top" colspan="2"><bold>C<sc>ONTROL</sc>
                                            C<sc>ONDITIONS</sc></bold></td>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                    ><italic>Control<sub>TTE</sub></italic></td>
                                <td align="left" valign="top">It must have been someone [who they
                                    think [&#8709;__ had access to sensitive information]].</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"
                                        ><italic>Control<sub>IBGE</sub></italic></td>
                                <td align="left" valign="top">It was the math professor [who the
                                    students all wondered [how you knew [&#8709; __ had a crush on
                                    the history professor]]].</td>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                            <tr>
                                <td align="left" valign="top" colspan="2"
                                        ><bold>F<sc>ILLERS</sc></bold></td>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"><italic>Ceiling</italic></td>
                                <td align="left" valign="top">There was an enormous painting of a
                                    Teddy bear next to the fireplace.</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"><italic>Middle</italic></td>
                                <td align="left" valign="top">The visitor&#8217;s log from the
                                    secretary&#8217;s secret desk drawer was taken out by Ming.</td>
                            </tr>
                            <tr>
                                <td align="left" valign="top"><italic>Floor</italic></td>
                                <td align="left" valign="top">The bug that the kid that the teacher
                                    scolded trapped ran in circles.</td>
                            </tr>
                            <tr>
                                <td colspan="2"><hr/></td>
                            </tr>
                        </table>
                    </table-wrap>
                    <p>The two critical conditions both contained subject gaps immediately following
                        overt clause-initial function words. In the <italic>TTE</italic> condition,
                        meant to measure the strength of the That-Trace Effect, the gap was in a
                        non-island and the function word was <italic>that</italic>. In the
                            <italic>IBGE</italic> condition, meant to measure the Island
                        Boundary-Gap Effect, the gap was in an island and the function word was
                        either <italic>why</italic> or <italic>whether</italic>.</p>
                    <p>The two control conditions matched the two critical conditions for as many
                        features as possible, except for having a gap immediately after a
                        clause-initial function word. These conditions were designed to remove
                        various potential sources of individual differences, isolating the
                        contribution of a gap immediately following a clause-initial function word.
                        The <italic>Control<sub>TTE</sub></italic> condition matched the
                            <italic>TTE</italic> condition in everything except having an overt
                        clause-initial function word. Similarly, the
                            <italic>Control<sub>IBGE</sub></italic> condition differed from the
                            <italic>IBGE</italic> condition in that it did not have an overt
                        clause-initial function word. However, in order to manipulate the
                        presence/absence of this function word, an additional declarative clause had
                        to be embedded in <italic>Control<sub>IBGE</sub></italic> items, as in
                        Experiment 1.</p>
                    <p>The extra level of embedding in <italic>Control<sub>IBGE</sub></italic>
                        conditions relative to <italic>IBGE</italic> conditions meant that these
                        conditions differed in two ways. This additional difference is not ideal,
                        but poses minimal risk to the interpretation of the results. The critical
                        prediction is a correlation between (a) whatever individual differences
                        exist in <italic>TTE</italic> stimuli above and beyond individual
                        differences in <italic>Control<sub>TTE</sub></italic> stimuli, and (b)
                        whatever individual differences exist in <italic>IBGE</italic> stimuli above
                        and beyond individual differences in <italic>Control<sub>IBGE</sub></italic>
                        stimuli. There is no reason to expect that individual differences in the
                        acceptability of clausal embedding correlate with the strength of the
                        That-Trace Effect. Therefore, any observed correlation can safely be assumed
                        to derive from the only shared property of the two measures: the
                        presence/absence of a gap immediately after a clause-initial function
                        word.</p>
                    <p>Finally, an additional type of filler was added to this study because several
                        participants in Experiments 1 and 2 reported anxiety about correctly
                        responding to comprehension questions after <italic>Floor</italic> filler
                        items, which were difficult to comprehend. The number of
                            <italic>Floor</italic> items was therefore reduced to five, and ten
                            &#8220;<italic>Middle</italic>&#8221; fillers were added. These were
                        composed of an inanimate complex NP subject (e.g., &#8220;The cookies with
                        dried rose petals and organic whole oats&#8230;&#8221;) in a passive
                        sentence with an explicit animate agent (&#8220;&#8230;were eaten by the
                        customer.&#8221;). Heavy NPs tend to be uttered later in English sentences
                            (<xref ref-type="bibr" rid="B60">Ross 1967</xref>), and animate NPs tend
                        to be uttered earlier (<xref ref-type="bibr" rid="B4">Bock &amp; Warren
                            1985</xref>). These sentences therefore violated two norms, and were
                        expected to be rated relatively low, but to be more acceptable and less
                        stress-inducing than the <italic>Floor</italic> fillers.</p>
                </sec>
            </sec>
            <sec>
                <title>4.2 Analysis</title>
                <p>In addition to not randomizing trial order, two other psychometric practices were
                    adopted. First, no trials were removed from the data prior to analysis. This is
                    because excluding trials means that individuals&#8217; mean ratings reflect the
                    means of different items, which could create artifactual individual
                    differences.</p>
                <p>Second, Experiment 3 analyzed raw ratings rather than <italic>z</italic>-scores.
                    This is because <italic>z</italic>-scores remove individual differences by
                    design. To calculate the <italic>z</italic>-score, the mean of all of a
                    participant&#8217;s ratings is subtracted from each of that participant&#8217;s
                    ratings to set each participant&#8217;s mean rating to 0. Each of the
                    participants&#8217; mean-adjusted ratings are then divided by the standard
                    deviation of all of that participant&#8217;s ratings, setting the standard
                    deviation of each participant&#8217;s ratings to 1. Typically, this is done to
                    eliminate differences in the use of the Likert scale. While different uses of
                    the scale are certainly a concern in Experiment 3, <italic>z</italic>-scoring
                    can have the unintended consequence of reducing the individual differences we
                    aim to study.</p>
                <p>To see this, consider a participant who likes <italic>TTE</italic> stimuli more
                    than the average person. For the sake of argument, let us also momentarily
                    assume that there are no individual differences in the acceptability of the
                    other conditions, nor in the use of the Likert scale. The participant who likes
                        <italic>TTE</italic> stimuli more will have a higher overall mean rating
                    (i.e., across all stimuli) than a participant who likes <italic>TTE</italic>
                    stimuli less. In calculating these participants&#8217;
                    <italic>z</italic>-scores, a higher mean will therefore be subtracted from the
                    ratings of the participant who likes <italic>TTE</italic> stimuli more, and a
                    lower mean will be subtracted from the ratings of the participant who likes the
                        <italic>TTE</italic> less, thus reducing the difference between these two
                    participants&#8217; ratings of <italic>TTE</italic> stimuli and obscuring the
                    individual differences. Now, individual differences in other conditions and in
                    the use of the Likert scale probably do exist, so the picture is somewhat
                    noisier. But if these individual differences are not correlated with the
                    individual differences in the <italic>TTE</italic> and <italic>IBGE</italic>
                    (and there is no reason to expect that they are), then, across a large enough
                    sample size, individual differences in other conditions and in the use of the
                    Likert scale should cancel out, and the same logic holds.<xref ref-type="fn"
                        rid="n8">8</xref></p>
                <p>To understand whether the That-Trace Effect and the Island Boundary-Gap Effect
                    derive from the same constraint, it is not enough to show a correlation between
                    participants&#8217; mean ratings of the two critical conditions, because a
                    correlation could be driven by any number of extraneous things. Thus, rather
                    than simply correlating ratings of these conditions with one another, each
                    critical condition was modeled with a separate multiple linear regression in
                    which the control conditions served as covariates. In Model 1,
                    participants&#8217; mean ratings of <italic>TTE</italic> items were modeled as a
                    function of participants&#8217; mean ratings of <italic>IBGE</italic> items, as
                    well as <italic>Control<sub>TTE</sub></italic> items and
                            <italic>Control<sub>IBGE</sub></italic> items. In Model 2,
                    participants&#8217; mean ratings of <italic>IBGE</italic> items were modeled as
                    a function of participants&#8217; mean ratings of <italic>TTE</italic> items,
                            <italic>Control<sub>IBGE</sub></italic> items, and
                            <italic>Control<sub>TTE</sub></italic> items.<xref ref-type="fn"
                        rid="n9">9</xref></p>
                <p>In the particular type of multiple regression employed here (R&#8217;s default
                        <italic>lm()</italic> function), multicollinearity (when two or more of the
                    regressors are correlated) is dealt with by ignoring the shared variance (<xref
                        ref-type="bibr" rid="B49">McElreath 2020</xref>; Ch. 6). Thus, by including
                    the control conditions as covariates, it is ensured that any significant
                    relationship between the two critical terms reflects individual differences in
                    properties shared only by those two conditions.</p>
                <p>This has a similar effect to simply subtracting each participant&#8217;s mean
                    rating of <italic>Control<sub>TTE</sub></italic> stimuli from their mean rating
                    of <italic>TTE</italic> stimuli and correlating that difference with the
                    difference between <italic>IBGE</italic> and
                        <italic>Control<sub>IBGE</sub></italic> stimuli. While subtraction may be
                    somewhat more intuitive, multiple regression offers a few additional benefits,
                    including producing statistics indicating whether these
                    &#8220;subtractions&#8221; remove meaningful variance.</p>
                <p>To make this point more concrete, consider two individuals who map the same
                    degree of subjective unacceptability to different points along the Likert scale.
                    One may assign this degree of unacceptability a 4, and the other may assign it a
                    6. This baseline difference in ratings will likely mean that for each type of
                    structure, the first person&#8217;s ratings will be lower than the second
                    person&#8217;s. This alone is enough to drive a correlation between individual
                    differences in two structures. One might therefore expect that
                        <italic>all</italic> of the independent variables should significantly
                    predict the dependent variables in Models 1 and 2.</p>
                <p>However, because this variance is shared by all of the independent variables, the
                    model cannot tell which of them to attribute it to, so it attributes it to none.
                    Thus, individual differences that are common to all of a participant&#8217;s
                    acceptability ratings, such as those that might arise from differences in the
                    use of the Likert scale, cannot drive a significant effect in the present
                    analyses. If the <italic>IBGE</italic> term is significant in Model 1, it means
                    that individual differences in the Island Boundary-Gap Effect correlate with
                    individual differences in the That-Trace Effect above and beyond what can be
                    explained by individual differences present in the ratings of
                            <italic>Control<sub>TTE</sub></italic> and/or
                            <italic>Control<sub>IBGE</sub></italic>.</p>
                <p>Another benefit of this analysis approach is that the models have built-in tests
                    of two key concepts in psychometric research: <italic>reliability</italic>, or
                    the accuracy of a measure, and <italic>validity</italic>, or the degree to which
                    a measure reflects the construct it is meant to measure (<xref ref-type="bibr"
                        rid="B15">Cronbach &amp; Meehl 1955</xref>). Without high validity and
                    reliability, individual differences are impossible to distinguish from naturally
                    occurring noise.</p>
                <p>The data can be said to have high reliability if a significant relationship is
                    found between <italic>TTE</italic> and <italic>IBGE</italic> because this would
                    indicate that acceptability ratings were an accurate enough measure to reveal
                    individual differences in a shared underlying constraint, despite other
                    differences between these two conditions.</p>
                <p>If acceptability ratings have high enough validity, then the distinct properties
                    of the two control conditions should lead to different patterns of significance
                    in the two models. In Model 1, where the dependent variable is
                        <italic>TTE</italic>, a significant effect of
                        <italic>Control<sub>TTE</sub></italic> is expected, but no effect of
                            <italic>Control<sub>IBGE</sub></italic> because, although it also shares
                    some structural properties with <italic>TTE</italic>, all of these properties
                    are also shared by <italic>Control<sub>TTE</sub></italic>. Similarly, in Model
                    2, <italic>Control<sub>IBGE</sub></italic> is expected to be significant, but
                            <italic>Control<sub>TTE</sub></italic> is not because the former shares
                    all of the same properties with <italic>IBGE</italic> as the latter.</p>
                <p>One final difference between this analysis and that of Experiments 1 and 2 is
                    that Experiment 3 employs a fixed-effects-only model. This is because there are
                    no random effects to feed the model: random effects for participants are
                    intended to remove individual differences, so these are not included, and random
                    effects for items require that the model have access to item-specific data, but
                    the present models analyzed aggregate statistics (i.e., participants&#8217;
                    condition means).</p>
            </sec>
            <sec>
                <title>4.3 Results</title>
                <p>Mean ratings for each condition are shown in Figure <xref ref-type="fig" rid="F3"
                        >3</xref>. Because particular lexicalizations were not counterbalanced
                    across items (i.e., the sentence about a pirate kidnapping a princess only ever
                    appeared in the <italic>Non-island,Overt</italic> condition), any differences
                    between conditions may reflect differences in the acceptability of the
                    particular sentences in a condition. Comparing condition means therefore does
                    not necessarily reveal anything about the true differences between the
                    structures&#8217; acceptability, so no such statistical analysis was
                    performed.</p>
                <fig id="F3">
                    <label>Figure 3</label>
                    <caption>
                        <p>Experiment 3 results: Mean ratings per condition. Bars denote standard
                            error.</p>
                    </caption>
                    <graphic xmlns:xlink="http://www.w3.org/1999/xlink"
                        xlink:href="glossapx-1-1-40-g3.png"/>
                </fig>
                <p>Nonetheless, the pattern is what would be expected had the study included
                    counterbalancing and randomization, perhaps indicating that any
                    lexicalization-specific acceptability differences between conditions were
                    relatively small. The <italic>TTE</italic> items were rated worse than the
                            <italic>Control<sub>TTE</sub></italic> stimuli, as expected due to the
                    That-Trace Effect. In islands, this pattern reversed, and <italic>IBGE</italic>
                    stimuli were rated better than <italic>Control<sub>IBGE</sub></italic> stimuli.
                    This is not surprising given that the <italic>Control<sub>IBGE</sub></italic>
                    condition contained one more clausal embedding than the <italic>IBGE</italic>
                    condition, making it longer and more complex.</p>
                <p>Results of both models are summarized in Table <xref ref-type="table" rid="T7"
                        >7</xref>. Both revealed a strong positive relationship across participants
                    between the strength of the That-Trace Effect and the Island Boundary-Gap Effect
                        (<italic>t</italic>(89) = 6.44, <italic>p</italic> &lt; .001; see Figure
                        <xref ref-type="fig" rid="F4">4</xref>). Note that this is the same effect
                    in both models (and as such has the same <italic>t</italic>-value). The
                    difference between Models 1 and 2 is in the relationships between the dependent
                    variable and the control conditions. In Model 1, <italic>TTE</italic> was
                    significantly predicted by <italic>Control<sub>TTE</sub></italic>
                        (<italic>t</italic>(89) = 3.82, <italic>p</italic> &lt; .001), but not by
                            <italic>Control<sub>IBGE</sub></italic>. In Model 2, the pattern was
                    flipped, and <italic>IBGE</italic> was significantly predicted by
                            <italic>Control<sub>IBGE</sub></italic> (<italic>t</italic>(89) = 7.562,
                        <italic>p</italic> &lt; .001), but not by
                        <italic>Control<sub>TTE</sub></italic>.<xref ref-type="fn" rid="n10"
                        >10</xref></p>
                <table-wrap id="T7">
                    <label>Table 7</label>
                    <caption>
                        <p>Experiment 3 results.</p>
                    </caption>
                    <table>
                        <tr>
                            <th colspan="5"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top" colspan="5">Model 1: TTE as function of
                                IBGE, Control<italic><sub>TTE</sub></italic>, &amp;
                                        Control<italic><sub>IBGE</sub></italic></th>
                        </tr>
                        <tr>
                            <th colspan="5"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top"/>
                            <th align="right" valign="top">E<sc>STIMATE</sc></th>
                            <th align="right" valign="top"><italic>t</italic></th>
                            <th align="right" valign="top"><italic>p</italic></th>
                            <th align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept</td>
                            <td align="right" valign="top">0.788</td>
                            <td align="right" valign="top">1.533</td>
                            <td align="right" valign="top">.139</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><italic>IBGE</italic></td>
                            <td align="right" valign="top">0.631</td>
                            <td align="right" valign="top">6.439</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>TTE</sub></italic></td>
                            <td align="right" valign="top">0.346</td>
                            <td align="right" valign="top">3.818</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>IBGE</sub></italic></td>
                            <td align="right" valign="top">&#8211;0.065</td>
                            <td align="right" valign="top">&#8211;0.583</td>
                            <td align="right" valign="top">.562</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top" colspan="5"><bold>Model 2: IBGE as
                                    function of TTE, Control<italic><sub>TTE</sub></italic>, &amp;
                                            Control<italic><sub>IBGE</sub></italic></bold></td>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"/>
                            <td align="right" valign="top"><bold>E<sc>STIMATE</sc></bold></td>
                            <td align="right" valign="top"><bold><italic>t</italic></bold></td>
                            <td align="right" valign="top"><bold><italic>p</italic></bold></td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept</td>
                            <td align="right" valign="top">&#8211;0.299</td>
                            <td align="right" valign="top">&#8211;0.644</td>
                            <td align="right" valign="top">.521</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><italic>TTE</italic></td>
                            <td align="right" valign="top">0.504</td>
                            <td align="right" valign="top">6.439</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>TTE</sub></italic></td>
                            <td align="right" valign="top">&#8211;0.060</td>
                            <td align="right" valign="top">&#8211;0.686</td>
                            <td align="right" valign="top">.495</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>IBGE</sub></italic></td>
                            <td align="right" valign="top">0.588</td>
                            <td align="right" valign="top">7.562</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                    </table>
                </table-wrap>
                <fig id="F4">
                    <label>Figure 4</label>
                    <caption>
                        <p>Experiment 3 results: Dots represent individuals; bilinguals have blue
                            interiors. In order to show just the relationship between the That-Trace
                            Effect and the Island Boundary-Gap Effect, the <italic>TTE</italic> and
                                <italic>IBGE</italic> values here were both residualized on both
                            control conditions. The dark grey line represents the best-fit line and
                            density plots appear in light grey. A Pearson&#8217;s correlation for
                            the data in this plot reveals a strong, significant relationship:
                                <italic>r</italic> = .564, <italic>t</italic>(91) = 6.511,
                                <italic>p</italic> &lt; .001).</p>
                    </caption>
                    <graphic xmlns:xlink="http://www.w3.org/1999/xlink"
                        xlink:href="glossapx-1-1-40-g4.png"/>
                </fig>
            </sec>
            <sec>
                <title>4.4 Discussion</title>
                <p>The main analysis revealed a significant relationship between
                        <italic>TTE</italic> and <italic>IBGE</italic>. That is, a person who finds
                    the That-Trace Effect to be especially bad is more likely to also find the
                    Island Boundary-Gap Effect especially bad. This relationship was present even
                    though the models removed variance associated with control conditions,
                    indicating that it could not have been driven by some extraneous individual
                    differences such as different uses of the Likert scale.</p>
                <p>It also could not have been driven by individual differences in how people rate
                    ungrammatical structures in general, because
                        <italic>Control<sub>IBGE</sub></italic>, which is ungrammatical due to an
                    island violation, did not significantly predict <italic>TTE</italic> in Model 1.
                    The results therefore support the idea that the That-Trace Effect and the Island
                    Boundary-Gap Effect reflect the same underlying violation.</p>
            </sec>
        </sec>
        <sec>
            <title>5. General Discussion</title>
            <p>This paper investigated a decades-old question: whether the Island Boundary-Gap
                Effect derives from the same underlying constraint as the That-Trace Effect. The two
                phenomena share surface similarities, but they differ in their underlying
                structures. Specifically, the offending word in the That-Trace Effect is a
                complementizer, <italic>that</italic>, while the offending word in the Island
                Boundary-Gap Effect can be a complementizer-like function word (<italic>because,
                    after, wherever</italic>, &#8230;), relative pronoun (<italic>who, why, whether,
                    etc</italic>.), or even an entire referential phrase (<italic>which of these
                    staplers</italic>). Three experiments found evidence supporting the idea that
                the Island Boundary-Gap Effect, at least as instantiated by
                <italic>wh</italic>-islands, shares a common cause with the That-Trace Effect.</p>
            <p>Experiment 1 showed that the That-Trace Effect and the Island Boundary-Gap Effect are
                subject to the same licensing conditions. Specifically, it aimed to rule out the
                competing hypothesis that the Island Boundary-Gap Effect is the result of a specific
                constraint against subject gaps in islands. If instead the Island Boundary-Gap
                Effect has the same underlying cause as the That-Trace Effect, then subject gaps
                should not be less acceptable than object gaps in islands when they are not
                immediately preceded by a clause-initial function word. However, because
                clause-initial function words that introduce islands are never optional, subjecthood
                is almost always confounded with the position immediately after the clause-initial
                function word.</p>
            <p>To remove this confound, the gap was embedded inside yet another clause, headed by
                the optional &#8216;<italic>that</italic>,&#8217; and the effect of a null vs. overt
                complementizer was assessed. Sentences with subject gaps were not rated worse than
                sentences with object gaps after a null complementizer, consistent with the idea
                that the Island Boundary-Gap Effect derives from whatever constraint also gives rise
                to the That-Trace Effect.</p>
            <p>Experiment 2 validated the types of sentences used in Experiment 1 by demonstrating
                that they show the defining characteristic of islands: an interaction between
                islandhood and gap position.</p>
            <p>The standard experimental approach thus demonstrated that the That-Trace Effect and
                the Island Boundary-Gap Effect are subject to the same distributional conditions.
                However, having similar licensing conditions is not proof that two constructs are
                the same. The standard approach leaves open the possibility that the two effects
                derive from different constraints, both of which happen to be alleviated in the
                presence of a null complementizer.</p>
            <p>Experiment 3 employed psychometric methods to provide stronger evidence for
                equivalence. This experiment showed that individual differences in the acceptability
                of the Island Boundary-Gap Effect track with those of the That-Trace Effect, above
                and beyond any correlations with closely matched control structures. This indicates
                that the two effects share a common cause, at least with the Island Boundary-Gap
                Effect instantiated with <italic>wh</italic>-islands. It is left to future work to
                determine whether this finding holds for other types of islands.</p>
            <sec>
                <title>5.1 Individual differences, reliability, and validity</title>
                <p>In addition to demonstrating a novel approach to assessing equivalence between
                    psychological constructs, Experiment 3 made another important contribution: the
                    finding of individual differences in a syntactic representation. Contrary to the
                    idealized notion that native speakers of the same language all have the exact
                    same grammar, speakers in fact have slightly different grammars (<xref
                        ref-type="bibr" rid="B17">D&#261;browska 2012</xref>). However, there is not
                    much prior evidence for individual differences in syntax.</p>
                <p>The clearest such data come from Han et al. (<xref ref-type="bibr" rid="B31"
                        >2007</xref>), who looked for evidence of different grammars among Korean
                    speakers. Specifically, there are multiple syntactic positions that a verb might
                    occupy due to the presence of a phenomenon known as <italic>verb
                        raising</italic>. While in languages with verb-medial word order, like
                    English, verb-raising results in a different surface word order, in verb-final
                    languages like Korean, verb raising would not alter the surface word order. It
                    is therefore possible that some Korean speakers might acquire a grammar with
                    verb raising, while others acquire one without verb raising. Han et al. (<xref
                        ref-type="bibr" rid="B31">2007</xref>) cleverly asked native Korean-speaking
                    children and adults to interpret sentences with negation to determine whether
                    the verb fell within the scope of negation, consistent with a non-verb raising
                    grammar, or outside of the scope of negation, consistent with verb raising.
                    Interestingly, results in both children and adults were bimodal: some speakers
                    appeared to have acquired one grammar, while others had acquired a different
                    one. (See also <xref ref-type="bibr" rid="B16">D&#261;browska 2008 for parallel
                        work demonstrating bimodal individual differences in Polish
                        morphology</xref>.)</p>
                <p>One reason for the paucity of evidence for individual differences like these is
                    the difficulty of controlling for confounds in this kind of research. For
                    instance, individual differences in reading times in language comprehension
                    tasks have been claimed to reflect individual differences in processing (<xref
                        ref-type="bibr" rid="B40">Kidd et al. 2018</xref>). But it is trivially true
                    that individuals&#8217; mean reading times will differ from one another given
                    the presence of noise in the data. To credibly demonstrate individual
                    differences, one must also demonstrate <italic>reliability</italic>. This was
                    achieved in Experiment 3 by showing that individual differences correlated
                    between the <italic>TTE</italic> and <italic>IBGE</italic> conditions.</p>
                <p>But even when reliable individual differences are identified, it is often not
                    possible to attribute them to any particular source. Reading time differences
                    could reflect differences in language processing mechanisms, as Kidd et al.
                        (<xref ref-type="bibr" rid="B40">2018</xref>) argue; but they could also
                    reflect differences in visual processing, attention, or even different computer
                    hardware (<xref ref-type="bibr" rid="B19">Enochson &amp; Culbertson
                    2015</xref>). In addition to reliability, one must therefore also demonstrate
                    that the measure is <italic>valid</italic>. Validity was demonstrated in
                    Experiment 3 by showing that these conditions correlated with related control
                    conditions, but not with unrelated ones.</p>
                <p>The present work demonstrates a clear case of individual differences in native
                    speakers&#8217; grammars. While individual differences in acceptability ratings
                    are generally disregarded as noise, these findings show that they contain
                    meaningful between-subjects variance. Individuals in Experiment 3 differed not
                    only in how strongly they dislike the That-Trace Effect/Island Boundary-Gap
                    Effect, consistent with decades of speculation along these lines (<xref
                        ref-type="bibr" rid="B13">Chomsky &amp; Lasnik 1977</xref>; <xref
                        ref-type="bibr" rid="B54">Pesetsky 1982</xref>; <xref ref-type="bibr"
                        rid="B63">Sobin 1987</xref>), but also in how much they (dis)like island
                    violations and ordinary relative clauses, as evidenced by the significant
                    relationships between the critical items and control items.</p>
            </sec>
            <sec>
                <title>5.2 Why is there a That-Trace Effect at all?</title>
                <p>The finding that the That-Trace Effect and the Island Boundary-Gap Effect derive
                    from the same underlying constraint makes an intriguing typological prediction:
                    that languages should either show sensitivity to both effects or neither (see
                    Goodall (<xref ref-type="bibr" rid="B26">2022</xref>) for similar reasoning
                    about island phenomena). Indeed, this prediction appears to be borne out in
                    Spanish (<xref ref-type="bibr" rid="B70">Torrego 1984</xref>; <xref
                        ref-type="bibr" rid="B68">Su&#241;er 1991</xref>) and Norwegian (<xref
                        ref-type="bibr" rid="B41">Kush &amp; Dahl 2020</xref>), both of which lack
                    the That-Trace Effect and the Island Boundary-Gap Effect. A promising avenue for
                    future work will be to establish whether this pattern holds for a wider set of
                    languages. Such a finding would provide important validation for the unified
                    analysis.</p>
                <p>But a fundamental question still stands: Why should a constraint that prohibits
                    gaps after clause-initial function words exist at all? And why in so many
                    languages? One might posit that something about the semantic or pragmatic
                    structure of these sentences is hard to process or ill-formed, as has been
                    suggested for islands (<xref ref-type="bibr" rid="B24">Goldberg 1995</xref>).
                    But this cannot be the case, as a sentence that is identical to one with a
                    That-Trace Effect violation except in that it has no <italic>that</italic> is
                    perfectly grammatical and has the same meaning.</p>
                <p>This is all the more puzzling when one considers the fact that the existence of
                    this constraint imposes a limit on the expressive capacity of a language. That
                    is, in cases where clause-initial function words are obligatory (as in islands),
                    English provides no good way to form a subject relative clause, as exemplified
                    by (10):</p>
                <table-wrap>
                    <table content-type="example">
                        <tbody>
                            <tr>
                                <td>(10)</td>
                                <td>&#8230;the doctor who I wondered why ___ was always running so
                                        late.<xref ref-type="fn" rid="n11">11</xref></td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>As it is quite common to talk about doctors and why they are running late, it
                    stands to reason that being able to express the intended meaning of this
                    sentence, and similar ones, could be useful for speakers of languages that
                    disallow it like English and Wolof. Why, then, should this particular structure
                    be prohibited?</p>
                <p>This study did not directly investigate the reason for the constraint, but the
                    finding of individual differences in Experiment 3 may provide a clue. That is,
                    whatever the underlying reason for the That-Trace Effect, it must be something
                    that can vary across individuals. One possibility consistent with this is that
                    the That-Trace Effect is an artifact of domain-general processing difficulty, as
                    argued by McDaniel et al. (<xref ref-type="bibr" rid="B48">2015</xref>).</p>
                <p>Another possibility is that the That-Trace Effect simply reflects the low
                    frequency with which English speakers hear the violation (<xref ref-type="bibr"
                        rid="B55">Phillips 2013</xref>). A child learning English might surmise
                    that, if it were allowed in English, she would hear it more frequently. Because
                    she doesn&#8217;t, she implicitly learns that it is unacceptable, rates it as
                    such when participating in a psycholinguistic experiment, and perpetuates this
                    cycle by producing it infrequently. Children who happen to hear more of these
                    structures early in life may go on to report that they are relatively more
                    acceptable later in life, and children who hear fewer should show greater
                    sensitivity. This would be consistent with the fact that the constraint appears
                    to vary across languages (<xref ref-type="bibr" rid="B45">Maling &amp; Zaenen
                        1978</xref>; <xref ref-type="bibr" rid="B52">Nicolis &amp; Biberauer
                        2008</xref>; <xref ref-type="bibr" rid="B9">Chac&#243;n 2015</xref>; <xref
                        ref-type="bibr" rid="B41">Kush &amp; Dahl 2020</xref>), and it converges
                    with recent ideas that a similar process may be at least partially responsible
                    for the idiosyncratic variation in which structures constitute islands across
                    languages (<xref ref-type="bibr" rid="B25">Goodall 2019</xref>).</p>
            </sec>
            <sec>
                <title>5.3 Causal inference</title>
                <p>This paper has relied heavily on a correlation to infer a causal relationship.
                    This inference is not generally licensed given that, if A correlates with B, it
                    could be either because A causes B or because some latent variable, C, causes
                    both A and B. Here it is assumed that a latent variable &#8211; maybe a
                    constraint like (11) &#8211; causes both the That-Trace Effect and the Island
                    Boundary-Gap Effect.</p>
                <table-wrap>
                    <table content-type="example">
                        <tbody>
                            <tr>
                                <td>(11)</td>
                                <td>A gap cannot immediately follow a clause-initial function
                                    word/phrase.</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>(See <xref ref-type="bibr" rid="B6">Bresnan 1972</xref>; <xref ref-type="bibr"
                        rid="B7">1977</xref>; <xref ref-type="bibr" rid="B13">Chomsky &amp; Lasnik
                        1977 for similar proposals</xref>.) It is this constraint that is understood
                    to vary across individuals, leading to the correlation between individual
                    differences in the That-Trace Effect and the Island Boundary-Gap Effect.</p>
                <p>In this case, the conclusion that a latent variable causes the correlation
                    derives not from the data, but from reasoning about the problem. That is, it is
                    not clear how the That-Trace Effect, a property of one type of sentence, could
                    cause a distinct effect in different kinds of sentences. Thus, only the latent
                    variable explanation makes sense.</p>
                <p>In some cases, however, reasoning may not provide a clear answer. For instance,
                    visual and verbal working memory appear to use partially overlapping resources
                        (<xref ref-type="bibr" rid="B50">Morey &amp; Cowan 2005</xref>). It is
                    therefore possible that some component of visual working memory relies causally
                    on the verbal working memory architecture (or vice versa). Here, however, the
                    standard experimental approach can suffice to demonstrate both equivalence and
                    causation. This is because working memory can be manipulated. For instance, if a
                    researcher reduces verbal working memory capacity &#8211; for example by varying
                    memory load, noise, attentional demands, or with transcranial magnetic brain
                    stimulation &#8211; then it is relatively straightforward to infer the
                    underlying relational architecture.</p>
                <p>For linguistic constructs like the ones investigated here, it is not obvious how
                    one might manipulate a syntactic &#8220;rule&#8221; like that in (11) to test
                    for causal consequences. While the psychometric approach exemplified in
                    Experiment 3 cannot definitively establish causality, a correlation may be
                    enough in cases where causality can be inferred from the logic of the problem.
                    For language research, then, the Experiment 3 approach may constitute a
                    particularly useful tool.</p>
            </sec>
        </sec>
        <sec>
            <title>6. Conclusion</title>
            <p>In three experiments, this paper addressed a decades-old question in
                psycholinguistics: whether two ungrammatical constructions, the That-Trace Effect
                and the Island Boundary-Gap Effect, both derive from the same underlying constraint.
                It did so while contrasting two approaches to experimental design to demonstrate how
                researchers might use Null Hypothesis Significance Testing to demonstrate
                    <italic>equivalence</italic> between psychological constructs.</p>
            <p>Experiments 1 and 2 looked for evidence of equivalence using the standard framework,
                and demonstrate that the two effects are subject to the same distributional
                conditions: in both cases, acceptability is only reduced in the presence of an overt
                clause-initial function word. While this finding is consistent with the idea that
                the two reflect the same constraint, it in no way requires this to be the case.
                Experiment 3 uses a psychometric approach and demonstrates that individual
                differences in the That-Trace Effect correlate with individual differences in the
                Island Boundary-Gap Effect, above and beyond what would be expected on the basis of
                individual differences in closely-matched structures. This would not be the case if
                the two did not share some underlying cause.</p>
            <p>These experiments show that the That-Trace Effect and the Island Boundary-Gap Effect,
                at least as instantiated in <italic>wh</italic>-islands, derive from the same
                underlying constraint. The experiments also demonstrate that psychometrics is a
                powerful tool with which to address the often difficult question of equivalence
                within the Null Hypothesis Significance Testing framework. This approach may be
                particularly useful when dealing with constructs like grammatical constraints, which
                cannot easily be manipulated in a standard experimental paradigm.</p>
        </sec>
    </body>
    <back>
        <sec>
            <title>Appendix</title>
            <sec>
                <title>A. Supplementary Analysis: Trial exclusion criteria in Experiments 1 and
                    2</title>
                <p>An anonymous reviewer wondered whether excluding trials in which participants
                    responded incorrectly to a comprehension question for a sentence with an island
                    violation might be overzealous given that the stimuli were ungrammatical. The
                    analyses in Experiments 1 and 2 were therefore repeated, but with different
                    exclusion criteria.</p>
                <p>For Experiment 1, participants were removed if they responded to fewer than 60%
                    of the unambiguous <italic>ceiling</italic> fillers correctly (32 exclusions) or
                    if they did not rate <italic>ceiling</italic> fillers more than two standard
                    deviations higher than <italic>floor</italic> fillers, as before (8 more
                    exclusions). Also as above, trials were removed if they were responded to in
                    less time than it would take to read at a rate of 1 word per 150 ms (71 critical
                    trials) and <italic>ceiling</italic> trials were removed if the comprehension
                    question was responded to incorrectly (103 trials). No critical trials were
                    excluded as all of these involved island violations. A total of 2833 remaining
                    trials from 121 participants were analyzed (as compared to 1967 trials from 106
                    participants in the main analysis). As before, <italic>z</italic>-scores were
                    computed by participant. All other analysis details were the same as in the main
                    analysis.</p>
                <p>Condition means are shown in Figure <xref ref-type="fig" rid="F5">5</xref>. Model
                    results, reported in Table <xref ref-type="table" rid="T8">8</xref>, were
                    qualitatively identical to those reported in the main analysis in the paper
                    (Table <xref ref-type="table" rid="T3">3</xref>), suggesting that the Experiment
                    1 results were relatively robust to particular choices of exclusion
                    criteria.</p>
                <fig id="F5">
                    <label>Figure 5</label>
                    <caption>
                        <p>Experiment 1 condition means without excluding island violation trials
                            for accuracy.</p>
                    </caption>
                    <graphic xmlns:xlink="http://www.w3.org/1999/xlink"
                        xlink:href="glossapx-1-1-40-g5.png"/>
                </fig>
                <table-wrap id="T8">
                    <label>Table 8</label>
                    <caption>
                        <p>Experiment 1: Results of supplementary analysis. The model was the same
                            as the main model reported in the paper (see Table <xref
                                ref-type="table" rid="T3">3</xref>), but data included all trials
                            that had previously been excluded because the comprehension question was
                            answered incorrectly.</p>
                    </caption>
                    <table>
                        <tr>
                            <th colspan="6"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top"/>
                            <th align="right" valign="top"><italic>&#946;</italic></th>
                            <th align="right" valign="top"><sc>D.F.</sc></th>
                            <th align="right" valign="top"><italic>t</italic></th>
                            <th align="right" valign="top"><italic>p</italic></th>
                            <th align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="6"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept
                                (<italic>That,Subject</italic>)</td>
                            <td align="right" valign="top">&#8211;0.551</td>
                            <td align="right" valign="top">89.349</td>
                            <td align="right" valign="top">&#8211;24.427</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>COMPLEMENTIZER</sc>
                                    (<italic>Null</italic>)</td>
                            <td align="right" valign="top">0.143</td>
                            <td align="right" valign="top">55.393</td>
                            <td align="right" valign="top">4.938</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>GAP POSITION</sc>
                                    (<italic>Object</italic>)</td>
                            <td align="right" valign="top">0.054</td>
                            <td align="right" valign="top">549.764</td>
                            <td align="right" valign="top">2.085</td>
                            <td align="right" valign="top">.038</td>
                            <td align="left" valign="top">*</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>COMPLEMENTIZER</sc> &#215; <sc>GAP
                                    POSITION</sc></td>
                            <td align="right" valign="top">&#8211;0.126</td>
                            <td align="right" valign="top">99.399</td>
                            <td align="right" valign="top">&#8211;3.367</td>
                            <td align="right" valign="top">.001</td>
                            <td align="left" valign="top">**</td>
                        </tr>
                        <tr>
                            <td colspan="6"><hr/></td>
                        </tr>
                    </table>
                </table-wrap>
                <p>For Experiment 2, participants were excluded if they responded correctly to fewer
                    than 60% of comprehension questions to &#8220;unambiguous&#8221; stimuli
                    (defined as <italic>ceiling</italic> fillers and the three critical conditions
                    that did not involve an island violation: <italic>Matrix,Non-island,
                        Matrix,Island</italic>, and <italic>Embedded,Non-island</italic>; 39
                    exclusions). Participants were also excluded if they did not rate
                        <italic>ceiling</italic> fillers more than two standard deviations higher
                    than <italic>floor</italic> fillers, as before (8 exclusions). Trials were
                    excluded if they were read in less time than it would take to read at a rate of
                    1 word per 150 ms (20 critical trials). A total of 647 critical trials that did
                    not involve island violations (19.1%) were excluded because the comprehension
                    question was answered incorrectly. All other analysis details were the same as
                    in the main analysis.</p>
                <p>Condition means are shown in Figure <xref ref-type="fig" rid="F6">6</xref>.
                    Again, model results, given in Table <xref ref-type="table" rid="T9">9</xref>,
                    were similar to those in the main analysis (Table <xref ref-type="table"
                        rid="T5">5</xref>), suggesting that the results of this experiment were also
                    relatively robust.</p>
                <fig id="F6">
                    <label>Figure 6</label>
                    <caption>
                        <p>Experiment 2 condition means without excluding island violation trials
                            for accuracy.</p>
                    </caption>
                    <graphic xmlns:xlink="http://www.w3.org/1999/xlink"
                        xlink:href="glossapx-1-1-40-g6.png"/>
                </fig>
                <table-wrap id="T9">
                    <label>Table 9</label>
                    <caption>
                        <p>Experiment 2: Results of supplementary analysis. The model was the same
                            as the main model reported in the paper (see Table <xref
                                ref-type="table" rid="T5">5</xref>), but data included all
                            island-violation trials that had previously been excluded because the
                            comprehension question was answered incorrectly.</p>
                    </caption>
                    <table>
                        <tr>
                            <th colspan="6"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top"/>
                            <th align="right" valign="top"><italic>&#946;</italic></th>
                            <th align="right" valign="top"><sc>D.F.</sc></th>
                            <th align="right" valign="top"><italic>t</italic></th>
                            <th align="right" valign="top"><italic>p</italic></th>
                            <th align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="6"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept
                                    (<italic>Non-island,Matrix</italic>)</td>
                            <td align="right" valign="top">0.093</td>
                            <td align="right" valign="top">35.781</td>
                            <td align="right" valign="top">2.313</td>
                            <td align="right" valign="top">.027</td>
                            <td align="left" valign="top">*</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>ISLANDHOOD</sc>
                                    (<italic>Island</italic>)</td>
                            <td align="right" valign="top">0.091</td>
                            <td align="right" valign="top">49.959</td>
                            <td align="right" valign="top">2.160</td>
                            <td align="right" valign="top">.036</td>
                            <td align="left" valign="top">*</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>GAP POSITION</sc>
                                    (<italic>Embedded</italic>)</td>
                            <td align="right" valign="top">&#8211;0.468</td>
                            <td align="right" valign="top">59.814</td>
                            <td align="right" valign="top">&#8211;12.070</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><sc>ISLANDHOOD</sc> &#215; <sc>GAP
                                    POSITION</sc></td>
                            <td align="right" valign="top">&#8211;0.228</td>
                            <td align="right" valign="top">95.858</td>
                            <td align="right" valign="top">&#8211;4.577</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td colspan="6"><hr/></td>
                        </tr>
                    </table>
                </table-wrap>
            </sec>
            <sec>
                <title>B. Supplementary Analysis: Analyzing z-scores in Experiment 3</title>
                <p>The Experiment 3 models reported in Table <xref ref-type="table" rid="T7"
                        >7</xref> analyzed raw ratings. As explained in Section 4.2,
                        <italic>z</italic>-scoring the ratings from Experiment 3 could have had the
                    unintended consequence of reducing individual differences in
                        <italic>TTE</italic> and <italic>IBGE</italic> stimuli. Specifically,
                    participants who like <italic>TTE</italic> and <italic>IBGE</italic> stimuli
                    more than the average person will likely have higher overall mean ratings on
                    average. (<italic>TTE</italic> and <italic>IBGE</italic> stimuli made up roughly
                    30% of all stimuli in Experiment 3.) Subtracting these higher means from their
                    ratings will reduce their ratings of <italic>TTE</italic> stimuli more than it
                    would for a person who finds the <italic>TTE</italic> worse than the average
                    person. Indeed, people who like <italic>TTE</italic> stimuli less will likely
                    have lower overall mean ratings, and subtracting these means from their ratings
                    will increase their ratings relative to participants who like
                        <italic>TTE</italic> stimuli more. The end result is a reduction in the
                    difference between these two types of participants, obscuring the individual
                    differences of interest.</p>
                <p>One potential way to avoid this would be to calculate <italic>z</italic>-scores
                    with the means and standard deviations of just filler trials. In this way,
                    participants&#8217; ratings of the critical and control conditions do not
                    contribute to scaling their ratings. Such an approach comes with its own risks.
                    For instance, participants&#8217; mean and standard deviations are calculated
                    from only 27 ratings (just filler trials), meaning that these estimates are
                    likely to be noisier than in an ordinary z-score approach (which would estimate
                    these statistics using all 67 trials&#8217; ratings). The approach furthermore
                    assumes that ratings of filler conditions are independent of ratings of critical
                    and control conditions, which is probably not true. Sprouse (<xref
                        ref-type="bibr" rid="B65">2009</xref>) demonstrated that participants in a
                    binary forced choice task aim to give roughly the same number of each of the two
                    types of response. It is easy to imagine that something similar happens with
                    Likert scale rating tasks. For example, participants may aim to make their mean
                    rating across stimuli a 5. If a participant strongly dislikes
                        <italic>TTE</italic> stimuli, they may give fillers higher ratings to
                    compensate for the lower ratings they assign to <italic>TTE</italic> items.</p>
                <p>To determine whether anything hinged on the choice to analyze raw ratings in
                    Experiment 3, the models reported in Table <xref ref-type="table" rid="T7"
                        >7</xref> were re-run using <italic>z</italic>-scores calculated using
                    participants&#8217; means and standard deviations calculated using just filler
                    trials. Results, reported in Table <xref ref-type="table" rid="T10">10</xref>,
                    largely pattern with those reported in the main analysis.</p>
                <table-wrap id="T10">
                    <label>Table 10</label>
                    <caption>
                        <p>Experiment 3: Results of supplementary analysis modeling modified
                                <italic>z</italic>-scores.</p>
                    </caption>
                    <table>
                        <tr>
                            <th colspan="5"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top" colspan="5">Model 1: <italic>TTE &#8764;
                                    IBGE</italic> + <italic>Control<sub>TTE</sub></italic> +
                                        <italic>Control<sub>IBGE</sub></italic></th>
                        </tr>
                        <tr>
                            <th colspan="5"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top"/>
                            <th align="right" valign="top">E<sc>STIMATE</sc></th>
                            <th align="right" valign="top"><italic>t</italic></th>
                            <th align="right" valign="top"><italic>p</italic></th>
                            <th align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept</td>
                            <td align="right" valign="top">0.080</td>
                            <td align="right" valign="top">0.740</td>
                            <td align="right" valign="top">.461</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><italic>IBGE</italic></td>
                            <td align="right" valign="top">0.539</td>
                            <td align="right" valign="top">5.884</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>TTE</sub></italic></td>
                            <td align="right" valign="top">0.078</td>
                            <td align="right" valign="top">0.773</td>
                            <td align="right" valign="top">.442</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>IBGE</sub></italic></td>
                            <td align="right" valign="top">&#8211;0.033</td>
                            <td align="right" valign="top">&#8211;0.280</td>
                            <td align="right" valign="top">.780</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" colspan="5"><bold>Model 2: <italic>IBGE &#8764;
                                        TTE</italic> + <italic>Control<sub>TTE</sub></italic> +
                                            <italic>Control<sub>IBGE</sub></italic></bold></td>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"/>
                            <td align="right" valign="top"><bold>E<sc>STIMATE</sc></bold></td>
                            <td align="right" valign="top"><bold><italic>t</italic></bold></td>
                            <td align="right" valign="top"><bold><italic>p</italic></bold></td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept</td>
                            <td align="right" valign="top">&#8211;0.045</td>
                            <td align="right" valign="top">&#8211;0.420</td>
                            <td align="right" valign="top">.675</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><italic>TTE</italic></td>
                            <td align="right" valign="top">0.519</td>
                            <td align="right" valign="top">5.884</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>TTE</sub></italic></td>
                            <td align="right" valign="top">&#8211;0.074</td>
                            <td align="right" valign="top">&#8211;0.747</td>
                            <td align="right" valign="top">.457</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>IBGE</sub></italic></td>
                            <td align="right" valign="top">0.569</td>
                            <td align="right" valign="top">5.709</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                    </table>
                </table-wrap>
                <p>Critically, as in the main analyses, <italic>IBGE</italic> significantly predicts
                        <italic>TTE</italic> in Model 1 and <italic>TTE</italic> significantly
                    predicts <italic>IBGE</italic> in Model 2. Also as in the main analyses,
                            <italic>Control<sub>IBGE</sub> z</italic>-scores significantly predicted
                        <italic>IBGE z</italic>-scores in Model 2. The only difference in the
                    pattern of significant/non-significant results between these analyses and the
                    main ones reported in Table <xref ref-type="table" rid="T7">7</xref> is that
                            <italic>Control<sub>TTE</sub></italic> did not significantly predict
                        <italic>TTE</italic> in Model 1. Note that this effect is not necessary for
                    concluding that individual differences in the That-Trace Effect correlate with
                    Individual Differences in the Island Boundary-Gap Effect. Thus, the results of
                    this analysis suggest that the findings of Experiment 3 are relatively robust to
                    Experimenter degrees of freedom in data preprocessing.</p>
            </sec>
            <sec>
                <title>C. Supplementary Analysis: Accounting for measurement error in Experiment
                    3</title>
                <p>The Experiment 3 models analyzed participants&#8217; means in each of the
                    critical and control conditions, thus ignoring measurement error associated with
                    the means. There exist models, commonly used in biological and economic
                    research, which aim to take into account the uncertainty in the predictors,
                    giving better estimates of the slopes (<italic>&#946;</italic>s). However, it is
                    important to note that measurement error in the independent variable(s) affects
                    model results in a different way from measurement error in the dependent
                    variable, the latter of which is a much more familiar problem in
                    psycholinguistics. With increased measurement error in the dependent variable,
                    the result is increased uncertainty in the estimated slope, but not bias in any
                    particular direction. However, when there is measurement error in the
                    independent variable(s), as in the present case, it causes dilution (or
                    attenuation bias): the systematic underestimation of the slopes and
                    corresponding increase in <italic>p</italic>-values (<xref ref-type="bibr"
                        rid="B27">Gorrod et al. 2013</xref>; <xref ref-type="bibr" rid="B5">Bound et
                        al. 2001</xref>; <xref ref-type="bibr" rid="B57">Riggs et al. 1978</xref>;
                        <xref ref-type="bibr" rid="B59">Rosner et al. 1992</xref>). Thus, if an
                    accurate estimate of the effect size is the goal, then it is very important to
                    take measurement error in the independent variables into account (although this
                    is almost never done in psycholinguistics). However, if the goal is simply to
                    establish a relationship, as in the present case, ignoring measurement error in
                    the independent variables is safe, and in fact a more conservative approach as
                    the result will be an underestimation of the slope(s).</p>
                <p>A clever suggestion by a reviewer was to analyze the Experiment 3 data by
                    modeling individual differences as random effects in a linear mixed-effects
                    model. The advantage of this approach is that it accounts for measurement error
                    while simultaneously modeling item-based differences, thus removing another
                    extraneous source of variance. A linear mixed effects model of raw ratings was
                    fit with a four-level fixed effect term for <sc>CONDITION</sc> (levels:
                        <italic>TTE, IBGE, Control<sub>TTE</sub></italic>, and
                            <italic>Control<sub>IBGE</sub></italic>) with random effects for
                    participants and items, and a random slope for condition nested within
                    participants. The frequentist model would not converge, so the same model was
                    fit in the Bayesian framework using the <italic>brms</italic> package (<xref
                        ref-type="bibr" rid="B8">B&#252;rkner 2017</xref>). The random effects
                    matrix for participants was used as estimates of individual differences per
                    condition.</p>
                <p>These estimates were then plugged into the same simple linear models used in the
                    manuscript (Table <xref ref-type="table" rid="T7">7</xref>). The results,
                    reported in Table <xref ref-type="table" rid="T11">11</xref>, revealed a
                    slightly different pattern of results from that reported in the paper.
                    Critically, however, the relationship between the <italic>TTE</italic> and
                        <italic>IBGE</italic> conditions remained positive and significant in both
                    models.</p>
                <table-wrap id="T11">
                    <label>Table 11</label>
                    <caption>
                        <p>Experiment 3 results: Models of individual differences, as estimated by a
                            Bayesian linear mixed-effects regression.</p>
                    </caption>
                    <table>
                        <tr>
                            <th colspan="5"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top" colspan="5">Model 1: <italic>TTE &#8764;
                                    IBGE</italic> + <italic>Control<sub>TTE</sub></italic> +
                                        <italic>Control<sub>IBGE</sub></italic></th>
                        </tr>
                        <tr>
                            <th colspan="5"><hr/></th>
                        </tr>
                        <tr>
                            <th align="left" valign="top"/>
                            <th align="right" valign="top">E<sc>STIMATE</sc></th>
                            <th align="right" valign="top"><italic>t</italic></th>
                            <th align="right" valign="top"><italic>p</italic></th>
                            <th align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept</td>
                            <td align="right" valign="top">0.001</td>
                            <td align="right" valign="top">0.013</td>
                            <td align="right" valign="top">.990</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><italic>IBGE</italic></td>
                            <td align="right" valign="top">3.624</td>
                            <td align="right" valign="top">4.685</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>TTE</sub></italic></td>
                            <td align="right" valign="top">0.3688</td>
                            <td align="right" valign="top">1.118</td>
                            <td align="right" valign="top">&lt;.267</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>IBGE</sub></italic></td>
                            <td align="right" valign="top">&#8211;2.678</td>
                            <td align="right" valign="top">&#8211;4.990</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" colspan="5"><bold>Model 2: <italic>IBGE &#8764;
                                        TTE</italic> + <italic>Control<sub>TTE</sub></italic> +
                                            <italic>Control<sub>IBGE</sub></italic></bold></td>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"/>
                            <td align="right" valign="top"><bold>E<sc>STIMATE</sc></bold></td>
                            <td align="right" valign="top"><bold><italic>t</italic></bold></td>
                            <td align="right" valign="top"><bold><italic>p</italic></bold></td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                        <tr>
                            <td align="left" valign="top">Intercept</td>
                            <td align="right" valign="top">0.000</td>
                            <td align="right" valign="top">&#8211;0.062</td>
                            <td align="right" valign="top">.950</td>
                            <td align="left" valign="top"/>
                        </tr>
                        <tr>
                            <td align="left" valign="top"><italic>TTE</italic></td>
                            <td align="right" valign="top">0.055</td>
                            <td align="right" valign="top">4.685</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>TTE</sub></italic></td>
                            <td align="right" valign="top">&#8211;0.330</td>
                            <td align="right" valign="top">&#8211;15.732</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td align="left" valign="top"
                                ><italic>Control<sub>IBGE</sub></italic></td>
                            <td align="right" valign="top">0.681</td>
                            <td align="right" valign="top">36.869</td>
                            <td align="right" valign="top">&lt;.001</td>
                            <td align="left" valign="top">***</td>
                        </tr>
                        <tr>
                            <td colspan="5"><hr/></td>
                        </tr>
                    </table>
                </table-wrap>
            </sec>
        </sec>
        <sec>
            <title>Data Accessibility Statement</title>
            <p>Data and analyses are publicly available online. <ext-link ext-link-type="uri"
                    xmlns:xlink="http://www.w3.org/1999/xlink"
                    xlink:href="https://doi.org/10.17605/OSF.IO/FN5AT"
                    >https://doi.org/10.17605/OSF.IO/FN5AT</ext-link>.</p>
        </sec>
        <fn-group>
            <fn id="n1">
                <p>Specifically, while complementizers are the head (C<sup>0</sup>) of a
                    complementizer phrase (CP), relative pronouns and wh- operators like
                        <italic>who, which</italic> and <italic>why</italic> appear in the specifier
                    position of CP, which is structurally above C<sup>0</sup>. The existence of
                    these two positions is made especially clear by dialects of English like that
                    spoken in Belfast, which allows relative pronouns and complementizers to
                    co-occur, as in &#8220;They didn&#8217;t know which model that they
                    discussed&#8221; (<xref ref-type="bibr" rid="B1">Baltin 2010: 331, Ex. 1</xref>;
                    see also <xref ref-type="bibr" rid="B71">van Gelderen 2013</xref> and <xref
                        ref-type="bibr" rid="B58">Rizzi &amp; Shlonsky 2008: 131, Ex. 30 for
                        Qu&#233;bec French</xref>).</p>
            </fn>
            <fn id="n2">
                <p>There do exist frameworks for assessing the null hypothesis, such as Bayes Factor
                    Analysis (<xref ref-type="bibr" rid="B36">Kass &amp; Raftery 1995</xref>)
                    (although it is hard to imagine how one might apply Bayes Factor here due to the
                    many extraneous differences between the structures). There are also some
                    existing approaches to demonstrating equivalence with Null Hypothesis
                    Significance Testing. In pharmacological research, the Two One-Sided
                        <italic>t</italic>-Test is commonly used to provide evidence for
                    equivalence. However, a more accurate characterization of this test is that it
                    provides evidence that two conditions are similar enough so as to be
                    inconsequential, but not necessarily that they are equivalent. See also Hoenig
                    &amp; Heisey (<xref ref-type="bibr" rid="B33">2001</xref>) on arguing for the
                    null when power is sufficiently high.</p>
            </fn>
            <fn id="n3">
                <p>Note that this reflects an unusually high number of participant exclusions. This
                    was deemed acceptable in all experiments because the exclusion criteria seemed a
                    priori reasonable, and even without these participants the experiments still had
                    more participants than the typical linguistic experiment. Post-hoc analyses
                    revealed that critical findings are robust to various exclusion criteria;
                    original data are available online at <ext-link ext-link-type="uri"
                        xmlns:xlink="http://www.w3.org/1999/xlink"
                        xlink:href="https://osf.io/fn5at/">https://osf.io/fn5at/</ext-link>.</p>
            </fn>
            <fn id="n4">
                <p>As with any analysis, the approach outlined here includes a number of
                    experimenter degrees of freedom, including the choice to exclude trials with
                    incorrect responses. A series of supplementary analyses, reported in the
                    appendices, show that the patterns of results across experiments were relatively
                    robust to these choices. Appendix A reports the results of a model analyzing
                    data from Experiments 1 and 2 in which trials were not excluded on the basis of
                    an incorrect comprehension question response if the stimulus involved an island
                    violation.</p>
            </fn>
            <fn id="n5">
                <p>Unexpectedly, ratings of the <italic>That,Object</italic> condition appeared to
                    be lower than those of the <italic>Null,Object</italic> condition. However,
                    pairwise comparisons revealed no significant difference between the two
                    conditions (<italic>That,Object</italic> &#8211; <italic>Null,Object</italic> =
                    &#8211;0.027, <italic>t</italic>(47.1) = &#8211;0.700, <italic>p</italic> =
                    .897), suggesting that this numerical difference was just noise.</p>
            </fn>
            <fn id="n6">
                <p>Due to an oversight, the design of the stimuli was slightly different from that
                    in Experiment 1. It is unlikely that this difference affected the results in any
                    important way, but in the interest of transparency it is explained here. In
                    Experiment 1, stimuli were designed so that the head noun was the same across
                    all four conditions within a given item (i.e., all four conditions for the first
                    item began with &#8220;It was the doctor who;&#8221; see Table <xref
                        ref-type="table" rid="T2">2</xref>). The trade-off for keeping the head noun
                    consistent across conditions was that the meaning of the proposition in the
                    lowest clause differed across conditions as a function of the <sc>GAP
                        POSITION</sc> manipulation. In the <italic>subject</italic> gap condition
                    (&#8220;(that)___hit the lawyer with a bat&#8221;), the doctor is the agent of
                    the hitting event. In the <italic>object</italic> gap condition (&#8220;(that)
                    the lawyer hit___with a bat&#8221;), the doctor is the patient. In contrast, in
                    Experiment 2, stimuli were designed so that the event remained the same across
                    all four conditions. However, this meant that the head noun differed as a
                    function of the <sc>GAP POSITION</sc> manipulation (see Table <xref
                        ref-type="table" rid="T4">4</xref>). Fortunately, this seems unlikely to
                    impact the generalizability of Experiment 2 findings to Experiment 1 given that
                    the global structure of the stimuli was the same across experiments. One might
                    be concerned that changing the role of the gap across conditions in Experiment 1
                    would result in different conditions having different degrees of plausibility.
                    However, the stimuli were designed so that any ordering of the nouns in the
                    sentence was roughly equally (im)plausible in order to ensure participants had
                    to parse the dependency to correctly respond to the comprehension question
                    (rather than reasoning over world knowledge). That is, there were no sentences
                    with events like <italic>lawyer defends burglar</italic>, where
                        <italic>lawyer</italic> is a priori much more likely to be the agent of the
                    verb <italic>defend</italic> than <italic>burglar</italic>, so the plausibility
                    of the events remained relatively constant across conditions.</p>
            </fn>
            <fn id="n7">
                <p>Given no a priori reason to believe that properties like the particular
                    tense/aspect or the use of modals might be able to drive the critical effect in
                    this study, the stimuli were diversified with respect to these properties to
                    make the task less monotonous. The sample stimuli in Table <xref
                        ref-type="table" rid="T6">6</xref> are random samples from the full set, so,
                    for instance, not all <italic>Control<sub>TTE</sub></italic> conditions
                    contained modals, and some items in other conditions did contain modals. In
                    order for this to be able to drive a correlation between individual differences
                    in <italic>TTE</italic> and <italic>IBGE</italic> above and beyond what can be
                    accounted for by control conditions, differences in the use of modals (for
                    example) would have to be correlated in the two critical conditions, but not in
                    the control conditions. This was not the case.</p>
            </fn>
            <fn id="n8">
                <p>There are still good arguments for <italic>z</italic>-scoring, even considering
                    the concerns outlined here. One middle-ground approach may be to calculate
                        <italic>z</italic>-scores based on ratings of filler items only, so that the
                    individual differences of interest do not contribute to the participant
                    means/standard deviations used to scale ratings. Analyses of ratings transformed
                    in this way are reported in Appendix B. Results largely pattern with those of
                    the main analyses, suggesting that the findings are relatively robust to
                    experimenter degrees of freedom in data preprocessing.</p>
            </fn>
            <fn id="n9">
                <p>Note that this approach does not take into account measurement error in the
                    participant means. However, measurement error in independent variables leads to
                    a systematic <italic>under</italic>-estimation of effect sizes and
                    correspondingly higher <italic>p</italic>-values (<xref ref-type="bibr"
                        rid="B27">Gorrod et al. 2013</xref>; <xref ref-type="bibr" rid="B5">Bound et
                        al. 2001</xref>), thus making models that ignore this error (like the
                    present ones) more conservative. One approach to accounting for measurement
                    error here is demonstrated in Appendix C with Bayesian mixed-effects models.</p>
            </fn>
            <fn id="n10">
                <p>The <italic>IBGE</italic> and <italic>Control<sub>IBGE</sub></italic> stimuli in
                    Experiment 3 contained &#8220;why&#8221; and &#8220;whether&#8221; as relative
                    pronouns introducing the island clause. An anonymous reviewer wondered whether
                    these different types of wh- words might behave differently. Supplementary
                    subset analyses, which can be found on the OSF repository, show that the pattern
                    of significant results remains exactly the same for the models in Table <xref
                        ref-type="table" rid="T7">7</xref> when run on subsets of the data with only
                    the relativizer <italic>why</italic> or <italic>whether</italic>.</p>
            </fn>
            <fn id="n11">
                <p>Of course, there is no <italic>great</italic> way of expressing this relative
                    clause in the first place because of the island violation. But some islands,
                    including <italic>wh</italic>-islands like the ones used throughout this paper,
                    have been shown to be moderately acceptable with gaps in object positions, as in
                    (8). The finding of Experiment 1 &#8211; that subject gaps are not worse than
                    object gaps in islands when not immediately preceded by a clause-initial
                    function word &#8211; suggests that, without the Island Boundary-Gap Effect,
                    (10) might be a reasonably acceptable sentence.</p>
            </fn>
        </fn-group>
        <sec>
            <title>Ethics and consent</title>
            <p>The experiments presented here were approved by the UC San Diego IRB. All
                participants provided informed consent prior to participation.</p>
        </sec>
        <ack>
            <title>Acknowledgements</title>
            <p>Thanks to Matt Wagers, Brianna Kaufman, and Victor S. Ferreira for invaluable help in
                the preparation of this manuscript.</p>
        </ack>
        <sec>
            <title>Funding information</title>
            <p>This research was supported in part by the Graduate Research Fellowship Program at
                the National Science Foundation under grant DGE-1144086 to the author.</p>
        </sec>
        <sec>
            <title>Competing interests</title>
            <p>The author has no competing interests to declare.</p>
        </sec>
        <sec>
            <title>Author contributions</title>
            <p>All authors contributed exactly equally.</p>
        </sec>
        <ref-list>
            <ref id="B1">
                <label>1</label>
                <mixed-citation publication-type="journal"><string-name><surname>Baltin</surname>,
                            <given-names>Mark</given-names></string-name>. <year>2010</year>.
                        <article-title>The nonreality of doubly filled comps</article-title>.
                        <source>Linguistic Inquiry</source>
                    <volume>41</volume>(<issue>2</issue>).
                        <fpage>331</fpage>&#8211;<lpage>335</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1162/ling.2010.41.2.331</pub-id></mixed-citation>
            </ref>
            <ref id="B2">
                <label>2</label>
                <mixed-citation publication-type="journal"><string-name><surname>Bates</surname>,
                            <given-names>Douglas</given-names></string-name>,
                            <string-name><given-names>Martin</given-names>
                        <surname>M&#228;chler</surname></string-name>,
                            <string-name><given-names>Ben</given-names>
                        <surname>Bolker</surname></string-name> &amp;
                            <string-name><given-names>Steve</given-names>
                        <surname>Walker</surname></string-name>. <year>2015</year>.
                        <article-title>Fitting linear mixed-effects models using
                        lme4</article-title>. <source>Journal of Statistical Software</source>
                    <volume>67</volume>(<issue>1</issue>). <fpage>1</fpage>&#8211;<lpage>48</lpage>.
                    DOI: <pub-id pub-id-type="doi">10.18637/jss.v067.i01</pub-id></mixed-citation>
            </ref>
            <ref id="B3">
                <label>3</label>
                <mixed-citation publication-type="book"><string-name><surname>Bever</surname>,
                            <given-names>Thomas G.</given-names></string-name>
                    <year>1974</year>. <chapter-title>The ascent of the specious; or, There&#8217;s
                        a lot we don&#8217;t know about mirrors</chapter-title>. In
                            <string-name><given-names>David</given-names>
                        <surname>Cohen</surname></string-name> (ed.), <source>Explaining linguistic
                        phenomena</source>, <fpage>173</fpage>&#8211;<lpage>200</lpage>.
                        <publisher-loc>Washington, DC</publisher-loc>: <publisher-name>John
                        Wiley</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B4">
                <label>4</label>
                <mixed-citation publication-type="journal"><string-name><surname>Bock</surname>,
                            <given-names>J. Kathryn</given-names></string-name> &amp;
                            <string-name><given-names>Richard K.</given-names>
                        <surname>Warren</surname></string-name>. <year>1985</year>.
                        <article-title>Conceptual accessibility and syntactic structure in sentence
                        formulation</article-title>. <source>Cognition</source>
                    <volume>21</volume>(<issue>1</issue>).
                    <fpage>47</fpage>&#8211;<lpage>67</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1016/0010-0277(85)90023-X</pub-id></mixed-citation>
            </ref>
            <ref id="B5">
                <label>5</label>
                <mixed-citation publication-type="book"><string-name><surname>Bound</surname>,
                            <given-names>John</given-names></string-name>,
                            <string-name><given-names>Charles</given-names>
                        <surname>Brown</surname></string-name> &amp;
                            <string-name><given-names>Nancy</given-names>
                        <surname>Mathiowetz</surname></string-name>. <year>2001</year>.
                        <chapter-title>Measurement error in survey data</chapter-title>. In
                            <string-name><given-names>James</given-names>
                        <surname>Heckman</surname></string-name> &amp;
                            <string-name><given-names>Ed</given-names>
                        <surname>Learner</surname></string-name> (eds.), <source>Handbook of
                        econometrics</source>
                    <volume>5</volume>. <fpage>3705</fpage>&#8211;<lpage>3843</lpage>.
                        <publisher-loc>New York</publisher-loc>:
                        <publisher-name>Springer-Verlag</publisher-name>. DOI: <pub-id
                        pub-id-type="doi">10.1016/S1573-4412(01)05012-7</pub-id></mixed-citation>
            </ref>
            <ref id="B6">
                <label>6</label>
                <mixed-citation publication-type="thesis"><string-name><surname>Bresnan</surname>,
                            <given-names>Joan W.</given-names></string-name>
                    <year>1972</year>. <source>Theory of complementation in English syntax</source>.
                        <publisher-loc>Cambridge, MA</publisher-loc>:
                        <publisher-name>MIT</publisher-name> dissertation.</mixed-citation>
            </ref>
            <ref id="B7">
                <label>7</label>
                <mixed-citation publication-type="book"><string-name><surname>Bresnan</surname>,
                            <given-names>Joan W.</given-names></string-name>
                    <year>1977</year>. <chapter-title>Variables in the theory of
                        transformations</chapter-title>. In <string-name><given-names>P.
                            P.</given-names>
                        <surname>Culicover</surname></string-name>,
                            <string-name><given-names>T.</given-names>
                        <surname>Wasow</surname></string-name> &amp;
                            <string-name><given-names>A.</given-names>
                        <surname>Akmajian</surname></string-name> (eds.), <source>Formal
                        syntax</source>. <publisher-loc>New York</publisher-loc>:
                        <publisher-name>Academic Press</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B8">
                <label>8</label>
                <mixed-citation publication-type="journal"
                            ><string-name><surname>B&#252;rkner</surname>,
                            <given-names>Paul-Christian</given-names></string-name>.
                        <year>2017</year>. <article-title>brms: An R package for Bayesian multilevel
                        models using Stan</article-title>. <source>Journal of statistical
                        software</source>
                    <volume>80</volume>(<issue>1</issue>). <fpage>1</fpage>&#8211;<lpage>28</lpage>.
                    DOI: <pub-id pub-id-type="doi">10.18637/jss.v080.i01</pub-id></mixed-citation>
            </ref>
            <ref id="B9">
                <label>9</label>
                <mixed-citation publication-type="thesis"
                            ><string-name><surname>Chac&#243;n</surname>, <given-names>Dustin
                            Alfonso</given-names></string-name>. <year>2015</year>.
                        <source>Comparative psychosyntax</source>. <publisher-loc>College Park,
                        MD</publisher-loc>: <publisher-name>University of Maryland</publisher-name>
                    dissertation.</mixed-citation>
            </ref>
            <ref id="B10">
                <label>10</label>
                <mixed-citation publication-type="journal"><string-name><surname>Chomsky</surname>,
                            <given-names>Noam</given-names></string-name>. <year>1980</year>.
                        <article-title>On binding</article-title>. <source>Linguistic
                        inquiry</source>
                    <volume>11</volume>(<issue>1</issue>).
                    <fpage>1</fpage>&#8211;<lpage>46</lpage>.</mixed-citation>
            </ref>
            <ref id="B11">
                <label>11</label>
                <mixed-citation publication-type="book"><string-name><surname>Chomsky</surname>,
                            <given-names>Noam</given-names></string-name>. <year>1981</year>.
                        <source>Lectures on government and binding: The Pisa lectures</source>.
                        <publisher-name>Walter de Gruyter</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B12">
                <label>12</label>
                <mixed-citation publication-type="book"><string-name><surname>Chomsky</surname>,
                            <given-names>Noam</given-names></string-name>. <year>2014</year>.
                        <source>The minimalist program</source>. <publisher-name>MIT
                        press</publisher-name>. DOI: <pub-id pub-id-type="doi"
                        >10.7551/mitpress/9780262527347.001.0001</pub-id></mixed-citation>
            </ref>
            <ref id="B13">
                <label>13</label>
                <mixed-citation publication-type="journal"><string-name><surname>Chomsky</surname>,
                            <given-names>Noam</given-names></string-name> &amp;
                            <string-name><given-names>Howard</given-names>
                        <surname>Lasnik</surname></string-name>. <year>1977</year>.
                        <article-title>Filters and control</article-title>. <source>Linguistic
                        Inquiry</source>
                    <volume>8</volume>(<issue>3</issue>).
                        <fpage>425</fpage>&#8211;<lpage>504</lpage>.</mixed-citation>
            </ref>
            <ref id="B14">
                <label>14</label>
                <mixed-citation publication-type="book"><string-name><surname>Crocker</surname>,
                            <given-names>Linda</given-names></string-name> &amp;
                            <string-name><given-names>James</given-names>
                        <surname>Algina</surname></string-name>. <year>1986</year>.
                        <source>Introduction to classical and modern test theory</source>.
                        <publisher-name>ERIC</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B15">
                <label>15</label>
                <mixed-citation publication-type="journal"><string-name><surname>Cronbach</surname>,
                            <given-names>Lee J.</given-names></string-name> &amp;
                            <string-name><given-names>Paul E.</given-names>
                        <surname>Meehl</surname></string-name>. <year>1955</year>.
                        <article-title>Construct validity in psychological tests</article-title>.
                        <source>Psychological Bulletin</source>
                    <volume>52</volume>(<issue>4</issue>). <fpage>281</fpage>. DOI: <pub-id
                        pub-id-type="doi">10.1037/h0040957</pub-id></mixed-citation>
            </ref>
            <ref id="B16">
                <label>16</label>
                <mixed-citation publication-type="journal"
                            ><string-name><surname>D&#261;browska</surname>,
                            <given-names>Ewa</given-names></string-name>. <year>2008</year>.
                        <article-title>The later development of an early-emerging system: The
                        curious case of the polish genitive</article-title>. DOI: <pub-id
                        pub-id-type="doi">10.1515/LING.2008.021</pub-id></mixed-citation>
            </ref>
            <ref id="B17">
                <label>17</label>
                <mixed-citation publication-type="journal"
                            ><string-name><surname>D&#261;browska</surname>,
                            <given-names>Ewa</given-names></string-name>. <year>2012</year>.
                        <article-title>Different speakers, different grammars: Individual
                        differences in native language attainment</article-title>.
                        <source>Linguistic Approaches to Bilingualism</source>
                    <volume>2</volume>(<issue>3</issue>).
                        <fpage>219</fpage>&#8211;<lpage>253</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1075/lab.2.3.01dab</pub-id></mixed-citation>
            </ref>
            <ref id="B18">
                <label>18</label>
                <mixed-citation publication-type="webpage"><string-name><surname>Drummond</surname>,
                            <given-names>Alex</given-names></string-name>. <year>2013</year>.
                        <article-title>Ibex farm</article-title>.
                        <uri>http://spellout.net/ibexfarm</uri>. Online server.</mixed-citation>
            </ref>
            <ref id="B19">
                <label>19</label>
                <mixed-citation publication-type="journal"><string-name><surname>Enochson</surname>,
                            <given-names>Kelly</given-names></string-name> &amp;
                            <string-name><given-names>Jennifer</given-names>
                        <surname>Culbertson</surname></string-name>. <year>2015</year>.
                        <article-title>Collecting psycholinguistic response time data using Amazon
                        Mechanical Turk</article-title>. <source>PloS One</source>
                    <volume>10</volume>(<issue>3</issue>). <elocation-id>e0116946</elocation-id>.
                    DOI: <pub-id pub-id-type="doi"
                    >10.1371/journal.pone.0116946</pub-id></mixed-citation>
            </ref>
            <ref id="B20">
                <label>20</label>
                <mixed-citation publication-type="journal"
                            ><string-name><surname>Featherston</surname>,
                            <given-names>Sam</given-names></string-name>. <year>2005</year>.
                        <article-title>That-trace in German</article-title>. <source>Lingua</source>
                    <volume>115</volume>(<issue>9</issue>).
                        <fpage>1277</fpage>&#8211;<lpage>1302</lpage>. DOI: <pub-id
                        pub-id-type="doi">10.1016/j.lingua.2004.04.001</pub-id></mixed-citation>
            </ref>
            <ref id="B21">
                <label>21</label>
                <mixed-citation publication-type="journal"><string-name><surname>Ford</surname>,
                            <given-names>Marily</given-names></string-name>. <year>1983</year>.
                        <article-title>A method for obtaining measures of local parsing complexity
                        throughout sentences</article-title>. <source>Journal of Verbal Learning and
                        Verbal Behavior</source>
                    <volume>22</volume>(<issue>2</issue>).
                        <fpage>203</fpage>&#8211;<lpage>218</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1016/S0022-5371(83)90156-1</pub-id></mixed-citation>
            </ref>
            <ref id="B22">
                <label>22</label>
                <mixed-citation publication-type="book"><string-name><surname>Furr</surname>,
                            <given-names>R. Michael</given-names></string-name>. <year>2017</year>.
                        <source>Psychometrics: An introduction</source>. <publisher-loc>Thousand
                        Oaks, CA</publisher-loc>: <publisher-name>Sage
                    Publications</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B23">
                <label>23</label>
                <mixed-citation publication-type="journal"><string-name><surname>Gas</surname>,
                            <given-names>Susan</given-names></string-name>. <year>1979</year>.
                        <article-title>Language transfer and universal grammatical
                        relations</article-title>. <source>Language learning</source>
                    <volume>29</volume>(<issue>2</issue>).
                        <fpage>327</fpage>&#8211;<lpage>344</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1111/j.1467-1770.1979.tb01073.x</pub-id></mixed-citation>
            </ref>
            <ref id="B24">
                <label>24</label>
                <mixed-citation publication-type="book"><string-name><surname>Goldberg</surname>,
                            <given-names>Adele E.</given-names></string-name>
                    <year>1995</year>. <source>Constructions: A construction grammar approach to
                        argument structure</source>. <publisher-name>University of Chicago
                        Press</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B25">
                <label>25</label>
                <mixed-citation publication-type="confproc"><string-name><surname>Goodall</surname>,
                            <given-names>Grant</given-names></string-name>. <year>2019</year>.
                        <article-title>Predicting the severity of island violations across
                        languages: Some first steps</article-title>. <conf-name>Talk presented at
                        the 2019 LSA Annual Meeting session: Experimental approaches to
                        cross-linguistic variation in island phenomena</conf-name>.</mixed-citation>
            </ref>
            <ref id="B26">
                <label>26</label>
                <mixed-citation publication-type="book"><string-name><surname>Goodall</surname>,
                            <given-names>Grant</given-names></string-name>. <year>2022</year>.
                        <chapter-title>D-linking, non-finiteness, and cross-linguistic variation in
                        island phenomena</chapter-title>. In <source>Theory and experiment in
                        syntax</source>, <publisher-loc>Abingdon, UK</publisher-loc>:
                        <publisher-name>Routledge</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B27">
                <label>27</label>
                <mixed-citation publication-type="journal"><string-name><surname>Gorrod</surname>,
                            <given-names>Emma J.</given-names></string-name>,
                            <string-name><given-names>Michael</given-names>
                        <surname>Bedward</surname></string-name>, <string-name><given-names>David
                            A.</given-names>
                        <surname>Keith</surname></string-name> &amp;
                            <string-name><given-names>Murray V.</given-names>
                        <surname>Ellis</surname></string-name>. <year>2013</year>.
                        <article-title>Systematic underestimation resulting from measurement error
                        in score-based ecological indices</article-title>. <source>Biological
                        conservation</source>
                    <volume>157</volume>. <fpage>266</fpage>&#8211;<lpage>276</lpage>. DOI: <pub-id
                        pub-id-type="doi">10.1016/j.biocon.2012.09.002</pub-id></mixed-citation>
            </ref>
            <ref id="B28">
                <label>28</label>
                <mixed-citation publication-type="journal"><string-name><surname>Granger</surname>,
                            <given-names>Clive W. J.</given-names></string-name>
                    <year>1969</year>. <article-title>Investigating causal relations by econometric
                        models and cross-spectral methods</article-title>. <source>Econometrica:
                        journal of the Econometric Society</source>,
                        <fpage>424</fpage>&#8211;<lpage>438</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.2307/1912791</pub-id></mixed-citation>
            </ref>
            <ref id="B29">
                <label>29</label>
                <mixed-citation publication-type="journal"><string-name><surname>Hakes</surname>,
                            <given-names>David T.</given-names></string-name>,
                            <string-name><given-names>Judith S.</given-names>
                        <surname>Evans</surname></string-name> &amp; <string-name><given-names>Linda
                            L.</given-names>
                        <surname>Brannon</surname></string-name>. <year>1976</year>.
                        <article-title>Understanding sentences with relative
                    clauses</article-title>. <source>Memory &amp; Cognition</source>
                    <volume>4</volume>(<issue>3</issue>).
                        <fpage>283</fpage>&#8211;<lpage>290</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.3758/BF03213177</pub-id></mixed-citation>
            </ref>
            <ref id="B30">
                <label>30</label>
                <mixed-citation publication-type="confproc"><string-name><surname>Han</surname>,
                            <given-names>Chung-hye</given-names></string-name>,
                            <string-name><given-names>Noureddine</given-names>
                        <surname>Elouazizi</surname></string-name>,
                            <string-name><given-names>Christina</given-names>
                        <surname>Galeano</surname></string-name>,
                            <string-name><given-names>Emrah</given-names>
                        <surname>G&#246;rg&#252;l&#252;</surname></string-name>,
                            <string-name><given-names>Nancy</given-names>
                        <surname>Hedberg</surname></string-name>,
                            <string-name><given-names>Jennifer</given-names>
                        <surname>Hinnell</surname></string-name>,
                            <string-name><given-names>Meghan</given-names>
                        <surname>Jeffrey</surname></string-name>,
                            <string-name><given-names>Kyeong-min</given-names>
                        <surname>Kim</surname></string-name> &amp;
                            <string-name><given-names>Susannah</given-names>
                        <surname>Kirby</surname></string-name>. <year>2012</year>.
                        <article-title>Processing strategies and resumptive pronouns in
                        English</article-title>. In <conf-name>Proceedings of the 30th West Coast
                        Conference on Formal Linguistics</conf-name>,
                        <fpage>153</fpage>&#8211;<lpage>161</lpage>. <conf-sponsor>Cascadilla
                        Proceedings Project Somerville</conf-sponsor>,
                    <conf-loc>MA</conf-loc>.</mixed-citation>
            </ref>
            <ref id="B31">
                <label>31</label>
                <mixed-citation publication-type="journal"><string-name><surname>Han</surname>,
                            <given-names>Chung-hye</given-names></string-name>,
                            <string-name><given-names>Jeffrey</given-names>
                        <surname>Lidz</surname></string-name> &amp;
                            <string-name><given-names>Julien</given-names>
                        <surname>Musolino</surname></string-name>. <year>2007</year>.
                        <article-title>V-raising and grammar competition in korean: Evidence from
                        negation and quantifier scope</article-title>. <source>Linguistic
                        Inquiry</source>
                    <volume>38</volume>(<issue>1</issue>). <fpage>1</fpage>&#8211;<lpage>47</lpage>.
                    DOI: <pub-id pub-id-type="doi"
                    >10.1162/ling.2007.38.1.1</pub-id></mixed-citation>
            </ref>
            <ref id="B32">
                <label>32</label>
                <mixed-citation publication-type="book"><string-name><surname>Hawkins</surname>,
                            <given-names>John A.</given-names></string-name>
                    <year>2014</year>. <source>Cross-linguistic variation and efficiency</source>.
                        <publisher-name>OUP Oxford</publisher-name>. DOI: <pub-id pub-id-type="doi"
                        >10.1093/acprof:oso/9780199664993.001.0001</pub-id></mixed-citation>
            </ref>
            <ref id="B33">
                <label>33</label>
                <mixed-citation publication-type="journal"><string-name><surname>Hoenig</surname>,
                            <given-names>John M.</given-names></string-name> &amp;
                            <string-name><given-names>Dennis M.</given-names>
                        <surname>Heisey</surname></string-name>. <year>2001</year>.
                        <article-title>The abuse of power: the pervasive fallacy of power
                        calculations for data analysis</article-title>. <source>The American
                        Statistician</source>
                    <volume>55</volume>(<issue>1</issue>).
                    <fpage>19</fpage>&#8211;<lpage>24</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1198/000313001300339897</pub-id></mixed-citation>
            </ref>
            <ref id="B34">
                <label>34</label>
                <mixed-citation publication-type="journal"><string-name><surname>Holmes</surname>,
                            <given-names>Virginia M.</given-names></string-name> &amp;
                            <string-name><given-names>J. Kevin</given-names>
                        <surname>O&#8217;Regan</surname></string-name>. <year>1981</year>.
                        <article-title>Eye fixation patterns during the reading of relative-clause
                        sentences</article-title>. <source>Journal of Verbal Learning and Verbal
                        Behavior</source>
                    <volume>20</volume>(<issue>4</issue>).
                        <fpage>417</fpage>&#8211;<lpage>430</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1016/S0022-5371(81)90533-8</pub-id></mixed-citation>
            </ref>
            <ref id="B35">
                <label>35</label>
                <mixed-citation publication-type="confproc"
                            ><string-name><surname>Kandybowicz</surname>,
                            <given-names>Jason</given-names></string-name>. <year>2006</year>.
                        <article-title>Comp-trace effects explained away</article-title>. In
                            <string-name><given-names>Donald</given-names>
                        <surname>Baumer</surname></string-name>,
                            <string-name><given-names>David</given-names>
                        <surname>Montero</surname></string-name> &amp;
                            <string-name><given-names>Michael</given-names>
                        <surname>Scanlon</surname></string-name> (eds.), <conf-name>Proceedings of
                        the 25th West Coast Conference on Formal Linguistics 220</conf-name>.
                        <fpage>228</fpage>. <conf-loc>Somerville, MA</conf-loc>:
                        <conf-sponsor>Cascadilla Proceedings
                    Project</conf-sponsor>.</mixed-citation>
            </ref>
            <ref id="B36">
                <label>36</label>
                <mixed-citation publication-type="journal"><string-name><surname>Kass</surname>,
                            <given-names>Robert E.</given-names></string-name> &amp;
                            <string-name><given-names>Adrian E.</given-names>
                        <surname>Raftery</surname></string-name>. <year>1995</year>.
                        <article-title>Bayes factors</article-title>. <source>Journal of the
                        American Statistical Association</source>
                    <volume>90</volume>(<issue>430</issue>).
                        <fpage>773</fpage>&#8211;<lpage>795</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1080/01621459.1995.10476572</pub-id></mixed-citation>
            </ref>
            <ref id="B37">
                <label>37</label>
                <mixed-citation publication-type="journal"><string-name><surname>Kayne</surname>,
                            <given-names>Richard S.</given-names></string-name>
                    <year>1981</year>. <article-title>ECP extensions</article-title>.
                        <source>Linguistic inquiry</source>
                    <volume>12</volume>(<issue>1</issue>).
                        <fpage>93</fpage>&#8211;<lpage>133</lpage>.</mixed-citation>
            </ref>
            <ref id="B38">
                <label>38</label>
                <mixed-citation publication-type="journal"><string-name><surname>Keenan</surname>,
                            <given-names>Edward L.</given-names></string-name> &amp;
                            <string-name><given-names>Bernard</given-names>
                        <surname>Comrie</surname></string-name>. <year>1977</year>.
                        <article-title>Noun phrase accessibility and universal
                        grammar</article-title>. <source>Linguistic Inquiry</source>
                    <volume>8</volume>(<issue>1</issue>).
                    <fpage>63</fpage>&#8211;<lpage>99</lpage>.</mixed-citation>
            </ref>
            <ref id="B39">
                <label>39</label>
                <mixed-citation publication-type="book"><string-name><surname>Kenstowicz</surname>,
                            <given-names>Michael</given-names></string-name>. <year>1989</year>.
                        <chapter-title>The null subject parameter in modern Arabic
                        dialects</chapter-title>. In <string-name><given-names>Osvaldo</given-names>
                        <surname>Jaeggli</surname></string-name> &amp;
                            <string-name><given-names>Kenneth J.</given-names>
                        <surname>Safi</surname></string-name> (eds.), <source>The null subject
                        parameter</source>, <fpage>263</fpage>&#8211;<lpage>275</lpage>.
                        <publisher-loc>Dordrecht</publisher-loc>:
                        <publisher-name>Kluwer</publisher-name>. DOI: <pub-id pub-id-type="doi"
                        >10.1007/978-94-009-2540-3_9</pub-id></mixed-citation>
            </ref>
            <ref id="B40">
                <label>40</label>
                <mixed-citation publication-type="journal"><string-name><surname>Kidd</surname>,
                            <given-names>Evan</given-names></string-name>,
                            <string-name><given-names>Seamus</given-names>
                        <surname>Donnelly</surname></string-name> &amp;
                            <string-name><given-names>Morten H.</given-names>
                        <surname>Christiansen</surname></string-name>. <year>2018</year>.
                        <article-title>Individual differences in language acquisition and
                        processing</article-title>. <source>Trends in Cognitive Sciences</source>
                    <volume>22</volume>(<issue>2</issue>).
                        <fpage>154</fpage>&#8211;<lpage>169</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1016/j.tics.2017.11.006</pub-id></mixed-citation>
            </ref>
            <ref id="B41">
                <label>41</label>
                <mixed-citation publication-type="journal"><string-name><surname>Kush</surname>,
                            <given-names>Dave</given-names></string-name> &amp;
                            <string-name><given-names>Anne</given-names>
                        <surname>Dahl</surname></string-name>. <year>2020</year>. <article-title>L2
                        transfer of l1 island-insensitivity: The case of norwegian</article-title>.
                        <source>Second Language Research</source>. DOI: <pub-id pub-id-type="doi"
                        >10.1177/0267658320956704</pub-id></mixed-citation>
            </ref>
            <ref id="B42">
                <label>42</label>
                <mixed-citation publication-type="journal"
                            ><string-name><surname>Kuznetsova</surname>,
                            <given-names>Alexandra</given-names></string-name>,
                            <string-name><given-names>Per B.</given-names>
                        <surname>Brockhoff</surname></string-name> &amp;
                            <string-name><given-names>Rune H. B.</given-names>
                        <surname>Christensen</surname></string-name>. <year>2017</year>.
                        <article-title>lmerTest package: Tests in linear mixed effects
                        models</article-title>. <source>Journal of Statistical Software</source>
                    <volume>82</volume>(<issue>13</issue>).
                    <fpage>1</fpage>&#8211;<lpage>26</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.18637/jss.v082.i13</pub-id></mixed-citation>
            </ref>
            <ref id="B43">
                <label>43</label>
                <mixed-citation publication-type="journal"><string-name><surname>Lasnik</surname>,
                            <given-names>Howard</given-names></string-name> &amp;
                            <string-name><given-names>Mamoru</given-names>
                        <surname>Saito</surname></string-name>. <year>1984</year>. <article-title>On
                        the nature of proper government</article-title>. <source>Linguistic
                        inquiry</source>
                    <volume>15</volume>(<issue>2</issue>).
                        <fpage>235</fpage>&#8211;<lpage>290</lpage>.</mixed-citation>
            </ref>
            <ref id="B44">
                <label>44</label>
                <mixed-citation publication-type="journal"><string-name><surname>Longo</surname>,
                            <given-names>Matthew R.</given-names></string-name>,
                            <string-name><given-names>Friederike</given-names>
                        <surname>Sch&#252;&#252;r</surname></string-name>,
                            <string-name><given-names>Marjolein P. M.</given-names>
                        <surname>Kammers</surname></string-name>,
                            <string-name><given-names>Manos</given-names>
                        <surname>Tsakiris</surname></string-name> &amp;
                            <string-name><given-names>Patrick</given-names>
                        <surname>Haggard</surname></string-name>. <year>2008</year>.
                        <article-title>What is embodiment? a psychometric approach</article-title>.
                        <source>Cognition</source>
                    <volume>107</volume>(<issue>3</issue>).
                        <fpage>978</fpage>&#8211;<lpage>998</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1016/j.cognition.2007.12.004</pub-id></mixed-citation>
            </ref>
            <ref id="B45">
                <label>45</label>
                <mixed-citation publication-type="journal"><string-name><surname>Maling</surname>,
                            <given-names>Joan</given-names></string-name> &amp;
                            <string-name><given-names>Annie</given-names>
                        <surname>Zaenen</surname></string-name>. <year>1978</year>.
                        <article-title>The nonuniversality of a surface filter</article-title>.
                        <source>Linguistic Inquiry</source>
                    <volume>9</volume>(<issue>3</issue>).
                        <fpage>475</fpage>&#8211;<lpage>497</lpage>.</mixed-citation>
            </ref>
            <ref id="B46">
                <label>46</label>
                <mixed-citation publication-type="journal"
                            ><string-name><surname>Martinovi&#263;</surname>,
                            <given-names>Martina</given-names></string-name>. <year>2013</year>.
                        <article-title>Wh-morphology and cyclicity in
                    Wolof</article-title>.</mixed-citation>
            </ref>
            <ref id="B47">
                <label>47</label>
                <mixed-citation publication-type="journal"><string-name><surname>McDaniel</surname>,
                            <given-names>Dana</given-names></string-name>,
                            <string-name><given-names>Cecile</given-names>
                        <surname>McKee</surname></string-name> &amp; <string-name><given-names>Judy
                            B.</given-names>
                        <surname>Bernstein</surname></string-name>. <year>1998</year>.
                        <article-title>How children&#8217;s relatives solve a problem for
                        minimalism</article-title>. <source>Language</source>
                    <volume>74</volume>(<issue>2</issue>).
                        <fpage>308</fpage>&#8211;<lpage>334</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1353/lan.1998.0177</pub-id></mixed-citation>
            </ref>
            <ref id="B48">
                <label>48</label>
                <mixed-citation publication-type="journal"><string-name><surname>McDaniel</surname>,
                            <given-names>Dana</given-names></string-name>,
                            <string-name><given-names>Cecile</given-names>
                        <surname>McKee</surname></string-name>,
                            <string-name><given-names>Wayne</given-names>
                        <surname>Cowart</surname></string-name> &amp;
                            <string-name><given-names>Merrill F.</given-names>
                        <surname>Garrett</surname></string-name>. <year>2015</year>.
                        <article-title>The role of the language production system in shaping
                        grammars</article-title>. <source>Language</source>
                    <volume>91</volume>(<issue>2</issue>).
                        <fpage>415</fpage>&#8211;<lpage>441</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1353/lan.2015.0021</pub-id></mixed-citation>
            </ref>
            <ref id="B49">
                <label>49</label>
                <mixed-citation publication-type="book"><string-name><surname>McElreath</surname>,
                            <given-names>Richard</given-names></string-name>. <year>2020</year>.
                        <source>Statistical rethinking: A Bayesian course with examples in R and
                        Stan</source>. <publisher-name>CRC press</publisher-name>. DOI: <pub-id
                        pub-id-type="doi">10.1201/9780429029608</pub-id></mixed-citation>
            </ref>
            <ref id="B50">
                <label>50</label>
                <mixed-citation publication-type="journal"><string-name><surname>Morey</surname>,
                            <given-names>Candice C.</given-names></string-name> &amp;
                            <string-name><given-names>Nelson</given-names>
                        <surname>Cowan</surname></string-name>. <year>2005</year>.
                        <article-title>When do visual and verbal memories conflict? the importance
                        of working-memory load and retrieval</article-title>. <source>Journal of
                        Experimental Psychology: Learning, Memory, and Cognition</source>
                    <volume>31</volume>(<issue>4</issue>).
                        <fpage>703</fpage>&#8211;<lpage>713</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1037/0278-7393.31.4.703</pub-id></mixed-citation>
            </ref>
            <ref id="B51">
                <label>51</label>
                <mixed-citation publication-type="journal"><string-name><surname>Morgan</surname>,
                            <given-names>Adam Milton</given-names></string-name> &amp;
                            <string-name><given-names>Matthew W.</given-names>
                        <surname>Wagers</surname></string-name>. <year>2018</year>.
                        <article-title>English resumptive pronouns are more common where gaps are
                        less acceptable</article-title>. <source>Linguistic Inquiry</source>
                    <volume>49</volume>(<issue>4</issue>).
                        <fpage>861</fpage>&#8211;<lpage>876</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1162/ling_a_00293</pub-id></mixed-citation>
            </ref>
            <ref id="B52">
                <label>52</label>
                <mixed-citation publication-type="book"><string-name><surname>Nicolis</surname>,
                            <given-names>Marco</given-names></string-name> &amp;
                            <string-name><given-names>Theresa</given-names>
                        <surname>Biberauer</surname></string-name>. <year>2008</year>.
                        <chapter-title>The null subject parameter and correlating properties: The
                        case of Creole languages</chapter-title>. In
                            <string-name><given-names>Theresa</given-names>
                        <surname>Biberauer</surname></string-name> (ed.), <source>The limits of
                        syntactic variation</source>
                    <volume>132</volume>. <fpage>271</fpage>&#8211;<lpage>294</lpage>.
                        <publisher-loc>Philadelphia, PA</publisher-loc>: <publisher-name>John
                        Benjamins</publisher-name>. DOI: <pub-id pub-id-type="doi"
                        >10.1075/la.132.11nic</pub-id></mixed-citation>
            </ref>
            <ref id="B53">
                <label>53</label>
                <mixed-citation publication-type="thesis"
                        ><string-name><surname>Perlmutter</surname>, <given-names>David
                            M.</given-names></string-name>
                    <year>1968</year>. <source>Deep and surface structure constraints in
                        syntax</source>. <publisher-loc>Cambridge, MA</publisher-loc>:
                        <publisher-name>MIT</publisher-name> dissertation.</mixed-citation>
            </ref>
            <ref id="B54">
                <label>54</label>
                <mixed-citation publication-type="journal"><string-name><surname>Pesetsky</surname>,
                            <given-names>David</given-names></string-name>. <year>1982</year>.
                        <article-title>Complementizer-trace phenomena and the Nominative Island
                        Condition</article-title>. <source>The Linguistic Review</source>
                    <volume>1</volume>(<issue>3</issue>).
                        <fpage>297</fpage>&#8211;<lpage>344</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1515/tlir.1982.1.3.297</pub-id></mixed-citation>
            </ref>
            <ref id="B73">
                <label>73</label>
                <mixed-citation publication-type="book"
                        ><string-name><surname>Persetsky</surname>,
                        <given-names>David</given-names></string-name>. <year>2017</year>.
                    <chapter-title>Complementizer-trace effects</chapter-title>. In <source>Blackwell Companion to Syntax, 2nd Edition</source>.
                    <publisher-name>Oxford: Wiley-Blackwell</publisher-name>. DOI:<pub-id></pub-id> 10.1002/9781118358733 </mixed-citation>

            </ref>
            <ref id="B55">
                <label>55</label>
                <mixed-citation publication-type="book"><string-name><surname>Phillips</surname>,
                            <given-names>Colin</given-names></string-name>. <year>2013</year>.
                        <chapter-title>On the nature of island constraints II: Language learning and
                        innateness</chapter-title>. In <source>Experimental syntax and island
                        effects</source>, <fpage>132</fpage>&#8211;<lpage>157</lpage>.
                        <publisher-name>Cambridge University Press</publisher-name>. DOI: <pub-id
                        pub-id-type="doi">10.1017/CBO9781139035309.007</pub-id></mixed-citation>
            </ref>
            <ref id="B56">
                <label>56</label>
                <mixed-citation publication-type="book"><collab>R Core Team</collab>.
                        <year>2013</year>. <source>R: A language and environment for statistical
                        computing</source>. <publisher-name>R Foundation for Statistical
                        Computing</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B57">
                <label>57</label>
                <mixed-citation publication-type="journal"><string-name><surname>Riggs</surname>,
                            <given-names>Douglas S.</given-names></string-name>,
                            <string-name><given-names>Joseph A.</given-names>
                        <surname>Guarnieri</surname></string-name> &amp;
                            <string-name><given-names>Sidney</given-names>
                        <surname>Addelman</surname></string-name>. <year>1978</year>.
                        <article-title>Fitting straight lines when both variables are subject to
                        error</article-title>. <source>Life sciences</source>
                    <volume>22</volume>(<issue>1315</issue>).
                        <fpage>1305</fpage>&#8211;<lpage>1360</lpage>. DOI: <pub-id
                        pub-id-type="doi">10.1016/0024-3205(78)90098-X</pub-id></mixed-citation>
            </ref>
            <ref id="B58">
                <label>58</label>
                <mixed-citation publication-type="book"><string-name><surname>Rizzi</surname>,
                            <given-names>Luigi</given-names></string-name> &amp;
                            <string-name><given-names>Ur</given-names>
                        <surname>Shlonsky</surname></string-name>. <year>2008</year>.
                        <chapter-title>Strategies of subject extraction</chapter-title>. In
                        <source>Interfaces + recursion = language?</source>
                    <fpage>115</fpage>&#8211;<lpage>160</lpage>.
                        <publisher-loc>Berlin</publisher-loc>: <publisher-name>De Gruyter
                        Mouton</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B59">
                <label>59</label>
                <mixed-citation publication-type="journal"><string-name><surname>Rosner</surname>,
                            <given-names>B.</given-names></string-name>,
                            <string-name><given-names>D.</given-names>
                        <surname>Spiegelman</surname></string-name> &amp;
                            <string-name><given-names>Walter C.</given-names>
                        <surname>Willett</surname></string-name>. <year>1992</year>.
                        <article-title>Correction of logistic regression relative risk estimates and
                        confidence intervals for random within-person measurement
                        error</article-title>. <source>American journal of epidemiology</source>
                    <volume>136</volume>(<issue>11</issue>).
                        <fpage>1400</fpage>&#8211;<lpage>1413</lpage>. DOI: <pub-id
                        pub-id-type="doi"
                    >10.1093/oxfordjournals.aje.a116453</pub-id></mixed-citation>
            </ref>
            <ref id="B60">
                <label>60</label>
                <mixed-citation publication-type="thesis"><string-name><surname>Ross</surname>,
                            <given-names>John Robert</given-names></string-name>. <year>1967</year>.
                        <source>Constraints on variables in syntax</source>.
                        <publisher-loc>Cambridge, MA</publisher-loc>:
                        <publisher-name>MIT</publisher-name> dissertation.</mixed-citation>
            </ref>
            <ref id="B61">
                <label>61</label>
                <mixed-citation publication-type="book"><string-name><surname>Rust</surname>,
                            <given-names>John</given-names></string-name> &amp;
                            <string-name><given-names>Susan</given-names>
                        <surname>Golombok</surname></string-name>. <year>2014</year>. <source>Modern
                        psychometrics: The science of psychological assessment</source>.
                        <publisher-loc>Abingdon, U.K.</publisher-loc>:
                        <publisher-name>Routledge</publisher-name>. DOI: <pub-id pub-id-type="doi"
                        >10.4324/9781315787527</pub-id></mixed-citation>
            </ref>
            <ref id="B62">
                <label>62</label>
                <mixed-citation publication-type="book"
                        ><string-name><surname>Sch&#252;tze</surname>, <given-names>Carson
                            T.</given-names></string-name> &amp;
                            <string-name><given-names>Jon</given-names>
                        <surname>Sprouse</surname></string-name>. <year>2014</year>.
                        <chapter-title>Judgment data</chapter-title>. In
                            <string-name><given-names>Robert J.</given-names>
                        <surname>Pode&#353;va</surname></string-name> &amp;
                            <string-name><given-names>Devyani</given-names>
                        <surname>Sharma</surname></string-name> (eds.), <source>Research methods in
                        linguistics</source>
                    <volume>27</volume>. <fpage>27</fpage>&#8211;<lpage>50</lpage>.
                        <publisher-name>Cambridge University Press</publisher-name>. DOI: <pub-id
                        pub-id-type="doi">10.1017/CBO9781139013734.004</pub-id></mixed-citation>
            </ref>
            <ref id="B63">
                <label>63</label>
                <mixed-citation publication-type="journal"><string-name><surname>Sobin</surname>,
                            <given-names>Nicholas</given-names></string-name>. <year>1987</year>.
                        <article-title>The variable status of Comp-trace phenomena</article-title>.
                        <source>Natural Language &amp; Linguistic Theory</source>
                    <volume>5</volume>(<issue>1</issue>). <fpage>33</fpage>&#8211;<lpage>60</lpage>.
                    DOI: <pub-id pub-id-type="doi">10.1007/BF00161867</pub-id></mixed-citation>
            </ref>
            <ref id="B64">
                <label>64</label>
                <mixed-citation publication-type="thesis"><string-name><surname>Sprouse</surname>,
                            <given-names>Jon</given-names></string-name>. <year>2007</year>.
                        <source>A program for experimental syntax: Finding the relationship between
                        acceptability and grammatical knowledge</source>. <publisher-loc>College
                        Park, MD</publisher-loc>: <publisher-name>University of
                        Maryland</publisher-name> dissertation.</mixed-citation>
            </ref>
            <ref id="B65">
                <label>65</label>
                <mixed-citation publication-type="journal"><string-name><surname>Sprouse</surname>,
                            <given-names>Jon</given-names></string-name>. <year>2009</year>.
                        <article-title>Revisiting satiation: Evidence for an equalization response
                        strategy</article-title>. <source>Linguistic Inquiry</source>
                    <volume>40</volume>(<issue>2</issue>).
                        <fpage>329</fpage>&#8211;<lpage>341</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1162/ling.2009.40.2.329</pub-id></mixed-citation>
            </ref>
            <ref id="B66">
                <label>66</label>
                <mixed-citation publication-type="journal"><string-name><surname>Sprouse</surname>,
                            <given-names>Jon</given-names></string-name>,
                            <string-name><given-names>Ivano</given-names>
                        <surname>Caponigro</surname></string-name>,
                            <string-name><given-names>Ciro</given-names>
                        <surname>Greco</surname></string-name> &amp;
                            <string-name><given-names>Carlo</given-names>
                        <surname>Cecchetto</surname></string-name>. <year>2016</year>.
                        <article-title>Experimental syntax and the variation of island effects in
                        english and italian</article-title>. <source>Natural Language &amp;
                        Linguistic Theory</source>
                    <volume>34</volume>(<issue>1</issue>).
                        <fpage>307</fpage>&#8211;<lpage>344</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1007/s11049-015-9286-8</pub-id></mixed-citation>
            </ref>
            <ref id="B67">
                <label>67</label>
                <mixed-citation publication-type="journal"><string-name><surname>Sprouse</surname>,
                            <given-names>Jon</given-names></string-name>,
                            <string-name><given-names>Matt</given-names>
                        <surname>Wagers</surname></string-name> &amp;
                            <string-name><given-names>Colin</given-names>
                        <surname>Phillips</surname></string-name>. <year>2012</year>.
                        <article-title>A test of the relation between working-memory capacity and
                        syntactic island effects</article-title>. <source>Language</source>
                    <volume>88</volume>(<issue>1</issue>).
                        <fpage>82</fpage>&#8211;<lpage>123</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1353/lan.2012.0004</pub-id></mixed-citation>
            </ref>
            <ref id="B68">
                <label>68</label>
                <mixed-citation publication-type="book"><string-name><surname>Su&#241;er</surname>,
                            <given-names>Margarita</given-names></string-name>. <year>1991</year>.
                        <chapter-title>Indirect questions and the structure of CP: Some
                        consequences</chapter-title>. In
                            <string-name><given-names>Hector</given-names>
                        <surname>Campos</surname></string-name> &amp;
                            <string-name><given-names>Fernando</given-names>
                        <surname>Martfnez-Gil</surname></string-name> (eds.), <source>Current
                        studies in spanish linguistics</source>,
                        <fpage>283</fpage>&#8211;<lpage>312</lpage>. <publisher-loc>Washington,
                        D.C.</publisher-loc>: <publisher-name>Georgetown University
                        Press</publisher-name>.</mixed-citation>
            </ref>
            <ref id="B69">
                <label>69</label>
                <mixed-citation publication-type="journal"><string-name><surname>Swets</surname>,
                            <given-names>Benjamin</given-names></string-name>,
                            <string-name><given-names>Timothy</given-names>
                        <surname>Desmet</surname></string-name>, <string-name><given-names>David
                            Z.</given-names>
                        <surname>Hambrick</surname></string-name> &amp;
                            <string-name><given-names>Fernanda</given-names>
                        <surname>Ferreira</surname></string-name>. <year>2007</year>.
                        <article-title>The role of working memory in syntactic ambiguity resolution:
                        A psychometric approach</article-title>. <source>Journal of Experimental
                        Psychology: General</source>
                    <volume>136</volume>(<issue>1</issue>).
                        <fpage>64</fpage>&#8211;<lpage>81</lpage>. DOI: <pub-id pub-id-type="doi"
                        >10.1037/0096-3445.136.1.64</pub-id></mixed-citation>
            </ref>
            <ref id="B70">
                <label>70</label>
                <mixed-citation publication-type="journal"><string-name><surname>Torrego</surname>,
                            <given-names>Esther</given-names></string-name>. <year>1984</year>.
                        <article-title>On inversion in spanish and some of its
                        effects</article-title>. <source>Linguistic inquiry</source>
                    <volume>15</volume>(<issue>1</issue>).
                        <fpage>103</fpage>&#8211;<lpage>129</lpage>.</mixed-citation>
            </ref>
            <ref id="B71">
                <label>71</label>
                <mixed-citation publication-type="book"><string-name><surname>van
                        Gelderen</surname>, <given-names>Elly</given-names></string-name>.
                        <year>2013</year>. <source>Clause structure</source>.
                        <publisher-name>Cambridge University Press</publisher-name>. DOI: <pub-id
                        pub-id-type="doi">10.1017/CBO9781139084628</pub-id></mixed-citation>
            </ref>
            <ref id="B72">
                <label>72</label>
                <mixed-citation publication-type="book"><string-name><surname>Wiener</surname>,
                            <given-names>Norbert</given-names></string-name>. <year>1956</year>.
                        <chapter-title>The theory of prediction</chapter-title>. In <source>Modern
                        mathematics for engineers</source>. <publisher-loc>New York</publisher-loc>:
                        <publisher-name>McGraw-Hill</publisher-name>.</mixed-citation>
            </ref>
        </ref-list>
    </back>
</article>
