<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">2767-0279</journal-id>
<journal-title-group>
<journal-title>Glossa Psycholinguistics</journal-title>
</journal-title-group>
<issn pub-type="epub">2767-0279</issn>
<publisher>
<publisher-name>eScholarship Publishing</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5070/G6011185</article-id>
<article-categories>
<subj-group>
<subject>Regular article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Biased inferences about gender from names</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Gardner</surname>
<given-names>Bethany</given-names>
</name>
<email>bethany.gardner@vanderbilt.edu</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Brown-Schmidt</surname>
<given-names>Sarah</given-names>
</name>
<email>sarah.brown-schmidt@vanderbilt.edu</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>Department of Psychology and Human Development, Vanderbilt University, US</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2024-02-07">
<day>07</day>
<month>02</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="collection">
<year>2024</year>
</pub-date>
<volume>3</volume>
<issue>1</issue>
<elocation-id>2</elocation-id>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2024 The Author(s)</copyright-statement>
<copyright-year>2024</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://glossapsycholinguistics.journalpub.escholarship.org/articles/10.5070/G6011185/"/>
<abstract>
<p>How do alternative forms of reference to individuals &#8211; first, last, and full names &#8211; guide inferences about the gender of the referent? Given distributional correspondences between English first names and gender, first names provide probabilistic information about an individual&#8217;s gender. While English last names do not vary with gender, men are more likely to be referred to by last name alone. Across four experiments, we demonstrate that inferences about gender are shaped by a persistent bias to infer that people are male, along with probabilistic information carried by the first name. When an individual was introduced by last name alone, participants overwhelmingly used <italic>he</italic> to subsequently refer to the person, suggesting that participants inferred that the person was male. This bias was still present when the individual was introduced using a first or full name, with participants less likely to use <italic>she</italic> than the distributional characteristics of the first names would predict. When explicitly asked to recall the gender of an individual who was introduced by last name alone, participants preferentially responded that the person was male. This bias persisted even when the person was introduced using a first or full name. Repeated reference attenuated, but did not eliminate, this bias. We discuss implications for models of how world knowledge is linked to language use.</p>
</abstract>
</article-meta>
</front>
<body>
<sec>
<title>1. Introduction</title>
<p>When we talk about people, the way we talk about them can shape or support beliefs and inferences about the person. For example, if I state that &#8220;Jane ordered pizza,&#8221; this asserts some new information about Jane (that Jane ordered pizza), but also supports some inferences about Jane (that Jane is the sort of person that likes pizza). In addition, given that the name <italic>Jane</italic> is probabilistically associated with individuals who are female, the use of this name may lead to the inference that Jane is female. How might this inference about gender change if we introduced Jane using a full name, <italic>Jane Smith</italic>, or just a last name, <italic>Smith</italic>?</p>
<p>Making inferences about gender is rapid and automatic whenever a new person is introduced into a conversation or a story (<xref ref-type="bibr" rid="B18">Duffy &amp; Keir, 2004</xref>; <xref ref-type="bibr" rid="B22">Garnham et al., 2002</xref>; <xref ref-type="bibr" rid="B30">Kennison &amp; Trofe, 2003</xref>; <xref ref-type="bibr" rid="B41">Reynolds et al., 2006</xref>), even when gender is not relevant to the current context (<xref ref-type="bibr" rid="B10">Carreiras et al., 1996</xref>). Specifically, when a character is introduced using a gender-stereotyped role noun, then referred to later with a pronoun, readers are slower when the pronoun is incongruent with the gender stereotype (e.g., <italic>engineer&#8230;she, nurse&#8230;he</italic>) than when it is congruent (e.g., <italic>engineer&#8230;he</italic>, nurse&#8230;<italic>she</italic>). This suggests that readers infer the character&#8217;s gender based on gender cues associated with the role noun, then have to revise this inference based on new information from the pronoun, which incurs a processing cost (<xref ref-type="bibr" rid="B38">Oakhill et al., 2005</xref>; <xref ref-type="bibr" rid="B39">Osterhout et al., 1997</xref>; <xref ref-type="bibr" rid="B40">Pyykk&#246;nen et al., 2010</xref>; <xref ref-type="bibr" rid="B51">Sturt, 2003</xref>).</p>
<p>When the gender of a referent is unspecified, the person is often assumed to be male (<xref ref-type="bibr" rid="B23">Gastil, 1990</xref>; <xref ref-type="bibr" rid="B27">Hamilton, 1991</xref>; <xref ref-type="bibr" rid="B36">Moulton et al., 1978</xref>; <xref ref-type="bibr" rid="B49">Silveira, 1980</xref>). When reading a story about a character with a gender-neutral name who was never referred to with third-person pronouns or other gendered terms, about 75% of participants labelled the character as male (<xref ref-type="bibr" rid="B12">Davis Merritt &amp; Kok, 1995</xref>; <xref ref-type="bibr" rid="B13">Davis Merritt &amp; Wells Harrison, 2006</xref>). Despite knowing that around half of people are women, it has been argued that comprehenders tend to have a <italic>people=male</italic> bias, where the default person is male, and men belong to the unmarked category (<xref ref-type="bibr" rid="B49">Silveira, 1980</xref>).</p>
<p>Similar biases are evident in language production. When participants read short stories that used role nouns (e.g., <italic>After the shop on High Street closed for the night, a baker stayed to tidy up. Before the baker took out the trash&#8230;</italic>), then wrote continuations of the story, they were less likely to use <italic>she</italic> to refer to the character than the distributional statistics about the role nouns would predict (<xref ref-type="bibr" rid="B8">Boyce et al., 2019</xref>). While participants in a separate norming study estimated that 49% of bakers are women, participants in the sentence completion study used <italic>she</italic> to refer to the baker only around 30% of the time. A third experiment that probed memory for these characters found that participants were less likely to recall characters as female than would be expected given the norming study&#8217;s estimates of the gender distributions.</p>
<p>In a related study during the 2016 US presidential election cycle (<xref ref-type="bibr" rid="B58">von der Malsburg et al., 2020</xref>), one group of participants estimated the likelihoods of Clinton, Trump, and Sanders winning; a second group completed a sentence about the next US president (<italic>The next US president will be sworn into office in January 2017. After moving into the Oval Office, one of the first things that&#8230;</italic>); and a third group completed a self-paced reading task where <italic>she, he</italic>, or <italic>they</italic> referred to the next president. Data were collected at multiple intervals during the election cycle. While participants throughout the election cycle estimated a 50&#8211;60% chance Clinton would win, participants in the sentence completion task used <italic>she</italic> to refer to the next president only around 10% of the time and <italic>they</italic> around 50% of the time. Participants also showed significant delays when reading sentences that used <italic>she</italic> as compared to <italic>he</italic> and <italic>they</italic> when referring to the next US president. An auxiliary experiment found no general reading time penalty for <italic>she</italic> vs. <italic>he</italic>, indicating that these results were driven by difficulty in interpreting <italic>she</italic> when it co-referred with <italic>the president</italic>, as opposed to <italic>she</italic> being intrinsically slower to process. Similarly, when participants were asked to write about a generic person (e.g., <italic>Before a pedestrian crosses the street&#8230;</italic>) and then describe the person they imagined, participants imagined men two times as often as women, but were two and a half times as likely to use masculine names to refer to the characters (<xref ref-type="bibr" rid="B26">Hamilton, 1988</xref>). In all three studies, participants used masculine language forms (<italic>he</italic>, strongly masculine names) at higher rates than they inferred the referent to be male. These findings point to a bias in favor of producing masculine language forms, above and beyond a bias to infer gender-unspecified people as male.</p>
<p>Separate evidence suggests that people&#8217;s estimates of how gender is distributed within different contexts generally reflects real world distributions. Participant estimates of the gender ratios in different occupations were strongly correlated with UK government data (<xref ref-type="bibr" rid="B21">Garnham et al., 2015</xref>; <xref ref-type="bibr" rid="B35">Misersky et al., 2014</xref>). In the cases where the estimated and actual gender ratios diverged the most, it was in the direction of overestimating the proportion of men. This suggests that it is unlikely that the observed differences between beliefs about and language for role nouns (<xref ref-type="bibr" rid="B8">Boyce et al., 2019</xref>; <xref ref-type="bibr" rid="B58">von der Malsburg et al., 2020</xref>) are primarily driven by beliefs about role nouns that overestimate the prevalence of women, instead of by a bias towards masculine language forms.</p>
<p>The studies discussed so far have investigated gender inferences from role nouns, which carry probabilistic information about gender through corresponding knowledge of gender distributions in the world (e.g., what proportion of presidents are women). Personal names can also carry probabilistic information about a person&#8217;s gender. In English, for example, most first names have strong gender associations. Androgynous first names are relatively infrequent in the US, and specific names rarely maintain an androgynous gender association over time (<xref ref-type="bibr" rid="B32">Lieberson et al., 2000</xref>). While English last names do not mark gender per se, men are more likely than women to be referred to by last name, particularly in professional contexts (<xref ref-type="bibr" rid="B5">Atir &amp; Ferguson, 2018</xref>; <xref ref-type="bibr" rid="B19">Files et al., 2017</xref>; <xref ref-type="bibr" rid="B45">Rubin, 1981</xref>; <xref ref-type="bibr" rid="B50">Stewart et al., 2003</xref>; <xref ref-type="bibr" rid="B52">Takiff et al., 2001</xref>; <xref ref-type="bibr" rid="B56">Uscinski &amp; Goren, 2011</xref>). Speakers have a range of choices when referring to a person, including pronouns, first names, last names, role nouns, titles, and honorifics. These referential alternatives provide different types of probabilistic cues to the person&#8217;s gender that may guide the inferences about gender that a comprehender makes.</p>
<p>In addition to shaping inferences about gender, these referential choices also impact evaluations of the person. For example, when scientists were referred to by last names as opposed to gender-neutral full names, they were subsequently judged as more eminent, famous, and deserving of awards (<xref ref-type="bibr" rid="B5">Atir &amp; Ferguson, 2018</xref>). When students evaluated a transcript of a class introduction, professors who were referred to by title were afforded higher status than those referred to by first name (<xref ref-type="bibr" rid="B50">Stewart et al., 2003</xref>; <xref ref-type="bibr" rid="B52">Takiff et al., 2001</xref>). However, when female professors were referred to by title, they were perceived as less accessible; this double bind between respect and accessibility was not found for male professors (<xref ref-type="bibr" rid="B52">Takiff et al., 2001</xref>). One explanation for why referring to people by last name or title affords them higher status is that it makes them, overall, seem more masculine.</p>
<p>The aim of the present research is to examine how alternative ways of referring to people affect inferences about the person&#8217;s gender. Focusing on the use of a person&#8217;s name in English, we leverage the fact that first names (e.g., <italic>Jordan, Mary, Brian</italic>) are probabilistically associated with different genders. The <italic>use</italic> of last names is also probabilistically associated with gender, but indirectly, by virtue of the fact that men are more likely to be referred to by last name than women, particularly in professional settings (<xref ref-type="bibr" rid="B5">Atir &amp; Ferguson, 2018</xref>; <xref ref-type="bibr" rid="B19">Files et al., 2017</xref>; <xref ref-type="bibr" rid="B45">Rubin, 1981</xref>; <xref ref-type="bibr" rid="B50">Stewart et al., 2003</xref>; <xref ref-type="bibr" rid="B52">Takiff et al., 2001</xref>; <xref ref-type="bibr" rid="B56">Uscinski &amp; Goren, 2011</xref>). More generally, there is a people=male bias, where people are assumed to be male by default (<xref ref-type="bibr" rid="B23">Gastil, 1990</xref>; <xref ref-type="bibr" rid="B27">Hamilton, 1991</xref>; <xref ref-type="bibr" rid="B36">Moulton et al., 1978</xref>; <xref ref-type="bibr" rid="B49">Silveira, 1980</xref>). Here, we contrast two hypotheses about how these three different sources of information shape inferences about gender.</p>
<p>One hypothesis is that the people=male bias (<xref ref-type="bibr" rid="B23">Gastil, 1990</xref>; <xref ref-type="bibr" rid="B27">Hamilton, 1991</xref>; <xref ref-type="bibr" rid="B36">Moulton et al., 1978</xref>; <xref ref-type="bibr" rid="B49">Silveira, 1980</xref>) is present only when the referential form itself does not provide direct, probabilistic information about gender. If so, a person introduced by last name would be much more likely to be assumed to be male than female, due to both the people=male bias and the fact that that people referred to by last name tend to be male. In contrast, when a person is introduced with a first name, probabilistic information carried by the gender distribution of that name would guide gender inferences, instead of the people=male assumption. While this pattern would differ from findings for role nouns (<xref ref-type="bibr" rid="B8">Boyce et al., 2019</xref>; <xref ref-type="bibr" rid="B58">von der Malsburg et al., 2020</xref>), such a pattern of findings may be expected, given that gender associations for English first names cluster at the endpoints (<xref ref-type="bibr" rid="B32">Lieberson et al., 2000</xref>) more than gender associations for job-related role nouns (<xref ref-type="bibr" rid="B21">Garnham et al., 2015</xref>; <xref ref-type="bibr" rid="B35">Misersky et al., 2014</xref>). If, on average, first names carry stronger gender cues than role nouns, people may form inferences based primarily on names, without defaulting to the &#8220;people=male&#8221; assumption.</p>
<p>Alternatively, if the people=male bias persists in the face of probabilistic information about gender, inferences about gender should result from a combination of these cues. On this hypothesis, we would not expect introducing people with a first name to eliminate the <italic>people=male</italic> bias. Instead, a person would be less likely to be inferred to be female than predicted by the gender distribution of the first names. A series of four experiments tests these competing hypotheses in a paradigm where participants were introduced to characters using first, last, or full names. We consider two measures of gender inferences about the characters: use of gendered third-person pronouns to refer to the characters (Experiments 1 and 3) and explicit questions about the gender of the characters (Experiments 2 and 4).</p>
</sec>
<sec>
<title>2. Experiment 1</title>
<p>The aim of Experiment 1 was to examine the relationship between how a character in a sentence is introduced &#8211; by their first, last, or full name &#8211; and inferences about that character&#8217;s gender. Participants read sentences that introduced a character by name (e.g., Jordan, Smith, or Jordan Smith) and continued with fragments that invited continuation with a pronoun referring to the character. Participants wrote completions for the fragments, and we used the gender information that was (or was not) carried on the pronoun as a measure of the participants&#8217; inferences about that character&#8217;s gender.</p>
<sec sec-type="methods">
<title>2.1 Methods</title>
<sec>
<title>2.1.1 Participants</title>
<p>457 participants were included in the dataset, with each participant assigned to 1 of 3 between-participants conditions (First = 152, Full = 153, Last = 152). The sample size was selected a priori, based on Boyce et al. (<xref ref-type="bibr" rid="B8">2019</xref>). Participants were recruited on Amazon Mechanical Turk and required to be over the age of 18, located in the US, and have started learning English before the age of 5; they were paid $1.50 for a task that took approximately 10 minutes. A total of 570 participant responses were collected, and exclusion rationales and participant demographics are reported in Supplement &#167;2.1.<xref ref-type="fn" rid="n1">1</xref> Critically, participants who guessed that the study was about gender bias were excluded.</p>
</sec>
<sec>
<title>2.1.2 Norming study</title>
<p>In order to select a set of first names that range from feminine to androgynous to masculine, we first conducted a norming study on a set of 90 names. 30 masculine and 30 feminine names were selected from lists of the most common names for assigned male at birth (AMAB) and assigned female at birth (AFAB)<xref ref-type="fn" rid="n2">2</xref> babies in the US (<xref ref-type="bibr" rid="B53">USSSA, 2019</xref>). An additional 30 androgynous names were selected from a list of names that were given at least one third of the time to AFAB children in the US and also at least one third of the time to AMAB children (<xref ref-type="bibr" rid="B20">Flowers, 2015</xref>). 50 participants on Amazon Mechanical Turk, following the same inclusion criteria as Experiment 1, were asked to rate the 90 names on a scale of 1&#8211;7, with 1 being &#8220;definitely masculine&#8221; and 7 being &#8220;definitely feminine.&#8221; From these results, we selected 21 names to represent a range of ratings from masculine to feminine, with different levels of androgyny in between. The 21 names were not perfectly centered (<italic>M =</italic> 4.19), partially due to the fact that androgynous names that lean masculine are much more frequent than androgynous names that lean feminine (<xref ref-type="bibr" rid="B32">Lieberson et al., 2000</xref>). The norming data were compared to US census data from 1930&#8211;2015 (<xref ref-type="bibr" rid="B54">USSSA, 2020</xref>). The proportion given to AFAB children in the census data and gender rating from the norming data showed a very strong positive correlation, <italic>r</italic>(19) = .92, p &lt; .001, and differences between the measures did not consistently over- or underestimate the femininity of the names (Supplement &#167;1).</p>
</sec>
<sec>
<title>2.1.3 Materials and procedure</title>
<p>We created 21 prompts that introduced a human character by name and ended with a fragment that was easiest to continue with a pronoun referring to the character. The prompts did not include gendered pronouns, other names, or additional human characters. 3 between-participant conditions manipulated the form of name used to refer to the character: <italic>First Name</italic> (e.g., <italic>Jordan woke up early to walk the dog. After making coffee</italic>), <italic>Last Name</italic> (e.g., <italic>Smith woke up&#8230;</italic>), and <italic>Full Name</italic> (e.g., <italic>Jordan Smith woke up&#8230;</italic>). The participants&#8217; task was to read a sentence using one of these names and then write a completion to the continuing fragment. We measured which pronouns, if any, were used to refer to the character as a measure of the participant&#8217;s inference about the character&#8217;s gender.</p>
<p>The combinations of names and prompts were counterbalanced by creating 3 lists for each condition; each participant was randomly assigned to 1 of the resulting 9 lists. Each list for the First Name condition included the 21 first names selected from the norming study and all 21 prompts, and the 3 lists counterbalanced which names went with which prompts. Each list for the Last Name condition included 21 last names and all 21 prompts, and again the 3 lists counterbalanced which names went with which prompts. The last names were selected from a list of the most common surnames in the US (<xref ref-type="bibr" rid="B55">US Census Bureau, 2016</xref>), avoiding last names that are also commonly used as first names (Supplement &#167;1). Each of the 3 lists for the Full Name condition included 21 full names and all 21 prompts, and the 3 lists counterbalanced which names went with which prompts, as well as the combinations of first and last names. Each Full Name list had a different combination of first and last names; however, due to experimenter error, there was 1 combination that appeared in 2 lists (this item was included in the analysis) and 1 first name missing from 1 list in the Full Name condition. This resulted in 104 name combinations. In addition to the critical stimuli, each participant saw 8 filler items using the names of 26 US presidential 2020 candidates in May 2019. These fillers (8&#8211;9 per list) served two purposes: first, they were used as a distraction from the focus of the study. Second, they were used to pilot items for an unrelated study about forms of reference in political language. After completing the production task, participants were asked for basic demographic information: gender, age, race/ethnicity, and education level. The participant gender question was written as an open-ended response, following best practices for trans-inclusive study design (<xref ref-type="bibr" rid="B9">Cameron &amp; Stinson, 2019</xref>; <xref ref-type="bibr" rid="B37">NASEM, 2022</xref>; <xref ref-type="bibr" rid="B57">Vincent, 2018</xref>).</p>
</sec>
</sec>
<sec>
<title>2.2 Predictions</title>
<p>If bias in gender inferences is limited to cases when no other direct information about gender is provided, we would expect people to use probabilistic gender information to infer gender when this information is available (First and Full Name conditions), resulting in rates of <italic>she</italic> responses that match the gender distributions of the first names. A bias to infer characters as male would only appear when a direct cue to gender is not provided, resulting in a bias towards <italic>he</italic> responses only in the Last Name condition.</p>
<p>Alternatively, if the bias to assume people are male occurs even when probabilistic cues to gender are available, we would expect characters to be more likely to be inferred as male than female in all three conditions, with combined effects of the first name&#8217;s gender associations and a bias in gender inferences in the First and Full Name conditions. If this is the case, while <italic>she</italic> responses will increase as the first names become more feminine, the rate of <italic>she</italic> responses will be lower than predicted by the gender associations of the first names.</p>
<p>A secondary question was whether introducing a character with the full name (e.g., Jordan Smith) rather than the first name only (e.g., Jordan) would attenuate the influence of the gender information carried by the first name. This question was motivated by the observation that, in English, it is more common to refer to men than women by their last names (<xref ref-type="bibr" rid="B19">Files et al., 2017</xref>; <xref ref-type="bibr" rid="B45">Rubin, 1981</xref>; <xref ref-type="bibr" rid="B50">Stewart et al., 2003</xref>; <xref ref-type="bibr" rid="B52">Takiff et al., 2001</xref>; <xref ref-type="bibr" rid="B56">Uscinski &amp; Goren, 2011</xref>), and thus the full name may act as a cue to masculinity.</p>
</sec>
<sec>
<title>2.3 Results</title>
<p>Responses that used <italic>he/him/his</italic> pronouns to refer to the named character were categorized together and will be referred to as <italic>he</italic> responses. Likewise, responses that used <italic>she/her/hers</italic> pronouns to refer to the named character are categorized as <italic>she</italic> responses. Responses that did not use a gendered pronoun were coded as <italic>other</italic>; these responses most commonly repeated the character&#8217;s name (e.g., <italic>After making coffee&#8230;Jordan sat down</italic>), but also included responses with no grammatical subject (e.g., <italic>&#8230;sat down</italic>) and responses not referring to the character (e.g., <italic>&#8230;it started raining</italic>). Uses of singular <italic>they</italic> were infrequent (Supplement &#167;2.3). For the First Name and Full Name conditions, the rates of <italic>he</italic> and <italic>she</italic> responses were roughly equal, following the balanced distribution of first names in our stimuli (<xref ref-type="table" rid="T1">Table 1</xref>). In the Last Name condition, responses overwhelmingly biased towards <italic>he</italic>. Notably, in the Last Name condition, <italic>she</italic> responses were slightly less common than other responses that did not gender the character.</p>
<table-wrap id="T1">
<caption>
<p><bold>Table 1:</bold> Experiment 1: Number of <italic>she, he</italic>, and <italic>other</italic> responses and the ratio of <italic>she</italic> responses to <italic>he</italic> and <italic>other</italic> responses for each condition.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 1: Number of Responses by Condition</bold></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold><italic>She</italic></bold></td>
<td align="left" valign="top"><bold><italic>He</italic></bold></td>
<td align="left" valign="top"><bold><italic>Other</italic></bold></td>
<td align="left" valign="top"><bold>Ratio of <italic>She</italic> &#124;<italic>He</italic> + <italic>Other</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>First</bold></td>
<td align="left" valign="top">1395</td>
<td align="left" valign="top">1572</td>
<td align="left" valign="top">225</td>
<td align="left" valign="top">0.776</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Full</bold></td>
<td align="left" valign="top">1535</td>
<td align="left" valign="top">1514</td>
<td align="left" valign="top">131</td>
<td align="left" valign="top">0.933</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Last</bold></td>
<td align="left" valign="top">251</td>
<td align="left" valign="top">2616</td>
<td align="left" valign="top">325</td>
<td align="left" valign="top">0.085</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Responses were analyzed using logistic mixed-effect regression models with <italic>lme4</italic> in R (<xref ref-type="bibr" rid="B6">Bates et al., 2015</xref>), predicting the log odds of <italic>she</italic> responses (coded as 1) as opposed to <italic>he</italic> and <italic>other</italic> responses (coded as 0). <italic>Other</italic> responses were coded as 0 because they were not frequent enough to be placed in a third category. Participant and item were included as random intercepts, with items defined as the unique first, last, and first + last name combinations. Treating the names as the random items meant that the condition manipulations were fully between-participant and between-item, so fitting a random slope model was not possible. The fixed effect of Condition was coded with orthogonal Helmert contrasts, with the first contrast comparing the Last Name condition to the First and Full Name conditions, and the second contrast comparing the First Name condition to the Full Name condition. All models are reported with Bonferroni corrections for multiple comparisons. Overall, participants were less likely to respond <italic>she</italic> than <italic>he</italic> and <italic>other</italic> (<italic>&#946;</italic> = &#8211;1.43, <italic>z</italic> = &#8211;4.65, p &lt; .001). Participants in the First and Full Name conditions were more likely to produce <italic>she</italic> than participants in the Last Name condition (<italic>&#946;</italic> = 2.82, <italic>z</italic> = 4.03, p &lt; .001). The comparison between First and Full Name conditions was not significant (<xref ref-type="table" rid="T2">Table 2</xref>).</p>
<p>The second model included each first name&#8217;s normed Gender Rating as a covariate (<xref ref-type="table" rid="T3">Table 3</xref>). This analysis included the First and Full Name conditions only, as the Last Name condition did not contain first names. Condition was coded with mean-centered contrasts, comparing the First and Full Name conditions. The Gender Rating for each first name was mean centered, with positive numbers more feminine and negative numbers more masculine.</p>
<table-wrap id="T2">
<caption>
<p><bold>Table 2:</bold> Experiment 1: Model results for the effect of Condition on the likelihood of <italic>she</italic> responses (= 1) as opposed to <italic>he</italic> and <italic>other</italic> responses (= 0).</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 1: Condition</bold></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top" colspan="4"><bold><italic>Refer to using</italic></bold> she</td>
</tr>
<tr>
<td align="left" valign="top"><bold><italic>Predictors</italic></bold></td>
<td align="left" valign="top"><bold><italic>Log-Odds</italic></bold></td>
<td align="left" valign="top"><bold><italic>SE</italic></bold></td>
<td align="left" valign="top"><bold><italic>z</italic></bold></td>
<td align="left" valign="top"><bold><italic>p</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>(Intercept)</bold></td>
<td align="left" valign="top">&#8211;1.428</td>
<td align="left" valign="top">0.308</td>
<td align="left" valign="top">&#8211;4.644</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Condition: Last</bold> (&#8211;.66) <bold>vs. First</bold> (+.33) <bold>+ Full</bold> (+.33)</td>
<td align="left" valign="top">2.824</td>
<td align="left" valign="top">0.702</td>
<td align="left" valign="top">4.026</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition: First (&#8211;.5) vs. Full (+.5)</td>
<td align="left" valign="top">0.620</td>
<td align="left" valign="top">0.700</td>
<td align="left" valign="top">0.886</td>
<td align="left" valign="top">0.376</td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><italic>Random Effects</italic></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Participant</sub></td>
<td align="left" valign="top">1.029</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Item</sub></td>
<td align="left" valign="top">7.234</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Participant</sub></td>
<td align="left" valign="top">457</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Item</sub></td>
<td align="left" valign="top">104</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">Observations</td>
<td align="left" valign="top">9564</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><sup>&#8224;</sup>Bonferroni corrected &#945; = .0167.</p></fn>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="fig" rid="F1">Figure 1</xref> shows the proportions of <italic>he, she</italic>, and <italic>other</italic> responses for the First and Full Name conditions by the Gender Rating of the first name. As the rating of the name became more feminine, <italic>she</italic> responses increased (<italic>&#946;</italic> = 1.59, <italic>z</italic> = 21.97, p &lt; .001). However, inspection of the data shows that the increase in <italic>she</italic> responses was not symmetrical to the increase in <italic>he</italic> responses. In the mostly-feminine range of first names (1 to 2 on the X-axis), <italic>he</italic> responses outnumbered <italic>other</italic> responses. In the mostly-masculine range (&#8211;3 to &#8211;2), <italic>she</italic> responses occurred at similar rates as <italic>other</italic> responses. Particularly in the First Name condition, <italic>she</italic> responses did not surpass <italic>he</italic> responses until the first name in the prompt was biased somewhat feminine, rather than at the midpoint on the scale. With mean-centered fixed effects, the significant intercept term (<italic>&#946;</italic> = &#8211;0.51, <italic>z</italic> = &#8211;4.28, p &lt; .001) reflects overall fewer <italic>she</italic> than <italic>he</italic> and <italic>other</italic> responses, and the effect of Condition (<italic>&#946;</italic> = 0.53, <italic>z</italic> = 2.22, p &lt; .05) reflects more <italic>she</italic> responses in the Full Name than the First Name condition. The interaction between Condition and Gender Rating was not significant, indicating that the effect of Gender Rating was of a similar magnitude in the First and Full Name conditions.<xref ref-type="fn" rid="n3">3</xref></p>
<fig id="F1">
<caption>
<p><bold>Figure 1:</bold> Experiment 1: Proportions of <italic>he</italic> (blue), <italic>she</italic> (red), and <italic>other</italic> (gray) responses in the First and Full Name conditions by the mean-centered gender rating of the first name. Points indicate means for each of the 21 first names, and solid lines indicate a smooth function on the raw data. On the x-axis, 0 is the mean of the 21 names, and the dashed line indicates the center of the original response scale in the norming study.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g1.png"/>
</fig>
<table-wrap id="T3">
<caption>
<p><bold>Table 3:</bold> Experiment 1: Model results for the effects of Condition and Gender Rating on the likelihood of <italic>she</italic> responses (= 1) as opposed to <italic>he</italic> and <italic>other</italic> responses (= 0) in the First and Full Name conditions.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 1: Condition and Gender Rating</bold></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top" colspan="4"><bold><italic>Refer to using</italic> she</bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold><italic>Predictors</italic></bold></td>
<td align="left" valign="top"><bold><italic>Log-Odds</italic></bold></td>
<td align="left" valign="top"><bold><italic>SE</italic></bold></td>
<td align="left" valign="top"><bold><italic>z</italic></bold></td>
<td align="left" valign="top"><bold><italic>p</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>(Intercept)</bold></td>
<td align="left" valign="top">&#8211;0.513</td>
<td align="left" valign="top">0.120</td>
<td align="left" valign="top">&#8211;4.282</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition (First = &#8211;.5, Full = +.5)</td>
<td align="left" valign="top">0.532</td>
<td align="left" valign="top">0.240</td>
<td align="left" valign="top">2.218</td>
<td align="left" valign="top">0.027<italic><sup>&#8224;</sup></italic></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Gender Rating</bold> (Centered, Masc &#8211;, Fem +)</td>
<td align="left" valign="top">1.593</td>
<td align="left" valign="top">0.073</td>
<td align="left" valign="top">21.967</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition &#215; Gender Rating</td>
<td align="left" valign="top">&#8211;0.175</td>
<td align="left" valign="top">0.139</td>
<td align="left" valign="top">&#8211;1.257</td>
<td align="left" valign="top">0.209</td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><italic>Random Effects</italic></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Participant</sub></td>
<td align="left" valign="top">0.889</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Item</sub></td>
<td align="left" valign="top">0.501</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Participant</sub></td>
<td align="left" valign="top">305</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Item</sub></td>
<td align="left" valign="top">83</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">Observations</td>
<td align="left" valign="top">6372</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><sup>&#8224;</sup>Bonferroni corrected &#945; = .0125.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>An additional analysis examined if including <italic>other</italic> responses with <italic>he</italic> responses impacted our findings. The results revealed the same patterns as above, with the following exceptions: In the model testing the effects of Condition and Gender Rating, the intercept (<italic>&#946;</italic> = &#8211;0.22, <italic>z</italic> = &#8211;1.75, p = .08) and the difference between the First and Full Name conditions (<italic>&#946;</italic> = &#8211;0.25, <italic>z</italic> = &#8211;1.57, p = .12) were not significant. A second exploratory analysis added a quadratic effect of Gender Rating to evaluate whether the increase in <italic>she</italic> responses as a function of Gender Rating was nonlinear. For example, we might expect to see an effect of Gender Rating that is weaker at the endpoints (strongly gendered names) than at the midpoint (androgynous names), with a larger <italic>he</italic> response bias for androgynous names than for strongly gendered names. The quadratic effect of Gender Rating was not significant, nor did it significantly interact with Condition, inconsistent with this possibility. A final exploratory analysis included participant gender as a covariate, testing if male participants showed a larger <italic>he</italic> response bias; this analysis revealed no significant effects after Bonferroni corrections for multiple comparisons. These three analyses are discussed in more detail in Supplement &#167;2.3&#8211;2.5.</p>
</sec>
<sec>
<title>2.4 Discussion</title>
<p>We investigated whether the form of reference&#8212;first name, last name, or full name&#8212;affected people&#8217;s inferences about a character&#8217;s gender, measured through the pronouns they used to complete a sentence referring to the character. When participants were not given explicit cues to gender (Last Name condition), participants overwhelmingly used <italic>he</italic> to refer to the character. Moreover, in the Last Name condition, participants were approximately equally likely to not use a gendered pronoun at all (<italic>other</italic> responses) as they were to use <italic>she</italic>. Although probabilistic cues to the referent&#8217;s gender did shape inferences, with more <italic>she</italic> responses when a first name was given (First and Full Name conditions), inspection of the data indicates that the bias towards <italic>he</italic> responses persisted. A character&#8217;s name needed to be more strongly feminine for participants to preferentially refer to them with <italic>she</italic>. In addition, participants showed a pattern of asymmetry for mostly-masculine and mostly-feminine names. In the First and Full Name conditions, androgynous names that leaned feminine (e.g., <italic>Jackie</italic>) still elicited <italic>he</italic> responses, but androgynous names that leaned masculine (e.g., <italic>Chris</italic>) elicited <italic>other</italic> responses, rather than <italic>she</italic> responses. As the first names became more feminine, the rate of <italic>she</italic> responses remained flat and parallel to the rate of <italic>other</italic> responses, then increased more sharply, whereas the rate of <italic>he</italic> responses decreased more gradually. We also hypothesized that introducing a person with a first and last name would attenuate the gender cue from the first name, such that the preference for <italic>he</italic> responses would be greater in the Full Name condition as compared to the First Name condition. The data were not consistent with this prediction; instead, the preference for <italic>he</italic> responses was numerically larger in the First Name condition.</p>
<p>The fact that the gender ratings of the first names predicted <italic>she</italic> responses indicates that participants were willing to produce feminine language forms in this task. The observed bias towards masculine language forms instead points to biased inferences about gender. However, one potential concern with this interpretation is that the pronouns produced in reference to the characters may not entirely match participants&#8217; underlying inferences about the characters&#8217; genders. Instead, it is possible that some <italic>he</italic> responses in the Last Name condition come from generic masculine usage, with participants producing <italic>he</italic> in an ostensibly gender-unspecified manner. In the 19<sup>th</sup> century, the generic masculine was prescribed as correct, explicitly replacing alternatives like singular <italic>they</italic> and <italic>he or she</italic> that had been in use. This was contested by feminists in the 1970s and 1980s, who argued that this language was not inclusive and perpetuated biases of masculine as the default (<xref ref-type="bibr" rid="B7">Bodine, 1975</xref>). While this guidance has been replaced in formal language policies by <italic>he or she</italic> and occasionally singular <italic>they</italic> constructions (<xref ref-type="bibr" rid="B2">APA, 2019</xref>; <xref ref-type="bibr" rid="B3">APA Publication Manual Task Force, 1997</xref>; <xref ref-type="bibr" rid="B42">Robertson, 2021</xref>), some speakers retain the generic masculine usage. If so, some instances of <italic>he</italic> responses in the data &#8211; particularly those in the Last Name condition, where no direct information about the character&#8217;s gender is included &#8211; may reflect this generic use. To provide a more direct test of the influence of referential form on gender inferences per se, in Experiment 2 we ask participants to make explicit inferences about the referent&#8217;s gender.</p>
</sec>
</sec>
<sec>
<title>3. Experiment 2</title>
<p>The aim of Experiment 2 was to examine the relationship between how a character in a story is referenced (e.g., by their first, last, or full name) and later explicit judgments about that character&#8217;s gender. Participants read a series of seven short stories that introduced a human character with a name and described them completing an everyday action. After a brief delay task, participants were prompted with each character&#8217;s action and asked to indicate the character&#8217;s gender in a free-response box. Note that participants were only asked explicit questions about gender after having read all seven stories first, in contrast to Experiment 1, where participants generated a sentence completion after reading each story preamble. This design choice was used to avoid participants reading the stories with the expectation that they would be later asked about gender.</p>
<sec sec-type="methods">
<title>3.1 Methods</title>
<sec>
<title>3.1.1 Participants</title>
<p>1351 participants were included in the dataset, with each participant assigned to 1 of 3 conditions (First = 451, Last = 448, Full = 452). The sample size was determined a priori, based on Boyce et al. (<xref ref-type="bibr" rid="B8">2019</xref>). Participants were recruited on Amazon Mechanical Turk using the same inclusion criteria and payment as in Experiment 1. A total of 1534 responses were collected; exclusion rationales and participant demographics are reported in Supplement &#167;3.1. Unlike in Experiment 1, participants were not excluded for guessing the study was about gender bias, since this task explicitly asked about gender inferences.</p>
</sec>
<sec>
<title>3.1.2 Materials and procedure</title>
<p>The names were combined into 3 between-participants conditions as in Experiment 1 (<italic>First Name, Last Name, Full Name</italic>). Participants saw two-sentence stories that referred to a character by name twice and did not contain any gendered pronouns (<xref ref-type="fig" rid="F2">Figure 2</xref>). The stories described everyday actions selected to avoid strong gender stereotypes (e.g., making coffee, walking a dog). Because the task involved a memory component, participants completed 7 critical trials (as opposed to 21 in Experiment 1). The materials included a total of 7 stories, 21 first names, 21 last names, and 63 first + last name pairs. Within each of the 3 conditions, 9 lists counterbalanced which names were included (3 sets) and the combinations of names and stories (also 3 sets); participants were randomly assigned to 1 of these 27 lists. In the First Name condition, each list included 7 out of the 21 first names, distributed evenly across the gender ratings from masculine to feminine. In the Last Name condition, each list included 7 out of the 21 last names, distributed randomly. In the Full Name condition, we used the same 3 combinations of first and last names as in Experiment 1, with the exception that we corrected an error in the Experiment 1 lists where a duplicate name appeared. As in Experiment 1, the names of 26 US presidential 2020 candidates acted as filler items to pilot a separate study, with each participant seeing 1 of these items.</p>
<p>After reading each story, participants typed the name of the character as an attention check. Participants then completed 16 simple math questions as a distraction task. Next, participants were given a summary of the main action in each story and asked to type the gender of the character into a free response box (<xref ref-type="fig" rid="F2">Figure 2</xref>). The free response box allowed participants to express uncertainty (e.g., <italic>gender wasn&#8217;t specified</italic> or <italic>I can&#8217;t remember</italic>). Critically, the memory prompt referenced the action and not the name.</p>
</sec>
</sec>
<sec>
<title>3.2 Predictions</title>
<p>The results of Experiment 1 indicate that the bias to infer characters as male was not eliminated when probabilistic information about the gender of a character &#8211; given by their first name &#8211; was provided. Instead, the findings support a model of gender inference where probabilistic cues about gender are combined with a people=male bias. However, pronouns produced in reference to the character may not necessarily reflect underlying inferences about the character&#8217;s gender. In particular, some <italic>he</italic> responses in the Last Name condition may have been driven by a generic masculine usage, where participants produced <italic>he</italic> but did not necessarily infer the character as male.</p>
<fig id="F2">
<caption>
<p><bold>Figure 2:</bold> Experiment 2: Procedure and example stimuli.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g2.png"/>
</fig>
<p>If the results of Experiment 1, where participants were less likely to produce <italic>she</italic> than predicted by the gender association of the first names, do reflect a bias to infer characters as male, the pattern of results should be the same when participants are asked directly about the characters&#8217; genders. Characters will be more likely to be recalled as female as the rating of the first name becomes more feminine (First and Full Name conditions). However, a bias to infer characters as male will be present in all three conditions and strongest when direct probabilistic information about gender is not provided (Last Name condition).</p>
<p>Alternatively, the bias to use the pronoun <italic>he</italic> when describing the characters in Experiment 1, particularly in the Last Name condition, may not have reflected a bias in underlying gender inferences about the character, and instead reflected the use of the generic masculine (<xref ref-type="bibr" rid="B7">Bodine, 1975</xref>). If so, in Experiment 2, we would expect to observe no bias to recall characters as male in the First and Full Name conditions, where probabilistic cues to gender are available. In the Last Name condition, we would expect characters to be more likely to be recalled as male than as female, due to the tendency to refer to men using last names more often than women.</p>
</sec>
<sec>
<title>3.3 Results</title>
<p>Responses were coded as <italic>male</italic> (e.g., &#8220;m,&#8221; &#8220;man,&#8221; &#8220;male&#8221;), <italic>female</italic> (e.g., &#8220;f,&#8221; &#8220;woman,&#8221; &#8220;female&#8221;), or <italic>other</italic> (e.g., &#8220;It wasn&#8217;t specified,&#8221; &#8220;I don&#8217;t remember&#8221;). As in Experiment 1, the rates of <italic>male</italic> and <italic>female</italic> responses were roughly equal in the First and Full Name conditions, following the balanced distribution of the first names, but participants overwhelmingly responded <italic>male</italic> in the Last Name condition (<xref ref-type="table" rid="T4">Table 4</xref>). Overall, participants were less likely to respond <italic>female</italic> than <italic>male</italic> or <italic>other</italic> (<italic>&#946;</italic> = &#8211;0.86, <italic>z</italic> = &#8211;5.71, p &lt; .001). Participants in the First and Full Name conditions were more likely to respond <italic>female</italic> than participants in the Last Name condition (<italic>&#946;</italic> = 2.00, <italic>z</italic> = 5.83, p &lt; .001). There was no difference between the First and Full Name conditions.</p>
<table-wrap id="T4">
<caption>
<p><bold>Table 4:</bold> Experiment 2: Number of <italic>female, male</italic>, and <italic>other</italic> responses and the ratio of <italic>female</italic> responses to <italic>male</italic> and <italic>other</italic> responses for each condition.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 2: Number of Responses by Conditio</bold>n</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold><italic>Female</italic></bold></td>
<td align="left" valign="top"><bold><italic>Male</italic></bold></td>
<td align="left" valign="top"><bold><italic>Other</italic></bold></td>
<td align="left" valign="top"><bold>Ratio of <italic>Female</italic> &#124;<italic>Male</italic> + <italic>Other</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>First</bold></td>
<td align="left" valign="top">1579</td>
<td align="left" valign="top">1543</td>
<td align="left" valign="top">35</td>
<td align="left" valign="top">1.001</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Full</bold></td>
<td align="left" valign="top">1446</td>
<td align="left" valign="top">1633</td>
<td align="left" valign="top">85</td>
<td align="left" valign="top">0.842</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Last</bold></td>
<td align="left" valign="top">406</td>
<td align="left" valign="top">2498</td>
<td align="left" valign="top">232</td>
<td align="left" valign="top">0.149</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="T5">
<caption>
<p><bold>Table 5:</bold> Experiment 2: Model results for the effect of Condition on the likelihood of <italic>female</italic> responses (= 1), as opposed to <italic>male</italic> and <italic>other</italic> responses (= 0).</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 2: Conditi</bold>on</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top" colspan="4"><bold><italic>Recall as female</italic></bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold><italic>Predictors</italic></bold></td>
<td align="left" valign="top"><bold><italic>Log-Odds</italic></bold></td>
<td align="left" valign="top"><bold><italic>SE</italic></bold></td>
<td align="left" valign="top"><bold><italic>z</italic></bold></td>
<td align="left" valign="top"><bold><italic>p</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>(Intercept)</bold></td>
<td align="left" valign="top">&#8211;0.861</td>
<td align="left" valign="top">0.151</td>
<td align="left" valign="top">&#8211;5.710</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Condition: Last</bold> (&#8211;.66) <bold>vs. First</bold> (+.33) <bold>+ Full</bold> (+.33)</td>
<td align="left" valign="top">2.000</td>
<td align="left" valign="top">0.343</td>
<td align="left" valign="top">5.843</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition: First (&#8211;.5) vs. Full (+.5)</td>
<td align="left" valign="top">&#8211;0.231</td>
<td align="left" valign="top">0.345</td>
<td align="left" valign="top">&#8211;0.669</td>
<td align="left" valign="top">0.50</td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><italic>Random Effects</italic></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Participant</sub></td>
<td align="left" valign="top">0.196</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Item</sub></td>
<td align="left" valign="top">1.782</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Participant</sub></td>
<td align="left" valign="top">1351</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Item</sub></td>
<td align="left" valign="top">105</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">Observations</td>
<td align="left" valign="top">9457</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><sup>&#8224;</sup>Bonferroni corrected &#945; = .0167.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>Next, the effect of Gender Rating was analyzed for the First and Full Name conditions (<xref ref-type="table" rid="T6">Table 6</xref>), again following the same model specifications as in Experiment 1. The intercept term was significant (<italic>&#946;</italic> = &#8211;0.18, <italic>z</italic> = &#8211;2.99, p &lt; .01), indicating that participants were less likely to respond <italic>female</italic> in the First and Full Name conditions overall. <italic>Female</italic> responses became more likely as the names became more feminine (<italic>&#946;</italic> = 0.78, <italic>z</italic> = 22.34, p &lt; .001), but did not surpass <italic>male</italic> responses until the first name in the prompt was biased somewhat feminine, rather than at the mean (<xref ref-type="fig" rid="F3">Figure 3</xref>). The interaction between Gender Rating and Condition was not significant, indicating that the linear effect of Gender Rating was similar in the First and Full Name conditions.</p>
<table-wrap id="T6">
<caption>
<p><bold>Table 6:</bold> Experiment 2: Model results for the effects of Condition and Gender Rating on the likelihood of <italic>female</italic> responses (= 1) as opposed to <italic>male</italic> and <italic>other</italic> responses (= 0) in the First and Full Name conditions.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 2: Condition and Gender Rating</bold></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top" colspan="4"><bold><italic>Recall as female</italic></bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold><italic>Predictors</italic></bold></td>
<td align="left" valign="top"><bold><italic>Log-Odds</italic></bold></td>
<td align="left" valign="top"><bold><italic>SE</italic></bold></td>
<td align="left" valign="top"><bold><italic>z</italic></bold></td>
<td align="left" valign="top"><bold><italic>p</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>(Intercept)</bold></td>
<td align="left" valign="top">&#8211;0.176</td>
<td align="left" valign="top">0.059</td>
<td align="left" valign="top">&#8211;2.999</td>
<td align="left" valign="top"><bold>0.003</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition (First = &#8211;.5, Full = +.5)</td>
<td align="left" valign="top">&#8211;0.223</td>
<td align="left" valign="top">0.117</td>
<td align="left" valign="top">&#8211;1.907</td>
<td align="left" valign="top">0.057</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Gender Rating</bold> (Centered, Masc &#8211;, Fem +)</td>
<td align="left" valign="top">0.783</td>
<td align="left" valign="top">0.035</td>
<td align="left" valign="top">22.338</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition &#215; Gender Rating</td>
<td align="left" valign="top">&#8211;0.066</td>
<td align="left" valign="top">0.069</td>
<td align="left" valign="top">&#8211;0.961</td>
<td align="left" valign="top">0.336</td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><italic>Random Effects</italic></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Participant</sub></td>
<td align="left" valign="top">0.114</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Item</sub></td>
<td align="left" valign="top">0.141</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Participant</sub></td>
<td align="left" valign="top">903</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Item</sub></td>
<td align="left" valign="top">83</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">Observations</td>
<td align="left" valign="top">6321</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><sup>&#8224;</sup>Bonferroni corrected &#945; = .0125.</p></fn>
</table-wrap-foot>
</table-wrap>
<fig id="F3">
<caption>
<p><bold>Figure 3:</bold> Experiment 2: Proportions of <italic>male</italic> (blue), <italic>female</italic> (red), and <italic>other</italic> (gray) responses in the First Name and Full Name conditions by the mean-centered gender rating of the first name. Points indicate means for each of the 21 first names, and lines indicate a smooth function on the raw data. Here, 0 is the mean of the 21 names, and the dashed line indicates the center of the original response scale in the norming study.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g3.png"/>
</fig>
<p>Next, we conducted the same three exploratory analyses as in Experiment 1: exclusion of the <italic>other</italic> responses, a quadratic effect of Gender Rating, and participant gender effects (Supplement &#167;3.3&#8211;3.5). The rate of <italic>other</italic> responses (3.72%) was lower than in Experiment 1, and excluding <italic>other</italic> responses did not affect the substantive pattern of results. When adding a quadratic effect of Gender Rating to the Condition and Gender Rating model, neither the quadratic effect nor its interaction with Condition were significant. This finding is inconsistent with the hypothesis that the Condition effect would be larger at the midpoint of Gender Rating compared to the endpoints. Adding participant gender as a covariate revealed a significant interaction between Participant Gender and the Last vs. First + Full contrast (<italic>&#946;</italic> = &#8211;.42, <italic>z</italic> = &#8211;2.93, p &lt; .01), such that male participants were less likely than non-male participants to respond <italic>female</italic> in the First and Full Name conditions (<italic>&#946;</italic> = &#8211;.26, <italic>z</italic> = &#8211;3.75, p &lt; .001), whereas there was no effect of participant gender in the Last Name condition (<italic>&#946;</italic> = .15, <italic>z</italic> = 1.23, p = .22). Across conditions, the effect of Gender Rating was smaller for male participants than for non-male participants (<italic>&#946;</italic> = &#8211;0.16, <italic>z</italic> = &#8211;2.64, p &lt; .01).</p>
</sec>
<sec>
<title>3.4 Discussion</title>
<p>Experiment 2 was designed to address the possibility that the preference for <italic>he</italic> responses in Experiment 1, especially in the Last Name condition, was due to participants using generic masculine forms, rather than a bias in the underlying gender inferences. Our findings were inconsistent with this as the primary explanation of the data. As in Experiment 1, participants&#8217; judgments about the genders of characters introduced in short narratives exhibited a bias to infer characters as male. This bias was strongest in the Last Name condition, compared to the conditions where the character was introduced including a first name (First and Full Name conditions). Participants did not recall the character as female 50% of the time at the midpoint on the name gender rating continuum, but instead when the names were somewhat feminine.</p>
<p>The bias to infer characters as male was present in Experiment 2, but smaller than in Experiment 1 (see comparison between experiments in <xref ref-type="fig" rid="F8">Figure 8</xref>). This difference is primarily due to an attenuated difference between the Last Name and First + Full Name conditions, where participants were 16.84 times more likely to produce a <italic>she</italic> response in the First + Full Name conditions in Experiment 1, but 7.39 times more likely to respond <italic>female</italic> in the First + Full Name conditions in Experiment 2. The mismatch between knowledge about the gender associations of first names and inferences about the genders of characters with those names &#8211; where characters only began being preferentially inferred as female when first names were somewhat feminine, not at the midpoint &#8211; was consistent across the two experiments. In the First and Full Name conditions, the odds ratios were 0.70 for a <italic>she</italic> response compared to a <italic>he</italic> or <italic>other</italic> response and 0.77 for a <italic>female</italic> response compared to a <italic>male</italic> or <italic>other</italic> response (<xref ref-type="fig" rid="F8">Figure 8</xref>).</p>
<p>One reason that the bias to infer characters as male was smaller in Experiment 2, aside from residual uses of generic <italic>he</italic> in the Last Name condition, is that people may be less likely to assume characters are male by default when the task requires them to think more directly about gender. This would be consistent with prior results, where after writing about a generic person, participants were two and a half times more likely to use masculine names to describe the character, as compared to two times more likely to explicitly label the character as male (<xref ref-type="bibr" rid="B26">Hamilton, 1988</xref>). Before considering this further, we first explore whether the people=male bias can be attenuated when people are provided with more information about, and repeated reference to, a character.</p>
</sec>
</sec>
<sec>
<title>4. Experiment 3</title>
<p>Experiments 1 and 2 examined gender inferences after brief introductions to characters, but in many settings, we receive significantly more individuating information about a person before needing to refer to them or reflect on their gender. If so, having more information about a person as an individual may reduce reliance on the people=male bias in typical settings. To address this question, Experiment 3 investigates whether the bias to infer characters as male persists after repeated reference to the character in a narrative that highlights an aspect of their life. In addition to providing individuating information, the use of repeated reference in the narrative provides multiple opportunities and more time to process an inference about gender. We also explore whether the form of reference shapes perceptions of character traits beyond gender.</p>
<p>Participants in Experiment 3 read a paragraph-length story about a character, written as a short news story highlighting an accomplishment. Characters were always introduced with a full name, then referred to 3 more times, which varied by the same conditions as in prior experiments (<italic>First Name, Last Name, Full Name</italic>). Participants continued the story by completing a sentence fragment, and we measured which pronouns, if any, were used to refer to the character. After the sentence completion task, participants rated the character in terms of likeability, accomplishment, and importance. This process was repeated for 7 stories and characters. Prior research demonstrates that professionals who are referred to by last name, a convention associated with masculinity in English (<xref ref-type="bibr" rid="B19">Files et al., 2017</xref>; <xref ref-type="bibr" rid="B56">Uscinski &amp; Goren, 2011</xref>), were judged as more accomplished and deserving of awards (<xref ref-type="bibr" rid="B5">Atir &amp; Ferguson, 2018</xref>). Given these findings, we hypothesized that characters who are rated as more accomplished and important may be more likely to be referred to with <italic>he</italic>. Judgments of status and likeability frequently trade off in women (<xref ref-type="bibr" rid="B50">Stewart et al., 2003</xref>; <xref ref-type="bibr" rid="B52">Takiff et al., 2001</xref>), and so characters who are rated as more likeable may be more likely to be referred to with <italic>she</italic>.</p>
<sec sec-type="methods">
<title>4.1 Methods</title>
<sec>
<title>4.1.1 Participants</title>
<p>Participants were recruited on Amazon Mechanical Turk, following the same criteria and procedures as in Experiments 1 and 2. The sample size (1350 planned) was chosen to generate the same number of data points as in Experiment 1 (150 participants per condition, completing 21 trials) and Experiment 2 (450 participants per condition, completing 7 trials). Because trials in Experiment 3 were longer than in Experiment 1, each participant completed only 7 trials. The final sample (N = 1272) included 405 in the First Name condition, 510 in the Last Name condition, and 357 in the Full Name condition, with conditions unbalanced due to variable rejection rates on MTurk. Participant exclusions and demographics are reported in Supplement &#167;4.1.</p>
</sec>
<sec>
<title>4.1.2 Materials and procedure</title>
<p>As in Experiments 1 and 2, participants were randomly assigned to 1 of 3 between-participants conditions: <italic>First Name, Last Name</italic>, and <italic>Full Name</italic>. Participants saw paragraph-length stories that included the character&#8217;s name 4 times, but did not use any gendered pronouns. The first reference to the character in the story always used a full name. The following 3 references to the character used either their first, last, or full name, according to the condition (<xref ref-type="fig" rid="F4">Figure 4</xref>). The materials included a total of 7 stories, each written as a short news article highlighting the character&#8217;s accomplishment: publishing a study, running a successful campaign event, having a bestseller, releasing a new album, breaking a running record, founding an animal rescue, and donating holiday meals. In addition to the 7 stories, the materials included 3 combinations of the 21 first names and 21 last names. Similar to Experiment 2, there were 9 lists within each condition, counterbalancing the combinations of names and stories and the combinations of first and last names; each participant was randomly assigned to 1 of the resulting 27 lists. Each list had first names evenly distributed across the gender ratings from masculine to feminine. After reading each of the 7 stories, participants were given a sentence fragment, which contained a 5<sup>th</sup> instance of the character&#8217;s name, varying by condition. Participants were asked to complete the sentence. Next, they were asked to rate the character on a 1&#8211;7 scale as Likeable, Accomplished, and Important. The names in these prompts again varied according to condition.</p>
</sec>
</sec>
<sec>
<title>4.2 Predictions</title>
<p>The results of Experiments 1 and 2 support a model where gender is inferred based on a combination of probabilistic cues to gender &#8211; here in the form of a person&#8217;s first name &#8211; and an overall people=male bias. However, the characters were only mentioned once in Experiment 1 and twice in Experiment 2, and the Last Name condition contained no cues about the characters&#8217; genders through the names themselves, only more indirect cues from the fact that men are more likely to be referred to by last name. The bias to infer people as male may only be present in the initial inferences established during these brief introductions. If so, additional information about the referent and time to process gender inferences may attenuate or eliminate this tendency. We would then expect to find no bias towards <italic>he</italic> responses when the probabilistic information about gender carried by the first name is repeatedly presented (First and Full Name conditions).</p>
<fig id="F4">
<caption>
<p><bold>Figure 4:</bold> Experiment 3: Procedure and example stimuli.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g4.png"/>
</fig>
<p>Alternatively, if biased gender inferences persist after repeatedly encountering cues to a character&#8217;s gender, we would expect to observe a bias to infer characters as male in all conditions, with <italic>he</italic> responses occurring more frequently than predicted by the gender distributions of the first names. One reason to expect the bias to persist comes from work showing that revising initial inferences about a character&#8217;s gender incurs processing costs while reading (<xref ref-type="bibr" rid="B10">Carreiras et al., 1996</xref>; <xref ref-type="bibr" rid="B22">Garnham et al., 2002</xref>; <xref ref-type="bibr" rid="B30">Kennison &amp; Trofe, 2003</xref>; <xref ref-type="bibr" rid="B51">Sturt, 2003</xref>).</p>
</sec>
<sec>
<title>4.3 Results</title>
<p>Responses were categorized as <italic>he, she</italic>, and <italic>other</italic>. The sentence completion prompts were less constrained than Experiment 1, with only 53% of responses beginning with a pronoun (compared to 93% in Experiment 1). As a result, we analyzed pronouns used to refer to the character at any position in the response (69% of responses). <xref ref-type="table" rid="T7">Table 7</xref> shows the proportions of responses across conditions, with <italic>other</italic> responses occurring in about a third of trials, an increase compared to Experiment 1. Responses were analyzed using logistic mixed-effect regression models, as before. Unlike Experiments 1 and 2, the contrasts for Condition were weighted to account for uneven numbers of participants in each condition. Recall that in all conditions, the first of 4 repetitions of the name was always a full name. As a result, we now analyze Gender Rating in all 3 conditions (<xref ref-type="table" rid="T8">Table 8</xref>).</p>
<table-wrap id="T7">
<caption>
<p><bold>Table 7:</bold> Experiment 3: Number of <italic>she, he</italic>, and <italic>other</italic> responses, the ratio of <italic>she</italic> responses to <italic>he</italic> and <italic>other</italic> responses, and the ratio of <italic>she</italic> to <italic>he</italic> responses for each condition.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="6"><bold>Experiment 3: Number of Responses by Condition</bold></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold><italic>She</italic></bold></td>
<td align="left" valign="top"><bold><italic>He</italic></bold></td>
<td align="left" valign="top"><bold><italic>Other</italic></bold></td>
<td align="left" valign="top"><bold>Ratio of <italic>She</italic> &#124; <italic>He</italic> + <italic>Other</italic></bold></td>
<td align="left" valign="top"><bold>Ratio of <italic>She</italic> &#124; <italic>He</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>First</bold></td>
<td align="left" valign="top">941</td>
<td align="left" valign="top">992</td>
<td align="left" valign="top">902</td>
<td align="left" valign="top">0.497</td>
<td align="left" valign="top">0.949</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Full</bold></td>
<td align="left" valign="top">848</td>
<td align="left" valign="top">899</td>
<td align="left" valign="top">752</td>
<td align="left" valign="top">0.514</td>
<td align="left" valign="top">0.943</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Last</bold></td>
<td align="left" valign="top">1079</td>
<td align="left" valign="top">1378</td>
<td align="left" valign="top">1113</td>
<td align="left" valign="top">0.433</td>
<td align="left" valign="top">0.783</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F5">
<caption>
<p><bold>Figure 5:</bold> Experiment 3: Proportions of <italic>he</italic> (blue), <italic>she</italic> (red), and <italic>other</italic> (gray) responses in the First, Last, and Full Name conditions by the mean-centered gender rating of the first name. Points indicate means for each of the 21 first names, and lines indicate a smooth function on the raw data. Here, 0 is the mean of the 21 names, and the dashed line indicates the center of the original response scale in the norming study.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g5.png"/>
</fig>
<p>Overall, participants were less likely to produce <italic>she</italic> responses than <italic>he</italic> and <italic>other</italic> responses (<italic>&#946;</italic> = &#8211;1.53, <italic>z</italic> = &#8211;15.09, p &lt; .001). While <italic>she</italic> responses increased as the first name became more feminine (<italic>&#946;</italic> = 1.15, <italic>z</italic> = 19.02, p &lt; .001), <italic>she</italic> responses only surpassed <italic>he</italic> responses when names were somewhat feminine, not at the mean (<xref ref-type="fig" rid="F5">Figure 5</xref>). Neither main effect of Condition was significant. The interaction between Gender Rating and the Last vs. First + Full condition contrast was significant (<italic>&#946;</italic> = 0.12, <italic>z</italic> = 2.15, p &lt; .05), such that the effect of Gender Rating was larger in the First + Full Name conditions compared to the Last Name condition. This interaction is likely due to the fact that the First + Full conditions had 4 repetitions of the gendered first name, whereas the Last Name condition only had 1 use of the first name. The interaction between Gender Rating and the First vs. Full Name condition contrast was not significant.</p>
<table-wrap id="T8">
<caption>
<p><bold>Table 8:</bold> Experiment 3: Model results for the effects of Condition and Gender Rating on the likelihood of <italic>she</italic> responses (= 1) as opposed to <italic>he</italic> and <italic>other</italic> responses (= 0).</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 3: Condition and Gender Rating</bold></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top" colspan="4"><bold><italic>Refer to using</italic></bold> she</td>
</tr>
<tr>
<td align="left" valign="top"><bold><italic>Predictors</italic></bold></td>
<td align="left" valign="top"><bold><italic>Log-Odds</italic></bold></td>
<td align="left" valign="top"><bold><italic>SE</italic></bold></td>
<td align="left" valign="top"><bold><italic>z</italic></bold></td>
<td align="left" valign="top"><bold><italic>p</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>(Intercept)</bold></td>
<td align="left" valign="top">&#8211;1.524</td>
<td align="left" valign="top">0.101</td>
<td align="left" valign="top">&#8211;15.090</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition: Last (&#8211;.6) vs. First (+.4) + Full (+.4)</td>
<td align="left" valign="top">0.153</td>
<td align="left" valign="top">0.092</td>
<td align="left" valign="top">1.674</td>
<td align="left" valign="top">0.094</td>
</tr>
<tr>
<td align="left" valign="top">Condition: First (&#8211;.48) vs. Full (+.52); Last (.02)</td>
<td align="left" valign="top">0.091</td>
<td align="left" valign="top">0.116</td>
<td align="left" valign="top">0.786</td>
<td align="left" valign="top">0.432</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Gender Rating</bold> (Centered, Masc &#8211;, Fem +)</td>
<td align="left" valign="top">1.148</td>
<td align="left" valign="top">0.060</td>
<td align="left" valign="top">19.017</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition (Last vs. First + Full) &#215; Gender Rating</td>
<td align="left" valign="top">0.105</td>
<td align="left" valign="top">0.049</td>
<td align="left" valign="top">2.153</td>
<td align="left" valign="top">0.031<italic><sup>&#8224;</sup></italic></td>
</tr>
<tr>
<td align="left" valign="top">Condition (First vs. Full) &#215; Gender Rating</td>
<td align="left" valign="top">&#8211;0.056</td>
<td align="left" valign="top">0.063</td>
<td align="left" valign="top">&#8211;0.894</td>
<td align="left" valign="top">0.371</td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><italic>Random Effects</italic></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Participant</sub></td>
<td align="left" valign="top">0.793</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Item</sub></td>
<td align="left" valign="top">0.421</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Participant</sub></td>
<td align="left" valign="top">1272</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Item</sub></td>
<td align="left" valign="top">63</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">Observations</td>
<td align="left" valign="top">8904</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><sup>&#8224;</sup>Bonferroni corrected &#945; = .0083.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>We conducted the same three exploratory analyses as in Experiments 1 and 2, following the same model specifications and applying Bonferroni corrections for multiple comparisons. Responses coded as <italic>other</italic> were similar to the types in Experiment 1 (e.g., repeating the character&#8217;s name), but represented a larger proportion of the data (31%). When excluding <italic>other</italic> responses (Supplement &#167;4.3), the Last vs. First + Full contrast was significant (<italic>&#946;</italic> = 0.26, <italic>z</italic> = 2.63, p &lt; .01), such that participants were less likely to produce <italic>she</italic> in the First and Full Name conditions than in the Last Name condition, similar to the results of the first two experiments. The intercept (<italic>&#946;</italic> = &#8211;0.42, <italic>z</italic> = &#8211;3.42, p &lt; .001) and the interaction between Condition and Gender Rating both remained significant (<italic>&#946;</italic> = 0.42, <italic>z</italic> = 5.46, p &lt; .001) in this subset of the data.</p>
<p>Adding a quadratic effect of Gender Rating to the primary model, which included <italic>other</italic> responses, revealed a significant quadratic effect of Gender Rating (<italic>&#946;</italic> = &#8211;0.11, <italic>z</italic> = &#8211;3.67, p &lt; .001). Inspection of the data suggests that this effect may be due to stronger effects of name rating towards the center of the gender rating scale (androgynous names) than at the end points (strongly-gendered names). The interaction between the quadratic effect of Gender Rating and the Last vs. First + Full contrast was significant (<italic>&#946;</italic> = &#8211;0.10, <italic>z</italic> = &#8211;3.24, p &lt; .001). Probing this interaction indicated that the quadratic effect of Gender Rating was significant in the First and Full Name conditions (<italic>&#946;</italic> = &#8211;0.15, <italic>z</italic> = &#8211;4.28, p &lt; .001), but not in the Last Name condition (<italic>&#946;</italic> = &#8211;0.06, <italic>z</italic> = &#8211;1.67, p = .09), which likely reflects that the Last Name condition only included the gendered first name in 1 out of the 4 repetitions. This analysis also indicated a significant Condition effect for the Last vs. First + Full contrast (<italic>&#946;</italic> = 0.24, <italic>z</italic> = 3.00, p &lt; .01), such that participants were more likely to produce <italic>she</italic> in the First and Full Name conditions compared to the Last Name condition (Supplement &#167;4.4).</p>
<p>Adding participant gender as a covariate revealed that male participants were less likely than non-male participants to produce <italic>she</italic> responses as compared to <italic>he</italic> and <italic>other</italic> responses across all three conditions (<italic>&#946;</italic> = &#8211;.33, <italic>z</italic> = &#8211;3.53, p &lt; .001). Participant Gender did not significantly interact with Condition or Gender Rating (Supplement &#167;4.5). Finally, we conducted exploratory analyses of the Accomplishment, Likeability, and Importance ratings (Supplement &#167;4.6). Because these ratings were near ceiling at the positive ends of the scales, the results were largely nonsignificant, with the exception that more likeable characters were more likely to be referred to with <italic>she</italic>.</p>
</sec>
<sec>
<title>4.4 Discussion</title>
<p>Experiment 3 examined whether the bias to produce <italic>he</italic> persists with more information about the character and more time to process an inference about their gender. Participants read paragraph-length news stories that mentioned a character four times and described their noteworthy accomplishment. All characters were introduced by their full name, and the following three references carried varying gender cues. In the First and Full Name conditions, the first name was repeated; in the Last Name condition, the first name was not repeated, but the choice of form of reference is an indirect cue to gender, given that men are more likely to be referred to by last name. Despite the fact that all characters were first introduced with their full name, the bias towards producing <italic>he</italic> persisted. <italic>She</italic> responses increased as the gender rating of the first names became more feminine, but only overtook <italic>he</italic> responses when the names were somewhat feminine, not at the midpoint. The effect of gender rating was stronger in the First and Full Name conditions, where the gendered first name was repeated four times, than in the Last Name condition, where the first name only appeared once.</p>
<p>Across conditions, the odds ratio of a <italic>she</italic> response (vs. a <italic>he</italic> or <italic>other</italic> response) was 0.24 in Experiment 1 and 0.22 in Experiment 3 (<xref ref-type="fig" rid="F8">Figure 8</xref>). Note, however, that there was a higher rate of <italic>other</italic> responses in Experiment 3. It is unclear how much this reflects a greater flexibility in the sentence completion prompts, with the stories in Experiment 3 allowing more felicitous continuations not using a third-person pronoun to refer to the character than the sentences in Experiment 1. Excluding <italic>other</italic> responses, the odds ratio of a <italic>she</italic> response was 0.32 in Experiment 1 and 0.65 in Experiment 3. This suggests that the <italic>he</italic> response bias was attenuated in comparison to Experiment 1, but still present. This difference across studies was most notable in the Last Name condition: In Experiment 1, where the Last Name condition did not contain direct cues to gender, the odds ratio of a <italic>she</italic> response (vs. a <italic>he</italic> response, <italic>other</italic> responses excluded) was 0.04. In Experiment 3, where the first mention was by full name, the odds ratio of a <italic>she</italic> response (vs. a <italic>he</italic> response, <italic>other</italic> excluded) was 0.51. Thus, providing the comprehender with probabilistic information about a character&#8217;s gender may attenuate, but cannot completely override, the bias to infer characters as male instilled by reference by a last name.</p>
</sec>
</sec>
<sec>
<title>5. Experiment 4</title>
<p>Experiment 4 investigated if the bias to infer characters as male after a short delay persists when participants have more information about the characters, see repeated cues about their gender, and are asked directly about their gender inferences later. Participants read a series of paragraph-length stories about a character, written as a short news story highlighting an accomplishment. Characters were introduced with a full name, then referred to three more times following the same conditions as prior experiments (<italic>First Name, Last Name, Full Name</italic>). After reading each story, participants rated the character on Likeability, Accomplishment, and Importance. After reading stories about 7 characters and rating each character, there was a brief delay during which participants completed simple math problems. Next, participants were cued with the activity described in each of the 7 stories, one at a time, and were asked to recall the gender of the character.</p>
<p>Experiments 2 and 4 differ from Experiments 1 and 3 in that these studies directly ask about gender inferences. In addition, a key feature of Experiments 2 and 4 is that participants in these studies read stories about all 7 characters and only then were asked about the gender of the 7 characters. While we can assume that participants inferred the gender of the characters as they were reading (<xref ref-type="bibr" rid="B18">Duffy &amp; Keir, 2004</xref>; <xref ref-type="bibr" rid="B22">Garnham et al., 2002</xref>; <xref ref-type="bibr" rid="B30">Kennison &amp; Trofe, 2003</xref>; <xref ref-type="bibr" rid="B39">Osterhout et al., 1997</xref>; <xref ref-type="bibr" rid="B41">Reynolds et al., 2006</xref>; <xref ref-type="bibr" rid="B51">Sturt, 2003</xref>), the instructions did not guide participants to read the stories with the intention of remembering the characters&#8217; genders, and the reading task did not prompt participants to read with the intention of designing a story continuation referring to the character. Thus, in both Experiments 2 and 4, while we expect that participants will make an inference about the gender of each character as they read, they are not aware that they will be later asked about this inference.</p>
<sec sec-type="methods">
<title>5.1 Methods</title>
<sec>
<title>5.1.1 Participants</title>
<p>Participants were recruited on Amazon Mechanical Turk, following the same criteria and procedures as in Experiments 1&#8211;3. The sample size (1350 planned) was chosen to generate the same number of data points as Experiments 1&#8211;3; here, 450 participants in each of the 3 conditions completed 7 trials each. A total of 1361 responses were recorded. The final sample (N = 1253) included 422 participants in the First Name condition, 415 in the Last Name condition, and 416 in the Full Name condition. Participant exclusions and demographics are shown in Supplement &#167;5.1.</p>
</sec>
<sec>
<title>5.1.2 Materials and procedure</title>
<p>Participants read the same stories as in Experiment 3, with characters that were introduced with a full name and subsequently referenced 3 more times, varying according to the 3 between-subjects conditions. After reading each story, participants rated the characters on Likeability, Accomplishment, and Importance. After a short delay, participants were asked to recall the gender of each character, as in Experiment 2. They were cued by the action in the story, without using the name or any gendered pronouns, and entered their answers in a free-response box. The 9 lists within each condition, counterbalancing names and prompts, were identical to Experiment 3; again participants were randomly assigned to 1 of 27 lists. <xref ref-type="fig" rid="F6">Figure 6</xref> shows the procedure and an example story.</p>
<fig id="F6">
<caption>
<p><bold>Figure 6:</bold> Experiment 4: Procedure and example stimuli.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g6.png"/>
</fig>
</sec>
</sec>
<sec>
<title>5.2 Predictions</title>
<p>Thus far, our findings support a model of gender inference where probabilistic cues to gender are combined with a bias to infer that a character is male. In Experiment 4, we test the predictions of this model when the characters are presented in more detail and where the probabilistic cue to gender (in the form of the first name) is more strongly established, but when the sequence of the experiment does not prompt participants to be making inferences about gender when first reading the stories. While the results of Experiment 3 demonstrated a persistent bias to produce <italic>he</italic>, here we ask if the bias to infer characters to be male persists when directly asked about the character&#8217;s gender. If so, the bias towards recalling characters as male will be present even when probabilistic information about gender is repeatedly provided (First and Full Name conditions). Alternatively, if the bias to infer referents as male is attenuated after probabilistic cues about gender are well-established and when gender inferences are asked about directly, we would expect no bias towards recalling characters as male in the First and Full Name conditions.</p>
</sec>
<sec>
<title>5.3 Results</title>
<p>As in Experiment 2, responses were coded as <italic>male, female</italic>, or <italic>other</italic> (<xref ref-type="table" rid="T9">Table 9</xref>) and analyzed using logistic mixed-effect regression models predicting the log odds of a <italic>female</italic> response as opposed to <italic>male</italic> and <italic>other</italic> responses (<xref ref-type="table" rid="T10">Table 10</xref>). Characters were less likely to be recalled as female overall (<italic>&#946;</italic> = &#8211;0.26, <italic>z</italic> = &#8211;3.14, p &lt; .01), and somewhat more likely to be recalled as female in the First + Full Name conditions than in the Last Name condition (<italic>&#946;</italic> = 0.13, <italic>z</italic> = 0.94, p &lt; .05). The comparison between First and Full Name conditions was not significant. Participants responded <italic>female</italic> more frequently as the first names became more feminine (<italic>&#946;</italic> = 0.76, <italic>z</italic> = 16.65, p &lt; .001), but <italic>female</italic> responses did not overtake <italic>male</italic> responses until the first names were somewhat feminine (<xref ref-type="fig" rid="F7">Figure 7</xref>). The interaction between Gender Rating and Condition was significant for the Last vs. First + Full contrast (<italic>&#946;</italic> = 0.13, <italic>z</italic> = 3.81, p &lt; .001), due to a larger effect of Gender Rating in the First + Full Name conditions, where the first name was repeated four times, as compared to the Last Name condition, where the first name was only presented once. An interaction between Gender Rating and the First vs. Full contrast (<italic>&#946;</italic> = &#8211;0.10, <italic>z</italic> = &#8211;2.45, p &lt; .05) was due to a larger effect of Gender Rating in the First Name condition than in the Full Name condition.</p>
<table-wrap id="T9">
<caption>
<p><bold>Table 9:</bold> Experiment 4: Number of <italic>female, male</italic>, and <italic>other</italic> responses and the ratio of <italic>female</italic> responses to <italic>male</italic> and <italic>other</italic> responses for each condition.</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 4: Number of Responses by Condition</bold></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top"><bold><italic>Female</italic></bold></td>
<td align="left" valign="top"><bold><italic>Male</italic></bold></td>
<td align="left" valign="top"><bold><italic>Other</italic></bold></td>
<td align="left" valign="top"><bold>Ratio of <italic>Female</italic> &#124;<italic>Male</italic> + <italic>Other</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>First</bold></td>
<td align="left" valign="top">1381</td>
<td align="left" valign="top">1511</td>
<td align="left" valign="top">62</td>
<td align="left" valign="top">0.878</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Full</bold></td>
<td align="left" valign="top">1380</td>
<td align="left" valign="top">1416</td>
<td align="left" valign="top">116</td>
<td align="left" valign="top">0.901</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Last</bold></td>
<td align="left" valign="top">1292</td>
<td align="left" valign="top">1529</td>
<td align="left" valign="top">84</td>
<td align="left" valign="top">0.801</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="T10">
<caption>
<p><bold>Table 10:</bold> Experiment 4: Model results for the effects of Condition and Gender Rating on the likelihood of <italic>female</italic> responses (= 1) as opposed to <italic>male</italic> and <italic>other</italic> responses (= 0).</p>
</caption>
<table>
<thead>
<tr>
<td align="left" valign="top" colspan="5"><bold>Experiment 4: Condition and Gender Rating</bold></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="left" valign="top" colspan="4"><bold><italic>Recall as female</italic></bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold><italic>Predictors</italic></bold></td>
<td align="left" valign="top"><bold><italic>Log-Odds</italic></bold></td>
<td align="left" valign="top"><bold><italic>SE</italic></bold></td>
<td align="left" valign="top"><bold><italic>z</italic></bold></td>
<td align="left" valign="top"><bold><italic>p</italic></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>(Intercept)</bold></td>
<td align="left" valign="top">&#8211;0.256</td>
<td align="left" valign="top">0.082</td>
<td align="left" valign="top">&#8211;3.138</td>
<td align="left" valign="top"><bold>0.002</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition: Last (&#8211;.67) vs. First (+.33) + Full (+.33)</td>
<td align="left" valign="top">0.126</td>
<td align="left" valign="top">0.062</td>
<td align="left" valign="top">2.048</td>
<td align="left" valign="top">0.041<italic><sup>&#8224;</sup></italic></td>
</tr>
<tr>
<td align="left" valign="top">Condition: First (&#8211;.49) vs. Full (+.51)</td>
<td align="left" valign="top">0.068</td>
<td align="left" valign="top">0.072</td>
<td align="left" valign="top">0.944</td>
<td align="left" valign="top">0.345</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Gender Rating</bold> (Centered, Masc &#8211;, Fem +)</td>
<td align="left" valign="top">0.764</td>
<td align="left" valign="top">0.046</td>
<td align="left" valign="top">16.648</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Condition (Last vs. First + Full) &#215; Gender Rating</bold></td>
<td align="left" valign="top">0.131</td>
<td align="left" valign="top">0.035</td>
<td align="left" valign="top">3.809</td>
<td align="left" valign="top"><bold>&lt;0.001</bold></td>
</tr>
<tr>
<td align="left" valign="top">Condition (First vs. Full) &#215; Gender Rating</td>
<td align="left" valign="top">&#8211;0.103</td>
<td align="left" valign="top">0.042</td>
<td align="left" valign="top">&#8211;2.447</td>
<td align="left" valign="top">0.014<italic><sup>&#8224;</sup></italic></td>
</tr>
<tr>
<td align="left" valign="top" colspan="5"><italic>Random Effects</italic></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Participant</sub></td>
<td align="left" valign="top">0.201</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">&#964;<sub>00 Item</sub></td>
<td align="left" valign="top">0.360</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Participant</sub></td>
<td align="left" valign="top">1253</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">N <sub>Item</sub></td>
<td align="left" valign="top">63</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
<tr>
<td align="left" valign="top">Observations</td>
<td align="left" valign="top">8771</td>
<td align="left" valign="top" colspan="3"></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><sup>&#8224;</sup>Bonferroni corrected &#945; = .0083.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>Finally, we conducted the same set of supplementary analyses as in prior experiments (Supplement &#167;5.3&#8211;5.5). Excluding <italic>other</italic> responses (2.99% of total responses) revealed a similar pattern of results as the primary analysis. Adding a quadratic effect of Gender Rating revealed no new significant effects after Bonferroni correction for multiple comparisons. Adding Participant Gender as a covariate revealed that male participants were overall less likely than non-male participants to recall the character as female (<italic>&#946;</italic> = &#8211;.20, <italic>z</italic> = &#8211;3.27, p &lt; .001). As in Experiment 3, the Accomplishment, Likeability, and Importance ratings were near ceiling at the positive ends of the scales, and more likeable characters were more likely to be recalled as female. Additionally, interactions between each of the three character ratings and Gender Rating indicated that the effects of Likeability, Accomplishment, and Importance on gender inferences were stronger with more feminine names (Supplement &#167;5.6).</p>
<fig id="F7">
<caption>
<p><bold>Figure 7:</bold> Experiment 4: Proportions of <italic>male</italic> (blue), <italic>female</italic> (red), and <italic>other</italic> (gray) responses in the First, Last, and Full Name conditions by the mean-centered gender rating of the first name. Points indicate means for each of the 21 first names, and lines indicate a smooth function on the raw data. Here, 0 is the mean of the 21 names, and the dashed line indicates the center of the original response scale in the norming study.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g7.png"/>
</fig>
</sec>
<sec>
<title>5.4 Discussion</title>
<p>Experiment 4 examined whether the bias to explicitly recall characters as male persists when participants see additional information about, and repeated reference to, the character in a narrative, and when they have more time to develop inferences about the character but are less prompted to do so intentionally by the structure of the experiment. Although all characters were first introduced with their full name, which provided probabilistic information about their gender, participants were overall less likely to infer the character as female than to infer them as male or not indicate a gender inference. Characters needed to have first names that were rated somewhat feminine before they were more likely to be recalled as female, while characters with androgynous first names were more likely to recalled as male. The effect of the first name&#8217;s gender rating was strongest in the First Name condition, where the first name appeared in all 4 references to the character, and weakest in the Last Name condition, where the first name only appeared once. The bias to infer characters as male was smaller in Experiment 4 as compared to Experiment 2, but still not eliminated: the odds ratio of a <italic>female</italic> (vs. <italic>male</italic> or <italic>other</italic> response) across conditions was 0.42 in Experiment 2, and 0.77 in Experiment 4 (<xref ref-type="fig" rid="F8">Figure 8</xref>).</p>
</sec>
</sec>
<sec>
<title>6. General discussion</title>
<sec>
<title>6.1 Overview of findings</title>
<p>The present studies investigated how choices in how we refer to a person affect inferences about that person&#8217;s gender. Specifically, we examined how referring to a character by first name, last name, or full name impacted two measures of the readers&#8217; gender inferences: pronoun use in a sentence completion task and responses to an explicit question about gender. We considered two competing hypotheses about the gender inference process. One hypothesis was that people only show a bias to assume referents are male when few cues about gender are available (e.g., you only know the person&#8217;s last name). Alternatively, we hypothesized that gender inferences might be shaped by a combination of the people=male bias (<xref ref-type="bibr" rid="B49">Silveira, 1980</xref>), along with other probabilistic cues to gender.</p>
<p>Across four experiments, using both short and long character introductions and two measures of gender inference, we observed that inferences about gender were shaped by a persistent people=male bias, along with clear use of probabilistic cues to gender. The results of Experiment 1 showed that characters who were referred to by last name only were overwhelmingly referred to with <italic>he</italic>. While providing more direct cues to gender through a first name attenuated this bias, it did not eliminate it. Instead, a character&#8217;s first name had to be at least somewhat feminine before <italic>she</italic> responses became more common than <italic>he</italic> responses. The increases in <italic>she</italic> and <italic>he</italic> responses were asymmetric, with androgynous names that leaned feminine still eliciting <italic>he</italic> responses, but androgynous names that leaned masculine eliciting responses that did not use a pronoun instead of <italic>she</italic> responses. An alternative interpretation of these findings, however, is that bias to use <italic>he</italic>, particularly in the Last Name condition, was due to participants using the generic masculine, and not due to biased inferences about gender. To address this possibility, Experiment 2 asked participants about their gender inferences directly. The results of Experiment 2 revealed a persistent, if smaller, overall bias to recall the characters as male. In addition, when cues to gender were provided through the use of the character&#8217;s first name, the name had to be somewhat feminine before the character was preferentially recalled as female. This mismatch between knowledge about the gender distributions of the first names and inferences about the characters from those names remained similar between Experiments 1 and 2.</p>
<p>In the first two experiments, participants read sentence-length descriptions of the characters. One question, then, is whether the observed bias in gender inferences would persist when a character is referred to multiple times and more is known about them. If the people=male bias attenuates as more information about the person has accrued and more time has been spent thinking about them, it may be attenuated in longer texts. To address this question, Experiments 3 &amp; 4 examined gender inferences in paragraph-length news stories, where characters were introduced by full name and subsequently referred to three more times (manipulated by condition). In comparison to Experiments 1 &amp; 2, these stories provided repeated cues to the character&#8217;s gender and more time for the reader to process inferences about the character, as well as more closely resembling a way we might read about a new person in everyday life. While the people=male bias was numerically smaller in Experiments 3 &amp; 4 compared to Experiments 1 &amp; 2, characters were still less likely to be referred to with <italic>she</italic> and less likely to be recalled as female than the distributions of the first names would predict. The strongest biases were observed when no direct information about gender was provided (Last Name condition in Experiments 1 &amp; 2), where about 80% of characters were referred to with <italic>he</italic> or recalled as male. It is worth noting the magnitude of the people=male bias is roughly the same as results from 20&#8211;30 years ago (<xref ref-type="bibr" rid="B12">Davis Merritt &amp; Kok, 1995</xref>; <xref ref-type="bibr" rid="B26">Hamilton, 1988</xref>), despite continuing social advances in gender equality.</p>
<fig id="F8">
<caption>
<p><bold>Figure 8:</bold> Comparing Experiments 1&#8211;4: Odds ratios of <italic>she</italic> vs. <italic>he</italic> + <italic>other</italic> and <italic>female</italic> vs. <italic>male</italic> + <italic>other</italic> responses averaged across all condititions, in the Last Name condition only, in the First + Full Name conditions only, and in the Last Name compared to the First + Full Name conditions. Values less than 1 indicate being less likely to produce a <italic>she/female</italic> response; values greater than 1 indicate being more likely to produce a <italic>she/female</italic> response, with a discontinuous X axis to include the 2 largest values. Odds ratios correspond to the exponentiated beta estimates reported in the models.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g8.png"/>
</fig>
</sec>
<sec>
<title>6.2 Choosing referential forms</title>
<p>A primary focus of research on pronoun production in English has been on when speakers use pronouns instead of other referential expressions. A common assumption is that there is a causal link between a referent&#8217;s status in the discourse (e.g., whether it was the sentence topic, how often it was mentioned, and in what syntactic position, among others) and the form of reference the speaker selects. As a referent becomes more focused, salient, or prominent, the forms of reference used generally become more reduced, and pronouns are more likely (e.g., <xref ref-type="bibr" rid="B25">Gundel et al., 2012</xref>; <xref ref-type="bibr" rid="B43">Rodhe &amp; Kehler, 2014</xref>; see <xref ref-type="bibr" rid="B4">Arnold &amp; Zerkle, 2019</xref>, for discussion). For example, Schmitt et al.&#8217;s (<xref ref-type="bibr" rid="B47">1999</xref>) model of lexical access in pronoun production, tested in German, assumes that if a lexical concept is activated and sufficiently &#8220;in focus&#8221; in the discourse, the speaker will produce a pronoun instead of a full noun phrase. In this model, activating the lexical concept also activates the corresponding grammatical gender node (masculine, feminine, neuter), and if the speaker uses a pronoun, the gender node is selected in order to produce the correct pronoun (<xref ref-type="bibr" rid="B28">Jescheniak &amp; Levelt, 1994</xref>; <xref ref-type="bibr" rid="B31">Levelt et al., 1999</xref>; <xref ref-type="bibr" rid="B44">Roelofs, 1992</xref>). Lexical access models generally agree that grammatical gender is represented as a separate lexical-syntactic feature in the mental lexicon (<xref ref-type="bibr" rid="B59">Wang &amp; Schiller, 2019</xref>), but models differ in the structure of, and time course with which, this feature is connected to other linguistic representations (serial, unidirectional connections, e.g., <xref ref-type="bibr" rid="B28">Jescheniak &amp; Levelt, 1994</xref>; <xref ref-type="bibr" rid="B31">Levelt et al., 1999</xref>; <xref ref-type="bibr" rid="B44">Roelofs, 1992</xref>; or bidirectional connections, e.g., <xref ref-type="bibr" rid="B14">Dell, 1986</xref>, <xref ref-type="bibr" rid="B15">1988</xref>, <xref ref-type="bibr" rid="B16">1999</xref>; <xref ref-type="bibr" rid="B17">Dell &amp; O&#8217;Seaghdha, 1992</xref>). When producing gender-marked pronouns, as well as determiners, competition between different forms can arise from the grammatical gender features of <italic>other</italic> lexical concepts that are also activated (<xref ref-type="bibr" rid="B46">Schiller &amp; Caramazza, 2003</xref>). Generally, models of grammatical gender selection do not include competition between multiple gender features activated by the <italic>same</italic> lexical concept. The closest analogue is languages where the singular and plural forms of determiners vary for the same grammatical gender, and one approach argues that the singular form is activated by default, and can interfere with activating the plural form (<xref ref-type="bibr" rid="B29">Jescheniak et al., 2014</xref>; <xref ref-type="bibr" rid="B48">Schriefers et al., 2002</xref>).</p>
<p>Complexities arise, however, when we consider that gendered language talking about people reflects a social construct negotiated between speakers, not a discrete grammatical or semantic feature (<xref ref-type="bibr" rid="B1">Ackerman, 2019</xref>; <xref ref-type="bibr" rid="B11">Conrod, 2020</xref>; <xref ref-type="bibr" rid="B34">McConnell-Ginet, 2014</xref>). In contrast to grammatical gender, information about social gender carried by names is probabilistic. We know from experience that most people named Mary are referred to with <italic>she</italic>, most people named Brian are referred to with <italic>he</italic>, and people named Jordan are commonly referred to with <italic>he</italic> or <italic>she</italic>. But without knowing more about a particular person, speakers may be unsure of which pronouns are appropriate. Additionally, there are many contexts in which multiple choices of pronouns are available, such as using singular <italic>they</italic> instead of <italic>he</italic> or <italic>she</italic> to leave a referent&#8217;s gender unspecified (e.g., <italic>My friend<sub>i</sub> sent me a picture of their<sub>i</sub> cat</italic>) and using singular <italic>they</italic> instead of <italic>he</italic> or <italic>she</italic> for people who use <italic>they/them</italic> pronouns. This means that models of pronoun production that include reference to people need to account for speakers&#8217; decisions about <italic>which</italic> pronouns to produce, in addition to decisions about <italic>when</italic> to produce pronouns.</p>
<p>Our findings offer insights into the mechanisms guiding inferences about gender and the processes by which people choose pronouns to refer to a person. We propose that the ways in which we refer to people are influenced by multiple factors, including speaker knowledge of gender distributions, speaker inference about a referent&#8217;s gender, and speaker pronoun choice, in turn influencing the comprehender&#8217;s inference about referent gender (<xref ref-type="fig" rid="F9">Figure 9</xref>). To explain our findings, we discuss potential locations for bias to infer referents as male in this process. In particular, we focus on contexts like those in our experimental stimuli, where speakers have cues about gender in the form of names, but need to make an inference about which pronouns, if any, are appropriate to produce.</p>
<fig id="F9">
<caption>
<p><bold>Figure 9:</bold> Factors influencing personal pronoun choice.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossapx-3-1-185-g9.png"/>
</fig>
<p>First, we assume that, based on world experience, speakers form and store estimates of gender distributions across contexts, including, at the most basic level, the knowledge that close to half of people are women or girls ([A] in <xref ref-type="fig" rid="F9">Figure 9</xref>). Speakers also have information about the proportions of women in specific contexts: Estimates about the relative rates of women in various jobs showed a strong correlation with employment data, and when actual and estimated data diverged, participants were more likely to overestimate the proportion of men in a given occupation than to overestimate the proportion of women (<xref ref-type="bibr" rid="B21">Garnham et al., 2015</xref>; <xref ref-type="bibr" rid="B35">Misersky et al., 2014</xref>). Similarly, our norming study found a strong positive correlation between how feminine a first name was rated and the proportion at which it was given to children assigned female at birth. These findings indicate that people have reasonably well-calibrated knowledge of gender distributions in occupational contexts and based on first names, and that biases, when present, more frequently underestimate the proportion of women.</p>
<p>Speakers also have knowledge about a particular referent [B], and in many contexts, this includes information about what names, pronouns, titles, and other forms of reference are appropriate for them. In the present experiments, we focus on contexts where relatively little information is provided about the referent, requiring speakers to make inferences about their gender [C] and what language to use to refer to them [D]. Context-specific knowledge about gender distributions [A] contributes to the speaker&#8217;s inferences about a referent&#8217;s gender [C]. Previous findings show that this process [red] underestimates the prevalence of women, since people are biased to infer gender-unspecified referents as male (<xref ref-type="bibr" rid="B12">Davis Merritt &amp; Kok, 1995</xref>; <xref ref-type="bibr" rid="B13">Davis Merritt &amp; Wells Harrison, 2006</xref>; <xref ref-type="bibr" rid="B49">Silveira, 1980</xref>). In the present research, when no direct cues to gender were provided and participants were asked about their gender inferences (Last Name condition in Experiment 2), approximately 80% of responses inferred the referent to be male. This bias persisted, albeit attenuated, when some gender information was given (First and Full Name conditions) and after repeated reference (Experiments 3 &amp; 4).</p>
<p>The speaker&#8217;s knowledge of gender distributions in general [A] is one contributor to their choice of what referring language to use [D], though speakers tend to use feminine language forms less frequently than their probabilistic knowledge about gender distributions would predict [blue] (<xref ref-type="bibr" rid="B8">Boyce et al., 2019</xref>; <xref ref-type="bibr" rid="B26">Hamilton, 1988</xref>; <xref ref-type="bibr" rid="B58">von der Malsburg et al., 2020</xref>). In the present research, participants were less likely to use <italic>she</italic> to refer to characters than the gender distribution of the first names would predict, after only one reference to the character and after repeated reference. One explanation is that the criterion for inferring referents as male is lower than the criterion for inferring them as other genders. This is one implication of the people=male hypothesis (<xref ref-type="bibr" rid="B49">Silveira, 1980</xref>): if the generic person is a man, then producing <italic>he</italic> (the unmarked category) might require a lower threshold of evidence than producing <italic>she</italic> (the marked category). Since pronouns are typically reserved to refer to the most salient, accessible, or in focus character (<xref ref-type="bibr" rid="B4">Arnold &amp; Zerkle, 2019</xref>), a related possibility is that characters inferred as male are seen as more salient, and thus more likely to be referenced using a pronoun. This may explain why, in the First and Full Name conditions, <italic>he</italic> responses were dominant across the masculine half of the scale, whereas <italic>she</italic> and <italic>other</italic> responses (which typically did not use a pronoun) were both common in the feminine half of the scale.</p>
<p>Another factor in the choice of referring language [D] is the speaker&#8217;s inference about that specific referent&#8217;s gender [C]. A disconnect between the two is another potential source of bias [purple]: Recall that while around half of participants believed Hillary Clinton would win the 2016 US election, only 10% of responses used <italic>she</italic> to refer to the next president (<xref ref-type="bibr" rid="B58">von der Malsburg et al., 2020</xref>). When asked for both explicit and implicit measures about a character&#8217;s gender, participants described generic referents as female at higher rates than they chose feminine names to refer to them (<xref ref-type="bibr" rid="B26">Hamilton, 1988</xref>).</p>
<p>These findings suggest that inferences about referent gender and choices about gendered language are distinct processes. Mappings between gender inferences and language choice also vary by dialect, such as uses of generic <italic>he</italic> and singular <italic>they</italic>. Moreover, these mappings are not always symmetric. This was clearest in Experiment 1, where participants still used <italic>he</italic> when the character had a feminine-leaning androgynous name, but were equally likely to use <italic>she</italic> or no pronouns for characters with masculine-leaning androgynous names. This suggests that a speaker&#8217;s inference about a referent&#8217;s gender may need to be more certain to prompt the use of <italic>she</italic> than to prompt the use of <italic>he</italic>.</p>
<p>It is important to note that the discrete choices involved in gendered language production do not preclude the underlying inference about a referent&#8217;s gender being probabilistic. When participants in Experiments 1 &amp; 3 used <italic>he</italic> or <italic>she</italic>, this did not necessarily reflect certainty about a gender inference. This response pattern may be more common for speakers of dialects where forms that directly encode uncertainty, such as using singular <italic>they</italic> for a referent with an unknown or unspecified gender, are not available. Future work could explore how the same language produced may reflect underlying levels of confidence. The experiments here cannot distinguish between the contributions of distributional knowledge [blue] and inferences about a specific referent&#8217;s gender [purple], only conclude that the resulting choices show a bias against using feminine language forms.</p>
<p>Comprehenders use speakers&#8217; gendered language choices [D], as well as their knowledge of gender distributions [A], to form their own inference about a new referent&#8217;s gender [E]. One possibility is that comprehenders know that speaker pronoun choice is biased towards <italic>he</italic> and correct for this bias [weighting yellow over orange]. Although the experiments here do not address this question, Boyce et al. (<xref ref-type="bibr" rid="B8">2019</xref>) suggest that this is not the case. Participants read stories that included two repetitions of a role noun and one gendered pronoun. When asked to recall the character&#8217;s gender, participants did not correct for masculine bias in pronoun use, and instead continued to recall the referents as feminine at lower rates than the normed gender distribution of the role nouns.</p>
<p>Another aspect at play here is comprehenders&#8217; knowledge of who and what speakers discuss. While the present experiments have focused on the probabilistic information about gender carried by names, the contexts in which a person is mentioned can also provide gender cues. This is particularly relevant in Experiments 3 &amp; 4, where the stimuli were news stories highlighting a person&#8217;s accomplishment, instead of sentences describing a person performing everyday activities. While a comprehender may know that people named Jordan are about equally likely to be male or female, they may also know that a person mentioned for their recent career accomplishment is more likely to be male. Several of the stories described more stereotypically-feminine accomplishments (i.e., charity work), but most were stereotypically more masculine (i.e., politics, sports). The gender stereotypes of the stories were counter-balanced within the experiment by having each story paired with masculine, feminine, and androgynous first names across lists. However, the present experiments did not attempt to measure or experimentally manipulate the fact that people may be making additional inferences about gender based on the fact that the character accomplished something and that their accomplishment was judged as newsworthy. The results here show a bias to infer people are male and to use masculine language forms when making inferences about a character in a brief story, leaving open questions about how these biases interact with speakers&#8217; original choices of who to discuss.</p>
<p>Finally, it is likely that inferences about gender in comprehension [E] influence underlying beliefs about gender distributions [A]. As such, speaker&#8217;s choices about how to refer to entities in the world arguably drive patterns in language comprehension (<xref ref-type="bibr" rid="B33">MacDonald, 2013</xref>). Thus, if speakers consistently underuse feminine forms of reference and comprehenders do not correct for this bias, beliefs about gender distributions in general and in specific contexts may then become biased to underestimate women [green].</p>
</sec>
<sec>
<title>6.3 Implications for talk about women</title>
<p>These results have potential implications for how we talk about women. When we refer to people, we choose between different combinations of forms, including pronouns, first names, last names, gendered titles (<italic>Mr</italic>., <italic>Mrs</italic>., <italic>Ms</italic>.), and nominally ungendered titles (<italic>Doctor, Professor</italic>). If certain forms of reference make feminine referents less likely to be inferred as feminine, should this influence which forms we choose? On the one hand, prior work suggests that people who are referred to with masculine-coded terms are judged more competent and successful (<xref ref-type="bibr" rid="B5">Atir &amp; Ferguson, 2018</xref>; <xref ref-type="bibr" rid="B45">Rubin, 1981</xref>; <xref ref-type="bibr" rid="B50">Stewart et al., 2003</xref>; <xref ref-type="bibr" rid="B52">Takiff et al., 2001</xref>). Given these observations, a strategic speaker or writer could refer to a woman using masculine-coded forms to encourage a more masculine interpretation of the referent. This could mean reaping potential advantages (e.g., in perceived &#8220;eminence&#8221;), but potentially at the cost of having someone&#8217;s femininity be diminished or unacknowledged, and of perpetuating language production patterns that in turn may shape biases in comprehension. Alternatively, it may be preferable to work to change the underlying tendency to underestimate the presence of women, by choosing language forms that make it more difficult to assume a person is a man by default, especially in contexts where women are less visible. An open question, then, is if using gendered language to refer to women in contexts where their presence is systematically underestimated (e.g., doctors) would change perceptions about the contributions of women to that sphere of life.</p>
</sec>
<sec>
<title>6.4 Conclusion</title>
<p>Across a series of four experiments, we asked if alternative forms of reference to individuals guide inferences about the gender of a referent introduced in a sentence or brief story. Using measures of personal pronoun choice (Experiments 1 &amp; 3) and explicit queries about gender (Experiments 2 &amp; 4), a persistent bias to assume that the referent was male was observed across all four studies. This bias was strongest when the character was introduced by last name alone, but persisted even when the character was referred to by their first name. We argue that the observed male bias in gender inference likely results from multiple processes, including biases in knowledge of gender distributions, inferences about a referent&#8217;s gender, and pronoun choice.</p>
</sec>
</sec>
</body>
<back>
<fn-group>
<fn id="n1"><p><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://github.com/bethanyhgardner/gender-bias-names/blob/main/supplement.pdf">https://github.com/bethanyhgardner/gender-bias-names/blob/main/supplement.pdf</ext-link>.</p></fn>
<fn id="n2"><p>We use assigned male at birth (AMAB) and assigned female at birth (AFAB) to indicate that these datasets only have information about what sex children were assigned at birth, not their gender identities later. For more information about current best practices for talking about gender, see GLAAD (<xref ref-type="bibr" rid="B24">2020</xref>).</p></fn>
<fn id="n3"><p>A reviewer pointed out that the mean for the 21 first names (<italic>M</italic> = 4.21) is higher than the center of the scale (= 4). An alternative way of conducting this analysis would be to center Gender Rating at 4, the mean of the response scale (dashed line in <xref ref-type="fig" rid="F1">Figure 1</xref>), instead of at the item mean (0 in in <xref ref-type="fig" rid="F1">Figure 1</xref>). This alternative analysis yielded the same pattern of results in this and subsequent experiments, though the absolute value of the intercept was consistently larger (see details in Supplement &#167;2.2).</p></fn>
</fn-group>
<sec>
<title>Data accessibility statement</title>
<p>The study preregistrations, materials, de-identified data, analysis code, and supplementary analyses all 4 experiments are available on OSF (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.17605/OSF.IO/AYPU2">10.17605/OSF.IO/AYPU2</ext-link>) and this project&#8217;s Github repository (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.5281/zenodo.10293754">10.5281/zenodo.10293754</ext-link>).</p>
</sec>
<sec>
<title>Ethics and consent</title>
<p>The protocol for this study was reviewed by the Vanderbilt University IRB (160023), and all participants gave informed consent.</p>
</sec>
<ack>
<title>Acknowledgements</title>
<p>This work was supported in part by National Science Foundation 1556700 and 1921492 to Sarah Brown-Schmidt.</p>
</ack>
<sec>
<title>Competing interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<sec>
<title>Authors&#8217; contributions</title>
<p><italic>Conceptualization</italic>: Bethany Gardner, Sarah Brown-Schmidt</p>
<p><italic>Data curation</italic>: Bethany Gardner</p>
<p><italic>Formal analysis</italic>: Bethany Gardner, Sarah Brown-Schmidt</p>
<p><italic>Funding acquisition</italic>: Sarah Brown-Schmidt</p>
<p><italic>Investigation</italic>: Bethany Gardner</p>
<p><italic>Methodology</italic>: Bethany Gardner, Sarah Brown-Schmidt</p>
<p><italic>Software</italic>: Bethany Gardner</p>
<p><italic>Supervision</italic>: Sarah Brown-Schmidt</p>
<p><italic>Visualization</italic>: Bethany Gardner</p>
<p><italic>Writing (initial draft)</italic>: Bethany Gardner</p>
<p><italic>Writing (review &amp; editing)</italic>: Sarah Brown-Schmidt</p>
</sec>
<ref-list>
<ref id="B1"><label>1</label><mixed-citation publication-type="journal"><string-name><surname>Ackerman</surname>, <given-names>L.</given-names></string-name> (<year>2019</year>). <article-title>Syntactic and cognitive issues in investigating gendered coreference</article-title>. <source>Glossa: A Journal of General Linguistics</source>, <volume>4</volume>(<issue>1</issue>), <fpage>117</fpage>. DOI: <pub-id pub-id-type="doi">10.5334/gjgl.721</pub-id></mixed-citation></ref>
<ref id="B2"><label>2</label><mixed-citation publication-type="webpage"><collab>American Psychological Association [APA]</collab>. (<year>2019</year>). <source>Singular</source> they. APA Style. <uri>https://web.archive.org/web/20211124212947/https://apastyle.apa.org/style-grammar-guidelines/grammar/singular-they</uri></mixed-citation></ref>
<ref id="B3"><label>3</label><mixed-citation publication-type="journal"><collab>APA Publication Manual Task Force</collab>. (<year>1997</year>). <article-title>Guidelines for nonsexist language in APA journals</article-title>. <source>American Psychologist</source>, <volume>32</volume>(<issue>6</issue>), <fpage>487</fpage>&#8211;<lpage>494</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0003-066X.32.6.487</pub-id></mixed-citation></ref>
<ref id="B4"><label>4</label><mixed-citation publication-type="journal"><string-name><surname>Arnold</surname>, <given-names>J. E.</given-names></string-name>, &amp; <string-name><surname>Zerkle</surname>, <given-names>S. A.</given-names></string-name> (<year>2019</year>). <article-title>Why do people produce pronouns? Pragmatic selection vs. Rational models</article-title>. <source>Language, Cognition and Neuroscience</source>, <volume>34</volume>(<issue>9</issue>), <fpage>1152</fpage>&#8211;<lpage>1175</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/23273798.2019.1636103</pub-id></mixed-citation></ref>
<ref id="B5"><label>5</label><mixed-citation publication-type="journal"><string-name><surname>Atir</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Ferguson</surname>, <given-names>M. J.</given-names></string-name> (<year>2018</year>). <article-title>How gender determines the way we speak about professionals</article-title>. <source>Proceedings of the National Academy of Sciences</source>, <volume>115</volume>(<issue>28</issue>), <fpage>7278</fpage>&#8211;<lpage>7283</lpage>. DOI: <pub-id pub-id-type="doi">10.1073/pnas.1805284115</pub-id></mixed-citation></ref>
<ref id="B6"><label>6</label><mixed-citation publication-type="journal"><string-name><surname>Bates</surname>, <given-names>D. M.</given-names></string-name>, <string-name><surname>M&#228;chler</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Bolker</surname>, <given-names>B. M.</given-names></string-name>, &amp; <string-name><surname>Walker</surname>, <given-names>S. C.</given-names></string-name> (<year>2015</year>). <article-title>Fitting linear mixed-effects models using lme4</article-title>. <source>Journal of Statistical Software</source>, <volume>67</volume>(<issue>1</issue>), <fpage>1</fpage>&#8211;<lpage>48</lpage>. DOI: <pub-id pub-id-type="doi">10.18637/jss.v067.i01</pub-id></mixed-citation></ref>
<ref id="B7"><label>7</label><mixed-citation publication-type="journal"><string-name><surname>Bodine</surname>, <given-names>A.</given-names></string-name> (<year>1975</year>). <article-title>Androcentrism in prescriptive grammar: Singular <italic>they</italic>, sex-indefinite <italic>he</italic>, and <italic>he or she</italic></article-title>. <source>Language in Society</source>, <volume>4</volume>(<issue>2</issue>), <fpage>129</fpage>&#8211;<lpage>146</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S0047404500004607</pub-id></mixed-citation></ref>
<ref id="B8"><label>8</label><mixed-citation publication-type="confproc"><string-name><surname>Boyce</surname>, <given-names>V.</given-names></string-name>, <string-name><surname>von der Malsburg</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Poppels</surname>, <given-names>T.</given-names></string-name>, &amp; <string-name><surname>Levy</surname>, <given-names>R.</given-names></string-name> (<year>2019</year>). <source>Remember him, forget her: Gender bias in the comprehension of pronominal referents</source> [Conference Talk]. <conf-name>32nd Annual CUNY Conference on Human Sentence Processing</conf-name>. <uri>https://osf.io/c8b3f/</uri></mixed-citation></ref>
<ref id="B9"><label>9</label><mixed-citation publication-type="journal"><string-name><surname>Cameron</surname>, <given-names>J. J.</given-names></string-name>, &amp; <string-name><surname>Stinson</surname>, <given-names>D. A.</given-names></string-name> (<year>2019</year>). <article-title>Gender (mis)measurement: Guidelines for respecting gender diversity in psychological research</article-title>. <source>Social and Personality Psychology Compass</source>, <volume>13</volume>(<issue>11</issue>), <elocation-id>e12506</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1111/spc3.12506</pub-id></mixed-citation></ref>
<ref id="B10"><label>10</label><mixed-citation publication-type="journal"><string-name><surname>Carreiras</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Garnham</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Oakhill</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Cain</surname>, <given-names>K.</given-names></string-name> (<year>1996</year>). <article-title>The use of stereotypical gender information in constructing a mental model: Evidence from English and Spanish</article-title>. <source>The Quarterly Journal of Experimental Psychology</source>, <volume>49A</volume>(<issue>3</issue>), <fpage>639</fpage>&#8211;<lpage>664</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/713755647</pub-id></mixed-citation></ref>
<ref id="B11"><label>11</label><mixed-citation publication-type="journal"><string-name><surname>Conrod</surname>, <given-names>K.</given-names></string-name> (<year>2020</year>). <article-title>Pronouns and gender in language</article-title>. In <string-name><given-names>K.</given-names> <surname>Hall</surname></string-name> &amp; <string-name><given-names>R.</given-names> <surname>Barrett</surname></string-name> (Eds.), <source>Oxford handbook of language and sexuality</source>. DOI: <pub-id pub-id-type="doi">10.1093/oxfordhb/9780190212926.013.63</pub-id></mixed-citation></ref>
<ref id="B12"><label>12</label><mixed-citation publication-type="journal"><string-name><surname>Davis Merritt</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Kok</surname>, <given-names>C. J.</given-names></string-name> (<year>1995</year>). <article-title>Attribution of gender to a gender-unspecified individual: An evaluation of the people = male hypothesis</article-title>. <source>Sex Roles</source>, <volume>33</volume>(<issue>3&#8211;4</issue>), <fpage>145</fpage>&#8211;<lpage>157</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/BF01544608</pub-id></mixed-citation></ref>
<ref id="B13"><label>13</label><mixed-citation publication-type="journal"><string-name><surname>Davis Merritt</surname>, <given-names>R.</given-names></string-name>, &amp; <string-name><surname>Wells Harrison</surname>, <given-names>T.</given-names></string-name> (<year>2006</year>). <article-title>Gender and ethnicity attributions to a gender-and ethnicity-unspecified individual: Is there a people = white male bias?</article-title> <source>Sex Roles</source>, <volume>54</volume>, <fpage>787</fpage>&#8211;<lpage>797</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s11199-006-9046-7</pub-id></mixed-citation></ref>
<ref id="B14"><label>14</label><mixed-citation publication-type="journal"><string-name><surname>Dell</surname>, <given-names>G. S.</given-names></string-name> (<year>1986</year>). <article-title>A spreading-activation theory of retrieval in sentence production</article-title>. <source>Psychological Review</source>, <volume>93</volume>(<issue>3</issue>), <fpage>283</fpage>&#8211;<lpage>321</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0033-295X.93.3.283</pub-id></mixed-citation></ref>
<ref id="B15"><label>15</label><mixed-citation publication-type="journal"><string-name><surname>Dell</surname>, <given-names>G. S.</given-names></string-name> (<year>1988</year>). <article-title>The retrieval of phonological forms in production: Tests of predictions from a connectionist model</article-title>. <source>Journal of Memory and Language</source>, <volume>27</volume>(<issue>2</issue>), <fpage>124</fpage>&#8211;<lpage>142</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/0749-596X(88)90070-8</pub-id></mixed-citation></ref>
<ref id="B16"><label>16</label><mixed-citation publication-type="journal"><string-name><surname>Dell</surname>, <given-names>G. S.</given-names></string-name> (<year>1999</year>). <article-title>Connectionist models of language production: Lexical access and grammatical encoding</article-title>. <source>Cognitive Science</source>, <volume>23</volume>(<issue>4</issue>), <fpage>517</fpage>&#8211;<lpage>542</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0364-0213(99)00014-2</pub-id></mixed-citation></ref>
<ref id="B17"><label>17</label><mixed-citation publication-type="journal"><string-name><surname>Dell</surname>, <given-names>G. S.</given-names></string-name>, &amp; <string-name><surname>O&#8217;Seaghdha</surname>, <given-names>P. G.</given-names></string-name> (<year>1992</year>). <article-title>Stages of lexical access in language production</article-title>. <source>Cognition</source>, <volume>42</volume>(<issue>1&#8211;3</issue>), <fpage>287</fpage>&#8211;<lpage>314</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/0010-0277(92)90046-K</pub-id></mixed-citation></ref>
<ref id="B18"><label>18</label><mixed-citation publication-type="journal"><string-name><surname>Duffy</surname>, <given-names>S. A.</given-names></string-name>, &amp; <string-name><surname>Keir</surname>, <given-names>J. A.</given-names></string-name> (<year>2004</year>). <article-title>Violating stereotypes: Eye movements and comprehension processes when text conflicts with world knowledge</article-title>. <source>Memory &amp; Cognition</source>, <volume>32</volume>(<issue>4</issue>), <fpage>551</fpage>&#8211;<lpage>559</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/BF03195846</pub-id></mixed-citation></ref>
<ref id="B19"><label>19</label><mixed-citation publication-type="journal"><string-name><surname>Files</surname>, <given-names>J. A.</given-names></string-name>, <string-name><surname>Mayer</surname>, <given-names>A. P.</given-names></string-name>, <string-name><surname>Ko</surname>, <given-names>M. G.</given-names></string-name>, <string-name><surname>Friedrich</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Jenkins</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Bryan</surname>, <given-names>M. J.</given-names></string-name>, <string-name><surname>Vegunta</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Wittich</surname>, <given-names>C. M.</given-names></string-name>, <string-name><surname>Lyle</surname>, <given-names>M. A.</given-names></string-name>, <string-name><surname>Melikian</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Duston</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Chang</surname>, <given-names>Y.-H. H.</given-names></string-name>, &amp; <string-name><surname>Hayes</surname>, <given-names>S. N.</given-names></string-name> (<year>2017</year>). <article-title>Speaker introductions at internal medicine grand rounds: Forms of address reveal gender bias</article-title>. <source>Journal of Women&#8217;s Health</source>, <volume>26</volume>(<issue>5</issue>). DOI: <pub-id pub-id-type="doi">10.1089/jwh.2016.6044</pub-id></mixed-citation></ref>
<ref id="B20"><label>20</label><mixed-citation publication-type="webpage"><string-name><surname>Flowers</surname>, <given-names>A.</given-names></string-name> (<year>2015</year>). <source>Unisex names data</source> [Data Set]. FiveThirtyEight. <uri>https://github.com/fivethirtyeight/data/tree/master/unisex-names</uri></mixed-citation></ref>
<ref id="B21"><label>21</label><mixed-citation publication-type="journal"><string-name><surname>Garnham</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Doehren</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Gygax</surname>, <given-names>P.</given-names></string-name> (<year>2015</year>). <article-title>True gender ratios and stereotype rating norms</article-title>. <source>Frontiers in Psychology</source>, <volume>6</volume>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2015.01023</pub-id></mixed-citation></ref>
<ref id="B22"><label>22</label><mixed-citation publication-type="journal"><string-name><surname>Garnham</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Oakhill</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Reynolds</surname>, <given-names>D.</given-names></string-name> (<year>2002</year>). <article-title>Are inferences from stereotyped role names to characters&#8217; gender made elaboratively?</article-title> <source>Memory &amp; Cognition</source>, <volume>30</volume>(<issue>3</issue>), <fpage>439</fpage>&#8211;<lpage>446</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/BF03194944</pub-id></mixed-citation></ref>
<ref id="B23"><label>23</label><mixed-citation publication-type="journal"><string-name><surname>Gastil</surname>, <given-names>J.</given-names></string-name> (<year>1990</year>). <article-title>Generic pronouns and sexist language: The oxymoronic character of masculine generics</article-title>. <source>Sex Roles</source>, <volume>23</volume>(<issue>11&#8211;12</issue>), <fpage>629</fpage>&#8211;<lpage>643</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/BF00289252</pub-id></mixed-citation></ref>
<ref id="B24"><label>24</label><mixed-citation publication-type="webpage"><collab>GLAAD</collab>. (<year>2020</year>). <source>GLAAD Media reference guide &#8211; transgender</source>. <publisher-name>GLAAD</publisher-name>. <uri>https://web.archive.org/web/20200522040917/https://www.glaad.org/reference/transgender</uri></mixed-citation></ref>
<ref id="B25"><label>25</label><mixed-citation publication-type="journal"><string-name><surname>Gundel</surname>, <given-names>J. K.</given-names></string-name>, <string-name><surname>Hedberg</surname>, <given-names>N.</given-names></string-name>, &amp; <string-name><surname>Zacharski</surname>, <given-names>R.</given-names></string-name> (<year>2012</year>). <article-title>Underspecification of cognitive status in reference production: Some empirical predictions</article-title>. <source>Topics in Cognitive Science</source>, <volume>4</volume>(<issue>2</issue>), <fpage>249</fpage>&#8211;<lpage>268</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/j.1756-8765.2012.01184.x</pub-id></mixed-citation></ref>
<ref id="B26"><label>26</label><mixed-citation publication-type="journal"><string-name><surname>Hamilton</surname>, <given-names>M. C.</given-names></string-name> (<year>1988</year>). <article-title>Using masculine generics: Does generic he increase male bias in the user&#8217;s imagery?</article-title> <source>Sex Roles</source>, <volume>19</volume>(<issue>11&#8211;12</issue>), <fpage>785</fpage>&#8211;<lpage>799</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/BF00288993</pub-id></mixed-citation></ref>
<ref id="B27"><label>27</label><mixed-citation publication-type="journal"><string-name><surname>Hamilton</surname>, <given-names>M. C.</given-names></string-name> (<year>1991</year>). <article-title>Masculine bias in the attribution of personhood: People = male, male = people</article-title>. <source>Psychology of Women Quarterly</source>, <volume>15</volume>(<issue>3</issue>), <fpage>393</fpage>&#8211;<lpage>402</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/j.1471-6402.1991.tb00415.x</pub-id></mixed-citation></ref>
<ref id="B28"><label>28</label><mixed-citation publication-type="journal"><string-name><surname>Jescheniak</surname>, <given-names>J. D.</given-names></string-name>, &amp; <string-name><surname>Levelt</surname>, <given-names>W. J. M.</given-names></string-name> (<year>1994</year>). <article-title>Word frequency effects in speech production: Retrieval of syntactic information and of phonological form</article-title>. <source>Journal of Experimental Psychology: Learning, Memory and Cognition</source>, <volume>20</volume>(<issue>4</issue>), <fpage>824</fpage>&#8211;<lpage>843</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0278-7393.20.4.824</pub-id></mixed-citation></ref>
<ref id="B29"><label>29</label><mixed-citation publication-type="journal"><string-name><surname>Jescheniak</surname>, <given-names>J. D.</given-names></string-name>, <string-name><surname>Schriefers</surname>, <given-names>H.</given-names></string-name>, &amp; <string-name><surname>Lemh&#246;fer</surname>, <given-names>K.</given-names></string-name> (<year>2014</year>). <article-title>Selection of freestanding and bound gender-marking morphemes in speech production: A review</article-title>. <source>Language, Cognition and Neuroscience</source>, <volume>29</volume>(<issue>6</issue>), <fpage>684</fpage>&#8211;<lpage>694</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01690965.2012.654645</pub-id></mixed-citation></ref>
<ref id="B30"><label>30</label><mixed-citation publication-type="journal"><string-name><surname>Kennison</surname>, <given-names>S. M.</given-names></string-name>, &amp; <string-name><surname>Trofe</surname>, <given-names>J. L.</given-names></string-name> (<year>2003</year>). <article-title>Comprehending pronouns: A role for word-specific gender stereotype information</article-title>. <source>Journal of Psycholinguistic Research</source>, <volume>23</volume>(<issue>3</issue>), <fpage>355</fpage>&#8211;<lpage>378</lpage>. DOI: <pub-id pub-id-type="doi">10.1023/A:1023599719948</pub-id></mixed-citation></ref>
<ref id="B31"><label>31</label><mixed-citation publication-type="journal"><string-name><surname>Levelt</surname>, <given-names>W. J. M.</given-names></string-name>, <string-name><surname>Roelofs</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Meyer</surname>, <given-names>A. S.</given-names></string-name> (<year>1999</year>). <article-title>A theory of lexical access in speech production</article-title>. <source>Behavioral and Brain Sciences</source>, <volume>22</volume>, <fpage>1</fpage>&#8211;<lpage>75</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S0140525X99001776</pub-id></mixed-citation></ref>
<ref id="B32"><label>32</label><mixed-citation publication-type="journal"><string-name><surname>Lieberson</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Dumais</surname>, <given-names>S.</given-names></string-name>, &amp; <string-name><surname>Baumann</surname>, <given-names>S.</given-names></string-name> (<year>2000</year>). <article-title>The instability of androgynous names: The symbolic maintenance of gender boundaries</article-title>. <source>American Journal of Sociology</source>, <volume>105</volume>(<issue>5</issue>), <fpage>1249</fpage>&#8211;<lpage>1287</lpage>. DOI: <pub-id pub-id-type="doi">10.1086/210431</pub-id></mixed-citation></ref>
<ref id="B33"><label>33</label><mixed-citation publication-type="journal"><string-name><surname>MacDonald</surname>, <given-names>M. C.</given-names></string-name> (<year>2013</year>). <article-title>How language production shapes language form and comprehension</article-title>. <source>Frontiers in Psychology</source>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2013.00226</pub-id></mixed-citation></ref>
<ref id="B34"><label>34</label><mixed-citation publication-type="journal"><string-name><surname>McConnell-Ginet</surname>, <given-names>S.</given-names></string-name> (<year>2014</year>). <article-title>Gender and its relation to sex: The myth of &#8220;natural&#8221; gender</article-title>. In <string-name><given-names>G. G.</given-names> <surname>Corbett</surname></string-name> (Ed.), <source>The expression of gender</source> (pp. <fpage>3</fpage>&#8211;<lpage>38</lpage>). DOI: <pub-id pub-id-type="doi">10.1515/9783110307337.3</pub-id></mixed-citation></ref>
<ref id="B35"><label>35</label><mixed-citation publication-type="journal"><string-name><surname>Misersky</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Gygax</surname>, <given-names>P. M.</given-names></string-name>, <string-name><surname>Canal</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Gabriel</surname>, <given-names>U.</given-names></string-name>, <string-name><surname>Garnham</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Braun</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Chiarini</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Englund</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Hanulikova</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>&#214;ttl</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Valdrova</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Von Stockhausen</surname>, <given-names>L.</given-names></string-name>, &amp; <string-name><surname>Sczesny</surname>, <given-names>S.</given-names></string-name> (<year>2014</year>). <article-title>Norms on the gender perception of role nouns in Czech, English, French, German, Italian, Norwegian, and Slovak</article-title>. <source>Behavior Research Methods</source>, <volume>46</volume>(<issue>3</issue>), <fpage>841</fpage>&#8211;<lpage>871</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/s13428-013-0409-z</pub-id></mixed-citation></ref>
<ref id="B36"><label>36</label><mixed-citation publication-type="journal"><string-name><surname>Moulton</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Robinson</surname>, <given-names>G. M.</given-names></string-name>, &amp; <string-name><surname>Elias</surname>, <given-names>C.</given-names></string-name> (<year>1978</year>). <article-title>Sex bias in language use: &#8220;Neutral&#8221; pronouns that aren&#8217;t</article-title>. <source>American Psychologist</source>, <fpage>1032</fpage>&#8211;<lpage>1036</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0003-066X.33.11.1032</pub-id></mixed-citation></ref>
<ref id="B37"><label>37</label><mixed-citation publication-type="book"><collab>National Academies of Sciences, Engineering, and Medicine [NASEM]</collab>. (<year>2022</year>). <source>Measuring sex, gender identity, and sexual orientation</source>. <publisher-name>National Academies Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.17226/26424</pub-id></mixed-citation></ref>
<ref id="B38"><label>38</label><mixed-citation publication-type="journal"><string-name><surname>Oakhill</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Garnham</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Reynolds</surname>, <given-names>D.</given-names></string-name> (<year>2005</year>). <article-title>Immediate activation of stereotypical gender information</article-title>. <source>Memory &amp; Cognition</source>, <volume>33</volume>(<issue>6</issue>), <fpage>972</fpage>&#8211;<lpage>983</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/BF03193206</pub-id></mixed-citation></ref>
<ref id="B39"><label>39</label><mixed-citation publication-type="journal"><string-name><surname>Osterhout</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Bersick</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Mclaughlin</surname>, <given-names>J.</given-names></string-name> (<year>1997</year>). <article-title>Brain potentials reflect violations of gender stereotypes</article-title>. <source>Memory &amp; Cognition</source>, <volume>25</volume>(<issue>3</issue>), <fpage>273</fpage>&#8211;<lpage>285</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/BF03211283</pub-id></mixed-citation></ref>
<ref id="B40"><label>40</label><mixed-citation publication-type="journal"><string-name><surname>Pyykk&#246;nen</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Hy&#246;n&#228;</surname>, <given-names>J.</given-names></string-name>, &amp; <string-name><surname>Van Gompel</surname>, <given-names>R. P. G.</given-names></string-name> (<year>2010</year>). <article-title>Activating gender stereotypes during online spoken language processing: Evidence from visual world eye tracking</article-title>. <source>Experimental Psychology</source>, <volume>57</volume>(<issue>2</issue>), <fpage>126</fpage>&#8211;<lpage>133</lpage>. DOI: <pub-id pub-id-type="doi">10.1027/1618-3169/a000016</pub-id></mixed-citation></ref>
<ref id="B41"><label>41</label><mixed-citation publication-type="journal"><string-name><surname>Reynolds</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Garnham</surname>, <given-names>A.</given-names></string-name>, &amp; <string-name><surname>Oakhill</surname>, <given-names>J.</given-names></string-name> (<year>2006</year>). <article-title>Evidence of immediate activation of gender information from a social role name</article-title>. <source>The Quarterly Journal of Experimental Psychology</source>, <volume>59</volume>(<issue>5</issue>), <fpage>886</fpage>&#8211;<lpage>903</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/02724980543000088</pub-id></mixed-citation></ref>
<ref id="B42"><label>42</label><mixed-citation publication-type="confproc"><string-name><surname>Robertson</surname>, <given-names>M.</given-names></string-name> (<year>2021</year>). <source>Breaking, bending, and stretching the rules of singular</source> they [Conference talk]. <conf-name>27th Annual Lavender Languages and Linguistics Conference</conf-name>.</mixed-citation></ref>
<ref id="B43"><label>43</label><mixed-citation publication-type="journal"><string-name><surname>Rodhe</surname>, <given-names>H.</given-names></string-name>, &amp; <string-name><surname>Kehler</surname>, <given-names>A.</given-names></string-name> (<year>2014</year>). <article-title>Grammatical and information-structural influences on pronoun production</article-title>. <source>Language, Cognition and Neuroscience</source>, <volume>29</volume>(<issue>8</issue>), <fpage>912</fpage>&#8211;<lpage>927</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/01690965.2013.854918</pub-id></mixed-citation></ref>
<ref id="B44"><label>44</label><mixed-citation publication-type="journal"><string-name><surname>Roelofs</surname>, <given-names>A.</given-names></string-name> (<year>1992</year>). <article-title>A spreading-activation theory of lemma retrieval in speaking</article-title>. <source>Cognition</source>, <volume>42</volume>, <fpage>107</fpage>&#8211;<lpage>142</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/0010-0277(92)90041-F</pub-id></mixed-citation></ref>
<ref id="B45"><label>45</label><mixed-citation publication-type="journal"><string-name><surname>Rubin</surname>, <given-names>R. B.</given-names></string-name> (<year>1981</year>). <article-title>Ideal traits and terms of address for male and female college professors</article-title>. <source>Journal of Personality and Social Psychology</source>, <volume>41</volume>(<issue>5</issue>), <fpage>966</fpage>&#8211;<lpage>974</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0022-3514.41.5.966</pub-id></mixed-citation></ref>
<ref id="B46"><label>46</label><mixed-citation publication-type="journal"><string-name><surname>Schiller</surname>, <given-names>N. O.</given-names></string-name>, &amp; <string-name><surname>Caramazza</surname>, <given-names>A.</given-names></string-name> (<year>2003</year>). <article-title>Grammatical feature selection in noun phrase production: Evidence from German and Dutch</article-title>. <source>Journal of Memory and Language</source>, <volume>48</volume>(<issue>1</issue>), <fpage>169</fpage>&#8211;<lpage>194</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0749-596X(02)00508-9</pub-id></mixed-citation></ref>
<ref id="B47"><label>47</label><mixed-citation publication-type="journal"><string-name><surname>Schmitt</surname>, <given-names>B. M.</given-names></string-name>, <string-name><surname>Meyer</surname>, <given-names>A. S.</given-names></string-name>, &amp; <string-name><surname>Levelt</surname>, <given-names>W. J. M.</given-names></string-name> (<year>1999</year>). <article-title>Lexical access in the production of pronouns</article-title>. <source>Cognition</source>, <volume>69</volume>(<issue>3</issue>), <fpage>313</fpage>&#8211;<lpage>335</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0010-0277(98)00073-0</pub-id></mixed-citation></ref>
<ref id="B48"><label>48</label><mixed-citation publication-type="journal"><string-name><surname>Schriefers</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Jescheniak</surname>, <given-names>J. D.</given-names></string-name>, &amp; <string-name><surname>Hantsch</surname>, <given-names>A.</given-names></string-name> (<year>2002</year>). <article-title>Determiner selection in noun phrase production</article-title>. <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>, <volume>28</volume>(<issue>5</issue>), <fpage>941</fpage>&#8211;<lpage>950</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0278-7393.28.5.941</pub-id></mixed-citation></ref>
<ref id="B49"><label>49</label><mixed-citation publication-type="journal"><string-name><surname>Silveira</surname>, <given-names>J.</given-names></string-name> (<year>1980</year>). <article-title>Generic masculine words and thinking</article-title>. <source>Women&#8217;s Studies International Quarterly</source>, <volume>3</volume>(<issue>2&#8211;3</issue>), <fpage>165</fpage>&#8211;<lpage>178</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0148-0685(80)92113-2</pub-id></mixed-citation></ref>
<ref id="B50"><label>50</label><mixed-citation publication-type="journal"><string-name><surname>Stewart</surname>, <given-names>T. L.</given-names></string-name>, <string-name><surname>Berkvens</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Engels</surname>, <given-names>W. A. E. W.</given-names></string-name>, &amp; <string-name><surname>Pass</surname>, <given-names>J. A.</given-names></string-name> (<year>2003</year>). <article-title>Status and likability: Can the &#8220;mindful&#8221; woman have it all?</article-title> <source>Journal of Applied Social Psychology</source>, <volume>33</volume>(<issue>10</issue>), <fpage>2040</fpage>&#8211;<lpage>2059</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/j.1559-1816.2003.tb01874.x</pub-id></mixed-citation></ref>
<ref id="B51"><label>51</label><mixed-citation publication-type="journal"><string-name><surname>Sturt</surname>, <given-names>P.</given-names></string-name> (<year>2003</year>). <article-title>The time-course of the application of binding constraints in reference resolution</article-title>. <source>Journal of Memory and Language</source>, <volume>48</volume>(<issue>3</issue>), <fpage>542</fpage>&#8211;<lpage>562</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0749-596X(02)00536-3</pub-id></mixed-citation></ref>
<ref id="B52"><label>52</label><mixed-citation publication-type="journal"><string-name><surname>Takiff</surname>, <given-names>H. A.</given-names></string-name>, <string-name><surname>Sanchez</surname>, <given-names>D. T.</given-names></string-name>, &amp; <string-name><surname>Stewart</surname>, <given-names>T. L.</given-names></string-name> (<year>2001</year>). <article-title>What&#8217;s in a name? The status implications of students&#8217; terms of address for male and female professors</article-title>. <source>Psychology of Women Quarterly</source>, <volume>25</volume>, <fpage>134</fpage>&#8211;<lpage>144</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/1471-6402.00015</pub-id></mixed-citation></ref>
<ref id="B53"><label>53</label><mixed-citation publication-type="webpage"><collab>United States Social Security Administration [USSSA]</collab>. (<year>2019</year>). <source>Top names over the last 100 years</source> [Data Set]. <publisher-name>United States Social Security Administration</publisher-name>. <uri>https://www.ssa.gov/oact/babynames/decades/century.html</uri></mixed-citation></ref>
<ref id="B54"><label>54</label><mixed-citation publication-type="webpage"><collab>United States Social Security Administration [USSSA]</collab>. (<year>2020</year>). <source>Beyond the top 1000 names</source> [Data Set]. <publisher-name>United States Social Security Administration</publisher-name>. <uri>https://www.ssa.gov/oact/babynames/limits.html</uri></mixed-citation></ref>
<ref id="B55"><label>55</label><mixed-citation publication-type="webpage"><collab>US Census Bureau</collab>. (<year>2016</year>). <source>Frequently occurring surnames from the 2010 census</source> [Data Set]. <publisher-name>US Census Bureau</publisher-name>. <uri>https://www.census.gov/topics/population/genealogy/data/2010_surnames.html</uri></mixed-citation></ref>
<ref id="B56"><label>56</label><mixed-citation publication-type="journal"><string-name><surname>Uscinski</surname>, <given-names>J. E.</given-names></string-name>, &amp; <string-name><surname>Goren</surname>, <given-names>L. J.</given-names></string-name> (<year>2011</year>). <article-title>What&#8217;s in a name? Coverage of Senator Hillary Clinton during the 2008 Democratic primary</article-title>. <source>Political Research Quarterly</source>, <volume>64</volume>(<issue>4</issue>), <fpage>884</fpage>&#8211;<lpage>896</lpage>. DOI: <pub-id pub-id-type="doi">10.1177/1065912910382302</pub-id></mixed-citation></ref>
<ref id="B57"><label>57</label><mixed-citation publication-type="journal"><string-name><surname>Vincent</surname>, <given-names>B. W.</given-names></string-name> (<year>2018</year>). <article-title>Studying trans: Recommendations for ethical recruitment and collaboration with transgender participants in academic research</article-title>. <source>Psychology and Sexuality</source>, <volume>9</volume>(<issue>2</issue>), <fpage>102</fpage>&#8211;<lpage>116</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/19419899.2018.1434558</pub-id></mixed-citation></ref>
<ref id="B58"><label>58</label><mixed-citation publication-type="journal"><string-name><surname>von der Malsburg</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Poppels</surname>, <given-names>T.</given-names></string-name>, &amp; <string-name><surname>Levy</surname>, <given-names>R.</given-names></string-name> (<year>2020</year>). <article-title>Implicit gender bias in linguistic descriptions for expected events: The cases of the 2016 US and 2017 UK election</article-title>. <source>Psychological Science</source>, <volume>31</volume>(<issue>2</issue>), <fpage>115</fpage>&#8211;<lpage>128</lpage>. DOI: <pub-id pub-id-type="doi">10.1177/0956797619890619</pub-id></mixed-citation></ref>
<ref id="B59"><label>59</label><mixed-citation publication-type="journal"><string-name><surname>Wang</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Schiller</surname>, <given-names>N. O.</given-names></string-name> (<year>2019</year>). <article-title>A review on grammatical gender agreement in speech production</article-title>. <source>Frontiers in Psychology</source>, <volume>9</volume>, <fpage>2754</fpage>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2018.02754</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>