Alves-Soares Leonardo 2020 Thesis
Alves-Soares Leonardo 2020 Thesis
Leonardo Alves-Soares
University of Ottawa
Department of Linguistics
Faculty of Arts
University of Ottawa
Abstract
This dissertation investigates how cognates are organized in the bilingual mental
representations, influences the processing of cognates and false friends in the other
word recognition, the Bilingual Interactive Activation (BIA) and the Bilingual
Interactive Activation Plus (BIA+), the premise is that there is activation from
orthography to phonology across a bilingual’s two languages and that this activation
Portuguese-English cognates and false friends that were selected for a cross-language
lexical decision task with masked priming. Dynamic time warping (DTW), an
automatic speech recognition and to measure acoustic similarity between two time-
to change one word into another and normalizes the result by their lengths, was used
who acquired their second language after reaching puberty, and English functional
iii
phonological and semantic similarity and the role each of them, along with English
proficiency, word frequency and length play in the organization of the Portuguese-
Acknowledgements
If there is one thing 2020 will be remembered for is that it has been a very tough year
for everyone due to COVID-19. These are strange times. Besides social isolation,
physical distancing and having to wear masks in order to protect ourselves and
others, there is much uncertainty about the future all around. Finishing a dissertation
in 2020 makes acknowledging those that stuck by me and helped me through this
in my research and never gave up on me even at the times when I felt there was no
point moving ahead. Juana always encouraged me to keep my head up, shoulders
back and to move ahead at full throttle gracefully, never looking back. Your wit,
passion and tenacity, even when the politics of the world of academia get in the way,
are an inspiration to me. I am extremely proud of having had the opportunity to work
closely with you and consider it a privilege being among one of your doctoral
students.
I would like to thank professor Marc Brunelle for opening my mind to the
possibilities with mixed-effects models and helping me see a much brighter and more
exciting world beyond ANOVAs. I will be forever grateful for all the time you took to
get me started with R and mixed-effects models. It was not an easy road for me to
v
I also would like to thank all my committee members, Anahi Alba de la Fuente,
Elena Valenzuela, Nikolay Slavkov and Ricardo Augusto de Souza, not only for taking
the time to carefully read my work and provide me with invaluable feedback, but also
for having made my dissertation defense feel more like a conversation among
thank you for all her assistance with reviewing my data analysis chapter, and, in
particular, for going over my mixed-effects models with a fine-tooth comb. They say
there is no better way to learn than to teach; I can say I have learned a great deal by
I would like to recognize a few specific Faculty of Arts members who ensured
Official Languages and Bilingualism Institute at the University of Ottawa, who were
dreams in North America. And, finally, Pumba, thank you for always being there for
me unconditionally along the way, through thick and thin. You have always been my
biggest cheerleader, helping me keep my eyes on the prize and not lose track of a
much brighter future. I would have never been able to accomplish all of this without
you. And, to Caio Caiana, my little girl, thank you for being my furball, jumping on
my desk when I was writing this dissertation, keeping me calm, and for showering
Table of Contents
1. Introduction ................................................................................................................. 1
2.3. Testing the Organization of Bilingual Mental Lexicon: The Lexical Decision Task and
3.1. Participants.................................................................................................................................................. 41
4.2.1. Model 1 - Liner Mixed-Effects Model fitted with LMER ................................................................ 103
4.2.2. Model 2 - Liner Mixed-Effects Model fitted with GLMER ............................................................... 107
List of Tables
Table 10. ERS and SEMAC Independent samples t-test (Descriptive Statistics)..........77
Table 11. Pearson product-moment correlations between RT and ERS and SEMAC ... 78
Table 12. Point-biserial correlations between ERS and SEMAC with RAW_ACCURACY
............................................................................................................................... 79
Table 16. Original DTW crosslinguistic phonological similarity data vs. consolidated
data ........................................................................................................................ 88
List of Figures
Figure 1. (1984)’s Word Association Model as depicted in Kroll and Stewart (1994, p.
Figure 2. Potter et al. (1984)’s Concept Mediation Model as depicted in Kroll and
Figure 3. The Revised Hierarchical Model as depicted in Kroll and Stewart (1994, p.
158). ......................................................................................................................... 13
Figure 5. The Bilingual Interactive Activation Model (BIA) depicted in in Dijkstra &
Figure 6. The Bilingual Interactive Activation Model Plus as depicted in Dijkstra &
various network layers of lexical representation, from the input word (at the
grafite-graphite and the false friend pair smoking ‘tuxedo’-smoking. Areas that
are darker represent points where the accoustic distance between the waveforms
Figure 9. PCT screen capture illustrating the parameters users can set for DTW
Figure 10. Acoustic similarity results table. In the Result column, higher values
waveforms. ............................................................................................................. 65
Figure 11. Kernel density plots illustrating the distribution of RT (overall and by
LANGUAGE)........................................................................................................... 73
categories showing a higher mean NLD for COG and FF pairs. ......................... 85
Figure 13. Kernel density plots of the DTW crosslinguistic phonological similarity
semantic categories showing a lower mean for COG and FF pairs. Lower values
Figure 15: Differences between functional monolinguals and bilinguals with regards
List of Examples
List of Appendices
1. Introduction
For a Portuguese-English bilingual – or any other bilingual for that matter – riding
the subway in New York City can be a truly remarkable experience. Among all the
hustle and bustle, all the bright, flashing lights and all the announcements in
country can lead to moments of confusion and ambiguity. For example, seeing the
English word push printed on access doors to stations, emergency exits or on the
windows in the trains can automatically trigger the Portuguese word puxe, which
sounds similar to the English word but has a completely different meaning in the
to cause movement toward oneself (or itself), which in English is conveyed by the
word pull. In other words, the pronunciation similarities between puxe and push are
merely superficial. Similarly, the English word platform can automatically trigger the
Portuguese word plataforma, except that these two words have the same meaning
across both languages. Words like puxe and push, which share some phonemes,
graphemes and yet have completely different meanings crosslinguistically, are often
defined in the literature as false friends, or sometimes false cognates. Words like
platform and plataforma are defined as cognates. They share many graphemes,
As such, both languages also have considerable number of cognates, e.g., computador
and computer, piano and piano, televisão and television, robô and robot, zero and
zero, etc. English and Portuguese also possess a considerable number of false friends,
e.g., batom ‘lipstick’ and baton, chute ‘kick’ and chute, sapo ‘frog’ and sap, tampão
friends has helped elucidate the organization of the bilingual mental lexicon,
particularly the mechanisms that trigger that sense of ambiguity that bilinguals can
often experience when they read or hear a word in one of their languages that look
and sound like a word in the other language. For example, in translation recognition
and lexical decision tasks with masked priming, in which accuracy and reaction time
are the dependent variables, highly fluent bilinguals show a significantly greater
priming effect for cognates than for non-cognates in both L1-to-L2 and L2-to-L1
the mental lexicon. Thus, it is possible that crosslinguistically cognates may share a
single mental representation. There is evidence suggesting that bilinguals are able to
name cognates more accurately and faster than both non-cognates and false friends
(Sunderman & Schwartz, 2008). Additionally, Jared and Szucs (2002) observed that
Similarly, Schwartz, Kroll and Diaz (2007) showed highly proficient English-Spanish
bilinguals displayed greater naming latencies (or RTs) for cognates with high
were different crosslinguistically. This effect was even observed irrespective of the
overlap that sound rather different crosslinguistically are processed more slowly than
Despite all of the evidence supporting the special status of cognates, many
outstanding issues still remain. For example, most studies do not make a clear
distinction among the various kinds of cognates, viz., true vs. false friends, semi-false
vs. false friends, interlingual homophones vs. homographs, etc. Additionally, the
crosslinguistically) has not been yet established for many language pairs, in particular
their second language. The premise is that there is activation from orthography to
phonology across a bilingual’s two languages and that this activation is modulated by
this dissertation will further advance the understanding of the interplay between
monolinguals, who served as controls. The stimuli selected for the experiment were
objective metrics such as the Normalized Levenshtein Distance and dynamic time
1
While in the extant literature there are multiple definitions of bilingualism, this study is primarily
concerned with late sequential bilingualism, as defined by Montrul (n.d., p. 17). Specifically,
Portuguese speakers who acquired English as their second language (L2) during or after puberty.
5
mental lexicon, from its origins on spreading activation and connectionist models, to
the structural elements of five well-known models of bilingual word recognition, viz.,
the Revised Hierarchical Model (RHM), the Interactive Activation Model (IAM), the
Bilingual Interactive Activation Model (BIA) and its derivatives, the Bilingual
Chapter 3 explains the methodology employed in the study, from the process
of participant recruitment and English proficiency testing to how all the stimuli were
selected for the cross-language lexical decision task using objective metrics of
crosslinguistic code overlap as well as how the experimental word lists were
assembled.
divided into two sections, Preliminary Analyses and Mixed-Effects Models, with the
former serving as groundwork for the latter. In Preliminary Analyses, each of the
variables of interest are analyzed using conventional statistical analyses tools, such
model (lmer), with reaction time as the dependent variable, and a generalized linear
mixed-effect model (glmer), with accuracy as the dependent variable, are presented.
Both mixed-effects analyses allowed for precise quantification of the individual fixed
this study and formalizes the contribution to the existing research in light of the
framework of two important models of the bilingual mental lexicon, the BIA and the
BIA+.
Chapter 6 summarizes the study and offers the main conclusions as well as
pertaining to meaning, spelling and pronunciation of all the words a speaker has
acquired throughout their life. One of the greatest challenges when proposing a
model of the mental lexicon concerns the representation of concepts and form,
representations) and vice-versa. Collins and Quillian (1969), based upon spreading
the network nodes represent acquired linguistic knowledge and the pathways, or the
node in the network becomes activated, it automatically spreads (or percolates) its
nodes with which it is semantically associated. For example, when the word plant is
representation of the word in memory spread (or percolate) their activation to nodes
linked to the conceptual representation of the word flower, which, in turn, spread
their activation to nodes linked to the conceptual representation of the English word
rose. Words in the mental lexicon are also thought to have a resting-level (baseline
8
level) activation; words that are repeatedly activated together are posited to
collectively retain a portion of their activation; thus, they are posited to be at a higher
resting-level activation, which facilitates the speed with which activation spreads
between them and to other associated words (Anderson, 1983a, 1983b, 2015; Samani
Over the years the underpinnings of the mental lexicon were further
atop the hierarchy, an intermediate lexical layer, and a sublexical layer, which sits at
the bottom of the hierarchy. In this framework, during the course of visual word
which is responsible for ensuring that words in the mental lexicon that match a
and produced is also introduced (Caramazza, 1997; Costa, La Heij, & Navarrete, 2006;
Costa & Santesteban, 2004; Dell, 1986; Levelt, 1993, 2001; Levelt, Roelofs, & Meyer,
1999; Motley & Camden, 1985). Levelt (1993, 2001) argued that without a lexical
simultaneously active and compete with one another for production, which would
9
make human communication a rather slow and error-ridden process. Levelt states
that speakers err no more than once or twice every 1,000 words, in spite of the fact
that they amass a mental lexicon of 50,000-100,000 words throughout their lifetime.
Putting it simply, because human communication is both effective and fluid, a lexical
in a message. Levelt theorizes that the lexical selection system takes into account the
representations in order to ultimately select a lexical item with the highest level of
Highly proficient sequential bilinguals, without much apparent delay or effort, are
able to switch between their L1 and L2 as well as map a specific word they read or hear
to one of their two languages. This remarkable ability has been the subject of much
investigation. Based on the enormous body of research accumulated over the last four
decades, two theoretical issues have consistently remained under the microscope: (a)
whether bilinguals’ two languages are stored together in a single, integrated mental
lexicon, expressed in the literature as the shared-lexicon hypothesis, and (b) whether
the words from bilinguals’ two languages are accessed simultaneously (in parallel)
10
different core components, some leapfrogging earlier models, have been postulated
over the years and tackle the shared lexicon hypothesis and the language non-
models of bilingual word recognition, viz., the Revised Hierarchical Model (RHM),
the Bilingual Interactive Activation Model (BIA) and its precursor the Interactive
Activation Model (IAM), the Bilingual Interactive Activation Model Plus (BIA+), and
Multilink.
Kroll and Stewart (1994) – later revisited in Kroll and Tockowicz (2001) – proposed a
originally by Potter, So, von Eckhardt, and Feldman (1984) to test whether L2
and translation-into-L2 tasks. The first model, viz., Word Association Model,
purported that words in the bilingual mental lexicon are stored in two separate, yet
interconnected, lexical memory systems (one for each language), and that concepts
are stored in an abstract semantic system. As Figure 1 below shows, in the Word
Association Model L2 words are associated with L1 words (and vice-versa). However,
11
only through L1 mediation can L2 words gain access to the abstract semantic system
(concepts).
images
[_, ~-~ L~2
concepts
Figure 1. (1984)’s Word Association Model as depicted in Kroll and Stewart (1994, p. 150). The
arrows represent lexical links.
The second model, viz., Concept Mediation Model, also proposed that words
in the bilingual mental lexicon are stored in two separate lexical memory systems
(one for each language) and that concepts are stored in an abstract semantic system.
However, as Figure 2 shows, unlike in the Word Association Model, in the Concept
Mediation Model the two lexical stores are depicted as completely independent from
each other. Further, L1 and L2 words have direct access to concepts in the common
abstract semantic system (concepts). However, in order for words in either lexical
12
store to gain access to words in the other lexical store, this access would need to be
LI
concepts
Figure 2. Potter et al. (1984)’s Concept Mediation Model as depicted in Kroll and Stewart
(1994, p. 150). The arrows represent lexical links.
In a series of studies that put both models to the test, Porter et al. only found
support for the Concept Mediation Model. They concluded that the semantic store
must always mediate the connections between both lexical stores irrespective of
Similar to both the Concept Mediation Model and the Word Association
Model, the Revised Hierarchical Model (RHM) also envisaged the bilingual mental
bilinguals’ two languages are stored in their own respective lexical memory system,
and at the semantic level, concepts are stored in an abstract memory system.
However, unlike its two predecessor models, the RHM posits that as bilinguals
increases, L2 words can gain direct access to the abstract conceptual store instead of
via L1 mediation.
lexical
links
Ll L2
concepts
Figure 3. The Revised Hierarchical Model as depicted in Kroll and Stewart (1994, p. 158).
arrows in Figure 3 above). Specifically, for unbalanced bilinguals, those who are less
i.e., L1 words gain access to L2 words via the semantic store (concepts), thus,
the semantic store. Translating from L2 to L1, however, is not mediated via the
semantic store, i.e., L2 words have direct access to L1 words, thus, translating in this
direction is faster (illustrated by the solid arrows in Figure 3 above). This is so because
the connections the L2 lexical store develops to the L1 lexical store are much stronger
than the other way around, which occur through conceptual mediation.
It is impossible to examine the BIA (Dijkstra & van Heuven, 1998) without first
introducing the Interactive Activation Model (IAM), its precursor. The IAM is a
postulates that bottom-up and top-down processes occur in parallel during the
visual-acoustic feature level, a letter-phoneme level, a word level, and higher levels
of processing that exert top-down input to the word level (see Figure 3). Low level
visual and acoustic feature-based representations spread their activation to letter and
activation to word-based representations one level higher. As its name suggests, the
15
IAM posits that visual word recognition is a highly interactive process; thus, each
adjacent level of representation can interact with each other, either exciting or
within the word level is only inhibitory, since one word is posited to receive excitatory
11l
nr
VISUAL INPUT
rn
ACOUSTIC INPUT
Figure 4. The Interactive Activation Model (IAM) as depicted in McClelland and Rumelhart
(1981, p. 378).
16
The BIA is in its essence an offshoot of the IAM. Thus, its architecture is largely
the same, with the exception of the added language level containing two language
nodes to represent bilinguals’ two languages, since the BIA is a model of bilingual
visual word recognition. Activation at each level of representation in the BIA can
adjacent levels (see Figure 5 ). In this way, when a string of letters is presented to the
letter-based representations with features present in the original input string and
also inhibit other letter and phoneme-based representations lacking those features.
contain the letters in the same position of the input string and also inhibit words that
do not. Thus, regarding the language non-selective hypothesis, activation in the BIA
inhibitory feedback on the activated lexical items from both languages, inhibiting
the ones from the other language to ensure successful identification of the input
string.
17
Language
Dutch
Word
words
''
Letter
Feature
Visua l input
Figure 5. The Bilingual Interactive Activation Model (BIA) depicted in in Dijkstra & van
Heuven (2002, p. 177). Excitatory connections are represented by arrowheads. Inhibitory
connections are represented by filled circles.
18
representations in the BIA, the Bilingual Interactive Activation Model Plus (BIA+)
was conceived (Dijkstra & van Heuven, 2002). The BIA+, similarly to the BIA, is a
languages are integrated in the Word Identification System, much like in a network
stored.
According to the BIA+ the bilingual mental lexicon is integrated and lexical
presented to the BIA+, a number of word competitors from both languages are
semantic overlap. In this fashion, when a word is presented to the BIA+, its lexical
representations (in both languages), the greater the number of similar word
Task schema
Identification system
Languagenodes Semantics
Lexical Orthography
- Lexical Phonology
Figure 6. The Bilingual Interactive Activation Model Plus as depicted in Dijkstra & van Heuven
(2002, p. 182). Activation flow is represented by arrows. Inhibitory connections are not
depicted in the illustration.
The BIA+ also posits that other factors, such as word frequency, recency of use
(how recently a word was last used by the bilingual) and L2 proficiency modulate
20
activation spread crosslinguistically, because they can raise or lower the resting-level
activation of stored representations. In this fashion, compared to words that are not
used often, frequently used words are thought to have a higher resting-level of
activation. Consequently, not only less input activation is required to activate stored
spread much more quickly between frequently used words. Likewise, a delay in
group of speakers is not equally proficient and fluent in their L2, their L2
representations.
2.2.4. Multilink
predecessors, in particular the RHM, the BIA and the BIA+. Multilink is a
researchers to generate new hypotheses for empirical testing and better understand
21
how the mechanisms underlying bilingual (and monolingual) word recognition and
production interact and play out both qualitatively and quantitatively with an
cognates might be stored. Its task/decision system can replicate several experimental
tasks, such as word processing during lexical decision, orthographic and semantic
priming, word naming, and word translation (in both forward and backward
frequency of use, word length and crosslinguistic overlap. In fact, Multilink is such a
new model that its lexicon at the moment only comprises English-Dutch words.
Multilink’s lexicon, however, is integrated across both languages. Over time, as the
model becomes more mainstream with language researchers, its architects hope
across both languages of a bilingual. This is in line with the findings of many
existence of two separate lexicons, one for each language (Marc Brysbaert & Duyck,
2010; Van Heuven, Dijkstra, & Grainger, 1998). Multilink depicts the mental lexicon
as layered lexical network interconnected by links that vary in strength and in which
22
having a layered structure, unlike the BIA+, a sublexical level is not specified in
Multilink, thus, activation flow occurs lexically. Multilink posits each stored word as
having a resting level activation (RLA), which is a function of the frequency of use of
the word itself (measured in occurrences per million) and the highest and lowest
frequencies of all other words stored in the lexicon. In this framework, when a written
between the input word and the stored representations as well as their RLA. Multilink
utilizes the Levenshtein distance (Levenshtein, 1966) normalized for word length as
takes place in a number of time steps (or cycles). Thus, the degree of semantic and
phonological crosslinguistic similarity between the input word and each word
through semantics. The task/decision system lastly checks for language membership
of the input word and word candidates as well as their degree of orthographic,
phonological and semantic activation required for output in the experimental task at
hand.
23
Phonology
Semantics
Language
, X
Orthography
G
l Input
I
Figure 7. The network architecture of Multilink as depicted in Dijkstra et al. (2018, p. 662). EN
= English, NL = Dutch. Arrows represent activation flow through the various network layers
of lexical representation, from the input word (at the bottom) to output (at the top).
The masked priming paradigm is one of the most well-known techniques employed
computer screen a forward mask, a prime, sometimes a backward mask, and a target.
24
order to determine its effect on participants’ response latencies (or their response
accuracy) to the target, thus illuminating the processing of linguistic code (Tzur &
Frost, 2007). Priming occurs when the prime facilitates participants’ response to the
2003, Chapter 1). The most common explanation for the facilitation effect induced by
their identification (Hutchison, 2007). In order to measure the effect of priming, the
a button box or keyboard, to indicate whether or not the target is a real word in a
particular language, e.g. is this an English word? (see Table 1 below outlining the
procedure).
25
#####
forward mask
500ms
hash marks covering the
same visuospatial field of the
prime
2
prime fruta 30-70ms
SOA
in lowercase letters
backward mask
(optional)
##### 500ms
3
target FRUIT 500ms
in uppercase letters
4 3,000ms before
participant’s response the next stimulus
usually pressing a key on a
is presented
button box (yes/no)
considered prior the design stage and during the administration of an LDT. A
particular important one is the stimulus onset asynchrony (SOA), which is the
26
amount of time that will elapse between the presentation of the prime and
subsequent target. A shorter SOA invokes early word recognition processes and
ensures that participants are completely unaware of the prime, thus mitigating the
likelihood that any cognitive processes (linguistically related and potentially non-
linguistically as well) are triggered (Forster & Davis, 1984; Forster et al., 2003;
Grainger & Holcomb, 2009; Rastle, Davis, Marslen-Wilson, & Tyler, 2000; Sabourin,
Brien, & Burkholder, 2014). Long SOAs make the prime completely visible to
participants, i.e., they are able to completely read the prime through, and, since they
are conscious of it, postperceptual cognitive processes, not all connected to the
language system, are automatically induced. (Forster et al., 2003, Chapter 1).
rather limiting priming effects, because other cognitive operations interfere with the
time course of the processing of a linguistic code (Colombo, 1986; Martin & Jensen,
that occur in the following sequence: (1) a forward mask consisting of a sequence of
hash marks/tags (#####) covering the same visuospatial field as the prime is
presented for 500ms before the prime; (2) the prime in lowercase letters is presented
briefly (SOA 30-70 ms); (3) the target, usually in uppercase letters, is presented; and
(4) participants have a preset amount of time to record their response (usually by
27
the press of a button on a button box) before the next trial is displayed. It is
important to highlight the case change between the prime and target, for it ensures
that the two stimuli are visually distinct. Additionally, in order to avoid participants
perceiving the target as a continuation of the prime due to their form overlap, some
studies have opted to add a backward mask between the presentation of the prime
and the subsequent presentation of the target (Forster et al., 2003, Chapter 1).
Distinct types of priming effects are reported in the literature when the
researchers to postulate and advance models of visual word recognition and lexical
the prime differs from the target in all baseline conditions” (Forster et al., 2003, para.
6). Among the effects observed, the translation priming effect is of particular interest
translation equivalent target in the other language, e.g., the Portuguese prime cão
would facilitate the processing of its English translation equivalent dog. This effect
asymmetry has been reported across several different language pairs, and is
significantly greater in the forward priming direction, viz., from L1-to-L2, than in
Etxebarria, Laka, & Carreiras, 2010; Gollan, Forster, & Frost, 1997; Grainger & Frenck-
28
effect of 39ms. In contrast, in the backward-priming direction, the results are not as
consistent, with an average magnitude of the effect remarkably less robust at around
to be the result of uneven proficiency between the L1 and the L2, i.e., bilinguals who
al., 2010; Gollan et al., 1997; Grainger & Frenck-Mestre, 2002; Jiang, 1999).
Nevertheless, more recently Wen and van Heuven (2017) conducted a meta-analysis
the effect sizes of translation priming effects in both directions. The results revealed
significant priming effects in both priming directions, with the effect being
shared semantic store, i.e., the prime in L1 and the target in L2, or vice versa, share
versa (Dimitropoulou, Duñabeitia, & Carreiras, 2011a; Van Hell & Dijkstra, 2002). For
expected to activate English targets like milk or sugar, since coffee is usually served
priming effect reported by studies that have utilized the masked-priming paradigm.
The effect is reported when the processing of a target, e.g., faster reaction times, is
condition, when the orthographic overlap between the prime and target is high, i.e.,
when they share many letters, there is minimal competition between them, thus
The phonological priming effect is another priming effect that has helped
enhance several models of bilingual visual word recognition and is often reported in
the literature. This effect is of particular interest because it is often present for
bilingual participants but absent for monolingual participants. A few studies that
other language have suggested that during the initial stages of visual word
activated (Marc Brysbaert, Van Dyck, & Van De Poel, 1999; Dimitropoulou,
Duñabeitia, & Carreiras, 2011b; Perfetti & Bell, 1991; Pexman, Lupker, & Jared, 2001;
mediated to a large extent, not only in L1 but also in L2” (Marc Brysbaert, 2003, p.
While employing the masked priming technique has been proven useful at
disentangle orthographic similarity from phonological similarity due to the fact that
than not, words that are orthographically similar crosslinguistically (have large
pronunciation). Simply put, when a word in one language sounds similar to a word
bilingual visual word recognition, viz., BIA+ and Multilink, have incorporated
2
The word “homophonic” is used here with the meaning of sounding similar or having a similar
pronunciation.
31
bilinguals’ two languages share a single semantic store and that words are comprised
orthographic similarity) should also play a role. With this in mind, the masked-
priming paradigm has been extensively used to elucidate the organization of the
bilingual mental lexicon. To that end, two classes of words, viz., cognates and false
friends, have been under close inspection by language researchers that, for the past
few decades, have persistently tried to further their understanding of how L1 and L2
words are stored and organized in the bilingual mental lexicon. Cognates are words
whose meaning is largely the same crosslinguistically, i.e., have near complete
semantic overlap. Likewise, their orthographic overlap (form similarity) is also high,
crosslinguistically (da Luz Fontes & Schwartz, 2010; Dijkstra, Grainger, & Van
Heuven, 1999; Dijkstra & van Heuven, 2002; Schepens, Dijkstra, Grootjen, & van
Heuven, 2013). For example, the English word kit [kɪt] and the Portuguese word kit
Specifically, their meaning is largely the same and their orthographic form is identical
across both languages. However, there is variation in how speakers of Portuguese and
English pronounce both words. Like cognates, false friends (often characterized as
friends denote different concepts across languages, i.e., their semantic overlap is
rather low or null (da Luz Fontes & Schwartz, 2010; Dijkstra et al., 1999; Dijkstra &
van Heuven, 2002; Schepens et al., 2013). For example, while the English word rim
[ɹĩm] and the Portuguese word rim [χĩŋ] ‘kidney’ have complete orthographic overlap
and partial phonological overlap, their meanings are completely different across both
languages.
advantage. They generally yield significantly more robust and more consistent
priming effects (de Groot & Nas, 1991; Gollan et al., 1997; Kim & Davis, 2003,
Sánchez-Casas, Davis, & García-Albea, 1992). Cognates and false friends, either as
primes or targets in LDTs, have often been compared against controls and
Kiyonaga, & Holcomb, 2006; Grainger, Spinelli, Farioli, Diependaele, & Ferrand,
3
Interlingual homographs are sometimes referred to in the literature as interlingual
pseudohomophones (Dijkstra et al., 1999).
33
2003; Perfetti & Tan, 1998; Pollatsek, Perea, & Carreiras, 2005; Ziegler, Ferrand,
Jacobs, Rey, & Grainger, 2000). Pseudowords are words that have been manipulated
either via computer software or manually. They resemble real words in a specific
effects, viz., faster reaction times, have been reported for cognates with partial as
well as 100% orthographic overlap, e.g., the Dutch-English cognate pair tomaat-
Dijkstra et al., 1999; Lemhöfer & Dijkstra, 2004; Van Hell & Dijkstra, 2002). When
pronounced similarly across two languages, evidence from LDTs shows faster
activated during visual word recognition, and, consequently, might play a role in the
organization of the bilingual mental lexicon. This facilitatory effect is observed even
with languages pairs that do not share the same orthographic script, such as Greek
and Spanish, Japanese and English, as well as Greek and French (Dimitropoulou et
al., 2011a; Nakayama, Sears, Hino, & Lupker, 2012; Voga & Grainger, 2007).
semantic overlap between a prime in one language and a target in the other could
some of the facilitatory effects encountered in the literature. Nevertheless, Voga and
Grainger (2007) present a different theory. Specifically, they argue that the cognate
orthographic and phonological relationships that exist between primes and targets
across some language pairs. It is possible that words that share a common
valuable evidence for language non-selectivity, i.e., bilinguals’ two languages appear
in which not only semantic representations but also orthographic and phonological
from one of bilinguals’ two languages is presented (de Groot, 1992; J. F. Kroll &
Stewart, 1994; Perea, Duñabeitia, & Carreiras, 2008; Soares & Grosjean, 1984; van Hell
researchers.
35
In light of the framework provided primarily by the BIA and BIA+, and to some extent
Multilink, the primary goal of this study is to quantify the individual effects of
experimental stimuli for a cross-language LDT with masked priming, for which
participate.
and false friends in a cross-language LDT, despite the great deal of crosslinguistic
similarity between the two languages. As Brazil, the most populous Portuguese
speaking country, has become one of the world’s most powerful economies,
consistently ranking among the top 10, and experienced dramatic economic growth,
the country’s participation in the world’s economic stage has also become more
prominent (“The World’s Top 10 Largest Economies,” n.d.). As a result, the number of
bilinguals has also increased. Thus, a more scientific understanding of how words
from two languages are stored and interact is needed and it will certainly assist in the
Despite the fact that the likelihood that two words across two different
languages are cognates is highly correlated with how similarly they are pronounced
(Kondrak & Sherif, 2006), crosslinguistic phonological similarity has not received
much attention from language researchers; thus, its individual effect on bilingual
et al., 2011a; Lemhöfer & Dijkstra, 2004; Nakayama et al., 2012; Van Hell & Dijkstra,
metric readily available for language research, researchers have opted to reduce
& Halle, 1992; G. Clements & Hume, 1995; G. N. Clements, 1985; inter alia). In fact,
influential studies on the bilingual mental lexicon have, for the most part, either
cognates as stimuli for their experiments (Comesaña et al., 2012; Friesen & Jared, 2012;
Pallier, Colomé, & Sebastián-Gallés, 2001; Pexman, Lupker, & Jared, 2001; Schwartz &
similarity pose numerous challenges and limit the results of these studies, in
are far more multidimensional than what is postulated by phonological theory and
personal judgments are not only often biased but also based on the personal
Smiljanic, & Walter, 2010). These pitfalls, however, can be successfully overcome if an
objective metric is used instead. To that end, dynamic time warping (DTW), an
algorithm that was originally conceived for automatic speech recognition to measure
orthographic similarity also be used. To that end, in light of the work of Schepens,
Dijkstra and Grootjen (2013), the Normalized Levenshtein Distance (NLD) was
chosen.
Taking into account spreading activation as well as the framework of two well-
known models of the bilingual mental lexicon discussed in 2.2., specifically the BIA
and BIA+, this study hypothesizes that activation in the bilingual mental lexicon
spreads in a feed-forward fashion from the semantic store, which is the site where
representations from both bilinguals’ two languages during visual word recognition.
through the semantic store. In this framework, cognates will have a clear processing
The shared semantics coupled with orthographic and phonological code overlap will
phonological overlap will still facilitate the processing of the English target, but the
advantage will be less pronounced than for Portuguese cognate primes due to no
phonological overlap with the English target (control pairs) slower because the
(distractor pairs) the slowest due to the absence of facilitation provided by semantic,
below.
39
Portuguese word
matched with
minimal or minimal or
unrelated English absent slower
none none
target
(control pairs)
Portuguese word
matched with
absent none none slowest
pseudoword
(distractor pairs)
In the extant literature, ANOVAs and t-tests have been the primary statistical
LDTs, and the reliance on these tools has continued to this day. Because one of the
similarity and because many of the variables of interest are continuous predictors, or
a combination of categorical and continuous predictors, this study will instead utilize
linear mixed-effects models, a much more robust statistical analysis tool, to analyze
the experimental data collected. Besides more precisely estimating the individual
40
(reaction time and accuracy), mixed-effects models will enable participants and
prime-target pairs (items) to be treated as random effects, which will allow the
individual participant and each individual prime-target pair, which in the end will
yield a more realistic overall representation of the processes involved in visual word
methodology, from the creation of the stimuli lists for the LDT using objective
well as assessing their level of English proficiency. It also describes the experimental
3.1. Participants
3.1.1. Recruitment
Participants for the experiment were recruited primarily via a recruitment email,4
entail, and instructed them to contact the researcher via email to express their
sequential bilinguals, the researcher sent out the recruitment email (see Appendix B)
to the International Office at the University of Ottawa, and requested that they
whose native language is Portuguese. Since a large proportion of students with this
language profile conducts their studies at the University in English, they were
recruitment email to the teaching staff of the English Intensive Program (EIP), a non-
4
The text included in recruitment email was approved by the University of Ottawa Office of
Ethics and Research Integrity (file number 09-12-13). The approval notice is included in
Appendix A.
42
international students and francophone Canadians who do not meet the English
proficiency requirements of the University so that they might attend classes either at
the graduate or undergraduate level. Teachers in the EIP are highly trained
area are exposed to both official languages, English and French, on a daily basis.
Street signs, advertisements, pamphlets, bus schedules, routes, etc. are omnipresent
diverse and multicultural population, so many native speakers of English are also
heritage speakers of another language, i.e., they can converse with their parents and
virtually impossible to find bona-fide native speakers of English who do not know,
minimize the effects of exposure and having some knowledge of a second language,
they disclosed during a prescreening interview that they could not comfortably
43
function in an environment that would require them to use their second language in
knew other potential participants that met the language requirements of the
3.1.2. Composition
in the experiment. All of them were 18 years of age or older at the time of their
participation. All bilingual participants were born and grew up in country which
Portuguese is the official language and indicated they had learned English as a second
language in private language schools in their home country during or after reaching
at the University of Ottawa under the Science Without Boarders program, a large-
government. The program seeks to strengthen and expand the initiatives of science
undergraduate and graduate students and researchers. Many of the students enrolled
44
under this program at the University of Ottawa obtained scores in the Test of English
System (IELTS) that satisfied the English proficiency requirements of the University.
Students who did not take or obtained a low score either of these tests had to
complete a few sessions in the EIP in order to satisfy the English proficiency
requirements of the University before they could start taking classes at either the
Most participants from Brazil were from the State of São Paulo. All
monolingual participants were born and grew up in a country in which English is one
Assessment
Questionnaire (see Appendix D), which was originally developed for the
participants could fill it out online on a computer and to make it convenient for them
Brown’s (1980) cloze test (see Appendix E). This kind of test was selected for the
tests require that test takers make highly complex series of grammatical and lexical
decisions, specifically having to deduce the deleted words based upon all available
contextual clues, which are posited to reflect test takers’ acquired language skills (Litz
words in length and with 50 missing words or blanks entitled Man and His Progress
(see Appendix E). Participants were instructed to first read the passage as a whole to
get the general meaning and then fill in the blanks with the words they judged
appropriate for the context. They were given 30 minutes complete the test.
scoring procedure (SEMAC) and the exact replacement scoring method (ERS). In the
SEMAC procedure, the test taker’s answers are compared with the original deleted
words. If the supplied words preserve the original intended meaning, even if they are
not direct synonyms of the deleted words, they are deemed acceptable and marked
as correct answers. In the ERS method only supplied words that are exactly identical
to the deleted words are marked as correct answers. It is important to note that
46
SEMAC and ERS scores correlate very highly with one another (Litz & Smith, 2006).
3.2. Materials/Stimuli
The stimuli list for the experiment was created in six stages: (1) compiling an
controls; (3) compiling a list of pseudowords and matching each of them with a
for each Portuguese and English word; (6) obtaining an orthographic and an acoustic
This study is primarily interested in two classes of word with respect to their
form and meaning overlap across languages: cognates and false friends. As indicated
are also high, sometimes complete (da Luz Fontes & Schwartz, 2010; Dijkstra et al.,
1999; Dijkstra & van Heuven, 2002; Schepens et al., 2013). For example, the
Portuguese word prisão [pri.ˈzãũ] forms a cognate pair with the English word prison
stable across both languages. False friends, in contrast, are words that very much
resemble cognates in terms of form similarity, i.e., they also possess a high degree of
orthographic and phonological overlap, but they denote different concepts across
languages (da Luz Fontes & Schwartz, 2010; Dijkstra et al., 1999; Dijkstra & van
Heuven, 2002; Schepens et al., 2013). For example, the Portuguese noun sapo [ˈsa.pʊ]
‘frog’ shares many of its letters and phonemes with the English noun sap [sæp] ‘fluid
that circulates in the vascular system of a plant.’ However, their form similarities are
48
only superficial because the two words ultimately represent different concepts in
both languages.
English cognates and false friends was assembled using the official word list utilized
in Alves-Soares (2013) as its foundation. The experimental stimuli on that list were
words served as primes; English words served as targets. As expected, the selected
cognates and false friends on that earlier list had various degrees of crosslinguistic
(Portuguese-English sequential bilinguals). The pairs from both lists were then
consolidated into a master Excel spreadsheet and classified as either cognates or false
friends, which became the experimental items list for the current study. Table 4 below
provides a sample of the experimental items list. The full list is provided in Appendix
F.
49
bilinguals), each of the Portuguese words in the experimental items list was matched
with a semantically unrelated English word that did not orthographically resemble
the Portuguese word, i.e., had a different spelling. For example, the Portuguese word
rim ‘kidney’ in the false friend pair rim-rim was matched with the semantically and
orthographically unrelated English word sun, yielding the control pair rim-sun. The
possible. The pseudowords were then matched with Portuguese words that were not
in the experimental and control items lists. The total number of Portuguese prime –
pseudoword target pairs matched the total combined number of experimental and
control pairs together. The full list of distractor pairs is provided in Appendix F.
To familiarize participants with the experimental task and before each trial
began, practice items were used. To that end, 36 practice pairs were created. Thirty-
51
six words of Portuguese that had not been previously used as experimental, control
or distractor items served as primes. They were matched with 18 unrelated words of
English that had not been previously included in the preceding lists. The remaining
Once the creation of the experimental, control, distractor and practice items
lists was complete, two official word lists (A and B) were created for the experimental
task. One half of the experimental and control pairs as well as all of distractor and
practice pairs formed list A. Conversely, the second half of the experimental and
control pairs as well as all of the distractor and practice pairs formed list B. It is
important to note that while all distractor and practice pairs in both lists were exactly
the same, the experimental and control pairs in list A were different in list B. As Table
5 shows, each official word list contained a total of 624 Portuguese prime - English
target pairs, specifically, 147 experimental, 147 control, 294 distractor and 36 practice
pairs.
52
Cognates 83
Experimental
False friends 64
A
Control 147
Distractor 294
Practice 36
Total 624
Cognates 83
Experimental
False friends 64
B
Control 147
Distractor 294
Practice 36
Total 624
All the experimental and control pairs from both word lists as well as all the
distractor and practice pairs were consolidated into a master Excel spreadsheet. This
was done to further facilitate data entry, which will be explained in the following
A word frequency count was obtained for each Portuguese-English cognate and false
friend in the official word lists. This was done because word frequency has major
implications for this study. Specifically, the frequency in which speakers of a language
are exposed to words, particularly in reading, may shape the organization of their
mental lexicon. Words that are more frequently encountered are more salient to
speakers of that language, which probably increases their baseline level of activation.
In addition, since a vast number of words in Portuguese and English are of Latin
roots, it is possible the effects of word frequency interact with the degree of
occurs per million words in collected corpora (Schepens et al., 2013). To that end, to
obtain the frequency of occurrence of each word in the preliminary word list O corpus
do Português5 (Davies, 2017b), a large online corpus of Portuguese words, was used to
obtain word frequency count for the Portuguese words, while Corpus of
Contemporary American English (COCA6) (Davies, 2017a), was used to obtain word
5
O Corpus do Português is a one-billion-word corpus of Brazilian and European Portuguese
words. It was created by Mark Davies (Brigham Young University) and funded by the US
National Endowment for the Humanities (Davies, 2017b).
6
Corpus of Contemporary American English (COCA) is the largest freely available and balanced
corpus of American English. It contains over 560 million words from spoken, fiction, popular
magazines, newspapers, and academic texts (Davies, 2017a).
54
frequency count for the English words. The word frequency count for each of the
Portuguese-English (prime-target) word pairs was then entered into in the master
spreadsheet in two separate columns, one for Portuguese and one for English. It is
important to note that no word frequency count was obtained for the pseudowords
(distractor items) in this experiment, since the selected pseudowords are not actually
similarity, how many letters are shared between word pairs from different languages.
(OS), an algorithm that was adapted from the graphic similarity index (GS), which
was developed by Weber (1970). The original GS index, which was devised as an
which Weber assigned distinctive weights based upon her own intuitions about the
index is expressed by the following algorithm and comprises the following indices, as
F: number of pairs of adjacent letters in the same order shared by the two words.
T: ratio of number of letters in the shorter word to the number of letters in the
longer one.
V: number of pairs of adjacent letters in reverse order shared the two words.
The GS index between two words ranges from 0 [zero] (no common letters) to
1,400 (identical letters). However, the application of the formula by itself can yield
results that are difficult to interpret because the formula is sensitive to word length.
Specifically, not all word pairs comprised of identical words will have the same GS
index (Guasch, Boada, Ferré, & Sánchez-Casas, 2013). For example, the GS index of a
word like trim with itself is 975, whereas the GS index of a word like grime with itself
algorithm. Van Ordern’s algorithm (OS) yields similarity values that vary between 0
al., 2013). Van Orden’s algorithm is expressed below, where P is the first word of a pair
!"(9, ;)
7" =
!"(;, ;)
reversing the order in which they are input in the algorithm yields completely
different orthographic similarity indices. With this drawback in mind, more recent
metric of orthographic similarity. Simply put, the Levenshtein distance between two
substitutions required to change one word into the other. Still, the metric is highly
sensitive to word length. In order to neutralize those effects, Schepens et al. (2013)
devised the normalized Levenshtein distance (NLD), which normalizes the metric
57
and yields similarity scores that always range between 0 [zero] (when word pairs are
irrespective of word length (Guasch et al., 2013). The NLD is calculated according to
Since one of the major goals of the current study is to further understand how
mirror the organization of the bilingual mental lexicon, needless to say, choosing the
most reliable as well as the most up-to-date metric of orthographic similarity was
paramount. For this reason, the NLD seemed the most apposite orthographic
similarity metric to use. To that end, NIM (Guasch et al., 2013), a web-based software
Universitat Rovira iVirgili in Tarragona, Spain, was utilized to calculate the NLD
between the word pairs in the preliminary word list. One of the greatest features of
NIM is the fact that it calculates both Levenshtein distance and NLD as well as
Weber’s GS and van Orden’s OS indices. In addition, researchers can copy and paste
lists of word pairs directly into NIM’s interface, and the software will output a table
58
with all the orthographic similarity indices, which can also be downloaded in
Microsoft Excel format. Once the orthographic similarity values were obtained for all
the word pairs in both word lists, they were entered into the master Excel spreadsheet
terms of distinctive features (Chomsky & Halle, 1992; Clements & Hume, 1995; G. N.
Clements, 1985, inter alia) and other formal primitives of phonological theory.
multidimensional, taking into account not only segmental features but also prosodic
available, many influential studies on the bilingual mental lexicon have often relied
stimuli selection (cf., Comesaña et al., 2012; Friesen & Jared, 2012; Schwartz & Kroll,
2006; Schwartz, Kroll, & Diaz, 2007; inter alia). The present study avoided this
59
similarity using a technique previously employed by Mielke (2012), viz., dynamic time
warping (DTW), which was originally conceived to compare different speech patterns
in automatic speech recognition and measures acoustic similarity between two time-
that two words across different languages are cognates is highly correlated with their
DTW in its essence is a time sequence alignment algorithm that, for the
purposes of the current study, iteratively aligns two wave files in a distance matrix by
warping them at their beginning as well as at their end until an optimal match
between the two wave files is found (Shinde & Pawar, 2014). To that end, the optimal
path between two acoustically similar wave files is a diagonal line. The more the
optimal path deviates from a diagonal line, the less acoustically similar two wave files
are. Figure 8 below illustrates the dynamically time warped the waveforms of the
Portuguese-English cognate pair grafite-graphite and the false friend pair smoking
word grafite with itself and the English word smoking with itself are also provided.
Although the optimal path between the cognate pair and the false friend pair is not
a straight diagonal line, it deviates less from a diagonal line in the false friend pair
60
acoustically similar to the English pronunciation of smoking. The optimal path for
the cognate pair grafite-graphite starts rather jagged and completely deviates from
a diagonal to a horizontal line after the fricative [f], indicating that the
pronunciation the English word smoking. When Portuguese grafite and English
smoking are compared with themselves, the optimal path is a straight diagonal line.
61
DTW of the Portuguese word grafite DTW of the Portuguese word smoking
with the English word grafite with the English word smoking
grafite [ˈɡræ.faɪt]
smoking
0.732404209
DTW of the Portuguese word grafite DTW of the English word smoking with
with itself itself
grafite [ɡra.ˈfi.tʃi]
smoking
quantify the degree to which waveforms of linguistic objects (sounds or words) are
similar to each other. For the purposes of this study, Phonological Corpus Tools (PCT)
(Hall, Allen, Fry, Mackie, & McAuliffe, 2016) was chosen for its easy-to-use interface,
computations unaided once a tab-delimited file that lists all the pairwise
parameters is frequency limits (or range), where researchers can set the minimum
and the maximum frequency range (in Hz) for the acoustic analysis. According to
Thus, “frequency components above 10,000 Hz are not likely to be useful for speech
communication even if the speaker has perfect hearing” (p. 23). In light of this fact,
this study set the limits of the frequency range to 80Hz minimum (to filter out low
is frequency resolution, where researchers can set the number of filters to be used to
divide up the frequency range they previously specified. The default setting of 26
filters was chosen for this study. The third parameter is the number of Mel-frequency
cepstral coefficients (MFCCs), where researchers can set the number MFCCs they
desire, which are a type of representation of a sound that is based on the human ear
63
response to sounds and are extensively used in automatic speech and speaker
parameter is whether PCT should output the acoustic similarity results in a scale that
ranges between 0 [zero] and 1, or use the default inverse similarity scale, which was
chosen for this study. Finally, researchers can request that PCT generates a results
table in the form of a tab-delimited text file, which can be saved on to the computer
and later easily imported into Excel. Figure 9 below is a screen capture of the several
parameters users can set in PCT. Figure 10 illustrates the results table PCT generates
once it completes all the acoustic similarity computations in the pairwise comparison
list.
64
Acoustic Similarity
Compariso n type
Represena tion
Q ..'\nalyze sIrq e d irecto ry
MFCC
o .recto ry :
Q /\mpl nude envelopes
0 Ou tp u t as s11111I
a- ty \0 10 ·1)
/\cou st c sIir11ar ty bere t ts from M ult 1proces s,nq .
Me li voce ssIr g canoe enable<I In P-eferences .
Ca cu a te aCOL:SHC S,r'";)l
larn y Ca cu ate aco1.. st1c s:--11larny
(start ne·N resu ts tao le )
Cance /\bou t ac ou stic sir-1ilarny ..
(add to cu rr ent resu ts ta ole )
Figure 9. PCT screen capture illustrating the parameters users can set for DTW algorithm to be
taken into account during the acoustic analysis
65
l>•,m1Y1ty
Figure 10. Acoustic similarity results table. In the Result column, higher values represent greater
acoustic distances (less acoustic similarity) between the waveforms.
To obtain the waveforms for each Portuguese and English word in the official
word lists, two male native speakers of each language of interest were recorded. To
accomplish this, one native speaker of American English (midwestern accent, native
of Ohio) and one native speaker of Canadian English (southeastern Ontario accent,
native of Cornwall) were recorded reading aloud each English word. Conversely, two
native speakers of Brazilian Portuguese, one native of São Paulo and one native of Rio
de Janeiro, who were born and raised in the metro areas of both cities, were recorded
66
reading aloud each Portuguese word. For the recording session, two spreadsheets
were created (one for each language) and then converted to individual slides in a PDF
file. Each slide had a white background with one word in a legibly large font in its
center. Recordings took place in the Language Acquisition Research Lab at the
University of Ottawa. Readers sat at a desk in front of a computer screen and were
accompanied by the researcher. They were instructed to read aloud each of the words
that appeared in the center of the computer screen as naturally as possible. The
researcher controlled the pace in which each word was displayed. Each reader was
Recorder,” 2019) was selected for the recordings. Audio was captured using a
(.wav) at 44,100 Hz. Audacity® was also used to create a label track that contained the
orthographic representation of each word in the recording. Each labeled word was
then exported as a separate waveform file, with its orthographic representation for its
name, and saved according to reader’s name and recording session. Both official word
lists (A + B) were used to create tab-delimited text files that listed all the pairwise
comparisons in the list according to their specific file location on the computer,
which were then used in PCT to perform the required acoustic similarity
computations.
67
any pronunciation discrepancies that might have occurred between the first and
second recording (before and after the break). To that end, each recording from each
reader of Portuguese was compared with each recording from each reader of English.
In the end, eight sets acoustic similarity comparisons (see Table 6 below) were
performed using PCT. Once all the DTW acoustic similarity values were obtained,
they were entered into an Excel spreadsheet so that the mean crosslinguistic
The experimental task consisted of a lexical decision task with masked priming,
Stimulus Delivery,” 2017), a stimulus delivery and experiment control software widely
Ottawa. Participants sat at a large computer desk in front of a 23-inch LCD monitor.
Before participants were allowed to initiate the experimental task, they engaged in a
prescreening interview with the researcher, in which they were asked a few questions
regarding their personal background and level of education, as well as how they had
learned about the experiment and whether they had any particular questions or
English. The purpose of the prescreening interview was to ensure participants met
the language requirements of the study and to give them a chance to get themselves
provided with the consent form (see Appendix C) and were given a few minutes to
read and sign it. Participants were then assigned a pseudocode that consisted of a
sequence of random letters and numbers so that their data would remain anonymous
(introduced in 3.1., Google Forms, see Appendix D) on the computer as well as the
randomly assigned participants to one of the two word lists as well as instructed them
to enter their assigned pseudocode in a text box that appeared on the computer
screen. Presentation® would then prompt participants to press the blue button on a
Cedrus RB-730 Response Pad (button box) to indicate they were ready commence the
experimental task. The buttons on the button box had four distinct colors (blue,
green, red and white), and the ones participants would be pressing during the task
(blue, green and red) had a label in a legible font indicating their specific function in
the experimental task. Once participants pressed the blue button, they were
presented with the task instructions and instructed to press the blue button again to
begin the (20) practice trials. The purpose of the practice trials was to ensure
participants had understood the task instructions and become familiar with the
procedure. Once participants completed the practice trials, the researcher checked
whether they needed further clarification about the procedure. When participants
were cleared to continue, they were instructed to press the blue button one more time
mask consisting of ten hash marks (##########) was presented in the center of the
70
computer screen for 500ms. Second, the forward mask was followed by the
Portuguese prime in lowercase letters for 60ms. Third, the English target was
presented in uppercase letters for 500ms. Fourth, once the English target was
displayed, participants had 3,000ms (3s) to decide by the press of a button (green =
yes, red = no) whether or not the target was a real word of English. Fifth, as soon as
participants pressed the button to indicate their response, the next prime-target pair
was presented automatically. If participants did not press any button during the
The experimental task was comprised of a total of four blocks of 150 trials, of
which four practice pairs were always displayed at the beginning of each block. A
pause was added at the end of each block, and participants were encouraged to get
up, stretch, rehydrate or use the restroom during this break. On average, participants
completed the task within 45 minutes, including break time. Their reaction time and
In order to facilitate the presentation of the results of the study, this chapter was
divided into two sections, Preliminary Analyses and Mixed-Effects Models. In the
Preliminary Analyses section, both dependent variables, reaction time and accuracy,
and each of the independent variables that comprised the master dataset were
independently analyzed, and whenever plotting the variable enhanced the results
being discussed, plots were provided. The results provided in this section serve as
groundwork for the ensuing section, since only more conventional statistical analysis
tools, such as correlations, t-tests and ANOVAs, were used. In the Mixed-Effects
Models section, one linear mixed-effects analysis of the relationship between reaction
time and the independent variables (lmer model), as well as one logistic regression
analysis between accuracy and the independent variables (glmer model) are
individual effects of crosslinguistic similarity, which was one of the primary goals of
this study. Both SPSS (“IBM SPSS Statistics 25,” 2018) and RStudio (RStudio Team,
2018), were used extensively to carry out the analyses in this section. In addition, both
SPSS and ggplot2 (Wickham, 2016), a data visualization package developed for the
results.
Before proceeding with the data analysis, it is important to underscore the fact
that prime-target pairs that were originally coded as Practice did not enter the
72
practice the experimental task and ask the researcher any questions they might have
had about the procedure. Four practice pairs were also displayed at the beginning of
Stimulus Delivery,” 2017), a stimulus delivery and experiment control software, was
used to control the presentation of the stimuli and record participants’ reaction time
during the study. To that end, Presentation records reaction time in tenths of
which normally report reaction time in milliseconds, the variable RT was converted
from tens of milliseconds to milliseconds. The descriptive statistics for RT, including
7
A brief description of the variable in the master dataset is provided hereinafter between
parentheses.
73
LANGUAGE) are provided in Table 7 below. Kernel density plots (Figure 11) are also
0.0020
0.002
0.0015
Density
Density
0.001
0.0010
0.0005
0.000
0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000
0.0000 Mean Reaction Time
Language Groups
□ Bilinguals □ Functional Monolinguals
0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000
Mean Reaction Time
Figure 11. Kernel density plots illustrating the distribution of RT (overall and by LANGUAGE)
with LANGUAGE as the grouping factor. The t-test showed that functional
major role in mean reaction time when all other variables are jointly taken into
Chapter 3, participants had 3,000ms in each trial (prime-target pair) to decide by the
press of a button whether or not the target word presented was a genuine English
word. Accuracy was thus recorded as a hit (correct answer), incorrect, or miss (when
the allocated 3,000ms was exceeded) for each trial, making it a categorical variable.
the entire experiment, was higher than the number of incorrect answers. However,
when the data is broken down by language group, bilinguals had a higher number of
Functional
14,659 1,759 46
Monolinguals
better understand the relationship between the two dependent variables. Since this
correct and incorrect answers were considered. The correlation yielded a weak yet
.186, p < .001, suggesting that on average as participants’ accuracy increased, their
and to weed out participants with an unusually low percentage of correct answers, a
the total number of hits (correct answers) per participant was divided by the total
number of trials in the experiment. This result was then multiplied by 100%. Because
the percentage of correct answers for one of the bilingual participants was
76
significantly lower than bilingual participants’ mean, that specific participant had to
higher than bilinguals’ (M = 88.72, SD = 7.43), t(33,426) = 109.54, p < .001. Although
this result is only preliminary, it shows that functional monolinguals were on average
more accurate at deciding whether the presented word was a genuine word of English.
proficiency test. Participants’ answers on the test were scored using both the
77
scoring method (ERS), producing two language proficiency variables, SEMAC and
.839, p < .001. An independent samples t-test was performed on both ERS and SEMAC
with LANGUAGE as the grouping factor. With respect to SEMAC, the t-test revealed
bilinguals, t(33,514) = 119.97, p < .001. Similarly, with respect to ERS, functional
t(33,514) = 116.88, p < .001. These results show that irrespective of scoring method
English than bilinguals, which was expected since this group of participants was
comprised of native speakers of the language. Please refer to Table 10 below for the
Table 10. ERS and SEMAC Independent samples t-test (Descriptive Statistics)
Std. Mean
Language Group N Mean
Dev Diff.
Bilinguals 16,464 50.07 11.75
ERS 13.51*
Functional Monolinguals 17,052 63.59 9.31
Bilinguals 16,464 76.86 16.28
SEMAC 15.69*
Functional Monolinguals 17,052 92.55 5.09
*p < .001
78
In order to determine the relationship between both ERS and SEMAC with RT,
weak yet significant negative correlation was obtained for both ERS, r(33,516) = – .10,
p < .001, and SEMAC, r(33,516) = – .13, p < .001. These results indicate that on average
language group.
differently when the data is broken down by LANGUAGE. For bilinguals, a weak
negative correlation is maintained with both ERS and SEMAC (both ps <.001). For
functional monolinguals, a weak positive correlation was found with ERS (p <. 001).
However, no significant correlation was found with SEMAC. See Table 11 below for
Table 11. Pearson product-moment correlations between RT and ERS and SEMAC
* p < .001
79
correlation was found with both variables (both ps < .001). When the English
proficiency data was split by LANGUAGE, significant positive correlations were also
found, indicating that as mean English proficiency increased, so did mean accuracy.
Table 12. Point-biserial correlations between ERS and SEMAC with RAW_ACCURACY
were stronger, SEMAC scores were selected to enter the mixed-model analyses as an
crosslinguistic semantic overlap between a prime and its target: cognates (COG), false
friends (FF), controls (CTRL), and distractors (DISTR). Consistent with the results
discussed in Alves-Soares, (2013) and other previous linguistic experiments that have
utilized masked priming in their methodology, participants’ reaction time was highly
influenced by the degree of semantic relationship between prime and target. Mean
reaction time was lowest for targets that were matched to a prime with which they
share a high degree of crosslinguistic semantic overlap (cognate items), and highest
for pseudoword targets (DISTR items), i.e., when a random word of Portuguese was
matched with a pseudoword target. This pattern is also present when the data is split
similarity was present for both language groups. Splitting the data by LANGUAGE
whether or not the target was a word of English than bilinguals. Refer to Table 13 for
A two-way ANOVA was then performed with RT as the dependent variable and
F(3, 33,508) = 875.88, p < .001, a significant main effect of language group, F(1, 33,508)
81
= 333.88, p < .001, as well as a significant interaction between semantic similarity and
language group, F(3, 33,508) = 10.06, p < .001. Pairwise comparisons using the
faster than bilinguals (p < .001). Regarding reaction time differences due to semantic
distractors (both ps < .001). Participants were also on average 29ms faster to respond
to cognates than false friends (p < .001). No significant mean differences in reaction
Language Semantic
Group Similarity N Mean* Std. Dev.* Min.* Max.*
COG 5,489 657.99 261.42 291.20 3,000
FF 2,890 687.34 285.97 326.00 3,000
Combined
CTRL 8,379 703.50 280.18 339.70 3,000
DIST 16,758 852.22 331.32 313.4 3,000
COG 2,700 680.35 290.99 291.20 3,000
FF 1,416 733.33 332.78 341.00 3,000
Bilinguals
CTRL 4,116 737.77 308.32 345.80 3,000
DIST 8,232 899.03 362.28 332.90 3,000
COG 2,789 636.35 227.17 305.60 3,000
Functional FF 1,474 643.17 223.68 326.00 3,000
Monolinguals CTRL 4,263 670.42 245.58 339.70 3,000
DIST 8,526 807.02 291.37 313.40 3,000
* in milliseconds
language groups and semantic similarity was obtained (see Table 14). The percentage
of correct and incorrect answers was also entered into the breakdown so that any
Raw
Language Group Percentage (%)
Count
incorrect 79 2.9
COG hit 2,615 97.1
Total 2,694 100.0
incorrect 107 7.6
FF hit 1,305 92.4
Total 1,412 100.0
Bilinguals
incorrect 256 6.2
CTRL hit 3,849 93.8
Total 4,105 100.0
incorrect 1,317 16.0
DISTR hit 6,890 84.0
Total 8,207 100.0
incorrect 45 1.6
COG hit 2,739 98.4
Total 2,784 100.0
incorrect 32 2.2
FF hit 1,441 97.8
Functional Total 1,473 100.0
Monolinguals incorrect 111 2.6
CTRL hit 4,138 97.4
Total 4,249 100.0
incorrect 525 6.2
DISTR hit 7,979 93.8
Total 8,504 100.0
correct answers for both language groups while cognate pairs yielded the highest.
Together these two trends indicate a possible main effect of LANGUAGE and
SEMANTIC on RAW_ACCURACY.
(overlap) between primes and targets that were selected for the experiment, the
Normalized Levenshtein Distance (NLD) (Schepens et al., 2013) was selected. The
NLD is the most up-to-date and reliable metric for the purposes of this study. It yields
similarity values that range between 0 [zero] (when prime-target pairs are
identical), irrespective of word length (Guasch et al., 2013). Table 15 below provides
and FF is significantly higher than in pairs classified as CTRL and DISTR. The mean
differences can be more clearly seen in both Table 15 and Figure 12 below.
85
Std.
Semantic Similarity N Mean Min. Max.
Dev
1.00
Normalized Levenshtein Similarity
.80
-L-
*
- *
*
.60 -
0
.40
0
.20 '
0
0
8 '---
.00
Figure 12. Mean differences in crosslinguistic orthographic overlap across semantic categories
showing a higher mean NLD for COG and FF pairs.
86
by Mielke (2012), viz., dynamic time warping (DTW). To that end, recordings of two
native speakers of Portuguese and of two native speakers of English reading aloud the
selected experimental words in their native language were made and used to compute
One hurdle that became apparent during the preliminary stages of the data
similarity value for them could have been computed via the DTW algorithm. As
synthesizer and requesting Siri to speak the pseudowords to the make the recordings
were two options explored, the end result sounded quite artificial and robotic,
Moreover, leaving the cells pertaining to distractor pairs blank in the crosslinguistic
later stages of the data analysis, particularly for fitting linear mixed-effects models.
Thus, it was paramount that the distractor pairs cells be populated with data.
The most elegant solution reached was to populate the cells with data that
data. To achieve this, first the crosslinguistic phonological similarity data was plotted
statistics were obtained. With this information available, using the rnorm function
in R, random numbers that followed a normal distribution with the exact same means
were generated. Afterward, both the original data and the randomly generated data
correlation was carried out and descriptive statistics were obtained to ensure no
disparities had arisen from the procedure. Table 16 and Figure 13 below show the
descriptive statistics and the kernel density plots of the crosslinguistic phonological
similarity data before and after the procedure. To more easily illustrate the mean
88
differences across the four semantic categories, descriptive statistics and box plots of
the data were also obtained for the variable PHONETIC (see Table 17 and Figure 14).
Table 16. Original DTW crosslinguistic phonological similarity data vs. consolidated data
16,758 57.07 5.56 45.50 85.88 33,516 57.07 5.56 35.42 85.87
0.08
0.06
0.06
Density
Density
0.04
0.04
0.02
0.02
0.00 0.00
25 35 45 55 65 75 85 25 35 45 55 65 75 85
Mean Crosslinguistic Phonetic Overlap Mean Crosslinguistic Phonetic Overlap
Original Data Consolidated Data
Figure 13. Kernel density plots of the DTW crosslinguistic phonological similarity data before
(left) and after (right) consolidation.
89
Std.
Semantic Similarity N Mean Min. Max.
Dev
90.00
80.00
DTW Phonetic Similarity
70.00
60.00
50.00
40.00
30.00
Figure 14. Mean differences in DTW crosslinguistic phonological overlap across semantic
categories showing a lower mean for COG and FF pairs. Lower values represent a greater the
degree of crosslinguistic phonological overlap between a prime and its target.
Page 1
90
complete, the relationship between the PHONETIC with both the two dependent
PHONETIC and RT yielded a weak yet significant positive correlation between the
two variables, r(33,516) = .02, p = .002, suggesting that on average participants reacted
word occurs per million words in collected corpora was obtained for each Portuguese
and English word selected for the present study. To that end, O corpus do Português
(Davies, 2017b), a large online corpus of Portuguese words, was used to obtain word
frequency count for the Portuguese words, while Corpus of Contemporary American
English (COCA) (Davies, 2017a), was used to obtain word frequency count for the
English words.
Table 18 below shows the descriptive statistics for both English_Freq and
yielded a word frequency count of zero for all of the pseudoword targets in distractor
pairs selected for the experiment, since these targets are not words that exist in the
91
English language. Because of this, distractor pairs have zero mean English frequency
in Table 18 below.
variables and RT. The results of the correlation between English_Freq and
= .15, p < .001. The correlation with RT indicated a weak yet significant negative
relationship with both English_Freq, r(33,516) = – .11, p < .001, and Portuguese_Freq,
r(33,516) = – .04, p < .001. A point biserial correlation between both word frequency
for both English_Freq, r(33,428) = .06, p < .001, and Portuguese_Freq, r(33,428) = .02,
p < .001.
4.1.2.6. Time
tenths of milliseconds) that elapsed from the presentation of the very first prime-
target pair until the presentation of each of the subsequent pair until the very end of
the experiment. Similar to RT, Time was recorded in tenths of milliseconds, but it was
relationship between Time and RT. A weak yet significant negative correlation was
found between the two variables, specifically, r(33,516) = – .08, p < .001. The
correlation was also run with LANGUAGE as a split factor in order to verify whether
the relationship manifested differently for both language groups. The weak negative
correlation between Time and RT was maintained for both language groups,
specifically, r(16,464) = – .06, p < .001 for bilinguals, and r(17,052) = – .14, p < .001 for
With respect to accuracy, a point biserial correlation was run between Time
and RAW_ACCURACY. The result of this correlation was not significant, r(33,428) =
significant result only for functional monolinguals, specifically, r(17,010) = .03, p <
.001, suggesting, despite being weak, that functional monolinguals were improving
in the experiment. Likewise, TARGET, comprises each of the English targets and
pseudowords included in the experiment. Refer to Appendix F for the complete list
random intercepts.
4.1.2.8. PAIR
number code, which was then encoded by the variable PAIR. This allowed PAIR to be
for the unique number code assigned to each of the prime-target pairs.
8
The point biserial correlation between Time and RAW_ACCURACY was not significant for
bilinguals, r(16,418) = .010, p = .179.
94
4.1.2.9. PARTICIPANT
In order to account for the great deal of individual variation resulting from
analysis as a random intercept. To that end, as each participant in the experiment was
4.1.2.10. OTHER_LANGUAGES
Background Questionnaire whether they had been exposed to (i.e., had received
encodes the participants’ responses to this particular question as either YES or NO.
This reflects the fact that most functional monolingual participants in the
experiment were from the National Capital Region in Canada, an area where English
speakers are exposed to French on a daily basis and often study the language in school
semantic and phonological overlap between French and Portuguese, both Romance
95
language to further understand its role in response latency. Thus, the variable
100.0%
Language Groups
Functional Monolinguals
Bilinguals
80.0%
60.0%
Percent
89.3%
40.0%
62.1%
20.0% 37.9%
10.7%
.0%
NO YES
OTHER_LANGUAGES
Figure 15: Differences between functional monolinguals and bilinguals with regards to
whether or not they have the knowledge of another European language besides English and
Portuguese.
ensuing section, an interim summary of the most import findings is provided below.
96
F(3, 33,508) = 875.88, p < .001, a significant main effect of language group, F(1,
similarity and language group, F(3, 33,508) = 10.06, p < .001. Functional
controls (both ps < .001). Participants were also on average 29ms faster to
bilinguals’ across all semantic categories. Distractor pairs yielded the lowest
percentage of correct answers for both language groups while cognate pairs
yielded the highest. These two findings indicate a possible main effect of
target pairs classified as cognates and false friends is significantly higher than
effect of NLD.
(p = .31).
r(33,516) = .15, p < .001. The correlation between RT and English_Freq was
r(33,516) = – .04, p < .001. The correlation between both types of word
98
r(33,428) = .06, p < .001, and Portuguese_Freq, r(33,428) = .02, p < .001.
< .001. The correlation was maintained for both language groups, specifically,
r(16,464) = – .06, p < .001 for bilinguals, and r(17,052) = – .14, p < .001 for
functional monolinguals. Despite the low rs, these correlations show that
between this dependent variable with Time was not significant (p = 401).
monolinguals, r(17,010) = .03, p < .001. Again, despite the weak r, this
experiment progressed.
besides English.
99
variable, was modeled using a linear mixed-effects model fitted with lmer.
regression mixed-effects model with glmer. Both lmer and glmer are part of the
lme4 package (Bates, Mächler, Bolker, & Walker, 2015) in R (R Core Team, 2019). Both
mixed-effects models were fit in RStudio running remotely on a virtual Linux virtual
instance using Amazon Elastic Compute Cloud (Amazon EC2)9. The results of each
9
Amazon Elastic Compute Cloud (Amazon EC2) is a subscription-based web service that is part
of Amazon Web Services (AWS), a collection of remote computing services that together
make up a cloud computing platform offered over the Internet by Amazon (“Amazon Elastic
Compute Cloud (Amazon EC2),” 2018). Amazon EC2 provides secure, resizable compute
capacity in the cloud, and offers a large number of instances, i.e., virtual computer
computational and data processing needs. For the purposes of this study, Amazon EC2 was
extremely useful during data analysis due to its computation power and processing speed,
specified in both models. Their random intercepts were always included. Random
slopes were included whenever possible; they were excluded if they correlated too
highly (r ≥ .90) with their random intercept or other random slopes. SEMANTIC,
A maximal mixed-effects structure was always sought for both models (Barr,
Lev, Scheepers, & Tily, 2013). Iterations of the maximal model were successively
simplified until the best fit was achieved. During the model simplification process,
the most non-significant effect (highest p-value), starting with any of the specified
interactions, was always removed first. This new iteration of the model was then
refitted. If this iteration of the model successfully converged, subsequently, the anova
function in R was called to compare the simplified (new iteration) model against its
originator (previous iteration) model. The iteration that yielded the lowest Akaike
in consecutive steps. The best model fit was deemed to have been reached when any
converging, produced a singularity fit, yielded a higher AIC value or was statistically
the same its previous iteration (Chi-Square results provided in the anova).
fitting both models, a few data cleaning techniques had to be applied. First, all
101
nloptwrap2, an optimizer function suggested by Bolker (2014) that calls for the
optimx package (Nash, 2014) in R, was used to fit every iteration of both models.
Finally, an inverse transformation (-1000/RT) was applied to the reaction time data
in order to reduce its positive skew. Reaction time data, more often than not,
skewed continuous dependent variable, as it is often the case with reaction time,
violates that assumption. One of the suggested ways to mitigate this is to use invRT,
Refer to Brysbaert & Stevens (2018) for further details on this technique.
PRIME, TARGET, and PAIR. Their random intercepts were always included. Random
slopes with PARTICIPANT were included whenever possible. They were excluded if
they correlated too highly (r ³ .90) with their random intercept or other random
slopes.
The following fixed effects, as well as their interactions with one another, were
specified in the lmer and glmer maximal models: LANGUAGE, SEMANTIC, NLD,
102
Portuguese). As presented below, a number of fixed effects and control variables did
not end up making to the best fit of both models, because they simply did not
The coefficients of effect size (R2 marginal and R2 conditional) were computed
for both models using the r.squaredGLM function included in the MuMIn package
(Bartoń, 2019). R2 marginal estimates the amount of variance accounted for by all the
fixed effects included in the model (i.e., fixed effects, control variables, and
the random effects and fixed effects included in the model (P. C. D. Johnson, 2014;
Nakagawa & Schielzeth, 2013). Additionally, the variance inflation factor (VIF) and
the VIF tolerances were computed for both models in order to assess multicollinearity
between predictors (Field, Miles, & Field, 2012, p. 293). Both models yielded VIF
values well below 10 and VIF tolerance values above .2. Based on these values, it can
be safely concluded that there is no multicollinearity within the data (VIF values for
4.2.1. Model
LMER.BEST.FIT 1 - Liner Mixed-Effects
<- lmer(invRT Model+fitted
~ LANGUAGE*SEMANTIC with LMER
ZNLD_gmc*ZPHONETIC_gmc +
(0+ZEnglish_Freq_gmc|PARTICIPANT) + (0+ZTime_ms_gmc|PARTICIPANT) +
was also found for cognates (p < .001), false friends (p < .01) and distractors (p < .001).
category), cognate pairs yielded on average 22ms faster reaction times, false friend
pairs yielded 14ms faster reaction times, and distractor pairs yielded 97ms slower
reaction times. These findings are in-line with what was initially predicted;
participants’ RT.
104
count increased. A significant main effect of target_length was also found (p < .001).
Longer targets, those comprised of longer letter strings, slowed participants’ reaction
times 11ms on average. In addition, a main effect of Time was encountered (p < .01).
Later trials yielded significantly faster reaction times than earlier ones, i.e.,
participants got on average 19ms faster as they progressed through the experiment.
and Portuguese were on average 45ms slower than participants who only spoke
4.2.1.3. Interactions.
< .01). Specifically, bilingual participants were on average 14ms faster on cognate pairs
interaction was also significant (p < .01). Specifically, prime-target pairs with a high
The model’s marginal effect size estimate was 16.14%. The primary and
secondary fixed effects plus their interactions did not account for much of the
size estimate was 61.43%. Adding the variance accounted for by PARTICIPANT, PAIR,
PRIME and TARGET random effects explained over half of the observed variability.
10
The raw RStudio output of the lmer model is provided in Appendix G.
Appendix G
106
Fixed effects:
invRT RT ms
Effect in ms Std. Error df t value F Pr(>|t|) ,---- d
Estimate Estimate
(Intercept) -1.713 584 0.064 70.91 -26.818 719.205 < 2E-16 ***
LANGUAGEBilinguals 0.199 660 77 0.076 64.02 2.601 6.765 0.011542 * 0.44
SEMANTICCOG -0.068 562 -22 0.020 287.80 -3.451 11.909 0.000643 *** -0.15
SEMANTICDISTR 0.244 681 97 0.027 79.54 8.997 80.946 9.31E-14 *** 0.54
SEMANTICFF -0.041 570 -14 0.020 442.00 -2.044 4.178 0.041527 * -0.09
ZNLD_gmc -0.008 581 -3 0.006 1987.00 -1.295 1.677 0.195404 -0.02
ZPHONETIC_gmc 0.003 585 1 0.003 5485.00 1.267 1.605 0.205051 0.01
ZEnglish_Freq_gmc -0.019 577 -6 0.005 461.70 -3.527 12.440 0.000462 *** -0.04
Zlength_TARGET_raw_gmc 0.030 594 11 0.006 366.70 5.121 26.225 4.92E-07 *** 0.07
ZTime_ms_gmc -0.057 565 -19 0.017 59.40 -3.389 11.485 0.00125 ** -0.13
OTHER_LANGUAGESYES 0.123 629 45 0.067 56.91 1.826 3.334 0.07305 . 0.27
LANGUAGEBilinguals:SEMANTICCOG -0.042 570 -14 0.018 55.20 -2.32 5.382 0.023631 * -0.09
LANGUAGEBilinguals:SEMANTICDISTR -0.005 582 -2 0.035 56.56 -0.128 0.016 0.898777 -0.01
LANGUAGEBilinguals:SEMANTICFF 0.018 590 6 0.015 53.07 1.157 1.339 0.252305 0.04
ZNLD_gmc:ZPHONETIC_gmc 0.008 586 3 0.003 1671.00 2.299 5.285 0.021632 * 0.02
Random effects:
family=binomial, control=glmerControl(optimizer="nloptwrap2"))
SEMANTIC was also found (p < .001). Specifically, irrespective of their language
judging whether or not a target was a genuine English word when they were shown
targets that occur more frequently in the language. A significant main effect of Time
was also found (p < .001). Specifically, irrespective of their language group
i.e., they were more accurate at judging a target as a genuine English word when it
they were more accurate in longer targets. Finally, a significant main effect of English
proficiency (SEMAC) was found (p < .001). Specifically, irrespective of language group
membership, participants who were on average more proficient in English, i.e., who
had scored higher in the cloze proficiency test, were also more accurate in the
experimental task.
4.2.2.3. Interactions.
controls in trials where the target were matched with a Portuguese prime cognate.
The model’s marginal effect size estimate, which takes into account the
response accuracy variability associated with all of the primary and secondary fixed
effects plus their interactions, was 19.39%. The model’s conditional effect size
estimate was 56.25%. Adding the variance accounted for by PARTICIPANT, PAIR,
PRIME and TARGET random effects explained over half of the observed variability.
109
11
The raw RStudio output of the glmer model is provided in Appendix H.
110
Estimates
Fixed effects: Log Odds (Logit) Odds Ratio ...__
Std. Error z value Pr(>|z|)
Log Odds (Logit) CI Lower Limit CI Upper Limit Odds Ratio CI Lower Limit CI Upper Limit
(Intercept) 4.877 -5.337 -4.418 131.248 0.005 0.012 0.234 20.817 < 2e-16 ***
LANGUAGEBilinguals -0.613 0.033 1.194 0.542 1.033 3.299 0.296 -2.073 0.038353 *
SEMANTICCOG 0.284 -0.740 0.172 1.329 0.477 1.187 0.233 1.222 0.221837
SEMANTICDISTR -1.372 0.997 1.748 0.253 2.710 5.744 0.192 -7.166 7.94E-13 ***
SEMANTICFF 0.185 -0.713 0.343 1.203 0.490 1.410 0.269 0.686 0.492957
ZEnglish_Freq_gmc 0.264 -0.431 -0.097 1.303 0.650 0.907 0.085 3.099 0.001942 **
ZTime_ms_gmc 0.174 -0.241 -0.108 1.191 0.786 0.898 0.034 5.136 2.80E-07 ***
Zlength_TARGET_raw_gmc 0.220 -0.363 -0.077 1.246 0.695 0.926 0.073 3.017 0.002549 **
ZSEMAC_gmc 0.496 -0.753 -0.238 1.642 0.471 0.788 0.131 3.773 0.000162 ***
LANGUAGEBilinguals:SEMANTICCOG 0.540 -1.022 -0.058 1.716 0.360 0.944 0.246 2.196 0.028062 *
LANGUAGEBilinguals:SEMANTICDISTR -0.136 -0.161 0.433 0.873 0.851 1.543 0.152 -0.900 0.368654
LANGUAGEBilinguals:SEMANTICFF -0.348 -0.170 0.867 0.706 0.843 2.381 0.265 -1.317 0.18815
Random effects:
Groups Name Variance Std.Dev.
PRIME (Intercept) 0.2766 0.5259
PAIR (Intercept) 0.3275 0.5723
TARGET (Intercept) 1.5102 1.2289
PARTICIPANT (Intercept) 0.6580 0.8112
111
(Anderson, 1983a, 1983b; Collins & Loftus, 1975; Collins & Quillian, 1969; Samani &
nodes; the associations between and among the countless types of linguistic
knowledge are depicted as the connection links. Thus, when a network node is
linked/associated nodes. Network nodes that are repeatedly activated together are
represented closer to one another, with shorter connection links, in order to indicate
lexicon, viz., RHM (J. F. Kroll & Stewart, 1994), BIA (Dijkstra & van Heuven, 1998;
Van Heuven et al., 1998), BIA+ (Dijkstra & van Heuven, 2002) and Multilink (Dijkstra
bilingual visual word recognition, lexical candidates from both L1 and L2 are
lexicon being integrated, i.e., lexical representations from both languages are stored
together.
The current study was conceived upon the principles set forth by spreading
112
activation and concerned with Portuguese-English cognates and false friends with
(semantic) overlap. From a theoretical perspective, its chief goal was to examine how
lexicon as well as to elucidate the lexical selection process. In addition, this study
sought to quantify the specific effects that crosslinguistic form and meaning overlap,
and any of their interactions, as well as word frequency, length, second language
proficiency as control variables exert in this organization and during lexical selection.
To that end, a lexical decision task with masked priming in the forward L1-to-L2
as experimental stimuli were also matched with English targets without form or
items were comprised of Portuguese primes that had not been used as experimental
English monolinguals served as controls. Two mixed-effects analyses were carried out
to analyze the data, and participants’ response latency and accuracy served as
dependent variables. The findings provide empirical support for many of the well-
to the BIA+. Thus, the following discussion was structured primarily in regard to the
113
BIA+’s framework.
5.1. Findings
A main effect of LANGUAGE was observed for response latencies (p < .05). When
other variables are not considered, bilingual participants’ reaction times were
indicating that lexical decision happens in a slower fashion overall for bilinguals. The
main effect of LANGUAGE was also observed for response accuracy (p < .o5).
Bilingual participants’ accuracy, i.e., deciding whether or not a target was an English
word, was lower than functional monolingual participants’. These two results point
to a language switching cost; when bilinguals are primed with an L1 word and the
lexical decision is carried out in their L2, their reaction times are significantly slower
from their L1 to their L2, bilinguals’ lexical decisions were less accurate compared to
English competitors. This reasoning is in line with the findings reported by von
Studnitz & Green (1997), which also employed a masked priming lexical decision
task.
latencies, but not on response accuracy. Both cognates (SEMANTICCOG) and false
114
controls (both ps < .001). Cognates, however, had a significant 8ms speed-advantage
over false friends. It follows then that crosslinguistic semantic overlap between a L1
greater the degree of semantic overlap between an L1 prime and L2 target, as it was
the case with cognates, the faster participants’ reaction times were on average. To
further support this assertion, two other findings regarding crosslinguistic semantic
overlap need also to be taken into consideration. Participants’ reaction times in false
friend trials (SEMANTICFF), i.e., word pairs in which the crosslinguistic form
(orthographic and phonological) overlap between the Portuguese prime and English
target was high but semantic overlap was low (or non-existent), despite being on
average significantly faster than control trials (p < .05), were still less robust relative
(SEMANTICDISTR), i.e., pairs in which a bona fine Portuguese prime was matched
whatsoever), were the slowest of all (97ms on average) in comparison to control trials
(p < .001). Soares and Grosjean (1984), report similar findings with nonwords, with
functional monolinguals’ reaction times were significantly faster in cognate and false
friend trials. This begs the question how functional monolingual participants could
cognates, even when they expressed not having any knowledge of Portuguese during
the intake assessment interview. Cognate and false friend trials have a common
the masked Portuguese prime, its lexical and sublexical orthographic (and
activation. Further, since the effect was also significant in false friend trials
effect) was observed on bilinguals’ response latencies and response accuracy (both ps
< .05). Bilinguals were on average 14ms faster to respond to an English target that was
116
participants were more accurate at deciding whether a target as a bona fide English
participants were not only significantly faster to react but also significantly more
These findings align well with the results of several noteworthy studies on the
bilingual mental lexicon (Altarriba & Basnight-Brown, 2007; Caramazza & Brones,
1979; Cristoffanini et al., 1986; Dijkstra et al., 1999; Dijkstra, Miwa, Brummelhuis,
Sappelli, & Baayen, 2010; Duñabeitia, Perea, & Carreiras, 2010; Duyck, Van Assche,
Drieghe, & Hartsuiker, 2007; Forster et al., 2003; Gerard, Linda, & Scarborough, 1989;
Guasch et al., 2013; Kim & Davis, 2003; Lemhöfer & Dijkstra, 2004; Nakayama et al.,
2012; Peeters, Dijkstra, & Grainger, 2013; Sanchez-Casas, Davis, & Garcia-Albea, 1992;
Schwartz et al., 2007). Taking into account the framework proposed by the BIA+ and
its updated version named the Multilink (both discussed in Chapter 2), they also have
major implications for the inner workings as well as the organization of the
crosslinguistic priming effects are automatic and not under direct control of the
and target. Third, when the degree of crosslinguistic form and semantic overlap
between an L1 prime and an L2 target is high, i.e., when they are cognates, their
accelerate bilinguals’ responses for this group of words. In addition, in the absence of
crosslinguistic priming effects can still occur due to crosslinguistic form overlap.
Finally, these findings support for the language non-selective access hypothesis, i.e.,
lexical candidates from bilinguals’ both languages are simultaneously activated upon
presentation of word in either language during the first stages of visual word
latencies. This interaction, however, did not enter the second mixed-effects model
recollected (see chapter 3, Methodology for further detail on stimuli selection). The
when orthographic overlap between a prime and its matched target is nonexistent,
and 1, when the prime and target are orthographically identical. Dynamic time
and Phonological Corpus Tools (Hall et al., 2016) was the software package used to
implement the DTW algorithm. These two objective metrics allowed this study to
more elegantly intuit how these two variables interact in the Portuguese-English
analysis with this language pair had not been yet attempted.
phonological overlap suggests that a certain level of integration exists between both
the L1 and the L2 and that orthographic representations spread their activation to
the prime become activated and spread their activation in a feed-forward fashion to
both languages are also activated subsequently based on their degree of phonological
“temporal delay assumption” (Dijkstra & van Heuven, 2002, p. 183) deserves merit.
The model proposes that there are time course differences in the contribution of the
two codes in lexical decision, with lexical and sublexical orthographic code being
activated first and L2 lexical and sublexical phonological code being activated slightly
activation spread, particularly in the forward L1-to-L2 direction. It follows then the
significant 3ms inhibitory effect observed with this interaction may be a consequence
of the temporal delay. Alternatively, it is possible that this 3ms delay is a result of the
code (and vice-versa). To that end, a certain grapheme (or its combinations) in
Portuguese may be consistently pronounced the same way in the language, whereas
may induce lateral inhibition, since the non-overlapping phonological code would
120
also activate lexical representations that are less phonologically similar to both prime
English (as well as French, Spanish and Italian), which despite sharing the same
roman script and having a great number of words with 100% orthographic overlap,
One of the control variables that was of particular interest in this study was
on average 6ms faster (p < .001) to select targets that occur more frequently in English,
i.e., have a higher frequency count per million words in the language. In addition,
irrespective of their language group membership, i.e., whether they were classified
high frequency count targets as genuine words of the English language, compared to
121
These two findings provide further insight of the inner workings of the
facilitates lexical decision in terms of speed and accuracy, but this facilitatory effect
is not substantial (only a 6ms facilitation in response times, d= .03). Both groups of
English targets that occur more frequently in the language. This finding is in line with
the BIA+ proposal. The model purports to say that word frequency affects how
activation spreads in the Word Identification System, which, in turn, affects how the
Task Schema selects a specific target. Specifically, frequency of use modulates how
activation spreads from the input string to linked representations in the Word
words have sufficiently higher resting-level activation and, consequently, spread their
activation to other linked representations much faster, which, during the course of
targets have a higher baseline (resting) level of activation, i.e., they are more salient,
candidates and the spread of activation between prime and target occurs at a
The BIA+ accounts for how representations from one language can have
122
language. In respect to the current study, because English was the target language of
the lexical decision task and was also the primary language of communication for all
of the participants, it would be safe to assume that English representations had ample
time to establish themselves in the mental lexical, i.e., increase their resting-level
representations might have been available for the bilingual participants. In this
as their primary language of communication, study and work on a daily basis, and the
lexical decision task was carried out in English, it is possible that “recency of use of
the target language” not only accelerated lexical selection in English but also exerted
top-down inhibition that made lexical selection in the language significantly more
accurate. Had the experimental task been conducted in the backwards priming
frequency effects on both response latency and accuracy would have been detected,
but only for bilingual participants. In order to more precisely quantify the effects of
recency of use, it would be important for future studies to compare bilinguals living
Since the effect of L2 word frequency was shown not to be substantial, instead
123
it was done in the current study, future studies should assemble an experimental
stimuli list in which the frequency of the readings of cognates and false friends is
be important verify whether switching the direction of the lexical decision task, viz.,
differences.
language group membership, processed longer targets, i.e., were comprised of more
letters, on average 11ms slower (p < .001). Besides potentially having been influenced
by the priming direction of the experimental task, this finding suggests a temporal
potentially due to greater memory (cognitive) load requirements. In this way, upon
prime during the first states of visual word recognition. As prime-target pairs in the
experimental task were matched for length, and both languages of the experiment
share the same alphabet, it is possible that the number of lexical competitors (or
neighbors) activated by the L1 prime is a function of the total number of letters that
124
recognizing a target as a genuine English word when they were longer in length (p <
.01). This shows that despite the cost associated with the simultaneous activation of
selection still occurred in a more accurate fashion. This finding provides support for
the existence a task schema, an idea first purported in the BIA+. According to Dijkstra
& van Heuven (2002), the Task Schema, which is in constant communication with the
Word Identification System, is responsible for ensuring proper “safe execution” of the
specific language. Because the effect of target word length affected both bilinguals
properly verify the membership of all the active lexical competitors in the target
language, in this case English, in order to accurately narrow down a specific target.
greater number of lexical competitors allowed the task schema to more accurately
125
select an English target, thereby yielding the significant effect of response accuracy
observed.
task, yielded a significant effect on both response latency (p < .01) as well as response
accuracy (p < .001). Bilinguals’ and functional monolinguals’ lexical decisions were
on average faster (19ms) and more accurate on later trials compared to earlier ones.
Two scenarios may explain these findings. (a) A facilitatory effect of practice:
as participants got more familiar with the lexical decision task, their reaction times
and accuracy in later trials significantly improved, probably also due to the rather
large total number of trials that comprised the experiment. In fact, in the extant
literature the effect of practice has been observed in crosslinguistic lexical decision
tasks with respect to response latency and accuracy when data was analyzed using
despite the highly significant (p < .001) effect of practice, in practical terms it was
response accuracy) and deemed to be a result of the large number of trials. In the
current study, the effect of practice may have been partly an upshot of the large
number of trials as well. In addition, the observed effect was also rather small (19ms
difference in response latency by the end of the task). (b) A facilitatory recency effect:
with English targets that were unrelated in both form and meaning, it is reasonable
could have retained a proportion of their activation and facilitated lexical selection
of targets in later trials. In addition, it is plausible that this recency effect was further
accentuated (additive effect) by the fact that all participants were using English as
their primary language of communication and that the experimental task carried out
in the Portuguese-English direction, with English being the target language. In other
English proficiency had no impact on response latency, and, in fact, it did not
main effect that was highly significant for response accuracy (p < .001), irrespective
of language group membership, and thus, entered the glmer model. Specifically,
participants who obtained a higher score on the cloze proficiency test were on average
language.
Duñabeitia, & Carreiras, 2011c; Finkbeiner, Forster, Nicol, & Nakamura, 2004; Jiang,
1999; Nakayama, Ida, & Lupker, 2016; Nakayama, Sears, Hino, & Lupker, 2013; Wang,
2013; Wen & van Heuven, 2017; Xia & Andrews, 2015). The robust translation
easily in the forward L1-to-L2 masked priming direction than the other way around.
In addition, it has been reported that in order to obtain similar effects in the
presented for longer, greater than the standard Stimuli Onset Asynchrony (SOA) of
Nevertheless, it has also been reported in the literature that the translation
(Duñabeitia, Dimitropoulou, et al., 2010; Duñabeitia, Perea, et al., 2010; Perea et al.,
2008). Additionally, the BIA+ model (Dijkstra & van Heuven, 2002), proposes that as
as well as highly proficient bilinguals would not be significant or, if significant, their
effect size (d) would be small (Wen & van Heuven, 2017).
the current study may have been more proficient in English than the cloze proficiency
test was able to detect. In addition, it is likely that using their L2 (English) as their
between the L1 and the L2, placing English in a more equal position to their L1
(Portuguese). These two factors explain why English proficiency had no effect on
response latency for both bilinguals and functional monolinguals. Regarding the
standpoint, this finding is compatible with the BIA+ framework. Specifically, daily
exposure to their L2 not only helped bilinguals to become more proficient in English
but also allowed them to develop more stable lexical representations in the language
non-target language leading to better overall accuracy (Dijkstra & van Heuven, 2002,
p. 189).
spoke another European language besides English and Portuguese, which for the
purposes of this study was important to account for. Many models of the bilingual
mental lexicon (in particular the BIA and its updated version BIA+) propose that the
129
and that during the first stages of visual word recognition a number of orthographic
orthographic similarity with the input string (Van Heuven et al., 1998). This
form overlap (Cristoffanini et al., 1986). Therefore, participants that speak languages
with a high degree of crosslinguistic form overlap, such as French and Portuguese,
may manifest slower reaction times on a crosslinguistic lexical decision task. To that
and French, both Romance Languages, and between both of these languages and
English. In the current study, all of the selected primes were in Portuguese. Upon
Portuguese prime activated English and potentially French candidates for functional
Ottawa, an institution where students are exposed to both English and French
regularly, in Canada’s National Capital Region, an area where residents are also
exposed to both English and French regularly, it is likely that lexical decision was
slowed down by their knowledge of French. Despite not being the target language,
great degree of orthographic code. Thus, having knowledge of French, or having been
may have learned other Romance Languages, such as Spanish and Italian, whose
this result indicates that more research is required in order to cognize how
In sum, the aforementioned findings provide strong empirical support for the BIA+.
First, the significant condition effects encountered suggest some level of integration
(orthographic and phonological) overlap between them was high (cognates and false
friends). Because the effect was significant irrespective of language group and
crosslinguistic form overlap in cognates and false friends is high, it follows that
crosslinguistic form overlap was likely the responsible factor. Second, crosslinguistic
semantic overlap plays a leading role in the organization of the bilingual mental
lexicon. When the Portuguese prime and the English target shared the same
reaction times were significantly faster than when the Portuguese prime and the
English target did not share the same conceptual representation (false friends). This
effect suggests that cognates may have a special representation in the bilingual
the findings of this study also support the BIA+’s depiction of the bilingual mental
Identification System, which sits atop the hierarchy, and a Task Decision/Schema
participants can be accounted for as being the result of these two complex, yet highly
interactive, cognitive processes. To that end, during the course of visual word
frequency, recency of use as well as proficiency level in the language had increased
languages, which competed against one another. Recognition of the target ultimately
activation spread in the Word Identification System and taking into account the
lexical decision task’s instructions, determined that the target had met recognition
threshold.
Despite all of the evidence gathered in support of the BIA+, the findings above
133
warrant future study in other to address the limitations encountered. First, the pool
was comprised of international undergraduate students from Brazil who had come
program in their respective fields of study. Only a few of the bilingual participants
while all bilingual participants indicated during the intake assessment interview that
English had become their primary language since arriving in Canada, most of them
had learned the language after starting high school in Brazil, which is usually the
norm in the country. Moreover, many of the bilingual participants admitted to having
studied French; some even mentioned they had attended L’Alliance Française in
Brazil for many years while they were also learning English. Adding to this, many
reported they were living in Gatineau, Québec, and had francophone roommates,
i.e., were exposed to French on a daily basis. Although one of the original goals of this
study was to quantify the effect of language dominance, age of English acquisition
Similarly, despite the fact that our functional monolingual participants were
native speakers of English and did not speak or know Portuguese, living in Ottawa,
an officially bilingual city, and working at University of Ottawa, the largest bilingual
university in Canada, meant that this group of participants was still exposed to
134
reported having studied Spanish and Italian. Since Portuguese, Spanish, Italian and
French are Romance Languages and share a great deal of morphological and
phonological similarities, it is possible these very facts skewed the results by making
for the control group, i.e., separating native speakers of English who speak only
English from native speakers of English who speak other romance languages, and not
Second, few studies have shed light on the dynamics of the Task Decision
Schema and the Word Identification System, in particular with respect to how the
types pseudowords (or nonwords) can sway how activation threshold is crossed for
recognition of a word to take place during a lexical decision task (Grainger & Jacobs,
1996; Lima & Huntsman, 1997; Lupker, Brown, & Colombo, 1997; Taylor & Lupker,
2001). To that end, after decision parameters are set during practice trials for an
English-only lexical decision task, the Task Decision Schema can be adapted or fine-
tuned over the course of the experiment, which can significantly skew target
pseudoword targets, if, for instance, all of the pseudowords are orthographically or
even phonologically unpronounceable in the target language. Despite the fact the
135
results of this study demonstrated that both group of participants were slowest on
order to minimize potential stimuli composition effects, it is vital for future studies
it is vital that pseudowords match Portuguese primes in length and that their
orthographic and phonological overlap is low, i.e., they must not be interlingual
homographs or homophones.
should also factor in subjective metrics in mixed-effects model analyses. For instance,
effect size between objective and subjective metrics, or even whether it would be
In conclusion, this dissertation found empirical support for the BIA+ model,
that linguistic representations from bilinguals’ both languages were activated during
orthographic and phonological overlap alone did not facilitate bilinguals’ recognition
136
respond and more accurate to identify English targets that were matched with
effect was further facilitated when the Portuguese prime and the English target
shared the same semantic representation, i.e., when they were cognates. These
findings suggest that cognates may share a single morphemic representation for in
the Portuguese-English bilingual mental lexicon. Moreover, it was found that both
with this group of participants and cognates showed that crosslinguistic overlap
participants were not only significantly faster to react but also significantly more
appears that semantic similarity in fact plays a major role in how activation spreads
crosslinguistically, since in its absence, as it was the case with false friends, no
137
crosslinguistic priming effect was observed. The BIA+ is able to accommodate the
significant effects of target word length on both response times and accuracy, as well
psycholinguistic fields in three major ways. First, despite the great deal of
between Portuguese and English, little research has been directed at expounding how
languages. In fact, the role that both crosslinguistic similarity and English proficiency
play in the organization of the Portuguese-English bilingual mental lexicon has been
addressing these issues in a more quantitative fashion and also provides undeniable
evidence in support of the BIA and BIA+. Second, from a methodological standpoint,
similarity such as dynamic time warping (DTW) and the normalized Levenshtein
distance (NLD); thus, the hurdles and limitations many second language researchers
successfully overcome. Third, although ANOVAs and t-tests have been the primary
studies to analyze bilingual data collected in LDTs, liner mixed-effects models were
138
chosen instead for this dissertation. Besides being more robust and more precise at
variables (reaction time and accuracy), mixed-effects models yielded a more realistic
applied level, the fields of translation and language pedagogy could undoubtedly
7. References
Hirotugu Akaike (pp. 199–213). New York, NY: Springer New York.
https://doi.org/10.1007/978-1-4612-1694-0_15
Amazon Elastic Compute Cloud (Amazon EC2). (2018). Amazon Web Services, Inc.
7.50016-9
Anderson, J. R. (2015). Cognitive psychology and its implications (8th ed.). New York,
Audacity®: Free Audio Editor and Recorder. (2019). Audacity Team. Retrieved from
https://audacityteam.org/
Barr, D. J., Lev, R., Scheepers, C., & Tily, H. J. (2013). Keep it maximal Appendix.
140
https://doi.org/10.1016/j.jml.2012.11.001.Random
9101.2010.02210.x
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting Linear Mixed-Effects
https://doi.org/10.18637/jss.v067.i01
https://stackoverflow.com/questions/21344555/convergence-error-for-
development-version-of-lme4
Bradlow, A., Clopper, C., Smiljanic, R., & Walter, M. A. (2010). A perceptual phonetic
similarity space for languages: Evidence from five native language listener
https://doi.org/10.1016/j.specom.2010.06.003
Brown, J. D. (1980). Relative Merits of Four Methods for Scoring Cloze Tests. The
4781.1980.tb05198.x
Brysbaert, M., & Stevens, M. (2018). Power Analysis and Effect Size in Mixed Effects
https://doi.org/10.5334/joc.10
141
Brysbaert, Marc. (2003). Bilingual visual word recognition: Evidence from masked
phonological priming. In Masked Priming: The State of the Art (pp. 178–188).
https://doi.org/10.4324/9780203502846
Brysbaert, Marc, & Duyck, W. (2010). Is it time to leave behind the Revised
Brysbaert, Marc, Van Dyck, G., & Van De Poel, M. (1999). Visual word recognition in
1523.25.1.137
Caramazza, A. (1997). How Many Levels of Processing Are There in Lexical Access?
https://doi.org/10.1080/026432997381664
Caramazza, A., & Brones, I. (1979). Lexical access in bilinguals. Bulletin of the
Cognition. https://doi.org/10.1037/0278-7393.15.2.316
Chomsky, N., & Halle, M. (1992). The Sound Pattern of English. PsycCRITIQUES.
Clements, G., & Hume, E. V. (1995). The internal organization of speech sounds. In J.
142
Cambridge, MA.
225. https://doi.org/10.1017/S0952675700000440
295X.82.6.407
Collins, A. M., & Quillian, M. R. (1969). Retrieval time from semantic memory.
https://doi.org/10.1016/S0022-5371(69)80069-1
https://doi.org/10.1037/0096-1523.12.2.226
Comesaña, M., Sánchez-Casas, R., Soares, A. P., Pinheiro, A. P., Rauber, A., Frade, S.,
& Fraga, I. (2012). The interplay of phonology and orthography in visual cognate
https://doi.org/10.1016/j.neulet.2012.09.010
Costa, A., Caramazza, A., & Sebastian-Galles, N. (2000). The Cognate Facilitation
7393.26.5.1283
143
Costa, A., La Heij, W., & Navarrete, E. (2006). The dynamics of bilingual lexical
Costa, A., & Santesteban, M. (2004). Lexical access in bilingual speech production:
https://doi.org/10.1016/j.jml.2004.02.002
Cristoffanini, P., Kirsner, K., & Milech, D. (1986). Bilingual lexical representation: The
https://doi.org/10.1080/14640748608401604
https://doi.org/10.1080/01690960903285797
https://corpus.byu.edu/coca/
http://www.corpusdoportugues.org/x.asp
7393.18.5.1001
144
295X.93.3.283
Dijkstra, T., Grainger, J., & Van Heuven, W. J. B. (1999). Recognition of Cognates and
Dijkstra, T., Miwa, K., Brummelhuis, B., Sappelli, M., & Baayen, H. (2010). How cross-
Dijkstra, T., & van Heuven, W. J. B. (1998). The BIA model and bilingual word
Dijkstra, T., & van Heuven, W. J. B. (2002). The architecture of the bilingual word
Dijkstra, T., Wah, BUYTENHUIJS, F., VAN HALEM, N., AL-JIBOURI, Z., DE KORTE,
M., & REKKÉ, S. (2018). Multilink: a computational model for bilingual word
657–679. https://doi.org/10.1017/S1366728918000287
priming effects with low proficient bilinguals. Memory and Cognition, 39(2),
260–275. https://doi.org/10.3758/s13421-010-0004-9
145
https://doi.org/10.1080/20445911.2011.477811
Dimitropoulou, M., Duñabeitia, J. A., & Carreiras, M. (2011c). Two words, one
Duñabeitia, J. A., Dimitropoulou, M., Uribe-Etxebarria, O., Laka, I., & Carreiras, M.
https://doi.org/10.1016/j.brainres.2010.08.066
Duñabeitia, J. A., Perea, M., & Carreiras, M. (2010). Masked translation priming
Dunabeitia, J. A., Perea, M., Carreiras, M., Duñabeitia, J. A., Perea, M., & Carreiras, M.
3169/a000013
Duyck, W., Van Assche, E., Drieghe, D., & Hartsuiker, R. J. (2007). Visual Word
Ferrand, L., & Grainger, J. (1993). The time course of orthographic and phonological
code activation in the early phases of visual word recognition. Bulletin of the
Field, A., Miles, J., & Field, Z. (2012). Discovering Statistics Using R. London, England:
Finkbeiner, M., Forster, K., Nicol, J., & Nakamura, K. (2004). The role of polysemy in
Forster, K. I., & Davis, C. (1984). Repetition priming and frequency attenuation in
Cognition. https://doi.org/10.1037/0278-7393.10.4.680
Forster, K. I., Mohan, K., & Hector, J. (2003). The mechanics of masked priming. In
Masked Priming: The State of the Art (pp. 2–20). Psychology Press.
https://doi.org/10.4324/9780203502846
https://doi.org/10.1017/S1366728910000489
specific-lexical-access-homographs-by-bilinguals
Gollan, T. H., Forster, K. I., & Frost, R. (1997). crucial for Translation Priming With
http://content.ebscohost.com/ContentServer.asp?T=P&P=AN&K=1997-06939-
004&S=L&D=pdh&EbscoContent=dGJyMMvl7ESeqa44v%2BbwOLCmr1Cep7NS
sKu4TLCWxWXS&ContentCustomer=dGJyMPGot1CzrrZLuePfgeyx44Dt6fIA
https://doi.org/10.1080/016909698386393
Grainger, J., & Holcomb, P. J. (2009). Watching the word go by: On the time-course of
Compass. https://doi.org/10.1111/j.1749-818X.2008.00121.x
Grainger, J., Kiyonaga, K., & Holcomb, P. J. (2006). The time course of orthographic
https://doi.org/10.1111/j.1467-9280.2006.01821.x
Grainger, J., Spinelli, E., Farioli, F., Diependaele, K., & Ferrand, L. (2003). Masked
https://doi.org/10.1037/0278-7393.29.6.1256
Guasch, M., Boada, R., Ferré, P., & Sánchez-Casas, R. (2013). NIM: A Web-based Swiss
Hall, K. C., Allen, B., Fry, M., Mackie, S., & McAuliffe, M. (2016). Phonological
http://corpustools.readthedocs.io/en/latest/index.html
Cognition. https://doi.org/10.1037/0278-7393.33.4.645
Jared, D., & Szucs, C. (2002). Phonological activation in bilinguals: Evidence from
https://doi.org/10.1017/S1366728902003024
Jiang, N. (1999). Testing processing explanations for the asymmetry in masked cross-
https://doi.org/10.1017/s1366728999000152
Johnson, K. (2011). Acoustic and Auditory Phonetics (3rd ed.). Malden, MA: Wiley-
Blackwell.
https://doi.org/10.1111/2041-210X.12225
https://doi.org/10.3758/BRM.42.3.627
Keuleers, E., Diependaele, K., & Brysbaert, M. (2010). Practice effects in large-scale
visual word recognition studies: A lexical decision study on 14,000 dutch mono-
https://doi.org/10.3389/fpsyg.2010.00174
Kim, J., & Davis, C. (2003). Task effects in masked cross-script translation and
https://doi.org/10.1016/S0749-596X(03)00093-7
Kondrak, G., & Sherif, T. (2006). Evaluation of several phonetic similarity algorithms
Kroll, J., & Curley, J. (1988). Lexical memory in novice bilinguals: The role of concepts
Kroll, J. F., & Stewart, E. (1994). Category Interference in Translation and Picture
https://doi.org/10.1006/jmla.1994.1008
Kroll, J. F., & Tokowicz, N. (2001). The development of conceptual representation for
words in a second language. One Mind, Two Lang. Biling. Lang. Process., 49–71.
https://doi.org/10.3758/BF03195845
Levelt, W. J. M., Roelofs, A., & Meyer, A. S. (1999). A theory of lexical access in speech
https://doi.org/10.1017/S0140525X99001776
https://doi.org/10.1007/BF00419412
Linck, J. A., & Cunnings, I. (2015). The Utility and Application of Mixed-Effects
https://doi.org/10.1111/lang.12117
SEMAC ) Versus Exact Replacement Scoring Methods ( ERS ) For ’ Cloze ’ Tests :
A Case Study. Asian EFL Journal Quaterly, 8(1), 1–28. Retrieved from
https://www.asian-efl-journal.com/1133/quarterly-journal/semantically-
acceptable-scoring-procedures-semac-versus-exact-replacement-scoring-
methods-ers-for-cloze-tests-a-case-study/#squelch-taas-tab-content-0-0
Lupker, S. J., Brown, P., & Colombo, L. (1997). Strategic Control in a Naming Task.
570–590.
Martin, R. C., & Jensen, C. R. (1988). Phonological priming in the lexical decision
https://doi.org/10.3758/BF03197052
Review. https://doi.org/10.1037/0033-295X.88.5.375
Mielke, J. (2012). A phonetically based metric of sound similarity. Lingua, 122(2), 145–
152
163. https://doi.org/10.1016/j.lingua.2011.04.006
https://doi.org/10.1080/03637758509376100
Nakagawa, S., & Schielzeth, H. (2013). A general and simple method for obtaining R2
Nakayama, M., Ida, K., & Lupker, S. J. (2016). Cross-script L2-L1 noncognate
https://doi.org/10.1017/S1366728915000462
Nakayama, M., Sears, C. R., Hino, Y., & Lupker, S. J. (2012). Cross-script phonological
https://doi.org/10.1080/01690965.2011.606669
Nakayama, M., Sears, C. R., Hino, Y., & Lupker, S. J. (2013). Masked translation
http://www.jstatsoft.org/v60/i02/
Pallier, C., Colomé, A., & Sebastián-Gallés, N. (2001). The Influence of Native-
9280.00383
Peeters, D., Dijkstra, T., & Grainger, J. (2013). The representation and processing of
identical cognates by late bilinguals: RT and ERP effects. Journal of Memory and
Perfetti, C. A., & Bell, L. (1991). Phonemic activation during the first 40 ms of word
Perfetti, C. A., & Tan, L. H. (1998). The time course of graphic, phonological, and
https://doi.org/10.1037/0278-7393.24.1.101
Pexman, P. M., Lupker, S. J., & Jared, D. (2001). Homophone Effects in Lexical
154
Pollatsek, A., Perea, M., & Carreiras, M. (2005). Does conal prime CANAL more than
cinal? Masked phonological priming effects in Spanish with the lexical decision
Potter, M. C., So, K. F., Eckardt, B. Von, & Feldman, L. B. (1984). Lexical and
5371(84)90489-4
Rastle, K., Davis, M. H., Marslen-Wilson, W. D., & Tyler, L. K. (2000). Morphological
context effects in letter perception: II. The contextual enhancement effect and
https://doi.org/10.1037/0033-295X.89.1.60
Sabourin, L., Brien, C., & Burkholder, M. (2014). The effect of age of L2 acquisition on
https://doi.org/10.1017/S1366728913000643
http://cogprints.org/1663/00/act2.htm
https://doi.org/10.1080/09541449208406189
Schepens, J., Dijkstra, T., Grootjen, F., & van Heuven, W. J. B. (2013). Cross-Language
8(5). https://doi.org/10.1371/journal.pone.0063006
Schwartz, A. I., & Kroll, J. F. (2006). Bilingual lexical activation in sentence context.
https://doi.org/10.1016/j.jml.2006.03.004
Schwartz, A. I., Kroll, J. F., & Diaz, M. (2007). Reading words in Spanish and English:
Shinde, R. B., & Pawar, V. P. (2014). Dynamic time Warping using MATLAB & PRAAT.
Soares, C., & Grosjean, F. (1984). Bilinguals in a monolingual and a bilingual speech
mode: The effect on lexical access. Memory & Cognition, 12(4), 380–386.
https://doi.org/10.3758/BF03198298
527–536. https://doi.org/10.1002/j.1545-7249.2008.tb00145.x
27(I), 117–138.
The World’s Top 10 Largest Economies. (n.d.). Retrieved October 1, 2019, from
https://www.focus-economics.com/blog/the-largest-economies-in-the-world
Tzur, B., & Frost, R. (2007). SOA does not Reveal the Absolute Time Course of
https://doi.org/10.1017/s1366728998000352
Van Hell, J. G., & Dijkstra, T. (2002). Foreign language knowledge can influence
157
Van Heuven, W. J. B., Dijkstra, T., & Grainger, J. (1998). Orthographic Neighborhood
458–483. https://doi.org/10.1006/jmla.1998.2584
van Orden, G. C. (1987). A ROWS is a ROSE: Spelling, sound, and reading. Memory &
Van Wijnendaele, I., & Brysbaert, M. (2002). Visual word recognition in bilinguals:
https://doi.org/10.1037/0096-1523.28.3.616
Voga, M., & Grainger, J. (2007). Cognate status and cross-script translation priming.
von Studnitz, R. E., & Green, D. W. (1997). Lexical decision and language switching.
https://doi.org/10.1177/136700699700100102
https://doi.org/10.1080/17470218.2012.716072
http://www.jstor.org/stable/747079
Wen, Y., & van Heuven, W. J. B. (2017). Non-cognate translation priming in masked
Xia, V., & Andrews, S. (2015). Masked translation priming asymmetry in Chinese-
https://doi.org/10.1080/17470218.2014.944195
Ziegler, J. C., Ferrand, L., Jacobs, A. M., Rey, A., & Grainger, J. (2000). Visual and
8. Appendix
FileNun-1>•:09-12-13
~~,I;.
~
...
~~
••
~;:.
,I
Dale <,nniddiY»')'):02/I0/JJl5
Universite d 'Ottawa
Bureau d'ethique et d'integrite de la recherche
University of Ottawa
Off,ce of ResearchEthics and Integrity
FileNumber: 09-12-13
i .-·,,.,.,.,
;,;in.,,.
... t~ .,,_
~~
Date (mm dd/yyyy) : 02/10/JJI 5
Universite d'Ottawa
Bureau d'ethique et d'integrite de Ia recherthe
University of Ottawa
Offa e R.esearthEthics
of Integrity
and
This is to confirm that the University of Ottawa Research Ethics Board identified above, which operates in
accordance with the Tri-Council Policy Statement (2010) and other applicable laws and regul aliens in Ontario,
has examined and approved the ethics application for the ab ave named research project . Ethics approval is
valid for the period indicated above and subject to the conditions listed in the section entitled "Special
Conditions/ Comments ".
During the course of the project, the protocol may not be modified without prior written approval from the
REB except when necessary to remove participants from immediate endangerment or when the modifica!ion(s)
pertain to only aclministra!ive or logistical components of the project (e.g., change of telephone number) .
Investigators must also promptly alert the REB of any changes which increase the risk to participant(s). any
changes which considerably affect the conduct of the project, all unanticipated and harmful events that occur,
and new information that may negatively affect the conduct of the project and safety of the participant(s)
Mo difica!ions to the project, including consent and recruitment documenta!ion, should be submitted to the
Ethics Office for approval using the "Modi fication to research project" form available at:
http :// re search .uo ttawa er/ethics/ submi s si on s-and-revi ews.
Please submit an annual rep art to the Ethics Office four weeks before the ab ave-referenced expiry date to
request a renewal of this ethics approval . To close the file, a final report must be submitted . The se documents
can be found at: http://re search .uottawa ca/ethics/submissions-and-reviews .
If you have any questions, please do not hesitate to contact the Ethics Office at extension 5387 or by e-mail at:
ethics@uOttawa .ca .
Si~ature:
Melanie Rioux
Ethics Coordinator
For Catherine Paquet, Director of the Office of Research Ethics and Integrity
2
550,rue Cwnberland,piece 154 550 CwnberlandStreet, JOom154
Ottawa(Ontario)KIN 6N5 Canada Ottawa,Ontario KIN 6N5 Canade
(613) 562-5387 • Telec./Fax (613) 562-5338
wwwu:cb1urbe
nnttawca/cleontnloeie/
WYCVrereamb
unttawca/ethics!
161
FileNuni>..-:09-12-13
"'
~1$,,,
~ ;i,.
Dale 4)nnidil/Y»')'): 04124/JJl 7
®~
Universite d 'Ottawa
Bureau d'ethique et d'integrite de la recherche
University of Ottawa
Off,ce of Research Ethics and Integrity
550, rue Cumberland, piece 154 550 Cumberland Street, room 154
Ottawa (Ontario) Kl N 6N5 Canada Ottawa, Ontario KIN 6N5 Canada
(613) 562-5387 • Telec./Fax (613) 562-5338
www .recherche.uottawa.caideontologie/www.reseaxch.uottawa.cafethicsl
162
FileNuni>..-:09-12-13
"'
~1$,,,
~ ;i,.
Dale 4)nnidil/Y»')'): 04124/JJl 7
®~
Universite d 'Ottawa
Bureau d'ethique et d'integrite de la recherche
University of Ottawa
Off,ce of Research Ethics and Integrity
550, rue Cumberland, piece 154 550 Cumberland Street, room 154
Ottawa (Ontario) Kl N 6N5 Canada Ottawa, Ontario KIN 6N5 Canada
(613) 562-5387 • Telec./Fax (613) 562-5338
www .recherche.uottawa.caideontologie/www.reseaxch.uottawa.cafethicsl
163
i .-·,,.,.,.,
;,;in.,,.
... t~ .,,_
~~
Date (mm dd/yyyy) : 04f.l4/JJI 7
Universite d'Ottawa
Bureau d'ethique et d'integrite de Ia recherthe
University of Ottawa
Offa e of R.esearthEthics Integrity
and
This is to confirm that the University of Ottawa Research Ethics Board identi lied above, which operates in
accordance with the Tri-Council Policy Statement (2010) and other applicable laws and regul aliens in Ontario,
has examined and approved the ethics application for the ab ave named research project . Ethics approval is
valid for the period indicated above and subject to the conditions listed in the section entitled "Special
Conditions/ Comments ".
During the course of the project, the protocol may not be modified without prior written approval from the
REB except when necessary to remove participants from immediate endangerment or when the modifica!ion(s)
pertain to only adrninistra!ive or logistical components of the project (e.g., change of telephone number) .
Investigators must also promptly alert the REB of any changes which increase the risk to participant(s). any
changes which considerably affect the conduct of the project, all unanticipated and harmful events that occur,
and new information that may negatively affect the conduct of the project and safety of the participant(s)
Mo difica!ions to the project, including consent and recruitment documenta!ion, should be submitted to the
Ethics Office for approval using the "Modi fication to research project" form available at:
https.//research.uottawa.ca /ethics/farms .
Please submit an annual rep art to the Ethics Office four weeks before the ab ave-referenced expiry date to
request a renewal of this ethics approval . To close the file, a final report must be submitted. These documents
can be found at: https://research.uottawa.ca/ethics/forms .
If you have any questions, please do not hesitate to contact the Ethics Office at extension 5387 or by e-mail at:
ethics@uOttawa .ca.
Signature:
Melanie Rioux
Ethics Coordinator
For Catherine Paquet, Director of the Office of Research Ethics and Integrity
2
550,rue Gwnberland,piece 154 550 GwnberlandStreet, room 154
Ottawa(Ontario)KIN 6N5 Canada Ottawa,OntarioKIN 6N5Ganade
(613) 562-5387• Telec.iFax (613) 562-5338
www.recherthe.uottaIW.caideon1ologie/www.researth .uotta"" ·ca/e!hies/
164
i .-·,,.,.,.,
;,;in.,,.
... t~ .,,_
~~
Date (mm dd/yyyy) : 04f.l4/JJI 7
Universite d'Ottawa
Bureau d'ethique et d'integrite de Ia recherthe
University of Ottawa
Offa e of R.esearthEthics Integrity
and
This is to confirm that the University of Ottawa Research Ethics Board identi lied above, which operates in
accordance with the Tri-Council Policy Statement (2010) and other applicable laws and regul aliens in Ontario,
has examined and approved the ethics application for the ab ave named research project . Ethics approval is
valid for the period indicated above and subject to the conditions listed in the section entitled "Special
Conditions/ Comments ".
During the course of the project, the protocol may not be modified without prior written approval from the
REB except when necessary to remove participants from immediate endangerment or when the modifica!ion(s)
pertain to only adrninistra!ive or logistical components of the project (e.g., change of telephone number) .
Investigators must also promptly alert the REB of any changes which increase the risk to participant(s). any
changes which considerably affect the conduct of the project, all unanticipated and harmful events that occur,
and new information that may negatively affect the conduct of the project and safety of the participant(s)
Mo difica!ions to the project, including consent and recruitment documenta!ion, should be submitted to the
Ethics Office for approval using the "Modi fication to research project" form available at:
https.//research.uottawa.ca /ethics/farms .
Please submit an annual rep art to the Ethics Office four weeks before the ab ave-referenced expiry date to
request a renewal of this ethics approval . To close the file, a final report must be submitted. These documents
can be found at: https://research.uottawa.ca/ethics/forms .
If you have any questions, please do not hesitate to contact the Ethics Office at extension 5387 or by e-mail at:
ethics@uOttawa .ca.
Signature:
Melanie Rioux
Ethics Coordinator
For Catherine Paquet, Director of the Office of Research Ethics and Integrity
2
550,rue Gwnberland,piece 154 550 GwnberlandStreet, room 154
Ottawa(Ontario)KIN 6N5 Canada Ottawa,OntarioKIN 6N5Ganade
(613) 562-5387• Telec.iFax (613) 562-5338
www.recherthe.uottaIW.caideon1ologie/www.researth .uotta"" ·ca/e!hies/
165
Target Participants
Description
You are being invited to participate in a Cross -Modal Language Processing study , which is
being conducted by L eonardo Alves -Soares, a Ph .D. student in the Department of Linguistics
at the Un iversity of Ottawa and sup ervised by professor Juana Mu.ii.oz·Liceras. The purpose of
the study is to furth er und erstand cross -language activation during the processing of words
with varying degrees of orthographic and phonetic overlap across the two languag es of a
bilingual. Th e study will take a approximate ly 90 minut es of your time , and it will take place
eith er in the Sound Patterns Laboratory , Simard Hall , room 333 (3 rd floor) , or in the Second
La nguag e Acquisition Research Laboratory , Arts Hall , room 253 , which are locat ed at the
Unive rsity of Ottawa main campus.
You will not r ece ive compensation for your participation in the study . However , if you wish , we
can provid e you with a cert ificat e of participation , which you can include with your CV.
Contact
If you are interested in participating in the study , or if you h ave any questions b efore deciding
whether or not to participate , pleas e contact the exp erim enter , Leonardo Alves -Soares
166
iJ
uOttawa
Exper imenter: Leonardo Alves-Soares
Universit d'Onaw a
Facu l!desans
Unguisrique
Supervisors: Professor Juana Muiioz·Liceras
University of Ottaw a Department of Modern Languages and
F u~yolArt>
Department of Linguistics
Ungulst,cs University of Ottawa
Arts Hall, 70 Laurier Ave. East , room 217
Ottawa , ON K l N 6N5
Parti cipation
The following is the sequence of events that will take place during
the stud y.
[i] Risks
This study does not involve risk or deception.
uOttawa Confidentiality and Anonymity
Your name and contact information will remain strictly
confidential. Your data will be identified by code only. All data
linked to you will match the code previously assigned to you. The
data will not contain your name or any other biographical
Universit~ d'Ottawa information.
Faculte de> arts
Linguisrique Conservation of Data
University of Ottaw a
The data will be kept forever. All (digital ) data collected for the
facultyof Art~ study will be stored in the investigator's main computer, which is
UnguistJCS password ·protected.
Voluntary Participation
You are under no obligation to participate in the study. If you
choose to participate , you may withdraw at any t ime . If you
choose to withdraw, all data gathered up to the time of
withdrawal will be discarded. No negative consequences will
result from either
your refusal to participate or withdraw.
'U 6 B ·S61·5i86
Q 613---561·5 14 1
t
101.am<!r
Ottiwa ON KlN 6N S Clr'lada
WW\v .u()Uow11.u
2
168
[i] Acceptance
By signing below, you acknowledge that you understand the
terms, parameters and conditions of this research study
uOttawa conducted by Leonardo Alves-Soares, Ph.D. student (the
"Experimenter") in the Department of Linguistics at the
University of Ottawa. This study is under supervision of
Professor Juana Muiioz· Liceras and Professor Mar c Brunelle (the
((Supervisors").
Universit~ d'Ottawa If you have any questions about the study , you may contact
Faculte de> arts
either the Experimenter or the Supervisor using the contact
Linguisrique information provided in the beginning of this form.
University of Ottaw a Alternatively, you may also contact the Protocol Officer for
facultyof Art~ Ethics in Research, University of Ottawa, Tabaret Hall, 550
UnguistJCS Cumberland Street, Room 154, Ottawa, Ontario, KlN 6N 5, 613·
562·5387, or at ethics@uottawa. ca.
There are two copies of this consent form, one of which is yours
to keep.
Participant
Experimenter
'U 6 B ·S61·5i86
Q 613---561·5 14 1
t
101.am<!r
Ottiwa ON KlN 6N S Clr'lada Print Name Signature Date
WW\v .u()Uow11.u
3
169
FORMULARIO DE CONSENTIMENTO
Professores Supervisores:
Professora Juana Muiioz·Liceras
Departmento de Linguas Modernas e
Departmento de Linguistica
Universidade de Ottawa
Pavilhao Artes
Avenida Laurier Leste n° 70, sala 2 17
Ottawa , ON KlN 6N5
Proposito do Estudo
0 prop6sito do estudo einv esti gar a ativru;ao simultan ea das duas
lin gu as de um bilingue durante processarnento de palavras
'D 61l-S6Hl86
W 61l-S6H14 1 cognatas .
70 l.awierE.
Ottawa ON Kl N 6NS Canada Parti cipa ,;;iio
www.u0ttow•.ca Durante o estudo, a seguinte cadeia de eventos ira acontecer.
(1) Voce preenchera urn questionario corn questoes sabre o seu background
linguistico.
170
(3) Assim que voce tenninar o teste de proficiencia, voce ira sentar-se em
u Ottawa ftente a uma workstation e fara uma tarefa sabre decisoes lexicas. A sua
acuidade e tempos de respostas serao medidos.
Conserva()iio de Dados
Os dados coletados durante este estudo jamais serao destruidos.
Todos dados (digitais) coletados seriio arquivados no computador
pessoal do pesquisador principal, que e protegido par senha, e uma
c6pia sera armazenada num pen drive criptografado, que ser a
guardado no escrit6rio dos superv1sores no campus da
universidade.
2
171
Participaf,iio Voluntaria
[i] Voce nao estii sob nenhwna obriga<;ao sequer de participar neste
estudo. Se voce decidir em participar, voce pode rescindir sua
participa<;ao em qualquer momenta. Se voce decidir rescindir sua
uOttawa participa9ao, todos os seus dados coletados serao descartados.
Nenhuma consequencia negativa sequer ocorrerii coma resultado
de voce recusar·se de parti cipar neste estudo ou rescindir sua
partici pa9ao
Universit~d'Onawa, Aceita
Facultede, arts Assinando abaixo, voce concorda que voce entende os termos,
Linguisciqu
e param.etros e condi96es deste estudo conduzido par Leonardo
Alves-Soares, estudante de doutorado (o "Investigador Principal")
University of Ottawa
Facultycl Art~ no Departamento de Linguistica na Universidade de Ottawa,
Unguisucs estudo supervisionado pelos pro fessores Juana Munoz·Li ceras e
Marc Brunelle (os "Pro fessores Supervisores").
Participante
Investigador Principal
'U 6 U·S61·5i&6
W 613---561·5141
70lawfert
Ottiwa ON K1N6NS Canada
WMv.uOl:Ulw•.ca
3
172
f- LanguageBackgro • *
• • I 61
Section 1 of 6
LanguageBackgroundQuestionnaire
Form description
Section 2 of 6 ,..
V
ParticipantInformation
Description (optional)
Participant Code *
Section 3 of 6 ,..
V
173
PersonalInformation
Description (optional)
Please enter tl1e month and the year you were born MM-YYYY
Please enter tl1e city and the country in which you were born.
Gender *
1. Female
2. Male
3. other
Sect10n 4 of 6
174
Native LanguageInformation
Description (optional)
0 Englisl1
Q Portuguese
Q otl1er...
0 YES
0 NO
y
Section 5 of 6 A
Other LanguageInformation
Description (optional)
y
Section 6 of 6 A
Q Englisl1
Q Portuguese
0 Otl1er...
Q elementary school
Q l1igl1school
Q college/university
Q Janguage scl10ol
Q otl1er...
Q everyday
Q veryoften
Q often
Q sometimes
Q rarely
Q never
On the scale below, what ls your overall level of proflcfency fn your second *
language?
2 3 4 5 6 7 8 9 10
On the scale below, please rate your level of wrltlng proflclency ln your *
second language?
2 3 4 5 6 7 8 9 10
On the scale below, please rate your level of readlng proflclency ln your *
second language?
2 3 4 5 6 7 8 9 10
On the scale below, please rate your level of oral proflclency ln your second *
language?
2 3 4 5 6 7 8 9 10
DIRECTIONS
5. Spelling will not count against you as long as the words in the blanks are legible.
EXAMPLE
The boy walked up the street. He stepped on a piece of ice. He fell (1) down, but
h e didn't hurt himself.
Man is th e only living creature that can mak e and us e tools. He is th e most
teachable of living b eings, earni ng the name of Homo sapiens. (l) ________ eve r
restless brain has used the 2)________ and the wisdom of his ancestors
run (5)_______ his feet, his hands h ave always (6)_______ free to carry
and to use (7)________ . Man's h a nds have served him well (8) ______ _
his life on earth . His development, (9)_______ can be divided into three major
Up to 10 ,000 years ago, (12) _______ human beings lived by huntin g and
(13) _______ . Th ey also pick e d be rri e s a nd fruits, (14 ) _______ dug for
(16) ________ wom e n act e d as food ga th erers. Since (17) ________ wom e n
p.1
179
to throw a t a n a nimal. (26 ) _______ ston e wa s chipped aga inst a noth er until
(27) _______ had a sharp edg e. Th e primitiv e (28) _______ who first
(32) _______ us eful tool, the spear. Flint, found (33 ) _______ man y
rocks, became a common cutting (34) _______ in the Paleolithic period of man's
know of this man (37 )_______ his stone implements, with whi ch he
(38 ) _______ kill animals, cut up the meat, (39 ) _______ scra pe the
st epp es of Europ e with ou t a fixed (44) _______ , always for ag ing for food . Perha p s
the (45 )_______ carried nuts an d berries in shells (46 ) _____ skins or even
in light , woven (47 ) _______ . Wh erever they campe d, the primitive peop le
(48 ) _______ fir es by strik in g flin t for sparks (49 ) _______ using dri ed
seeds, moss , and rotten (50 ) _______ for tinder . With fir es that h e kin d led
hims elf, man could keep wild an imals away and co uld cook thos e that h e k illed, as well
p.2
180
Word List 1
Page 1 of 16
181
Word List 1
prime TARGET Exp Status Item Nbr
cochilo FLEXIBLE CTRL 66
congelador SYMPATHETIC CTRL 68
corneta FRUIT CTRL 71
couve GEL CTRL 73
criarn;:a GENIAL CTRL 75
cumulo HOUR CTRL 78
curral INGENUITY CTRL 79
dan o SORT CTRL 81
da rdo TAMP O N CTRL 82
debit INTERPR ETATION CTRL 83
delito LICENSE CTRL 86
demao LANGUAGE CTR L 87
demasia LINE CTRL 88
dente MANNE Q UIN CTRL 89
destrrn;:o ULTIM ATELY CTRL 92
eg ua RANGER CTRL 94
elo MINERAL CTRL 96
fantoche MOUSE CTRL 102
far da HASTE CTRL 103
fenda NAT UR E CTRL 105
frei SUM CTRL 110
freio TAX CTRL lll
Page 2 ofi6
182
Word List 1
prime TARGET Exp Status Item Nbr
limo RUMOR CTRL 158
linho SAMBA CTRL 159
liquidificador SENTIMENT CTRL 160
lixo SIGNIFICANCE CTRL 162
loc;ao SIMPLE CTRL 164
Iona TEMPERATURE CTRL 167
louco TOMATO CTRL 170
manchete TRANSFORMATION CTRL 172
manicure TREPIDATION CTRL 174
manobra TRIUMPH CTRL 175
marimbondo ULTIMATUM CTRL 177
meado VEGETABLE CTRL 179
meia VEIN CTRL 180
melancia VEHICLE CTRL 181
metade RECIPIENT CTRL 183
m e tido VINEGAR CTRL 184
multa VENG EANCE CTRL 186
musculac;ao VOWEL CTRL 188
nada VOTE CTRL 189
nau ZEBRA CTRL 190
obra ARM CTRL 197
olaria ASSIST CTRL 198
ombro ACTUAL CTRL 199
orn;a AUDIENCE CTRL 200
paiol BANK CTRL 203
eaeo BEEF CTRL 205
paquiderme BOND CTR L 206
pipa COLLAR CTRL 217
prata COMPROMISE CTRL 221
preguic;a CONCOURSE CTRL 223
prenda CONCEAL CTRL 224
propina COSTUME CTRL 226
punho CUSP CTRL 229
eure DATA CTRL 230
quadra DESSERT CTR L 231
raio ENJOY CTRL 237
ralo EXPERT CTRL 238
ramo EXQUISITE CTRL 239
rapaz OR DINARY CTRL 292
rena EXCITING CTRL 242
Page 3 ofi6
183
Word List 1
prime TARGET Exp Status Item Nbr
renda EXIT CTRL 2 43
reu FACILITY CTRL 246
roda GRIP CTRL 249
ruga HOME CTRL 251
saguao HOSPICE CTRL 252
saia IDIOM CTRL 2 53
saldo INSCRIPTION CTRL 256
sarda INGENIOUS CTRL 254
sinal INSTANCE CTRL 257
sobra JAR CTRL 258
sobrado JOURNAL CTRL 259
solteiro LACE CTRL 260
solrn;:o LAMP CTRL 261
sortudo RECOURSE CTRL 263
sujeira LASER CTRL 264
toldo LENS CTRL 267
trilho LOCATION CTRL 269
tristeza LUNCH CTRL 270
unha MAGAZINE CTRL 271
urubu MASCARA CTRL 273
velho MALL CTRL 275
velocimetro MASTER CTRL 276
verruga NOTICE CTRL 278
veu OUTDOOR CTRL 281
visom PAVEMENT CTRL 287
vovo POST CTRL 289
voz PREJUDICE CTRL 291
aba HEVA D
abestalhado CHECKLEBURST D 2
a<;ucar HARETS D 3
adorno NULIPS D 4
agriao QUELTER D 5
agu a CAHED D 6
alavanca BETTULO D 7
alcatra VALLUME D 227
alergia OFFEN D 8
alfazema HOOPLES D 9
aluguel SUNTELO D 10
amaciante FRONG D 11
amendoim BELISM D 12
Page 4 ofi6
184
Word List 1
prime TARGET Exp Status Item Nbr
andaime SEGRETS D 13
arco-iris OUTBRAW D 14
asno OLEALD D 266
ata KIFF D 15
aterrisagem PAMA D 16
azeitona CUNWREMY D 17
babosa WINOOZE D 275
bafo VEAST D 18
bagulho REOCCUS D 19
bandeja PUNOUN D 20
banguela MAIATROSE D 21
banheira INWORM D 22
baranga MUNPY D 23
barco OVENCY D 24
barro ESUYP D 284
barulho RERARD D 25
baunilha RERATE D 26
berimbau BESLOOR D 2 53
bina TOOGIT D 282
biscate CHOR ENN D 270
bocejo VOQUEV D 267
bofe MICRONY D 27
bofetada TRONSHAT D 28
borra ch eiro MOGININ D 29
brega BETIPE D 30
bru xa HOLS D 31
brn;o voov D 226
cac;amba CHEAHOOF D 242
cachac;:a NUCK E D 32
cafetao SWOPOM D 241
cafofo PLOCK D 33
cafun e HUNM D 34
calc;a da CHEIR D 35
camada MUEBA D 36
cambio MERIP D 37
caneca UN ELIND D 235
canet a IADLOP D 38
can gote MAIATED D 39
capim BELYCH D 40
capo CHEESTNAIR D 41
Page 5 ofi6
185
Word List 1
prime TARGET Exp Status Item Nbr
caraca JIOFRAX D 246
carango TINEREW D 276
careca MODGONE D 237
carioca WERRADITH D 271
caroc,:o RODRUP D 278
carrasco KARPLE D 42
carreta TELCI D 43
carroc,:a MORATH D 279
casa PIFFURE D 44
cascavel INTIENT D 45
caxumba DRIRATHIEL D 245
cenoura BEORARNI D 46
cera KREEPEE D 47
chacara UNDERDOUG D 248
chacrete TOXIMBLE D 264
chafariz POLLNEST D 48
chao SOTTA D 49
chicoria ODRANOEL D 50
chinelo OPERTOT D 51
chuchu SHABI D 52
cidadao ENFLEP D 53
cinto REVENTIAL D 54
coisa TEBS D 55
col eira JOHA CKLE D 236
colete ADNANREF D 56
comicio JOVAPHILE D 225
comid a PROFINROR D 57
coqueluche KULATOR D 58
correio HARP ENITAGE D 59
couro FOMMER D 60
coxinha PLOOSNAR D 230
cozinha CHOOR D 61
dende SUMNIKA D 6z
desemerego FUTUKE D 63
deus WOTNOT D 64
dragao TESBAR D 65
erva ZESHO D 66
esfre gao DARNET D 67
esgoto LEPSET D 68
estrum e CHACAKA D 247
Page 6 ofi6
186
Word List 1
prime TARGET Exp Status Item Nbr
faro! RESAIX D 281
fechadura DINNY D 69
feijoada PROTHORACLED D 70
ferro }UNSET D 71
fiel SERAOS D 72
foca WENSH D 73
fogo BIEDE D 74
fogueira MBSON D 75
framboesa RELONE D 76
frigideira KECHIN D 77
fronha CHURRUSTUL D 78
frouxo WALNERT D 79
furacao TRINETTE D So
furo TWM D 294
gaita NIETWE D 277
garfo PLAMPLE D 81
gentalha BROXNER D 82
geringorn;a MONTAQUEST D 83
girico GROJET D 84
goela MIRESA D 251
goma NEKMIT D 283
graveto OBBISN D 85
gravura NOURINE D 86
gula BUENO D 293
hene JOURC H D 87
hieoteca JOENTELINITY D 88
h6spede FEA D 89
im6vel CAKESKI D 90
impressora DOCTURE D 91
inhame HEMICAL D 92
invalidez NEXTING D 93
jacare PROUSE D 94
janela PUBIEN D 95
jeca TROSP D 96
jejum NONVICT D 97
joia CHIER D 98
jumento ADEMPT D 99
lagar ta WITHNEN D 100
lagar tixa MINSE D 101
lap ela PORTHOLOBATE D 102
Page 7ofi6
187
Word List 1
prime TARGET Exp Status Item Nbr
laranja FROMMIN D 103
lareira ANUNAT D 104
leite MEGTA D 105
lerdo IBBLY D 106
lesma QUONBI D 107
liminar KROWT D 108
lula RACHLER D 109
lustre CUTION D 110
luz RELAND D 111
macaco CLORE D 112
ma<;arico LAWMER D 113
macete NUSHTRA D 114
ma<;o PLINGT D 115
macumba PEEVISITION D 116
madeixa RERATE D 117
mala PARIDIK D 118
malha,;ao JAYL D 119
mamadeira EDADIVITAN D 120
mangueira TRIME D 121
manicomio YONNAL D 122
manifestante LOGITUDE D 123
maremoto YENBATUR D 124
marimba SAFOME D 239
matra ca CH EILIT H D 252
me leca AFFER D 125
merenda GLARETRAM D 262
mes DURETS D 126
mesa GLANGING D 127
meta LACTER D 128
mexerica BUMMOXA D 240
milho JUESMA D 129
minh oca WANTEMER D 130
mo<;oila SASAROO D 2 43
mocreia NETTE D 131
moeda SUBAVA D 132
moela TITUI D 133
mofo ZOIT D 134
moita RUCTULE D 135
molamb ento MUFTA D 139
m olh o NEVESY D 136
Page 8 ofi6
188
Word List 1
prime TARGET Exp Status Item Nbr
monge RACA D 137
moqueca CHERSHOEE D 255
morango CLICEZO D 138
mudo ECKLE D 261
muleta TUPACASE D 254
mulher APINGO D 140
mundo NYNDRA D 141
muquifo CUTLUS D 142
mutreta CATACELLA D 143
nabo RUITA D 144
namoro BETTARAN D 145
nanico ANIRAM D 146
nariz FUDE D 147
nenem COMED D 148
ninho ZIBLA D 149
6culos SIEDCASE D 150
olheira SANNY D 151
olho DRAKET D 152
ouvinte RUXTA D 153
earceiro FLIPKNIPS D 154
easta EVENUL E D 155
eata TUPIB D 288
pauta MULTIVE D 156
paviu CHILLAID D 263
pei xe FUEVE D 157
een a WASMAC D 158
perici a VIOTTIS D 244
periguete AFFERARON D 268
peruca BIASDO D 257
picanha PELOO ZOID D 228
pichai;:ao OHCAMARG D 159
pn eu HAPPER D 160
po chete NAPERONE D 286
eocilg a LALONTENT D 161
pomar NECCA D 162
praia HYDRING D 163
presunto PADA D 164
erimavera INHOFT D 165
queda LAUDEITNESS D 166
quib e BUMOLA D 232
Page 9 ofi6
189
Word List 1
prime TARGET Exp Status Item Nbr
quinze PHICOLY D 167
rabisco GROLE D 168
rabo SCHNOL D 169
rachadura SPIPSCOPS D 170
ram;o KERBAL D 171
raeosa POMODO D 172
reda<;ao GULLSTONT D 173
reduto KUCALIR D 174
regua SADOKIN D 175
relogio ACAVO D 176
remo LANKERTORT D 177
revoga<;ao KROSHIST D 178
rissole PRUVIA D 231
rodovia KATASTRY D 179
roleta BOCILILE D 249
rolha OVERSAL D 180
rotulo PORIC D 181
sabor SETENFI D 182
sacana VASAGLE D 292
saci OSHUN D 183
sacole TESHEKUR D 184
saleicao DIGISOL D 234
salsinha HIABBYA D 185
sap eca OLIELLE D 238
sarro BISTUP D 287
sebo JEREN D 273
sede YEAVE D 186
serelepe LOOPLAB D 229
seta ROINAD D 233
sinuca SEILIU D 256
sol ENENTINESS D 187
surdo YOFFA D 260
sutia COSOMOQ D 188
tabefe SPUSIOUS D 291
tamandua MARESOM D 189
tapete FROWING D 190
tarado QIAMETH D 285
tatu ERAOW D 259
teclado MO VINY D 191
terr em oto AVILI D 192
Page 10 ofi6
190
Word List 1
prime TARGET Exp Status Item Nbr
tes ta OPONT D 193
tigela ARIGAIT D 194
tormenta IMPLOCE D 195
torresmo FLIO NDESO D 250
toucinho RANTOSTN ER D 196
trai c;:ao KONKERS D 197
tranco ACAER D 272
trema ALQUE D 198
tribufu HENDASSA D 274
trinco REILTAS D 289
troco SATEIL D 290
troc;:o YABATA D 199
tromba MURER D 200
trono QUINK D 201
trote YIMELLO D 280
tr uta FRUGTER D 202
tubarao CO NT ERAB D 203
tutu UMPRE D 204
vagao O NAMA D 269
varal ELOCI N D 205
var eio COPIC D 206
vazamento ZUZUUT D 207
vazao SOAXCAS E D 208
vela PUSOON D 209
ven cimento CINT RERY D 210
ver ao YULANAR D 211
vereda DORITY D 212
ve rniz TAMTOP D 213
vesgo GORMET D 214
vista TELVIT D 215
vitrin e HIPPLED D 216
vit ro BETIFT D 217
vitro la NOVALY D 258
viveiro ZEANDROT D 218
xav eco NOCOBOT D 265
xilindr6 GAULIN D 219
xinxim PHREST D 220
xod6 ATINTE D 221
zabumba SHINESH D 222
zagu eiro BRUXET D 223
Page 11ofi6
191
Word List 1
prime TARGET Exp Status Item Nbr
zarolh o NENURIA D 224
acento ACCENT F
adepto ADEPT F 2
adjunto ADJUNCT F 3
agenda AGENDA F 4
ae licac;:ao APPLICATION F 5
assist e ASSIST F 7
au dienc ia AUDIENCE F 9
banco BANK F 12
bife BEEF F 14
brando BRAND F 17
carta CART F 19
cartao CARTON F 20
ch oe e CHOP F 23
chute CHUTE F 24
co lar CO LLAR F 26
co nce ito CO NCEIT F 31
cova COVE F 36
cus e CUSP F 38
data DATA F 39
deserto DESSERT F 40
diversao DIVERSION F 42
dra ga DRAG F 43
ed itor EDITOR F 44
esperto EXPERT F 46
estr angeiro ESTRANGED F 48
estr anho STRANGER F 49
exito EXIT F 51
fabrica FABRIC F 53
facilidade FACILITY F 54
gradu ac;:ao GRADUATI O N F 55
gratuito GRATUITY F 56
inscric;:ao INSCRIPTION F 65
lanche LUN CH F 77
legend a LEGEND F 72
locac;:ao LOCATION F 76
maior MAYOR F 79
ma scara MASCARA F 80
mau MALL F 82
noticia NOTI CE F 85
Page 12 ofi6
192
Word List 1
prime TARGET Exp Status Item Nbr
novela NOVEL F 86
oficio OFFICE F 87
ordinario ORDINARY F 127
outdoor OUTDOOR F 88
pao PAN F 89
earente PARENT F 90
eastel PASTEL F 91
pavimento PAVEMENT F 93
pote POT F 96
erejuizo PREJUDICE F 97
preservativo PRESERVATIVE F 98
pressa PRESS F 99
pretende PRETEND F 100
recurso RECOURSE F 106
resumo RESUME F 107
simpatico SYMPATHETIC F 112
smoking SMOKING F 113
sorte SORT F 115
sueorte SUPPORT F 117
tameao TAMPON F 118
taxa TAX F 119
tenente TENANT F 120
turno TURN F 123
vento VENT F 125
vila VILE F 126
abajur BANDOVA p
acre DRIM p 2
apito WHISTL E p 3
aranha SHENDON p 4
aumento RAISE p 5
balde WOMT p 6
batata DONCE p 7
bebida BEVERAGE p 8
broca YOENK p 9
caixote KAHOE p 10
campo DIELB p 11
carretel REEL p 12
cogumelo MUSHROOM p 13
crern;a BELIEF p 14
descarado BLATANT p 15
Page 13 ofi6
193
Word List 1
prime TARGET Exp Status Item Nbr
desejo URGE p 16
en uia EEL p 17
ferrugem RUST p 18
gema YOLK p 19
irritado LASATY p 20
isoeor STYROFOAM p 21
lingui<;a SAUSAGE p 22
luva GLOVE p 23
magricela LANKY p 24
manivela CRANK p 25
marinheiro SITUYA p 26
mobilia FURNITURE p 27
6bito PRINKT p 28
oficina HENTIOM p 29
recife REEF p 30
s6tao ATTIC p 31
tesoura SCISSORS p 32
tesoura SCISSORS p 32
touro BULL p 33
valsa WALTZ p 34
vagueiro COWBOY p 35
abrueto ABRUPT C
adotivo ADOPTIVE C 2
alcool ALCOHOL C 4
angulo ANGLE C 6
base BASE C 13
bateria BATTERY C 14
botao BUTTON C 17
capacidade CAPACITY C 21
caso CASE C 24
cheque CHEQUE C 25
coluna COLUM N C 28
comatoso COMAT OSE C 30
comed ia COM EDY C 31
complicado COMPLICATED C 32
computador COMPUTER C 33
condicional CONDITIONAL C 34
confidente CONFIDENT C 35
conforto CO MFORT C 36
cortina CURTAIN C 39
Page 14 ofi6
194
Word List 1
prime TARGET Exp Status Item Nbr
criticismo CRITICISM C 41
cubo CUBE C 42
dama DAME C 43
detergente DETERGENT C 47
diferente DIFFERENT C 48
distribuic;:ao DISTRIBUTION C 49
elastico ELASTIC C 50
elevado r ELEVATOR C 51
entrada ENTRANCE C 52
estueido STUPID C 53
excelente EXCELLENT C 54
fantasia FANTASY C 56
febre FEVER C 58
fezes FECES C 59
flexivel FLEXIBLE C 60
frustrado FRUSTRATED C 63
fruta FRUIT C 64
gel GEL C 66
genial GENIAL C 68
girafa GIRAFFE C 69
maneguim MANNEQUIN C 79
maguina MACHINE C 80
mineral MINERAL C 84
m osaico MOSAIC C 86
mosquito MOSQUITO C 87
mostarda MUSTARD C 88
museu MUSEUM C 91
natureza NATURE C 92
ocasiao OCCASION C 96
outro OTHER C 100
paciencia PATIENCE C 101
pagam ento PAYMENT C 102
pape l PAPER C 103
eargue PARK C 104
piano PIANO C 107
planta PLANT C 108
prisao PRISON C 111
eroficiencia PROFICIEN CY C 112
projetor PROJECTOR C 113
pr 6pri o PROPER C 114
Page 15 ofi6
195
Word List 1
prime TARGET Exp Status Item Nbr
rato RAT C 120
referencia REFERENCE C 123
reparo REPAIR C 126
resposta RESPONSE C 128
restaurante RESTAURANT C 129
ritmo RHYTHM C 131
rosa ROSE C 133
shampoo SHAMPOO C 138
sofa SOFA C 142
sucesso SUCCESS C 143
testamento TESTAMENT C 145
texto TEXT C 146
transformac;:ao TRANSFORMATION C 149
trepidac;:ao TREPIDATION C 151
ultimato ULTIMATUM C 154
vagabundo VAGABOND C 155
vegetal VEGETABLE C 156
veia VEIN C 157
verme VERMIN C 159
vinagre VINEGAR C 160
vinganc;:a VENG EANC E C 161
vocac;:ao VOCATION C 162
zebra ZEBRA C 165
zero ZERO C 166
Page 16 ofi6
196
Word List 2
Pag e 1 of 16
197
Word List 2
Page 2 of 16
198
Word List 2
Page 3 of 16
199
Word List 2
Page 4 of 16
200
Word List 2
amaciante FRONG D 11
amendoim BELISM D 12
andaime SEGRETS D 13
arco-iris OUTBRAW D 14
asno OLEALD D 266
ata KIFF D 15
aterrisagem PAMA D 16
azeitona CUNWREMY D 17
babosa WINOOZE D 275
bafo VEAST D 18
bagulho REOCCUS D 19
band eja PUNOUN D 20
banguel a MALATROSE D 21
banheira INWORM D 22
baran!la MUNPY D 23
barco OVENCY D 24
barro ESUYP D 284
barulho RERARD D 25
baunilha RERATE D 26
berimbau BESLOOR D 253
bina TOOGIT D 282
biscate CHORENN D 270
bocejo VOQUEV D 267
bofe MICRONY D 27
bofetada TRONSHAT D 28
borracheiro MOGININ D 29
bre ga BETIPE D 30
bruxa HOLS D 31
buc;o voov D 226
cac;amba CHEAHOOF D 242
cachac;a NUCKE D 32
cafetao SWOPOM D 241
cafofo PLOCK D 33
cafune HUNM D 34
calc;ada CHEIR D 35
cam ada MUEBA D 36
cambio MERIP D 37
Pag e 5 of 16
201
Word List 2
Page 6 of 16
202
Word List 2
Page 7 of 16
203
Word List 2
jeium NONVIC T D 97
joia CHIER D 98
jumento ADEMPT D 99
lagarta WITHNEN D 100
m exerica BUMMOXA D 2 40
milho JUESMA D 12 9
minhoca WANTEMER D 13 0
Page 8 of 16
204
Word List 2
Page 9 of 16
205
Word List 2
Page 10 of 16
206
Word List 2
Page 11of 16
207
Word List 2
balcao BALCONY F 11
batom BATON F 13
bonde BOND F 15
brac;:o BRASS F 16
bussola COMPASS F 18
chape u CHAP EL F 21
ch ato CHAT F 22
cliq ue CLIQUE F 25
co legio COLLEGE F 27
competic;:ao COMPETITION F 28
comereensivel COMPREHENSIBLE F 29
comeromisso COMPROMISE F 30
conc ur so CONCOU RSE F 32
co ns elho CON CEAL F 33
copo CUP F 34
costume COSTUME F 35
curto CURT F 37
discus sao DISCUSSION F 41
duelo DUEL F 128
enjoo ENJOY F 45
esq uis ito EXQUISITE F 47
exc itant e EXCITING F 50
fa FAN F 52
grip e GRIP F 57
Page 12 of 16
208
Word List 2
Page 13 of 16
209
Word List 2
campo DIELB p 11
carretel REEL p 12
cogumelo MUSHROOM p 13
crem,:a BELIEF p 14
descarado BLATANT p 15
des ejo URGE p 16
engui a EEL p 17
ferrugem RUST p 18
ema YOLK p 19
irritado LASATY p 20
isopor STYROFOAM p 21
lingui<,:a SAUSAGE p 22
luva GLOVE p 23
magri cela LANKY p 24
m anivel a CRANK p 25
marinheiro SITUYA p 26
mobilia FURNITURE p 27
6bito PRINKT p 28
oficina HENTIOM p 29
recife REEF p 30
s6tao ATTIC p 31
tesoura SCISSORS p 32
tesour a SCISSORS p 32
touro BULL p 33
valsa WALTZ p 34
vaqueiro COWBO Y p 35
alarme ALARM C 3
alinhamento ALIGNMENT C 5
assinatura SIGNATURE C 7
assistenci a ASSISTANCE C 8
astuto ASTUTE C 9
Page 14 of 16
210
Word List 2
banda BAND C 11
bar BAR C 12
bee BEIGE C 15
bomba BOMB C 16
bravo BRAVE C 18
cafe COFFEE C 19
camelo CAMEL C 20
cargo CARGO C 22
carro CAR C 23
chocolate CHOCOLATE C 26
cirurgia SURGERY C 27
coma COMA C 29
constipai;:ao CONSTIPATION C 37
convenien te CONVENIENT C 38
creme CREAM C 40
demanda DEMAND C 44
desgosto DISGUST C 45
desgrai;:a DISGRACE C 46
face FACE C 55
fatalidade FATALITY C 57
formato FORMAT C 61
freezer FREEZER C 62
fundamental FUNDAMENTAL C 65
gelatina GELATIN C 67
!:lrafite GRAPHITE C 70
hora HOUR C 71
in!:lenuidade INGENUITY C 72
interpretai;:ao INTERPRETATION C 73
irado IRATE C 74
liberd ade LIBERTY C 75
liceni;:a LICENSE C 76
lingua LANGUAGE C 77
linha LINE C 78
medalha MEDAL C 81
memoria MEMORY C 82
m eta l METAL C 83
monstro MONSTER C 85
motocicleta MOTOR CYCLE C 89
Page 15 of 16
211
Word List 2
Page 16 of 16
212
LMER Model
##
## Attaching package: 'lmerTest'
library(optimx)
library(nloptr)
213
library(MuMIn)
library(r2glmm)
library(car)
#Data optimizer
defaultControl <- list(algorithm="NLOPT_LN_BOBYQA",xtol_rel=1e-6,maxeval=1e5)
## Scaled residuals:
## Min 1Q Median 3Q Max
## -7.4402 -0.6202 -0.0386 0.5881 6.1591
##
## Random effects:
## Groups Name Variance Std.Dev. Corr
## PRIME (Intercept) 0.0043107 0.06566
## PAIR (Intercept) 0.0022357 0.04728
## TARGET (Intercept) 0.0092168 0.09600
## PARTICIPANT Zlength_TARGET_raw_gmc 0.0004832 0.02198
## PARTICIPANT.1 ZTime_ms_gmc 0.0152537 0.12351
## PARTICIPANT.2 ZEnglish_Freq_gmc 0.0001005 0.01003
## PARTICIPANT.3 (Intercept) 0.0655086 0.25595
## SEMANTICCOG 0.0031251 0.05590 0.46
## SEMANTICDISTR 0.0170481 0.13057 -0.36 0.19
## SEMANTICFF 0.0010710 0.03273 0.54 0.63 0.22
## Residual 0.0828356 0.28781
## Number of obs: 33516, groups:
## PRIME, 901; PAIR, 852; TARGET, 591; PARTICIPANT, 57
##
## Fixed effects:
## Estimate Std. Error df t value
## (Intercept) -1.713e+00 6.388e-02 7.091e+01 -26.818
## LANGUAGEBilinguals 1.986e-01 7.636e-02 6.402e+01 2.601
## SEMANTICCOG -6.777e-02 1.964e-02 2.878e+02 -3.451
## SEMANTICDISTR 2.437e-01 2.709e-02 7.954e+01 8.997
## SEMANTICFF -4.078e-02 1.995e-02 4.420e+02 -2.044
## ZNLD_gmc -8.221e-03 6.348e-03 1.987e+03 -1.295
## ZPHONETIC_gmc 3.245e-03 2.560e-03 5.485e+03 1.267
218
##
## Correlation matrix not shown by default, as p = 15 > 12.
219
#Obtaining R2 values
r.squaredGLMM(LMER_best_fit_model)
## Warning: 'r.squaredGLMM' now calculates a revised statistic. See the help page.
## R2m R2c
## [1,] 0.1614303 0.614307
## GVIF Df GVIF^(1/(2*Df))
## LANGUAGE 1.940343 1 1.392962
## SEMANTIC 7.556333 3 1.400829
## ZNLD_gmc 2.780622 1 1.667520
## ZPHONETIC_gmc 1.540855 1 1.241312
## ZEnglish_Freq_gmc 1.050145 1 1.024766
## Zlength_TARGET_raw_gmc 1.034999 1 1.017349
## ZTime_ms_gmc 1.000053 1 1.000027
## OTHER_LANGUAGES 1.395945 1 1.181501
## LANGUAGE:SEMANTIC 4.502283 3 1.285007
## ZNLD_gmc:ZPHONETIC_gmc 1.519332 1 1.232612
222
1/vif(LMER_best_fit_model)
## GVIF Df GVIF^(1/(2*Df))
## LANGUAGE 0.5153727 1.0000000 0.7178946
## SEMANTIC 0.1323393 0.3333333 0.7138629
## ZNLD_gmc 0.3596318 1.0000000 0.5996931
## ZPHONETIC_gmc 0.6489905 1.0000000 0.8055995
## ZEnglish_Freq_gmc 0.9522495 1.0000000 0.9758327
## Zlength_TARGET_raw_gmc 0.9661841 1.0000000 0.9829466
## ZTime_ms_gmc 0.9999467 1.0000000 0.9999734
## OTHER_LANGUAGES 0.7163604 1.0000000 0.8463807
## LANGUAGE:SEMANTIC 0.2221095 0.3333333 0.7782059
## ZNLD_gmc:ZPHONETIC_gmc 0.6581842 1.0000000 0.8112855
223
GLMER Model
##
## Attaching package: 'lmerTest'
library(optimx)
library(nloptr)
library(MuMIn)
library(r2glmm)
library(car)
#Data optimizer
defaultControl <- list(algorithm="NLOPT_LN_BOBYQA",xtol_rel=1e-6,maxeval=1e5)
## Random effects:
## Groups Name Variance Std.Dev.
## PRIME (Intercept) 0.2766 0.5259
## PAIR (Intercept) 0.3275 0.5723
## TARGET (Intercept) 1.5102 1.2289
## PARTICIPANT (Intercept) 0.6580 0.8112
## Number of obs: 33428, groups:
## PRIME, 901; PAIR, 852; TARGET, 591; PARTICIPANT, 57
##
## Fixed effects:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 4.87710 0.23429 20.817 < 2e-16 ***
## LANGUAGEBilinguals -0.61333 0.29588 -2.073 0.038183 *
## SEMANTICCOG 0.28415 0.23245 1.222 0.221563
## SEMANTICDISTR -1.37249 0.19154 -7.166 7.74e-13 ***
## SEMANTICFF 0.18464 0.26908 0.686 0.492601
## ZEnglish_Freq_gmc 0.26429 0.08528 3.099 0.001941 **
## ZTime_ms_gmc 0.17440 0.03395 5.136 2.80e-07 ***
## Zlength_TARGET_raw_gmc 0.22030 0.07301 3.017 0.002550 **
## ZSEMAC_gmc 0.49578 0.13141 3.773 0.000161 ***
## LANGUAGEBilinguals:SEMANTICCOG 0.53983 0.24560 2.198 0.027950 *
## LANGUAGEBilinguals:SEMANTICDISTR -0.13628 0.15150 -0.900 0.368368
## LANGUAGEBilinguals:SEMANTICFF -0.34842 0.26453 -1.317 0.187791
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
##
229
## SEMANTICDIS
## SEMANTICFF
## ZEnglsh_Fr_
## ZTim_ms_gmc
## Zl_TARGET__
## ZSEMAC_gmc
## LANGUAGEB:SEMANTICC
## LANGUAGEB:SEMANTICD
## LANGUAGEB:SEMANTICF 0.474
#Obtaining R2
r.squaredGLMM(GLMER_best_fit_model)
## Warning: 'r.squaredGLMM' now calculates a revised statistic. See the help page.
## Warning: The null model is correct only if all variables used by the original
## model remain unchanged.
## R2m R2c
## theoretical 0.1938626 0.5625173
## delta 0.0744979 0.2161653
232
## GVIF Df GVIF^(1/(2*Df))
## LANGUAGE 1.756417 1 1.325299
## SEMANTIC 5.929369 3 1.345348
## ZEnglish_Freq_gmc 1.087451 1 1.042809
## ZTime_ms_gmc 1.001749 1 1.000874
## Zlength_TARGET_raw_gmc 1.036538 1 1.018105
## ZSEMAC_gmc 1.425754 1 1.194050
## LANGUAGE:SEMANTIC 6.720202 3 1.373716
1/vif(GLMER_best_fit_model)
## GVIF Df GVIF^(1/(2*Df))
## LANGUAGE 0.5693409 1.0000000 0.7545468
## SEMANTIC 0.1686520 0.3333333 0.7433019
## ZEnglish_Freq_gmc 0.9195813 1.0000000 0.9589480
## ZTime_ms_gmc 0.9982537 1.0000000 0.9991265
## Zlength_TARGET_raw_gmc 0.9647503 1.0000000 0.9822170
## ZSEMAC_gmc 0.7013831 1.0000000 0.8374862
## LANGUAGE:SEMANTIC 0.1488050 0.3333333 0.7279523