The Analysis of Verbal Behavior
1990,8, 1-11
An Instance of Spurious Equivalence Relations
Gregory Stikeleather
Palo Alto, California
AND
Murray Sidman
New England Center for Autism
Four normal children learned conditional discriminations that had upper-case or lower-case
Greek letters as comparison stimuli, and dictated letter names as samples. Experimental stimuli were three pairs of letters; within each pair, an upper- and a lower-case letter were conditionally related to the same dictated sample. Four control stimuli, also upper- and lower-case
letters, were each conditionally related to a different dictated sample. Conditional-discrimination tests for equivalence used the upper- and lower-case letters both as samples and comparisons. Untaught conditional relations between the upper- and lower-case members of each
experimental stimulus pair were expected to emerge on the basis of their previously established relations to a common sample. The emergence of conditional relations between control
stimuli, however, would have suggested an artifact. In test trials with the experimental stimuli
as samples and comparisons, new conditional discriminations emerged as expected with all
four children. With two of the children, however, consistent discriminations also emerged
between control stimuli. Evidence suggested that uncontrolled features of the program for
teaching the children the baseline conditional discriminations might have been responsible for
the emergence of untaught conditional relations.
When citing data to support or challenge
a descriptive or theoretical account, one
can often prevent fruitless controversy by
first making sure that the subjects' behavior was relevant to the phenomenon under
discussion. Before attempting to use particular data to support or refute an account of
equivalence relations, for example, one
should be reasonably certain that equivalence, or the absence of equivalence, was
really an outcome of the experimental
operations. The apparent emergence of
equivalence relations from conditional discriminations, and apparent failures of
equivalence to emerge, can sometimes be
explained by procedural rather than theoretical considerations.
For example, a subject's seeming failure
to show equivalence between sample and
comparison stimuli in a two-comparison
conditional discrimination can arise from
an experimenter's mistaken assumption
about the nature of the conditional control.
The usual assumption is that the subject's
choice on each trial is controlled by a relation between the sample and what the
experimenter has designated as the positive comparison, a relation that is often
called "S-plus control" (Carter & Werner,
1978). S-plus control leads to equivalence
between the sample and its positive comparison. The procedure may, however,
have generated "S-minus control," a conditional relation that can bring about equivalence between the sample and the negative
comparison. Because the actual controlling
relation goes unrecognized, the subject is
judged to have selected the "wrong" stim-
This reserach, carried out at Northeastern
University, was part of a study conducted by the first
author in partial fulfillment of requirements for a
Master's degree in Psychology. Results were presented at the 1979 meeting of the Association of
Behavior Analysis, Dearborn. We thank Leanna
Standish and Charles Hamad for their comments on
an earlier version of the manuscript, and to F. Garth
Fletcher for equipment construction and maintenance.
Reprint requests should be sent to Murray Sidman,
New England Center for Autism, 33 Turnpike Road,
Southborough, Massachusetts 01772.
1
2
GREGORY STIKELEATHER and MURRAY SIDMAN
uli on the equivalence test; S-minus control
generates a score of zero in equivalence
tests (Carrigan, 1986; Johnson & Sidman,
1990). If the theory at issue required that
equivalence fail to develop in that particular experiment, the investigator would mistakenly take this result as a confirmation of
the theory.
Another potential pitfall in interpretation can be brought about by contextual
control (Bush, Sidman, & de Rose, 1989;
Sidman, 1986; Wulfert & Hayes, 1988). Any
stimulus can be a member of more than
one class, and features of the environment
can determine which equivalence class a
stimulus belongs to at the moment. An
investigator who overlooks the possibility
of contextual control may conclude mistakenly that equivalence relations between
particular stimuli do not exist.
A subject's failure to show equivalence
can also arise from unintended identity
relations-physical resemblances between
stimuli-that can override other equivalence relations. In an equivalence test, stimuli that look alike may be classed together
in spite of the subjects having been taught
conditional discriminations that were
designed to place the stimuli in separate
classes (Barnes, 1990).
Faulty specification of the controlling
stimuli will inevitably cause negative
results in subsequent tests of transfer of
control to other stimuli (e.g., Constantine,
1981; Iversen, Sidman, & Carrigan, 1986).
And even before one tests for equivalence,
insufficiently rigorous criteria for the
acquisition of the basic conditional discriminations can cause a failure to establish
the necessary prerequisites-a failure that
is likely to go unrecognized. In a two-comparison conditional discrimination, for
example, a subject can achieve an accuracy
score of 75 percent-seemingly significantly different from chance -ven though
learning completely different conditional
discriminations than those the experimenter has designated and measured
(Sidman, 1980).
Procedural artifacts can also generate
"false positives," tests that seem to demonstrate equivalence even though equiva-
lence does not exist. Perhaps even less
likely to be detected are instances in which
equivalence does exist, but for reasons
unrelated to the experimental variables.
Again, such data may be taken to support
a particular descriptive or theoretical
account when, in fact, the equivalence test
was positive for reasons extraneous to the
account.
For example, when procedures designed
to establish equivalence relations have
failed to do so, false positives may be generated if the tests alone can teach the subject the relations being tested for. It is now
well established that learning does take
place even during tests that omit reinforcement (e.g., Lazar, Davis-Lang, & Sanchez,
1984; Saunders, Wachter, & Spradlin, 1988;
Sidman, Kirk, & Willson-Morris, 1985;
Sidman, Willson-Morris, & Kirk, 1986;
Sigurdardottir, Green, & Saunders, 1990;
Spradlin, Cotter, & Baxley, 1973). Furthermore, subjects can learn new conditional
discriminations in the absence of reinforcement without having previously learned
related discriminations (Harrison & Green,
1990; Saunders, Saunders, Kirby, &
Spradlin, 1988). (For more complete discussions of the conditions under which learning may take place during unreinforced
conditional-discrimination tests, see
Sidman, in press; Sidman, Kirk, & WilsonMorris, 1985.) If an equivalence test can by
itself teach a subject the very conditional
discriminations that are needed to demonstrate equivalence, then any use of that test
to support an account of equivalence that
is based on previous learning will be
invalid.
Also, even when experimental procedures that are designed to generate equivalence fail to do so, tests might yield seemingly positive results simply because the
tested stimuli are physically or conceptually similar. Such similarities are not
always apparent to the experimenter, and
not all subjects will react to them.
To determine whether evidence for
equivalence is spurious, the ideal control
would include all features present in the
experimental condition, while excluding
the independent variable and any features
SPURIOUS EQUIVALENCE RELATIONS
not present in the experimental condition.
We report here an attempt to set up just
such a control in the context of a procedure
used by Sidman (1971) to demonstrate the
emergence of equivalence relations from
conditional discriminations. Although we
did replicate the original demonstration,
we also discovered possible sources of artifacts. As the literature on stimulus equivalence expands, the importance of such discoveries grows; making them known will
help increase the likelihood that data cited
in support of a theoretical position are first
examined closely for their validity.
METHOD
The general plan of the experiment was
the following: Children were taught conditional discriminations with dictated letter
names as samples, and with five uppercase or five lower-case Greek letters as
comparisons. As Figure 1 indicates, the
children were to relate three of the dictated-name samples ("xi," "gamma," and
"lambda") both to an upper- and a lowercase comparison letter. These letters were
Greek Letters
Upper-case Letters
oQ4
|
V
-
>' Z53 fr
X
°'
U
the experimental stimuli. It was expected
that the children would, when tested,
match the upper- to the lower-case form of
each letter without having been directly
taught to do so (e.g., Sidman, Cresson, &
Willson-Morris, 1974). For example, with
upper-case xi as the sample and five lowercase letters as comparisons, lower-case xi
was the child's predicted choice; with
lower-case xi as the sample and five uppercase letters as comparisons, upper-case xi
was the predicted choice.
With the dictated samples 'omega" and
"phi," the children were taught to choose
upper-case omega and phi, respectively,
and with the samples "sigma" and "delta,"
to select lower-case sigma and delta,
respectively. Upper-case omega and phi,
and lower-case sigma and delta were the
control stimuli. Since each was related to a
different sample, it was expected that later
testing would yield unpredictable conditional relations, if any, between these stimuli.
Having learned the baseline conditional
discriminations, the children were tested
for the emergence of new conditional dis-
Dictated
Samples
L1Q
-
3
"'OMEGA
'PHI-
ACAMMACH
F4-
A
r
LAMBDM
a-
SIGMAv
|-4
'DELTA
"
EST
TEST
EACHl
3
-X
U
a
Lower-case Letters
Fig. 1. Subjects were first taught baseline conditional discriminations: Each of three dictated Greek letter name samples ("xi," "gamma," and "lambda") was related to an upper-case (UC) and a lower-case (LC) comparison. Two samples ("omega" and "phi") were related only to UC, and two ("sigma" and "delta") only to LC comparisons. Subjects
were then tested for the emergence of new conditional discriminations with UC samples and LC comparisons, and
vice versa.
4
GREGORY STIKELEATHER and MURRAY SIDMAN
criminations in which the stimuli were
upper- and lower-case letters. One set of
tests used each upper-case letter as a sample, with all five lower-case letters as comparisons. The other set of tests used lowercase samples and the upper-case letters as
comparisons.
The children received the same kinds of
teaching experiences with all stimuli, with
one exception: The two members of each
pair of experimental stimuli had a history
of being related to a common dictated
name; the control stimuli lacked this history. It was expected, therefore, that in the
tests the children would match the upperand lower-case member of each experimental stimulus pair to each other, but
would not do so with the control stimuli.
Apparatus and basic procedures have
been reported elsewhere (Sidman & Tailby,
1982), and will only be summarized here.
Features most relevant to the purposes of
this report will be described in detail.
Subjects
Four normal children participated: two
girls, Subjects S.D. and G.H., and two boys,
Subjects J.P. and B.N. Their ages (yearsmonths) at the time of their final tests were
5-5 (J.P.), 5-6 (S.D.), 6-4 (B.N.), and 6-5
(G.H.). Experimental sessions, held several
days a week at the same time each day,
lasted between 30 and 60 minutes.
Apparatus and Conditionaldiscrimination Procedures
The subject sat in front of a vertical panel
that held a circle of eight translucent keys,
with a ninth key in the center. Visual stimuli were projected onto the back of the
keys, and auditory stimuli were dictated
via a tape-and-speaker system. Pennies
were dropped into a tray beside the keys,
and doorbell chimes rang each time a
penny was delivered. Subjects' comments
were recorded on tape.
Visual sample stimuli were presented on
the center key. When the sample was auditory, the center key was illuminated with
white light. Comparison stimuli, always
visual, were presented on the outer keys,
but only the lower five of those keys were
used. The maximum number of comparison stimuli presented was five, the minimum, two (see below). Comparison keys
not illuminated with visual stimuli
remained dark.
A trial always began with the presentation of a sample stimulus. When the subject pressed the sample key, comparisons
appeared on the outer keys while the sample remained present (auditory samples
continued to be repeated). The trial ended
after the subject pressed a comparison key.
Except when the probability of reinforcement was less than 1.00 (see below), chimes
and a penny followed if the subject pressed
the comparison that was programmed as
correct for that sample. If the subject
pressed any other comparison key, an incorrect choice was recorded and no chimes or
pennies were delivered. When the subject
pressed a comparison key, all stimuli disappeared. The next sample was presented
after a 1.5 to 2.0-s intertrial interval.
Pre-Experimental Teaching
Subjects received no oral or written
instructions. To acquaint them with the
apparatus and procedures, they were
given four- or five-choice conditional discriminations with familiar colors and color
names. After delivering two or three pennies to accomplish magazine training, the
experimenter manually demonstrated the
first few conditional-discrimination trials
in which the sample and correct comparison were the same color. All subjects
proved able to select the appropriate comparison in the presence of a color or dictated color-name sample.
To ensure that the subjects could discriminate the stimuli to be used later, they
were then given five-choice conditional
discriminations in which the correct comparison was an upper-case or lower-case
Greek letter identical to the sample. All
subjects proved able to do this, and completed the pre-experimental teaching in
one to five sessions.
Teaching
With each of five dictated Greek letter
names as samples, the children learned to
SPURIOUS EQUIVALENCE RELATIONS
select the corresponding upper-case letter,
and with each of five letter-name samples,
they learned to select the corresponding
lower-case letter. Procedurally, as shown
in Figure 1, three dictated names were conditionally related both to upper- and
lower-case letters, two were related only to
upper-case letters, and two only to lowercase letters. The ten conditional discriminations were taught in a series of steps.
First, each subject was taught a twocomparison conditional discrimination. For
example, with dictated "gamma" or "xi" as
the sample, and the corresponding uppercase letters as comparisons, subjects
quickly learned to select upper-case
gamma when "gamma" was dictated, and
upper-case xi when "xi" was dictated.
Then, as the subject mastered each conditional discrimination, a new sample and
comparison stimulus were added until the
subject had learned all ten conditional discriminations. The comparison stimuli on
5
any trial were either upper or lower case,
never both.
The teaching sequence varied among
subjects. Table 1 shows how new conditional discriminations were introduced for
each subject. In sequences I and II, the subject was not given the three-comparison
task (gamma, xi, lambda) until each of the
three two-comparison components (gamma, xi; gamma, lambda; xi, lambda) had
been learned. In sequences III and IV,
lambda was added immediately after the
subject had learned the first two-comparison (gamma, xi) conditional discrimination. Subjects G.H. and B.N. differed from
the others in having one or both of the
abbreviated teaching sequences. The learning criterion at each teaching step was no
more than one error in a set of trials that
included at least five and sometimes ten
trials with each sample.
After a subject had learned each set of
five-comparison conditional discrimina-
Table 1
Sequences used to teach subjects conditional discriminations with upper-and lower-case Greek-letter comparisons
and dictated letter name samples. Upper- and lower-case Greek letters are denoted by the first letters of their
English names. Experimental stimuli: G, g, X, x, L, 1; Control stimuli: P, 0, s, d.
Subject
Teaching
Sequence
J. P.
S.D.
G.H.
B.N.
I, II
I,II
I,IV
III, IV
Teaching Sequences (Dictated Letter-Name Samples)
Sequence
I
Upper-case
II
Lower-case
III
Upper-case
IV
Lower-case
2-choice
G X
g x
G X
g x
2-choice
G L
g 1
2-choice
X L
x 1
3-choice
G X L
g x I
GX L
g x I
4-choice
GX L P
gx I s
GX L P
g x I s
5-choice
G X L P O
gx I s d
GX L PO
g x I s d
6
GREGORY STIKELEATHER and MURRAY SIDMAN
tions, one set with upper- and the other with
lower-case comparisons, both sets were
mixed within 20-trial blocks. A subject's
training was complete when at least 19 trials out of a 20-trial block were correct. The
total number of teaching sessions varied
from 7 for Subject G.H. to 21 for Subject J.P.
Testing
The subjects' final performance, ten conditional discriminations with dictated letter
names as samples and five upper- or lowercase letters as comparisons, constituted the
baseline trials in the tests. On test trials,
upper-case samples were presented with
lower-case comparisons, and lower-case
samples with upper-case comparisons.
These new five-comparison test tasks,
which the subjects had never seen before,
were inserted as probe trials among the
baseline trials.
Probe trials were never followed by
chimes or pennies. To help prevent discriminated extinction on test trials, the
probability of reinforcement after correct
baseline trials was reduced over several
sessions to 0.20 before testing began.
Subject J.P. differed from the others in having his baseline reinforcement probability
reduced to 0.00.
Each 75-trial test contained 50 baseline
trials and 25 probes. To maintain the overall reinforcement probability at 0.20 (no
reinforcement followed probe trials), the
baseline probability was increased to 0.30
during tests (for all but Subject J.P.). At the
end of test sessions (and the earlier sessions in which the reinforcement probability was reduced), the children were given
"identity matching" trials with colors or
Greek letters to make up for reinforcements they had missed.
The complete test battery contained four
75-trial tests, two with upper-case samples
and lower-case comparisons, and two with
lower-case samples and upper-case comparisons. All children had the test battery
at least once, starting with upper-case samples and lower-case comparisons.
RESULTS
The matrices in Table 2 summarize the
test results for each subject. Within each
matrix, rows designate sample stimuli, and
columns designate comparison stimuli.
The left-hand matrices show the percentage of probe trials on which subjects
selected each lower-case comparison when
each upper-case letter was a sample; the
right-hand matrices show selections of
upper-case comparisons when the samples
were lower-case letters.
Each matrix is divided into quadrants:
The upper left quadrants show the percentage of trials on which subjects selected
experimental comparison stimuli when the
samples were also experimental stimuli.
High frequencies in the three outlined cells
of each upper left quadrant would indicate
the emergence of conditional discriminations that were indicative of equivalence
relations between the upper- and lowercase member of each experimental stimulus
pair.
The lower right quadrants show the percentage of trials on which subjects selected
control comparison stimuli when the samples were also control stimuli. The lower
left and upper right quadrants show how
likely the subjects were to select a comparison stimulus from one set (experimental or
control) when the sample was a stimulus
from the other set.
The two uppermost matrices summarize
Subject J.P.'s performance. When an experimental stimulus, upper-case xi, gamma, or
lambda, was the sample (left-hand matrix),
the child most often chose lower-case xi,
gamma, or lambda, respectively. And with
lower-case xi, gamma, or lambda as the
sample (right-hand matrix), he most often
chose the corresponding upper-case letter.
These emergent conditional discriminations were symmetric; within each pair of
experimental stimuli it mattered little
which was the sample and which the comparison.
The child also showed comparison preferences when the sample was one of the
control stimuli, upper-case phi or omega
(left-hand matrix), or lower-case sigma or
delta (right-hand matrix), but these preferences were not as pronounced as they were
when the samples were experimental stim-
SPURIOUS EQUIVALENCE RELATIONS
7
pairs, symmetric conditional discriminations emerged. When a control stimulus
was the sample, however, the child displayed no clear-cut comparison preferences. (Instead, she exhibited a position
preference, pressing the bottom comparison key on almost 75% of the probe trials
that had a control stimulus as the sample.)
Even more strongly than the first two
children, Subjects G.H. and B.N. showed
the emergent symmetric conditional discriminations that were required to document equivalence relations between upperand lower-case experimental stimuli. These
subjects, however, also demonstrated consistent and symmetric conditional discriminations on trials that had control stimuli as
samples. With control stimuli, both children showed the same conditional discrim-
uli. Also, emergent conditional discriminations involving control stimuli were not
symmetric. Although the subject most
often selected lower-case sigma when
upper-case omega was the sample, his
most likely choice with lower-case sigma
as the sample was not upper-case omega
but upper-case phi. He also selected uppercase phi frequently when lower-case delta
was the sample. With upper-case phi as the
sample, however, he almost never chose
lower-case sigma or delta, most often
selecting lower-case gamma instead.
Subject S.D.'s tests are summarized
below those of Subject J.P. Again, the
results with the three pairs of experimental
stimuli were consistent with the formation
of equivalence relations; within each of the
experimental upper- and lower-case letter
Table 2
Percentage of test trials on which subjects related each sample to each comparison. Subjects are identified in the
leftmost column. Greek-letter stimuli are identified by their abbreviated names.
UPPER-CASE L
SAMPLES
ER-CASE COMPARISONS
xi
XI
93
GAM
J. P.
PHI
33
OME
13
XI
100
xi
S. D.
LAM
7
PHI
OME
27
13
XI
100
xi
G. H.
GAM
LAM
PHI
7
80
20
LAM
GAM
gam lam Sig del
10
B. N.
GAM
LAM
PHI
OME
7
gami
7
53
lam
sig
del
gam lam
7
27
sig del
100
7
13
33
20
gam lam
90
gam
lam
87
1 100
10
am lam
100
20
33
7
33
20
10
70
sig del
100
10
10
xi
100
gam
lam
7
7
20
87
13
20
20
60
20
100
sig
53
67
13
13
10
70
10
20
30
30
20
100
10
20
30
100
del
xi
100
XI GAM LAM PHI OME
100
10
sig
80
7
93
XI GAM LAM PHI OME
lam
90
20
80
30
10
20
gam
100
7
sig
del
sig del
80
93
XI GAM LAM PHI OME
xi
10
xi
XI GAM LAM PHT OME
xi
13
80
60
OME
XI
|
LOWER-CASE
SAMPLES UPPER-CASE COMPARISONS
del
90
10
I__
100
90
100
8
GREGORY STIKELEATHER and MURRAY SIDMAN
inations: When upper-case phi was the
sample, they most often selected lowercase sigma, and with lower-case sigma as
the sample, they almost always chose
upper-case phi. Symmetric conditional discriminations also emerged when uppercase omega or lower-case delta was the
sample.
When the sample was an experimental
stimulus, the children rarely chose a control comparison. When the sample was a
control stimulus, however, Subjects J.P.
and S.D. selected an experimental comparison about as often as they selected a control comparison. In contrast, when the
sample was a control stimulus, Subjects
G.H. and B.N. almost always selected a
control comparison.
DISCUSSION
Subjects J.P. and S.D. showed consistent
and symmetric conditional discriminations, not directly taught, when the samples in test trials were upper- or lower-case
experimental stimuli, but not when the
samples were control stimuli. The difference between experimental and control
stimuli indicated that the positive equivalence tests were not spurious. The children
had learned to relate each member of an
upper- and lower-case pair of experimental
stimuli to the same dictated letter name,
but to relate every control stimulus to a
different name.
This different treatment-some members of one set of comparison stimuli being
related to a common sample, and each
member of the other set being related to a
different sample-was the experimental
variable. If some feature of the teaching or
testing situation other than the experimental variable were sufficient for the emergence of consistent relations between
upper- and lower-case experimental stimuli, similar relations would have been
expected to emerge between control stimuli also.
The test performances of Subjects G.H.
and B.N., however, challenged these conclusions. Both children showed symmetric
relations emerging not only between
experimental stimuli, but between upperand lower-case control stimuli also,
thereby calling into question the nature of
the relations between experimental stimuli.
For these two subjects, some artifact had to
be responsible for the seeming emergence
of equivalence relations.
A post-experimental interview with
Subject B.N. provided clues to one possible
variable. The child was given ten probe trials, with the experimenter manually controlling the intertrial intervals so that questions could be asked and answered after
each trial. The following are some of the
more significant excerpts from the inter-
view:
E: How did you know to pick that one?
S: I don't kn-, well, I used my brain.
Everybody has one.
E: Well, what made you pick that one
instead of the others?
S: I thought which one is right.
E: How did you know it was right?
S: Well, because I both learned them at
the same time. I mean, like let's say it was
third. Well, it had one in the middle and it
was the third-let's say it was in the middle and it was the third one I learned. So I
look at the other one that I learned third on
the other team.
E: O.K., let's try some more... .What
made you pick that one?
S: Every time? Well, because it was
before the last one, and the other-and the
one on the other team was also before the
last one.
E: What is the last one?
S: The last one, there's two last ones,
there's two of it. The two last ones are
omega and delta. Want to hear the two
first ones?
E: Yeah.
S: Uh, two gammas, but they both look
different. One looks like a Y and one looks
like an upside-down L.
E: O.K., let's try some more....Now how
did you know to do that one?
S: I know it, but I told you before. Ilook, they both go first.
E: Go first?
S: Right. I both learned them first.
E: And what did you learn second?
SPURIOUS EQUIVALENCE RELATIONS
S: Second I learned the two xis. And
third, I learned the two lambdas. Now it's
not two of the same thing anymore.
E: So what is it?
S: Well, ther- laa-. Well, I don't
remember. I think it was, let me see, one,
two, three, oh, so there's two more.
Another one is, uh-uh-umm, sigma and
phi. And the very, very, very last one is
delta and omega. And that's how I know it
was that one, and don't ask me it anymore,
please.
E:....So how do you know which ones to
put together?
S: Well, whichever ones-like if it, if it
flashed the first one I learned, I mean like
gamma, then I press the other kind of
gamma. The same with all of them. I told
you those.
E: And the same with all of them?
S: Yeah. Yeah. Xi goes to the other xi,
and lambda goes to the other lambda. And
all the rest goes to-the one I learned first
goes to the one I learned first. The one I
learned second is the one, and then the
answer is the one I learned second. And it
goes on and on and on and until it gets up
to one, two, three, four, five times. And if
you count the both teams together, it's one,
two, three, four, five, six, seven, eight, nine,
ten.
Subject B.N.'s remarks called attention to
the teaching program. In all teaching
sequences (Table 1), conditional discriminations involving auditory samples and
upper-case comparisons were taught
before those involving lower-case comparisons. This could have been the basis for
the subject's classification of the stimuli
into two "teams."
Also, omega was always introduced as
the fifth upper-case letter, and delta as the
fifth lower-case letter. Subject B.N.
reported matching upper-case omega to
lower-case delta because both stimuli had
been taught last in their respective teams.
Upper-case phi and lower-case sigma were
the fourth letters to be introduced, and the
subject gave that reason for matching
them.
The teaching sequences that Subjects
9
G.H. and B.N. underwent differed from
those of the other subjects. Subject G.H.
was not questioned about her performance. Perhaps, however, the direct
change from a two-comparison to a threecomparison conditional discrimination in
sequences III and/or IV highlighted
lambda's status as the third stimulus introduced, and directed the subjects' attention
to the sequence of stimulus introduction.
This variable might account for the differences between the two pairs of subjects:
Subject B.N.'s comments about the
experimental stimuli suggested multiple
control. He reported matching the experimental stimuli not only because, within
their teams, xi, gamma, and lambda occupied corresponding positions in the teaching sequence, but also because each member of an upper- and lower-case pair had
been related to the same dictated sample.
Thus, equivalence relations between experimental stimuli might have emerged anyway, even without the common locations
of upper- and lower-case experimental
stimuli within the teaching sequence.
It is noteworthy that the experimental
variable, the conditional discriminations
that were taught explicitly, may in principle be no different than the confounding
variable, the sequence in which new letters
were introduced. With respect to the
experimental stimuli, dictated samples
were related in common to two comparison stimuli; the same was true of stimulus
locations within the teaching sequence.
The dictated sample, the position within
the teaching sequence, or both could have
become equivalent to the pair of experimental stimuli to which they were related.
Therefore, even though Subject B.N.'s
testimony pointed to a confounding variable, that variable would have produced
equivalence relations no less genuine than
those the experimental variable produced.
With respect to upper- and lower-case control stimuli, the equivalence relations were
not spurious, but the source of those relations was. They were produced by the
"wrong" variable.
Although several writers (Dugdale &
Lowe, 1990; Lowe, 1986; Stoddard &
10
GREGORY STIKELEATHER and MURRAY SIDMAN
McIlvane, 1986) have pointed out the need
for caution in interpreting a subject's verbal reports in experiments on equivalence
relations (an extreme example was Subject
B.N.'s explanation, "I used my brain"), the
accuracy of the subject's report in this
instance is not really an issue. That report
did identify a variable which, if not controlled, could influence experimental
results. Subsequent experiments in our laboratory have therefore avoided consistency
in the sequence of introduction of new
stimuli when teaching subjects conditional
discriminations as prerequisites for equivalence relations.
Other variables than those suggested by
Subject B.N.'s remarks might also have
produced the emergent relations between
control stimuli. Let us suppose, for example, that upper- and lower-case experimental stimuli did become related because they
had both become equivalent to the same
dictated sample. Then, when faced for the
first time (in a test) with a control stimulus
as the sample and the five letters of the
"other team" as comparisons, the subject
might first have eliminated the three
experimental comparison stimuli by exclusion (Dixon, 1977; McIlvane & Stoddard,
1981, 1985). The subject might then have
arbitrarily selected one of the control stimuli just to get on with the next trial.
Support for this possibility comes from
Table 2; all subjects distributed their
choices of comparisons differentially,
depending on whether the samples were
experimental or control stimuli. These differential distributions of choices suggest
the formation of two stimulus classes,
experimental and control.
For example, suppose the sample was
the control stimulus, upper-case omega.
Because some of the comparisons, the
lower-case experimental stimuli xi,
gamma, and lambda, already "went to"
specified members of the other team, the
child might have eliminated them as possible choices and arbitrarily selected one of
the uncommitted control stimuli, say
lower-case sigma. On a subsequent trial,
the other upper-case control stimulus, phi,
would have been presented as the sample.
This time, the child might have excluded
not only the experimental stimuli as possible choices, but lower-case sigma also,
because that letter had already been
selected in relation to upper-case omega.
Having learned to remain consistent, the
child might have continued to select sigma
or delta whenever omega or phi, respectively, appeared as the sample. These arbitrary relations between control stimuli
would then have needed only symmetry to
make them look like indicators of equivalence. If the subject's comparison selections
were actually controlled by a larger pattern-a compound consisting of the two
related stimuli-even symmetry need not
be assumed.
Other factors, too, can contribute to the
emergence of untaught conditional discriminations that seem to constitute positive evidence of equivalence, but do so
only through experimental artifact
(Harrison & Green,1990; Sidman, 1987).
New kinds of procedural artifacts are constantly being discovered. Particularly
deceptive are instances in which equivalence arises from sources other than those
the experimenter or theorist supposes to
have been causal. When known or suspected, such factors can be controlled.
Where they have not been controlled, a
detailed account of procedures can permit
others to evaluate their possible influence
upon experimental results, thereby increasing the validity and utility of descriptive
and theoretical accounts of stimulus equivalence that are held to be supported by or
derived from those results.
REFERENCES
Barnes, T. (1990). Equivalence without symmetry? A
stimulus artifact. Unpublished M.A. thesis,
Northeastern University, Boston.
Bush, K. M., Sidman, M., & de Rose, T. (1989).
Contextual control of emergent equivalence relations. Journal of the Experimental Analysis of Behavior,
51,2945.
Carrigan, P. F. (1986). Conditional discrimination and
transitive relations: A theoretical and experimental analysis. Unpublished Ph.D. dissertation, Northeastern
University, Boston.
Carter, D. E., & Werner, T. J. (1978). Complex learning
and information processing by pigeons: A critical
analysis. Journal of the Experimental Analysis of
Behavior, 29, 565-601.
SPURIOUS EQUIVALENCE RELATIONS
Constantine, B. (1981). An experimental analysis of stimulus control in simple conditional discriminations.
Unpublished Ph.D. dissertation, Northeastern
University, Boston.
Dixon, L. S. (1977). The nature of control by spoken
words over visual stimulus selection. Journal of the
Experimental Analysis of Behavior, 27,433-442.
Dugdale, N., & Lowe, C. F. (1990). Naming and stimulus equivalence. In D. E. Blackman & H. Lejeune
(Eds.), Behaviour analysis in theory and practice:
Contributions and controversies (pp. 115-138).
Hillsdale, NJ: Erlbaum.
Harrison, R.J., & Green, G. (1990). Development of
conditional and equivalence relations without differential consequences. Journal of the Experimental
Analysis of Behavior, 54, 225-237.
Iversen, I. H., Sidman, M., & Carrigan, P. (1986).
Stimulus definition in conditional discriminations.
Journal of the Experimental Analysis of Behavior, 45,
297-304.
Johnson, C., & Sidman, M. (1990, May). Stimulus
classes established by sample-and-S- conditioninal-discrimination training. Poster presented at the 16th
annual meeting of the Association for Behavior
Analysis, Nashville, TN.
Lazar, R. M., Davis-Lang, D., & Sanchez L. (1984). The
formation of visual stimulus equivalences in children. Journal of the Experimental Analysis of Behavior,
41,251-266.
Lowe, C. F. (1986, May). The role of verbal behavior in
the emergence of equivalence relations. Paper presented
at the 12th annual meeting of the Association for
Behavior Analysis, Milwaukee, WI.
McIlvane, W. J., & Stoddard, L. T. (1981). Acquisition
of matching-to-sample performances in severe
retardation: Learning by exclusion. Journal of Mental
Deficiency Research, 25, 33-48.
McIlvane, W. J., & Stoddard, L. T. (1985). Complex
stimulus relations and exdusion in mental retardation. Analysis and Intervention in Developmental
Disabilities, 5, 307-321.
Saunders, R. R., Saunders, K. J., Kirby, K. C., &
Spradlin, J. E. (1988). The merger and development
of equivalence classes by unreinforced conditional
selection of comparison stimuli. Journal of the
Experimental Analysis of Behavior, 29, 145-162.
Saunders, R. R., Wachter, J., & Spradlin, J. E. (1988).
Establishing auditory stimulus control over an
eight-member equivalence class via conditional discrimination procedures. Journal of the Experimental
Analysis of Behavior, 49, 95-115.
11
Sidman, M. (1971). Reading and auditory-visual
equivalences. Journal of Speech and Hearing Research,
14, 5-13.
Sidman, M. (1980). A note on the measurement of
conditional discrimination. Journal of the
Experimental Analysis of Behavior, 33, 285-289.
Sidman, M. (1986). Functional analysis of emergent
verbal classes. In T. Thompson & M. D. Zeiler
(Eds.), Analysis and integration of behavioral units (pp.
213-245). Hillsdale, NJ: Erlbaum.
Sidman, M. (1987). Two choices are not enough.
Behavior Analysis, 22, 11-18.
Sidman, M. (in press). Equivalence relations: Some
basic considerations. In L.J. Hayes & S.C. Hayes
(Eds.), Dialogues on verbal behavior: Proceedings of the
third International Institute on Verbal Relations. Reno,
NV: Context Press.
Sidman, M., Cresson, O., Jr., & Willson-Morris, M.
(1974). Acquisition of matching to sample via mediated transfer. Journal of the Experimental Analysis of
Behavior, 22, 261-273.
Sidman, M., Kirk, B., & Wilson-Morris, M. (1985). Sixmember stimulus classes generated by conditionaldiscrimination procedures. Journal of the Experimental Analysis of Behavior, 43, 21-42.
Sidman, M., & Tailby, W. (1982). Conditional discrimination vs. matching to sample: An expansion of the
testing paradigm. Journal of the Experimental
Analysis of Behavior, 37, 5-22.
Sidman, M., Willson-Morris, M., & Kirk, B. (1986).
Matching-to-sample procedures and the development of equivalence relations: The role of naming.
Analysis and Intervention in Developmental Disabilities, 6,1-19.
Sigurdardottir, Z. G., Green, G., & Saunders, R. R.
(1990). Equivalence classes generated by sequence
training. Journal of the Experimental Analysis of
Behavior, 53, 47-63.
Spradlin, J. E., Cotter, V. W., & Baxley, N. (1973).
Establishing a conditional discrimination without
direct training: A study of transfer with retarded
adolescents. American Journal of Mental Deficiency,
77,556-566.
Stoddard, L. T., & McIlvane, W. J. (1986). Stimulus
control research and developmentally disabled
individuals. Analysis and Intervention in Developmental Disabilities, 6,155-178.
Wulfert, E., & Hayes, S. C. (1988). Transfer of a conditional ordering response through conditional equiv-
alence dasses. Journal of the Experimental Analysis of
Behavior, 50, 125-144.