ABSTRACT OF THESIS
THE EFFICIACY OF THE MMPI-2 LEES-HALEY FAKE BAD SCALE (FBS) FOR
DIFFERENTIATING NEUROCOGNITIVE AND PSYCHIATRIC FEIGNERS
The FBS (Lees-Haley, 1992) is a relatively new validity scale for the MMPI-2 designed
specifically to detect feigned neurocognitive deficit. The aim of the present study was to
examine the FBS’s efficacy in differentiating psychiatric and neurocognitive feigners using a
known-groups design. Malingering tests were administered to 180 individuals undergoing
forensic neuropsychiatric evaluations. Based on the malingering test results, participants were
classified as honest responders, psychiatric feigners, neurocognitive feigners, or feigning both
psychiatric and neurocognitive deficits. The FBS significantly differentiated the 3 feigning
groups from the honest group, but it did not discriminate effectively between neurocognitive and
psychiatric feigners.
KEYWORDS: Malingering, MMPI-2, Fake Bad Scale (FBS), Neurocognitive Feigning,
Psychiatric Feigning
Victoria Louise Vagnini
August 6, 2003
Copyright © 2003 by Victoria Louise Vagnini. All rights reserved.
THE EFFICIACY OF THE MMPI-2 LEES-HALEY FAKE BAD SCALE (FBS) FOR
DIFFERENTIATING NEUROCOGNITIVE AND PSYCHIATRIC FEIGNERS
By
Victoria Louise Vagnini
David T. R. Berry, Ph.D.
Director of Thesis
David T. R. Berry, Ph.D.
Director of Graduate Studies
RULES FOR THE USE OF THESES
Unpublished theses submitted for the Master’s degree and deposited in the University of
Kentucky Library are as a rule open for inspection, but are to be used only with due regard to the
rights of the authors. Bibliographical references may be noted, but quotations or summaries of
parts may be published only with the permission of the author, and with the usual scholarly
acknowledgements.
Extensive copying or publication of the thesis in whole or in part also require the consent of the
dean of the Graduate School of the University of Kentucky.
THESIS
Victoria Louise Vagnini
The Graduate School
University of Kentucky
2003
THE EFFICIACY OF THE MMPI-2 LEES-HALEY FAKE BAD SCALE (FBS) FOR
DIFFERENTIATING NEUROCOGNITIVE AND PSYCHIATRIC FEIGNERS
____________________________________________
THESIS
____________________________________________
A thesis submitted in partial fulfillment of the
requirements for the degree of Master of Science in the
College of Arts and Sciences
at the University of Kentucky
By
Victoria Louise Vagnini
Lexington, Kentucky
Director: Dr. David T. R. Berry, Professor of Psychology
Lexington, Kentucky
2003
Copyright © 2003 by Victoria Louise Vagnini. All rights reserved.
To my mother, who supports all of my endeavors and
without whom the completion of this thesis may have been impossible.
ACKNOWLEDGEMENTS
The following thesis, while an individual work, benefited from the insights and direction
of several people. Without the help of my Thesis Chair, David T. R. Berry, this project would
likely have been done much faster but would not have instilled nearly the pride and
understanding. In addition, the members of my lab also helped in collecting and coding the data
used in this project.
This work would have been difficult to accomplish without the support of my family and
friends. Without the support from my mother, Nettie, and her endless encouragement, this
project would have been more unpleasant and difficult. I would also like to thank my father,
John, for his humor, support, and bat. In addition, I wish to acknowledge the ample support
received from other graduate students in the department of psychology.
iii
TABLE OF CONTENTS
Acknowledgements ................................................................................................................ iii
List of Tables .......................................................................................................................... vi
List of Files............................................................................................................................. vii
Chapter One: Introduction
Malingering in Psychological Settings ................................................................................1
Methodology of Malingering Research ...............................................................................2
Neuropsychometrics ............................................................................................................4
Detection of Malingering in Clinical and Forensic Settings................................................5
Detecting Feigned Neurocognitive/Somatic Symptoms......................................................6
The Letter Memory Test (LMT) ....................................................................................7
Test of Memory Malingering (TOMM).........................................................................8
Victoria Symptom Validity Test (VSVT)......................................................................9
Detecting Feigned Psychiatric Symptoms .........................................................................10
Structured Interview of Reported Symptoms (SIRS) ..................................................11
Screening Index of Malingered Symptoms (SIMS).....................................................12
Minnesota Multiphasic Personality Inventory-2 (MMPI-2) ........................................13
The Fake Bad Scale (FBS) for the MMPI-2 ......................................................................15
Purpose of the Present Study .............................................................................................18
Chapter Two: Method
Participants.........................................................................................................................21
Materials ............................................................................................................................23
Procedure ...........................................................................................................................23
Chapter Three: Results
Descriptives........................................................................................................................25
iv
Main Analyses..................................................................................................................25
Group Differences for MMPI-2 Clinical Scales ......................................................... 25
Group Differences for MMPI-2 Validity Scales..........................................................26
Classification Rates for MMPI-2 Validity Scales........................................................27
Classification Rates for FBS at Different Cutting Scores............................................28
Chapter Four: Discussion
How effective are the traditional MMPI-2 "fake bad" scales in detecting feigning? ........30
The relative success of the Fb scale in detecting feigning in this study ............................31
How effective was the FBS in differentiating psychiatric and neurocognitive feigners?..31
Study Limitations...............................................................................................................32
Conclusions........................................................................................................................32
Appendix
Tables.................................................................................................................................34
References...............................................................................................................................45
Vita ..........................................................................................................................................50
v
LIST OF TABLES
Table 1, Demographic Information..........................................................................................34
Table 2, MMPI-2 Clinical Scale T-score Data ........................................................................35
Table 3, MMPI-2 Clinical Scale Effect Size Data...................................................................36
Table 4, MMPI-2 Validity Scale T-score Data........................................................................37
Table 5, MMPI-2 Validity Scale Effect Size Data ..................................................................39
Table 6, Classification Rates for MMPI-2 Validity Scales......................................................40
Table 7, Classification Rates for the FBS at Different Cutting Scores....................................42
vi
LIST OF FILES
FILENAME
Torythesis................................................................................................................... 259KB
vii
Chapter One
Introduction
Malingering in Psychological Settings
Most individuals misrepresent themselves at some point in their lives. For example,
lying to friends about whether they look bad in a new outfit seems harmless, and in fact may
simply be part of human nature and a desire to avoid hurting a friend’s feelings. However, by
giving another person a planned but incorrect impression, people can also gain many other
valued outcomes. For example, a child may feign illness to avoid school, a person who is
unemployed may misrepresent his or her job search efforts in order to continue to receive
unemployment benefits, and a plaintiff in a lawsuit for psychological damages may exaggerate
his or her complaints to obtain a larger settlement. Thus, there is probably a continuum of
deliberate impression management ranging from “white lies” to outright fraud.
At times, detection of misrepresentation or lying about psychological symptoms can
literally be a life or death issue. Most people are familiar with publicized court cases in which
the accused have pled insanity as a defense in a capital crime. The professionals involved
(judges, lawyers, psychologists, etc.) may have to determine whether the defendant was actually
“insane” at the time of the crime or simply feigning insanity to escape consequences for his or
her actions. Personal injury litigation is another area in which psychologists must determine the
honesty of a plaintiff’s claim of head or bodily injury. Given the issues involved in personal
injury cases after head trauma, individuals may potentially be compensated with large sums of
money if the level of dysfunction they experience is great. The prospect of a large cash
settlement may well be a strong motivation to present one’s case in the worst light. Thus, it is
important to develop and refine methods to identify correctly individuals who are overreporting
their symptoms in compensation-seeking circumstances, as well as those who are honestly
responding.
In mental health evaluations, the most egregious type of impression management is
known as “malingering”. More technically, malingering is the intentional feigning or gross
exaggeration of a condition or symptoms for an external incentive (DSM-IV-TR, APA, 2000).
Malingering is listed in the DSM-IV-TR as a “V-code”, meaning it may be a condition that
warrants clinical attention, but is not a mental disorder. Two factors must be present for
malingering to be established. First, the individual must consciously control the production of
1
the false symptoms (distinguishing malingering from Conversion Disorder), and second, there
must be an external incentive for the production of the false symptoms (distinguishing
malingering from Factitious Disorder).
Several authors have suggested that in clinical settings there may be two distinct types of
malingering: psychiatric and neurocognitive/somatic (Larrabee, 1998; Iverson, Henrichs, Barton,
& Allen, 2002). Perhaps the most well-known type of feigning involves individuals malingering
psychiatric symptoms in order to portray themselves as having a significant condition such as a
psychotic disorder, which might exculpate them from a criminal penalty. Malingered
neurocognitive deficits may be less familiar. This involves individuals trying to portray
themselves as having primarily cognitive impairments such as memory problems. The research
literatures on these two types of malingered symptoms are usually separate, as are the assessment
techniques used to identify the feigning. The present study will bring these two literatures
together in an investigation of whether the MMPI-2 (which has traditionally been used to detect
psychiatric malingering) can contribute to identification of individuals malingering
neurocognitive/somatic deficit. More specifically, this study will evaluate a relatively new
MMPI-2 validity index, the Fake Bad Scale (FBS; Lees-Haley, English, & Glenn, 1991; LeesHaley, 1992), which has shown some promise in recent literature as a method for differentiating
neurocognitive malingerers from psychiatric malingerers. Before explaining this study in detail,
it is necessary to review several issues related to conducting malingering research, particularly
the methodology and relevant statistics for evaluating the classification accuracy of validity
scales or tests.
Methodology for Malingering Research
Two important issues in malingering research are the designs employed to validate
malingering tests and the techniques used to document the response set. Three main research
designs have been utilized to validate malingering tests and determine their accuracy in
classifying responders as honest or malingering. A simulation study design typically uses
“unimpaired” or nonclinical populations such as Psychology 100 students and assigns the
participants to respond to a test honestly or with instructions to “fake bad” (in other words, to
look cognitively or psychiatrically impaired on the tests). The accuracy of the test is estimated
with reference to the number of people in each group correctly classified on the basis of their test
results. While the simulation study design is appropriate for an initial validation study for a
2
malingering test due to its high internal validity, it has little generalizability to relevant clinical
populations unless a “clinical honest” group is included. A major drawback of this design is its
lack of external validity in that it does not examine the test’s utility in an actual forensic
population (real-world malingerers).
Another frequently used research design in the area of malingering is called a
differential-prevalence design. This design compares malingering test scores of a group at high
risk for feigning (compensation seekers) to a group with low risk for feigning (non-compensation
seekers). The malingering test’s positive classification rate should be higher in the compensation
seeking group, because there are thought to be more individuals exaggerating in this group. If no
difference is found in the rates of positive (malingering) scores in the two groups, serious
questions about the validity of the malingering measure should arise. The external validity of
this method is higher than in the simulation study, because individuals actually undergoing
clinical and forensic evaluations are used as participants. However, a major drawback to this
research design is that it cannot be used to assess the accuracy of test classifications, as the exact
criterion status of each subject is unknown.
Due to its logistical difficulty, a known-groups design is generally the last methodology
used in malingering test validation. This approach assesses the accuracy of a test’s classification
rate in actual clinical or forensic samples. A “gold standard” malingering measure is used to
assign the evaluees to honest or malingering groups. The accuracy of the new test is established
by determining its classification rate in the two groups identified by the criterion measure. This
research design has higher external validity than the simulation study because actual clinical
populations undergoing evaluations are evaluated. However, a drawback for this design is that
the validity of the gold standard test determines the ceiling of the estimated accuracy of the new
test. In addition, it is possible that only blatant malingerers may be identified by the external test
and thus, the results may be somewhat limited in terms of generalizability. Finally, because true
experimental control over the variable of malingering is not achieved, the issue of causality
cannot be addressed.
Rogers (1997) advocates requiring multiple supportive findings across the different
designs before accepting a malingering test as valid. Simulation studies are viewed as good
initial designs for evaluating new malingering tests because of the high level of control over the
subject’s response style (assigned malingerers vs. honest responders). Differential-prevalence
3
designs give information about the construct validity of the new procedure in clinical groups
thought to have different base rates of feigning. Known-group designs also use actual clinical
populations to raise external validity, and this design addresses the problem of internal validity
by classifying individual evaluees as honest or malingering based on the test sign from a gold
standard malingering measure. Minimally, validation using both simulation studies and knowngroups designs is thought to be necessary for establishing the efficacy of a new malingering
measure (Berry, Baer, Rinaldo, & Wetter, 2002).
Neuropsychometrics
The classification accuracy of all tests is traditionally expressed using specific statistics
and terminology. Sensitivity and specificity are parameters describing a test’s classification rate.
In the case of malingering tests, sensitivity is the percentage of participants who are malingering
who have a positive test sign. Specificity is the percentage of participants who are not
malingering who have a negative test sign. Sensitivity and specificity describe the accuracy of a
test sign, given that the criterion status of an individual is known, which unfortunately is rarely
the case in clinical practice. Two more clinically relevant statistics are Positive Predictive Power
(PPP) and Negative Predictive Power (NPP). PPP is the percentage of positive test signs that
correctly identify individuals who are malingering. NPP is the percentage of negative test signs
that correctly identify honest responders. Although PPP and NPP are partly dependant on the
base rate of malingering in a population, they are generally accepted as the most clinically
relevant classification parameters.
Unfortunately, the actual base rates of malingering which must contribute to determining
PPP and NPP are not known. However Strong, Greene, and Schinka (2000) used taxometric
analysis of items from the MMPI-2 to estimate the base rate of overreporting of symptoms on the
MMPI-2 in psychiatric inpatients and at a variety of VA medical units. They found that
approximately 27% of the psychiatric inpatients and 19% of the VA patients were overreporting
problems. More recently, the base rate for malingering after mild head injury was estimated to
be almost 40% based on a survey of forensic neuropsychologists (Mittenberg, Patton, Canyock,
& Condit, 2002). These base rate estimates for psychiatric and neurocognitive deficit
malingerers are important, because as noted above, along with sensitivity and specificity values,
base rates are used to determine PPP and NPP. While sensitivity and specificity are usually
assumed to remain constant as a property of the test and cutting score, PPP and NPP will vary
4
with base rates of malingering in various settings.
Detection of Malingering in Clinical and Forensic Settings
Clinical frameworks for identifying malingering vary. Although no formal criteria are
given for arriving at a determination of malingering, guidelines in the DSM-IV-TR (2000)
instruct the clinician to suspect malingering if two or more of the following conditions are
present: a medicolegal context, discrepancy between subjective and objective information, lack
of cooperation with the assessment or treatment, or the presence of Antisocial Personality
Disorder. However these criteria have been severely criticized (Rogers, 1997), and are probably
applicable only to psychiatric malingering. For diagnosis of cognitive/somatic malingering,
Slick, Sherman, and Iverson (1999) developed a set of criteria for identifying Definite
malingering of neurocognitive deficits (MNCD) that is more specific than the general guidelines
outlined in the DSM-IV-TR. Those criteria are 1) the presence of a substantial external
incentive, 2) definite negative response bias, as demonstrated by significantly worse-than-chance
performance on a well-validated measure of malingering, and 3) the behaviors meeting the
necessary criteria from the test data are not fully accounted for by psychiatric, neurological, or
developmental factors.
Slick and colleagues (1999) also propose definitions and criteria for Probable and
Possible malingered neurocognitive dysfunction. Probable MNCD is documented by the
presence of evidence strongly suggesting intentional exaggeration or fabrication of cognitive
dysfunction without plausible alternative explanations. The criteria for Probable MND are 1)
the presence of a substantial external incentive, 2) two or more types of evidence of fabrication
or exaggeration from neuropsychological testing, excluding definite negative response bias or
one type of evidence from neuropsychological testing (excluding definite negative response bias)
and one or more types of evidence of fabrication or exaggeration from self-report, and 3) the
behaviors meeting the necessary criteria from the test data or self-report are not fully accounted
for by psychiatric, neurological, or developmental factors. Possible MNCD is indicated by
evidence for intentional exaggeration or fabrication of cognitive dysfunction without plausible
alternatives. The criteria for Possible MNCD are 1) the presence of a substantial incentive, 2)
evidence of fabrication from self-report, and 3) behaviors meeting the criteria from self-report
are not fully accounted for by psychiatric, neurological, or developmental factors. Recent
published research has begun to apply the Slick et al. (1999) criteria to detection of
5
cognitive/somatic malingering (Greve, Bianchini, Mathias, Houston, & Crouch, 2003).
Detecting Feigned Neurocognitive/Somatic Symptoms
Neuropsychological tests assess cognitive functions that may be depressed as a result of
brain damage or dysfunction. However, in addition to integrity of brain functioning, these tests
are also sensitive to a variety of potentially confounding factors, such as medication effects,
psychiatric conditions and adequate effort from the test taker. Thus, among other concerns, the
validity of these tests depends on obtaining optimal effort from the test taker. In compensationseeking circumstances, individuals may deliberately underperform on tests designed to measure
their current cognitive ability. As previously noted, this phenomenon is generally known as
malingering neurocognitive deficits. Individuals involved in circumstances in which there is
potential monetary incentive for being impaired, such as in litigation or worker’s compensation,
are thought to be more likely to feign neuropsychological impairment (Pankratz & Binder,
1997). While results from standard neuropsychological tests are sometimes used to draw
inferences about motivation, previous research suggests that performances on these tests alone
may be inadequate indicators of malingering (Heaton, Smith, Lehman, & Vogt, 1978). This
concern prompted the creation of separate, objective indices specifically intended to detect
feigned cognitive deficits. These tests were designed to be relatively insensitive to brain
dysfunction but broadly reflective of the level of effort given, hence the common name
“motivational tests”.
Neuropsychological tests are designed to detect and measure significant brain
impairment, so they tend to be relatively easy for the unimpaired individual. Therefore, in order
to feign cognitive deficits, the typical intact individual must either deliberately try to answer the
test items incorrectly or exert little effort to perform adequately. While intentionally faking
answers or expending inadequate effort on motivational tests, malingerers may actually go too
far and perform more poorly than an individual with significant brain impairment. In extreme
cases, malingerers may perform significantly below chance on dichotomous, forced-choice
recognition memory tasks, providing evidence for intentional feigning. However, Guilmette et
al. (1994) and others have reported that use of a strict significantly below chance performance is
insufficiently sensitive to more subtle forms of malingering. This led to the current practice of
comparing results on motivational tests from compensation-seeking individuals with
questionable evidence of brain damage to non-compensation seeking groups with objective
6
evidence from neuroimaging of moderate to severe head injury. Performances on motivational
tests falling below cutoff scores established in the non-compensation-seeking, but significantly
cognitively impaired group raise the possibility of malingering in individuals with little or no
documented brain damage. Three well-validated tests used to detect neurocognitive malingering
are the LMT, the TOMM, and the VSVT.
Letter Memory Test (LMT).
The Letter Memory Test (LMT; Inman, Vickery, Berry, Lamb, Edwards, & Smith, 1998)
is a computer-administered, 45-item, forced-choice recognition task that uses consonant letters as
stimuli. It is entirely computer based and has constant delay periods of 5 seconds across all
trials. In an effort to increase chances of detecting deliberately poor effort from a test taker, the
face difficulty of the test is manipulated in two ways: the number of letters to be remembered
and the number of choices amongst which the target stimulus is identified. The manipulation
was based on the hypothesis that malingerers “titrate” their performances to “pass” obviously
easy items, but “fail” presumably difficult tasks.
The LMT (Inman et al., 1998) was validated following many of the methodological
suggestions by Rogers (1997) for malingering research. Participant groups included patients
with moderate to severe head injury, college students, community volunteers, depressed
psychiatric patients, non-compensation-seeking neurological patients, and compensation seeking
individuals with MHI. Inman et al. (1998) found that the LMT discriminated honest responders
from those putting forth poor effort with moderately high levels of accuracy. In the initial
validation study combined patient groups were contrasted with combined analogue malingerers
and a strong effect size (Cohen’s d = 2.00) was found. Using a cutting score of less than 93%
correct, the authors found a specificity of 100%, sensitivity of 84%, and an overall hit rate of
92%. A high internal consistency reliability coefficient alpha was found at .944. A strong effect
was also found for distinguishing known groups of independently identified malingerers from
honest neurological patients and the sensitivity and specificity were very good at 95% and 100%
respectively.
In a study by Inman and Berry (2002), the LMT attained a high hit rate for the detection
of malingering in an “enhanced” simulation study of college students with a history of a head
injury. The LMT again had good sensitivity and specificity (73% and 100%, respectively).
Orey, Cragar, and Berry (2000) found lower sensitivity for the LMT at 58% and excellent
7
specificity of 100% for head injured college students in a simulation study. Vickery, Berry,
Dearth, Vagnini, Baser, Cragar, & Orey (in press) found an even higher sensitivity (84.8%) and
slightly lower specificity (93.5%) for head injured participants and community volunteers in a
simulation design. Thus, the LMT appears to have adequate sensitivity and good specificity as a
measure for detecting feigned neurocognitive impairment.
Test of Memory Malingering (TOMM).
The Test of Memory Malingering (TOMM; Tombaugh, 1996) consists of 50 line
drawings of common objects. There are three trials. In Trial 1, the 50 pictures are presented one
at a time for 3 seconds each. After presentation of the 50 drawings, recognition testing is
administered, consisting of 50 trials, each with one of the previously shown target drawing
paired with a novel foil. The test taker chooses the pictures he or she remembers having been
shown previously. In Trial 2, the same format as Trial 1 is used and the 50 old drawings from
Trial 1 are presented again, followed by a test involving 50 pairs of pictures (target and novel foil
pictures) in which the previously seen item must be chosen. A retention trial is also
administered after a delay of approximately 20 minutes. The retention trial involves testing only
with the 50 target drawings and 50 new foils; the test taker chooses the one he or she remembers
being shown before the delay. Development of this test was based on evidence that recognition
memory (as opposed to free recall memory) is typically left intact even after severe neurological
impairment. Thus, the TOMM implicitly manipulates face difficulty level by presenting a task
that is erroneously thought by many test-takers to be difficult.
The development of the TOMM was conducted in two phases using nonclinical subjects
and a clinical sample with diverse cognitive impairments. The nonclinical population was found
to be extremely accurate at identifying targets. In the first trial 94% of the targets were correctly
identified as were over 99% of the targets for the remaining two trials. Individuals with various
cognitive impairments also identified targets at a rate almost as high as the nonclinical
population, establishing the test’s insensitivity to significant brain impairment. Based on these
results, a cutoff of 45/50 (90%) was established for identifying poor effort.
In the second series of studies, the test was first validated on a group of “at-risk”
malingerers in a differential-prevalence design. The scores for the compensation seeking group
were significantly lower than for the previous two groups of honest responders. Additionally, in
a simulation study, a sensitivity and specificity of 100% for individuals instructed to fake or
8
respond honestly on the TOMM were obtained with the suggested cutoff score. Finally using a
criterion of 45/50 (90% correct) on the second trial correctly classified 95% of all non-demented
patients and 91% of all patients (including patients with Alzheimer’s Disease) as not malingering
in a validation sample of 475 community volunteers and 161 neurologically impaired patients
(Tombaugh, 1997). A later study by Rees, Tombaugh, Gansler, and Moczynski (1998)
demonstrated converging validity using simulators, high-risk populations for malingering to
occur, and a computer form of the test. Reliability coefficients were not reported for the TOMM.
Based on this evidence, the TOMM appears to be one of the best validated motivational tests
available.
Victoria Symptom Validity Test (VSVT).
The Victoria Symptom Validity Test (VSVT; Slick, Hopp, Strauss, & Thompson, 1996)
is a computerized, forced-choice digit recognition test that includes two manipulations of face
difficulty level. It consists of 48 trials, with 24 “easy” items and 24 “difficult” items with equal
numbers of both types of items randomly presented in 3 blocks of 16 trials each. The “easy”
items are so labeled because the target five digit numbers and foils share no common digits (i.e.
46923 and 50187), and thus the correct answer is extremely easy to identify. The “difficult”
items are those for which the target numbers and foils are identical with the exception of
reversing the middle numbers (i.e. 46923 and 46293). Test-takers see the target number for 5
seconds (s) on the screen followed by a delay of 5s (for block 1), 10s (for block 2), or 15s (for
block 3), and then the target number and a foil are presented. The increasing delays before recall
across time are thought to increase face difficulty level. As the test-taker is told to choose the
target number as quickly and accurately as possible, responses are scored for errors and response
latency.
The VSVT was initially validated using a differential-prevalence design which compared
a sample of compensation-seeking (CS) patients and a group of patients not seeking
compensation (NCS). The majority of the NCS group was epilepsy surgery patients. After
analyzing the test scores using a traditional symptom validity testing significantly “below
chance” cutoff and obtaining excellent specificity but modesty sensitivity, Slick et al. (1996)
proposed a three-level classification system (valid, questionable, and invalid) in order to increase
the VSVT’s sensitivity in classifying malingerers. Significantly below chance performance falls
in the “invalid” category (e.g., scores < 9 on the easy or difficult items), performance
9
significantly above chance falls within the “valid” category (e.g., scores > 15 on the easy or
hard items), and the additional third category (questionable) applies to scores that fall within the
remaining 90% confidence interval around chance performance (e.g., scores between 9 and 15
inclusive on easy or difficult items).
Classification rates based on this three level system suggest that it is moderately effective
in classifying malingerers and non-malingerers on the basis of VSVT test scores. Slick et al.
(1996) gave percentages of correct classification for each of four groups using the VSVT. One
hundred percent of the controls and non-compensation seekers, 85% of the compensation
seekers, and 19% of the feigning group were classified as “valid”. Fifteen percent of the
compensation-seeking group and 81% of the feigning group were classified as either
“questionable” or “malingering/invalid”. In an independent cross-validation study, Doss,
Chelune, and Naugle (1999) found similar classification rates using the VSVT in a general
clinical setting with CS and NCS. They found that for the “easy” items, 98% of the NCS
patients and 95% of the CS patients had scores in the “valid” range. Conversely, 89% of the
NCS group and only 52% of the CS group had “valid” results for the “difficult” items. Eleven
percent of the NCS group was classified as either “questionable” or “malingering/invalid”.
Grote, Kooker, Garron, Nyenhuis, Smith, & Mattingly (2000) proposed that a 90%
correct cutoff was more effective than Slick’s original criteria for discriminating CS from NCS
groups using the VSVT. VSVT “difficult” memory scores of 16-20 were uncommon among the
NCS group in their study, but scores in this range are above chance and considered to be “valid”.
Their results indicate that the previously suggested cutoff score based on “above chance”
performance was too lenient and not adequately sensitive for detecting individuals who were not
putting forth their best effort. Grote et al. (2000) proposed that scores in the 16-20 range should
not be classified as “valid”, as they are rarely seen in patients with brain disease. Applying this
revised criteria, two-thirds of the CS group scored below the cutoff, suggesting greater
sensitivity. Additionally, using a cutoff of < 21 difficult items correct as indicative of feigning
retained adequate specificity in the NCS sample. Thus, with the revised cutting score
recommended by Grote et al. (2000), the VSVT appears to be one of the best validated
procedures for detecting malingered neurocognitive deficit.
Detecting Feigned Psychiatric Symptoms
In order to diagnose mental disorders, psychologists and other clinicians rely heavily on a
10
patient’s self report of symptoms. Most individuals seeking psychological treatment are
thought to try to represent themselves honestly to their clinician. However, if receiving a
psychiatric diagnosis potentially leads to monetary compensation, it may increase the likelihood
that individuals will fabricate or exaggerate their symptoms. The existence of external incentives
for a psychiatric diagnosis is in fact relatively common. For instance, patients can obtain
monetary compensation for a psychiatric illness (i.e., SSI) or avoid prison after committing a
crime by successfully pleading insanity. The frequency of malingered psychological disorders is
not clear. However, previous research suggests that it may occur in as many as 30% to 50% of
forensic and compensation-seeking cases (Berry et al., 2002). Given these estimated base rates,
it is clear that the honesty of the compensation-seeking test taker must be directly assessed in
order to verify reported psychiatric symptoms as authentic.
Malingered psychiatric complaints can be detected in several ways: through behavioral
observations, review of available records, or grossly inaccurate self report on standard
psychiatric measures. Additional objective data regarding the validity of the test taker’s
responses are available in two major forms: dedicated malingering measures such as the SIRS
and SIMS (see below) and validity scales from multi-scale inventories such as the MMPI-2 (see
below). Results from these procedures can increase a clinician’s confidence about the validity of
the person’s self-report, and well-validated measures for detecting malingered psychopathology
will be presented next.
Structured Interview of Reported Symptoms (SIRS).
The Structured Interview of Reported Symptoms (SIRS; Rogers, Bagby, & Dickens,
1992) is a structured interview designed to detect malingered psychopathology and other
response sets. It has 172 items that contribute to 8 primary scales and 5 supplementary scales.
Three of the primary scales focus on very unusual symptom presentations (Rare Symptoms,
Symptom Combinations, Improbable and Absurd Symptoms), four assess the range and severity
of symptoms (Blatant Symptoms, Subtle Symptoms, Selectivity of Symptoms, Severity of
Symptoms) and the last compares Reported versus Observed Symptoms. The questions cover a
wide range of psychopathology and symptoms that are unlikely to be true. Scoring is based on a
format common for structured interviews, “0” for no endorsement of the symptom, “1” for
partial endorsement, and “2” for full endorsement. There are 4 possible global classifications for
each primary scale based on a patient’s answers; honest, indeterminate, probable [feigning], or
11
definite [feigning]. The number of primary scales falling in the probable and definite feigning
range is the major interpretive index for the SIRS.
The SIRS is a psychometrically strong instrument. The reliability of the SIRS scales
appears to be good with mean alpha coefficients of .86 for primary scales and .75 for
supplementary scales (Rogers, Gillis, Dickens, & Bagby, 1991; Rogers, Gillis, & Bagby, 1990).
In addition, inter-rater reliability for the SIRS is high with a mean of .96 and .98 in two reported
studies (Rogers et al., 1991; Rogers, Kropp, Bagby, & Dickens, 1993).
Regarding validity, the overall accuracy of classification for probable feigners based on
single primary scale scores is moderately high (between 43.6 - 92.7%) with Symptom
Combinations and Improbable and Absurd scales having the lowest accuracy at 70.4% and
43.6% respectively (Rogers et al., 1992). Given the variable classification rates with individual
primary scales, Rogers et al. (1992) recommend using a stringent criterion of 3 out of the 8
primary scales in the “probable feigning” or “definite feigning” range as an indication of
feigning in order to minimize false positive classifications. Rogers et al. (1992) indicate that the
SIRS classification accuracy using this criterion has moderate sensitivity (48.5%) and excellent
specificity (99.5%). The very high specificity rate insures high PPP, which is reassuring to
clinicians with reservations about labeling a patient as a malingerer.
The SIRS has been shown to be effective in discriminating between feigners and honest
responders in both simulation and known-groups designs. Significant differences in the
predicted direction (i.e., feigners had higher scores) were found between simulators and honest
responders on all of the primary scales designed to detect feigning (Rogers et al., 1991). The
authors also found similar results using independently identified suspected malingerers. Given
the high accuracy of the SIRS, it is often used as a “gold standard” for detecting malingering of
psychiatric symptoms.
Screening Index of Malingered Symptoms (SIMS).
The SIMS (Smith and Burger, 1997) is a 75-item, true-false screen for feigned
psychopathology and cognitive impairment. The SIMS consists of 5 non-overlapping scales that
cover different types of feigning; Af (affective disorders), P (psychosis), N (neurological
impairment), Li (low intelligence), and Am (amnesia). A total score is also calculated from the
raw scores of the 5 scales. Smith and Burger (1997) report a hit rate of 94.5% for the SIMS in
discriminating between analog malingerers and honest responders. Rogers, Hinds, and Sewell
12
(1996) found the SIMS to be moderately effective in identifying feigned protocols. In their
study, using a cutoff of total score >16 as an indicator of feigning, the SIMS had a good PPP
(.87) and moderate NPP (.62). The authors also provided classification rates using a total score >
40 which increased the NPP to .94 while PPP dropped to .49. In a known-groups design with
individuals undergoing pretrial psychological evaluations for competency to stand trial or
criminal responsibility, Lewis, Simcox, and Berry (2002) found relatively high NPP (100%)
using a SIMS total score cutoff of > 16. In light of the high NPP value, these authors
recommend use of the SIMS total score (with a cutoff score larger than 16) as a screening
malingering measure.
Minnesota Multiphasic Personality Inventory-2.
The Minnesota Multiphasic Personality Inventory (MMPI; Hathaway and McKinley,
1943) was developed to serve as a self report inventory of psychopathology and was one of the
first personality measures to include multiple validity scales, and became one of the most widely
used and validated psychological tests. The Minnesota Multiphasic Personality Inventory-2
(MMPI-2; Butcher, Dahlstrom, Graham, Tellegen, & Kaemmer, 1989) improved upon the
original version of the test in several ways such as increasing the representativeness of the
standardization sample and changing some of the language in the test that had become obsolete
or was sexist.
The MMPI-2 retained four validity scales carried over from the original version: CS, L, F
and K. The Cannot Say (CS) scale represents the total number of items omitted or responded to
as both true and false. The Infrequency (F) scale consists of items that were infrequently
endorsed by the normative sample (< 10% of the sample endorsed the item). Typically,
individuals who endorse a high number of these items are attempting to present themselves in an
unrealistically negative light, are randomly responding, or are severely disturbed. The Lie (L)
scale consists of items denying common flaws or behaviors that most people have engaged in at
some point such as telling a white lie or failing to brush their teeth twice a day every day.
Individuals elevate this scale when they are attempting to portray themselves in an unrealistically
positive light or “faking good”. The Correction (K) scale was developed to compensate for
individuals trying to mask their level of disturbance. In addition to identifying subtle denial, the
correction score derived from this scale is added to several clinical scales in an attempt to
compensate for underreporting of symptoms. A fifth validity scale, the F-K scale was created by
13
taking the raw F scale score (tapping over-reporting symptoms) and subtracting the K scale
score (tapping defensiveness). The more positive the resulting score, the more likely the
individual is over-reporting his or her symptoms.
In addition to updating language and reducing bias in the item wording, the MMPI-2 also
added new validity scales in an attempt to improve the detection of different response styles.
The Backpage Infrequency (Fb) scale was developed to address the concern that all of the
original F scale items appeared in the first half of the test. The Fb items consist of questions on
the last half of the MMPI-2 that were rarely endorsed by the standardization sample. The newer
Infrequency Psychopathology (F(p); Arbisi & Ben-Porath, 1995) scale consists of items rarely
endorsed by a large sample of psychiatric patients as well as the MMPI-2 standardization
sample. Again, individuals who endorse a high number of these items are thought to be feigning
their symptoms since even severely psychologically disturbed individuals rarely endorsed these
items. The Superlative (S) scale is a new “fake-good” scale which is elevated when the
respondent is trying to give an even stronger positive impression compared to the previous “fakegood” scales. The S scale consists of items indicating unrealistically positive presentation of
oneself.
In addition to the more traditional validity scales designed to detect overreporting and
underreporting of psychiatric symptoms, the MMPI-2 added scales designed to detect
inconsistent responding, nay-saying, and yea-saying (Butcher et al., 1989). The Variable
Response Inconsistency (VRIN) scale provides an index of how inconsistently an individual
responds on the MMPI-2, and it consists of 67 pairs of items with similar or opposite content.
Each time a pair is answered inconsistently, a point is added to the VRIN raw score. The True
Response Inconsistency (TRIN) Scale provides an indication of the individual's yea-saying
(indiscriminant true responses) and nay-saying (indiscriminant false responses). This scale
consists of 23 item pairs that have opposite item content. If the individual answers the same for
both questions in the pair a point is added or subtracted from the raw score, with higher TRIN
raw scores indicating a yea-saying response style, and lower TRIN raw scores indicating
indiscriminant nay-saying. VRIN and TRIN are helpful for identifying profiles which are clearly
a result of inconsistent or random response styles before examining the traditional validity scales
which consist of infrequently endorsed items that may be elevated by this type of responding.
The MMPI-2 has a great deal of research supporting its use as a screening measure for
14
malingering. Four scales which have been shown to detect over-reporting of symptoms on the
MMPI-2 are the F, Fb, F-K, and F(p) scales. Berry and colleagues (2002) summarize
information about the 4 feigning indices from multiple studies which suggests that these scales
have moderately high sensitivity and specificity. The F-K and F scales had the highest average
classification rates in individuals who were malingering (sensitivity values of .84 and .83,
respectively). The scale with the highest classification rate in honest responders was F(p)
(specificity of .92, with the remaining 3 scales approximately 4-8 percentage points lower).
However, given limited PPP values using these scales, Berry et al. (2002) suggested that their
use be limited to a screening function.
The MMPI-2 traditional validity scales have been well-established as fairly effective
ways to detect psychiatric malingering. While these scales have demonstrated the ability to
differentiate individuals malingering psychiatric symptoms from honest patients, they have not
been successful at detecting individuals malingering somatic or neurocognitive deficits
(Larrabee, 1998).
The Fake Bad Scale for the MMPI-2
As noted in the above sections, the literature and tests used to detect psychiatric and
neurocognitive malingering are relatively separate. While the traditional MMPI-2 validity scales
(i.e., F, F(p), Fb, and F-K) have been used to detect over-reporting of psychiatric symptoms,
they may not have adequate sensitivity for identifying individuals who are feigning in a
neuropsychological context.
Larrabee (1998; in press) reports only borderline significant group differences on scales F
and Fb between neurocognitive feigners meeting the Slick et al. (1999) criteria and individuals
with actual CHI. In addition, F(p) and F-K were not significantly different for the two groups.
Greiffenstein, Gola, and Baker (1995) found that F and F-K did not differentiate individuals with
traumatic brain injury (TBI) from probable malingerers despite significant group differences on
neuropsychological and cognitive malingering measures. Further, in a follow-up study,
Greiffenstein, Baker, Gola, Donders, and Miller (2002) found that the traditional MMPI-2
validity scales were insensitive to large group differences in symptom-history conformity. In
other words, the traditional fake bad scales do not appear as effective in detecting the feigning
that occurs in a neuropsychological context as compared to detecting feigning seen in psychiatric
settings.
15
These results suggest that malingering can occur in two ways during neuropsychological
evaluations: as feigned memory impairment and/or exaggerated somatic complaints (Larrabee,
1998). Feigned memory impairment can be identified by using separate, objective measures of
neurocognitive malingering which were developed to detect feigned memory deficits by
documenting below chance performance or scores below normative groups of non-compensation
seeking. Meta-analysis suggests that these tests have empirical support for effectiveness at
detecting feigned memory deficit (Vickery, Berry, Inman, Harris, & Orey, 2001). However, in
addition to feigned memory deficits, exaggerated somatic complaints can also occur in a
neuropsychological assessment. Complaints such as pain, parathesias, and malaise are common.
An individual may exaggerate these complaints in order to strengthen his or her claim of
impairment. Individuals undergoing personal injury litigation may also be tempted to exaggerate
somatic complaints that occur as a result of their emotional distress. Research reviewed above
suggests that individuals feigning neurocognitive deficit or somatic complaints answer the
MMPI-2 in a manner that goes largely undetected by the “traditional” fake-bad scales (F, Fb,
F(p), & F-K).
Fortunately, Lees-Haley and colleagues (Lees-Haley et al., 1991; Lees-Haley, 1992) have
developed a validity scale intended to detect malingered somatic symptoms and malingered
neurocognitive deficit most often seen in individuals undergoing personal injury litigation or
neuropsychological assessment, the Lees-Haley Fake Bad Scale (FBS; Lees-Haley et al., 1991;
Lees-Haley, 1992). The FBS was constructed on a rational content basis by reviewing previous
unpublished MMPI test data from malingerers (Lees-Haley, 1991). The conceptualization of
individuals malingering somatic symptoms is primarily that they portray themselves as (overly)
honest people who have suffered an (overly) terrible injury. Lees-Haley chose MMPI items that
fit this type of goal-directed behavior thought to be associated with individuals feigning
emotional distress, such as trying to: 1) appear honest; 2) appear psychologically normal except
for the influence of the alleged cause of injury; 3) avoid admitting pre-existing psychopathology;
4) minimize the impact of previously disclosed pre-existing complaints; 5) hide or minimize preinjury antisocial or illegal behavior; and 6) present a degree of injury or disability within
plausible limits. In other words, this scale was developed to identify individuals who were trying
to appear honest and well functioning before an injury, with all problems the person is
experiencing solely the result of an injury. In its initial stages of validation, the FBS has some
16
support in its ability to detect both somatic and malingered neurocognitive deficit with
reasonable accuracy.
Multiple lines of evidence suggest that the FBS does not function in the same ways as the
traditional MMPI-2 validity scales, in that it is not thought to be sensitive to feigned psychiatric
symptoms. In fact the FBS does not have a significant item overlap with the traditional validity
scales (only 7 out of 42 items on the FBS overlap with the traditional validity scales). Thus, it
would be expected that it would measure something other than psychiatric overreporting and
would have lower correlations with the validity scales designed to detect psychiatric feigning.
Additionally, the average correlations between the FBS and the traditional fake-bad scales were
lower (average r = .23) than the average correlations within the traditional fake bad scales
(Larrabee, in press). Moreover, in another study, the classification accuracies of F, Fb, F(p), and
F-K were superior to the FBS when identifying individuals instructed to respond honestly or to
malinger mental illness (Rogers, Sewell, & Salekin, 1994). Thus, the FBS appears to tap a
different construct than the traditional fake-bad scales on the MMPI-2 (F, Fb, F(p), & F-K).
While the FBS is thought to be insensitive to feigned psychiatric symptoms, current
research suggests that it is more accurate than the traditional validity scales in the detection of
malingered neurocognitive deficit (MNCD) or feigned somatic symptoms. For example, the
classification accuracy of the FBS has reportedly been high when applied to forensic
populations. Lees-Haley et al. (1991) reported sensitivity values of 93% for identified personal
injury feigners, 88% for simulators of emotional distress, and specificity values of 83% using an
FBS cutoff of > 20. Iverson, Henrichs, Barton, and Allen (2002) also found high sensitivity in
identifying prison inmates coached to malinger using a > 20 cutoff, however, the resulting
specificity was of concern (up to 30% of the nonmalingering groups were misclassified as
malingering). The authors recommended using a cutoff of FBS > 26 in order to reduce the
number of false positive errors. The FBS also appears to distinguish feigning of certain specific
types of symptoms more effectively than the traditional validity scales. The FBS distinguished
simulators feigning neurotic vs. psychotic symptoms with higher accuracy than the traditional
MMPI-2 validity scales (Cramer, 1995).
Comparisons of suspected feigning and honestly responding groups also suggest the FBS
scores were higher in somatic feigning groups. In a differential-prevalence design, litigating
patients and individuals with mild head injuries seeking financial compensation have higher FBS
17
raw score means than clinical patient groups and groups with documented head injury
(Tsushima and Tsushima, 2001). These authors reported that litigating patients had a
significantly higher mean (FBS = 20.7) than clinical patients (FBS = 17) and controls (FBS =
11). Additionally, patients with mild head injury seeking financial compensation have higher
FBS means and more individuals identified as feigning (using cutoffs of FBS > 20 and 21,
respectively) compared to nonlitigating head-injured patients (Greiffenstein, Baker, Gola,
Donders, & Miller, 2002; Ross, Millis, Krukowski, Putnam, & Adams, in press).
FBS scores have also been shown to be higher in groups identified as feigning based on
independent criteria suggestive of feigned neurocognitive deficits or somatic complaints.
Larrabee (in press) reports classification rates for evaluees with documented closed head injury
(specificity = 80.77%) compared to individuals identified as definite malingerers of
neurocognitive deficit (sensitivity = 86.21%) with the FBS. In addition, a logistic regression
indicated that the predictive accuracy of the FBS was beyond that accounted for by other MMPI2 validity and clinical scales for detecting malingered neurocognitive deficits. Larrabee (in
press) found the best overall classification accuracy was achieved using a cutoff score of FBS >
22.
Despite the positive findings which support the ability of the FBS to detect feigning seen
in personal injury or neuropsychological settings, there has been some concern about its utility.
Recently, Butcher and colleagues have raised questions regarding the construct validity of the
FBS and its efficacy in detecting feigning (Butcher, Arbisi, Atlis, & McNulty, 2003). In a
review of the current studies on the efficacy of the FBS they conclude that the number of false
positives yielded by this scale is unacceptably high. In addition, they believe that the focus of
the scale on somatic complaints instead of rare symptoms may not differentiate malingerers from
bona fide patients. Butcher et al. (2003) suggest future studies examine the efficacy of the FBS
using known groups where there is more confidence in the classification of the patient as
malingering or honest. Thus although, the FBS appears to be the most promising MMPI-2
validity scale for detecting MND, and other types of malingering such as somatic malingering
most often seen in neuropsychological and personal injury settings, concerns have been raised by
some authors about its specificity rates.
Purpose of the Present Study
Past research using the FBS suggests that it shows promise as a way to detect malingered
18
neurocognitive deficit using the MMPI-2, a commonly used measure of psychopathology,
whereas the traditional MMPI-2 validity scales are well-supported for the detection of
malingered psychiatric symptoms. However, thus far, there has not been a study examining the
FBS’s accuracy when directly comparing individuals who are malingering somatic or
neurocognitive deficits (such as those seen in personal injury or head injury litigation) with
individuals who are malingering psychiatric symptoms (such as in workers’ compensation claims
of psychological disability). Thus, given the relatively separate nature of research on psychiatric
and cognitive malingering, the purpose of the present study is to evaluate the classification
accuracy of the FBS using a known-groups design. Specifically, a group of identified psychiatric
malingerers will be compared with groups of identified malingering neurocognitive deficits and
identified honest responders.
A forensic sample of 180 personal injury and head injury claimants and workers’
compensation cases was divided into 5 groups on the basis of scores on objective psychiatric and
neurocognitive malingering measures: (1) feigning psychiatric symptoms only, 2) feigning
neurocognitive/somatic symptoms only, 3) feigning both psychiatric and neurocognitive
symptoms, 4) honest responders and 5) indeterminate responders. Cases with indeterminate
classification on one or both types of malingering were excluded from analysis as the nature of
their response style was considered uncertain.
After the four clinical groups of interest were identified, the MMPI-2 validity scale
results were compared in order to determine which scales best differentiated the groups. The
traditional validity scales (F, Fb, F(p), and F-K) were expected to be most effective at identifying
the psychiatric feigning group. The FBS was predicted to be most effective at identifying the
NCF and PSY/NCF groups.
19
Copyright © 2003 by Victoria Louise Vagnini. All rights reserved.
20
Chapter Two
Method
Participants
The participants initially included 180 consecutive patients evaluated at the Lexington
Forensic Institute who signed an IRB approved consent form for the anonymous use of their
data. They included 94 workers’ compensation cases and 86 head injury cases. The former
group was undergoing psychological assessments in order to determine level of psychological
damage from an alleged injury experienced at work. The latter group was undergoing
neuropsychiatric assessment to determine level of brain damage and/or psychological damage as
a result of an alleged head injury.
Of the initial 180 cases, 8 did not receive the MMPI-2. Remaining MMPI-2 results were
screened for the Cannot Say raw score exceeding 29 (1 participant excluded) and VRIN T-score
>80 (1 participant excluded). No cases were excluded on the basis of TRIN > 100. The 170
remaining participants were classified as a psychiatric feigner (PSY), neurocognitive feigner
(NCF), feigning both psychiatric and neurocognitive symptoms (PSY/NCF), honest (HON), or
indeterminate (I) based on their responses on the previously mentioned objective malingering
tests: SIRS, LMT, VSVT, TOMM, and in some cases, the SIMS. Systematic rules were
developed to allow initial classification of each participant as feigning, honest, or indeterminate
for each type of malingering (e.g., psychiatric or neurocognitive). In a final stage, the possibility
of combined types of feigning was addressed.
The psychiatric feigning group (PSY) was determined primarily using the SIRS.
Individuals were classified as feigning psychological symptoms if at least 3 out of the 8 primary
scales were in the “probable feigning” or “definite feigning” range. An individual was classified
as psychiatric honest (or not feigning psychiatric symptoms) if at least 6 out of the 8 primary
scales were identified as being in the “honest” range, and no scales were in the “probable
feigning” or “definite feigning” range. All other SIRS results were classified as indeterminate
on psychiatric feigning status. Forty of the evaluees classified as indeterminate by the SIRS had
been administered the SIMS in addition to the other malingering measures. In these cases, if the
individual failed the SIMS (total score > 40), he or she was classified as feigning psychiatric
symptoms. If the individual who was indeterminate passed the SIMS, then he or she remained
classified as indeterminate. If an individual was classified as a psychiatric feigner and was not
21
classified as a neurocognitive feigner (see below), he or she became part of the psychiatric
feigning only group (PSY).
The neurocognitive feigning group (NCF) was determined using the three tests of
neurocognitive malingering mentioned above; the LMT, the TOMM, and the VSVT.
Individuals were classified as feigning if at least two out of the three test results were indicative
of feigned neurocognitive deficits (LMT < 93%, TOMM < 90% on Trial 2, VSVT < 21
“difficult” items correct). Individuals who were identified as responding honestly on all 3 (the
LMT, TOMM, and VSVT) were considered to be honest responders in regards to neurocognitive
feigning status. Individuals who were classified as feigning on only one of the three
motivational tests were classified as indeterminate for neurocognitive feigning status.
Individuals who were classified as feigning on at least two out of the three tests (LMT, TOMM,
and/or VSVT) and identified as honest or indeterminate on the SIRS (and SIMS where
applicable) were classified as neurocognitive feigners only (NCF).
The psychiatric and neurocognitive feigning group (PSY/NCF) included those
individuals who were classified as feigning on the psychiatric malingering measures (SIRS, and
SIMS when applicable) and identified as feigning on at least two out of the three neurocognitive
malingering tests (LMT, TOMM, & VSVT).
Individuals were identified as Indeterminate (I) if they were classified as indeterminate
on the SIRS (and subsequently passed the SIMS), identified as feigning on only one of the
neurocognitive malingering tests (LMT, TOMM, & VSVT), or identified as indeterminate by the
SIRS and identified as feigning on only one of the neurocognitive malingering tests (LMT,
TOMM, VSVT). Honest responders (HON) were those who were classified as honest on the
SIRS, LMT, TOMM, and VSVT.
Of the 170 participants with valid MMPI-2 data, 54 passed the SIRS, LMT, VSVT, &
TOMM and were classified as honest responders (HON). Nine participants failed the SIRS or
were classified as indeterminate on the SIRS but failed the SIMS, and passed at least 2
neurocognitive malingering tests. These 9 individuals were identified as psychiatric feigners
(PSY). Forty participants failed at least 2 of the neurocognitive malingering tests, and passed or
were classified as indeterminate by the SIRS (and then passed the SIMS). These individuals
were identified as neurocognitive feigners (NCF). Seventeen participants failed the SIRS and
failed at least 2/3 neurocognitive malingering tests. These individuals formed the psychiatric and
22
neurocognitive feigning group (PSY/NCF). Fifty participants were classified as indeterminate
on the SIRS (and passed the SIMS if applicable) and failed only one neurocognitive malingering
test. These individuals were classified as Indeterminate (I), and excluded from further analysis.
Materials
The short, standard psychological battery of tests that was administered to individuals
undergoing a worker’s compensation evaluation included the MMPI-2, Behavioral Health
Inventory, and Kaufman Brief Intelligence Test.
The standard neuropsychological battery of tests given to individuals undergoing a head
injury evaluation included the Wechsler Adult Intelligence Scale-III, Wechsler Memory ScaleIII, Boston Naming Test, Controlled Word Association Test (COWA), Reitan-Klove Sensory
Perception Test, Finger Tapping Test, Grip Strength Test, Grooved Pegboard, Wisconsin Card
Sorting Test (WCST), Brief Test of Attention (BTA), and the MMPI-2.
Malingering tests given to all evaluees were the SIRS, TOMM, LMT, and VSVT. A
subset also completed the SIMS.
Procedure
Each evaluee was interviewed by the evaluating psychiatrist and received a psychological
or neuropsychological battery of tests administered by 1 of 4 certified Master’s level clinicians
(supervised by a licensed clinical psychologist). Each individual was also given psychiatric
(SIRS, and sometimes the SIMS) and neurocognitive malingering tests (LMT, TOMM, and
VST) using standard procedures as part of the test battery.
After the reports were written, files were reviewed to identify evaluees who had given
consent for archival use of their data. RA1 deleted identifying information and printed out
copies of reports from the consenting participants. RA2 coded demographic and test
information. Honest, indeterminate, and feigning groups were determined by review of results
from the psychiatric malingering test and the neurocognitive malingering tests as described
earlier. Results from participants classified as indeterminate were removed from further
consideration.
23
Copyright © 2003 by Victoria Louise Vagnini. All rights reserved.
24
Chapter Three
Results
Descriptives
As previously noted, the number of participants classified in each study group was HON
= 54, PSY = 9, NCF = 40, and PSY/NCF = 17. Table 1 presents demographic characteristics for
the 4 study groups. One-way ANOVA or Π2 analyses, as appropriate, indicated that there were
no significant differences between the groups on the demographic variables of age (M = 42.3, SD
= 10.9), F(3, 119) = 1.589, p = .196; education (M = 12.0, SD = 2.6), F(3, 119) = .333, p = .802;
gender (ranged from 66.7% male in the PSY group to 77.8% in the HON group, Π2 = .909),
ethnicity (percentage of white participants ranged from 96.3% - 100%, Π2 = .932), and
handedness (most of the participants were right handed, ranging from 83.3% - 90.0%, Π2 = .839)
between the groups. A further analysis, including data from the I group (not shown in Table 1),
showed no significant difference on any demographic variables.
Main Analyses
Group Differences for MMPI-2 Clinical Scales
To determine whether the 4 study groups presented with different self reports of
psychopathology, an initial multivariate analysis of variance (MANOVA) was performed on the
10 clinical scales from the MMPI-2 by group. Results indicated a significant overall effect
(Wilks’ 7 = .280, F(30, 314.74) = 5.70, p < .01).
Table 2 presents the mean MMPI-2 clinical scale results for the honest and feigning
groups as well as findings from univariate ANOVA tests. One-way ANOVAs were statistically
significant for all 10 clinical scales. Therefore, Tukey’s HSD post-hoc comparisons with ∀ set
at .05 were undertaken. Not surprisingly, results from the follow-up tests indicated that the HON
group was significantly lower than all 3 feigning groups on every clinical scale. The lone
exception was scale 9 where both the HON and NCF groups had statistically significantly lower
mean T-scores than the PSY/NCF group. Additionally, clinical scale T-scores for the PSY &
PSY/NCF groups were generally higher than for the NCF group, with the NCF group typically
intermediate between the HON group and the PSY & PSY/NCF feigning groups. However, only
on scales 4, 6, & 8 were the PSY & PSY/NCF groups statistically significantly higher than the
NCF group.
Table 3 presents the differences between each of the feigning groups and the HON group
25
for all 10 of the MMPI-2 clinical scales using Cohen’s d (Cohen, 1977), which expresses the
differences between groups standardized in pooled standard deviation units. The PSY group had
the largest average d at 1.55 and the PSY/NCF group was a close second with d = 1.48. The
NCF group had a moderate average effect size at .86. Thus overall, all 3 feigning groups
endorsed more psychopathology than the HON group. However, the two groups identified as
feigning psychopathology on the SIRS and/or SIMS endorsed more symptoms than the NCF
group. These results suggest that the group selection criteria resulted in roughly the expected
stratification in level of reported psychopathology.
Group Differences for MMPI-2 Validity Scales
To determine whether the 3 groups of feigning participants presented with different
response sets compared to the HON group and each other, an initial multivariate analysis of
variance (MANOVA) was run on the validity scale data from the MMPI-2 by group. Results
indicated a significant overall effect, Wilks’s 7 = .317, F(24, 313.83) = 6.36, p < .01.
Table 4 presents the mean MMPI-2 validity scale results for the honest and 3 feigning
groups as well as findings from univariate ANOVA tests. Considering the “traditional” fake bad
validity scales first, the one-way ANOVAs were statistically significant for all 4. Thus, Tukey’s
HSD post-hoc comparisons were undertaken with ∀ set at .05. As expected, the HON group had
significantly lower scores on the traditional “fake-bad” validity scales (F, Fb, F(p), and F-K) than
all 3 feigning groups. In addition, the NCF group had significantly lower scores than the
PSY/NCF group on all 4 traditional fake bad scales and significantly lower scores than the PSY
group on the validity scales F and F-K. In a pattern similar to that seen in the clinical scale
results, on the traditional “fake bad” validity scales the HON group scores are the lowest, with
the NCF group generally scoring intermediate between the HON group and the PSY &
PSY/NCF groups. Overall, the two groups identified as feigning psychopathology on the SIRS
and /or SIMS have the highest MMPI-2 validity scale scores.
Regarding the performance of the four groups on the FBS, a one-way ANOVA was also
statistically significant. Based on Tukey’s HSD test, the HON group’s score was significantly
lower on the FBS than all 3 feigning groups, which were not significantly different from one
another. These results suggest that the FBS is sensitive to feigning, but not specific to
neurocognitive feigning. The PSY/NCF group had the highest FBS score, the PSY had the next
highest, and the NCF group had the lowest FBS score of the 3 feigning groups, although the
26
mean FBS scores for all these scores were rather close.
Because Lees-Haley (1992) suggested that certain types of faking good were common in
TBI feigning, it may be also worth reviewing the “fake good” scales in Table 4. The scores for
the four groups on the MMPI-2 “fake good” validity scales were markedly different than the
previous pattern of scores seen on the clinical and “fake bad” validity scales. One-way
ANOVAs were statistically significant for scales K and S, but not for L. Tukey’s HSD post-hoc
comparisons with ∀ set at .05 were undertaken for scales K and S. The HON and NCF groups
obtained significantly higher scores than the PSY and PSY/NCF groups for scales K and S. This
supports Lees-Haley’s suggestion that NCF may involve elements of faking good.
Table 5 presents the differences between each of the feigning groups and the HON group
for all of the MMPI-2 validity scales using Cohen’s d. For the traditional “fake bad” scales, the
PSY/NCF group had the largest mean effect size of 2.59 and the PSY group was next with a
mean effect size of 2.07 (means not shown in Table 5). Although the NCF group has lower
average effect sizes than the PSY and PSY/NCF groups, it did have higher scores on the fakebad validity scales than the HON group with a mean effect size of 1.02. For the traditional
MMPI-2 fake bad scales, the group selection criteria again produced roughly the expected
stratification. Somewhat unexpectedly, the PSY/NCF group had the largest effect size for the
FBS (d = 1.46), the PSY group had then next largest effect size (d = 1.36), and the NCF group
had the lowest effect size (d = 1.23).
Additionally, Table 5 presents the differences between the feigning groups and the HON
group for the K & S scales using Cohen’s d. NCF results for K and S were much closer to the
HON group, with the PSY and PSY/NCF groups most discrepant. These results suggest that the
HON and NCF groups appear to have been engaging in more positive impression management
than the PSY and PSY/NCF groups.
Classification Rates for MMPI-2 Validity Scales
Table 6 presents the classification parameters for the traditional MMPI-2 “fake bad”
validity scales and the FBS for the honest and 3 feigning groups using standard cutting scores
recommended in the literature and reviewed earlier. With the exception of the Fb scale, the
sensitivity values for the traditional validity scales were generally unacceptably low for all 3
types of feigning. The Fb scale had the only acceptable sensitivity values at .75 for both the PSY
& PSY/NCF groups and .67 for the NCF group. In contrast, specificities for all 3 feigning
27
groups using the traditional “fake-bad” validity scales were extremely high with an average
value of 98.5%. Overall, the only traditional MMPI-2 faking index supported here is Fb (median
sensitivity = .71, median specificity = 1.00).
Classification Rates for FBS at Different Cutting Scores
Regarding the FBS, it is also noteworthy that its sensitivity for identifying individuals
feigning neurocognitive and/or psychiatric deficits (using a raw score of 22 as a cutoff) is
generally much higher than for the traditional “fake-bad” scales. The FBS had the highest rate of
classifying individuals in the PSY group with a sensitivity of 100%. The FBS had slightly less
but still very good accuracy at classifying individuals in the NCF and PSY/NCF groups
(sensitivity = 95% and 88%, respectively). In contrast to the high sensitivity values, the
specificity of the FBS was disappointingly low at 50% across all 3 types of feigning, which will
sharply limit its PPP in most settings, as shown in Table 6. Consistent with prior research using
the FBS, this scale does appear to be the most sensitive scale for detecting neurocognitive
feigning (sensitivity for FBS scale = 95%, whereas the largest sensitivity value for this group
with the traditional “fake-bad” scales was 67% for Fb). However, the sensitivity rates for the
FBS are as high if not higher in the PSY and PSY/NCF groups (sensitivity = 100% and 88%,
respectively). Thus, although the FBS is more sensitive than the traditional “fake-bad” scales
for detecting neurocognitive feigning, it also appears to be more sensitive to psychiatric and
psychiatric and neurocognitive feigning as well, suggesting that it is not specifically sensitive to
NCF. In fact, the Fb scale slightly outperformed the FBS scale in overall hit rated for
discriminating NCF from HON participants. It is also worth noting again that the specificity
rates for FBS were quite modest across all types of feigning.
In order to explore the potential utility of the FBS further, Table 7 presents the
classification rates for the FBS at a variety of alternative cutoff scores suggested in the literature.
Generally, the sensitivity and specificity values for the 3 feigning groups are similar for each
cutoff. The cutting score that yielded the highest sensitivity for the 3 feigning groups was an
FBS raw score > 20 (average sensitivity value at this cutting score was 99%). However, the
specificities at this cutting score were unacceptably low at 32%. The highest specificity occurred
using a cutoff of > 30 at 96%. However, at this cutoff score the sensitivity values for the
feigning groups were unacceptably low with an average sensitivity of 34%. The cutoff score of >
24 for men and 26 for women appears to have the most accurate classification rates overall. For
28
men, this cutoff yielded an average sensitivity of 84% for the 3 groups and a specificity of 67%.
The hit rate was .77 for the PSY group, .73 for the PSY/NCF group, and .47 for the NCF group.
For women, the cutoff produced sensitivity values averaging 85% and specificity values of 75%.
The hit rates for the 3 feigning groups were .84 for the PSY/NCF group, .67 for the PSY group,
and .62 for the NCF group.
Copyright © 2003 by Victoria Louise Vagnini. All rights reserved.
29
Chapter Four
Discussion
Using a known-groups design, this study examined the efficacy of the Lees-Haley FBS
and traditional MMPI-2 fake-bad scales for identifying psychiatric, neurocognitive, and
combined psychiatric and neurocognitive feigners. Criterion status (presence or absence of
feigned psychiatric symptoms and/or cognitive deficits) was established through the use of
multiple well-validated procedures, suggesting that the study groups were very likely to have the
target response sets (or be free of them). The major findings of this study can be summed up in 3
main points. First, most of the MMPI-2 traditional validity scales were not particularly effective
in detecting malingered symptoms (psychiatric, neurocognitive, or both) in this setting. Second,
of the traditional validity scales, the Fb scale had the highest overall classification accuracy for
all three types of feigning. Finally, the FBS, which was the primary focus of this study, had the
highest sensitivity of all scales, but its specificity was unacceptably low. Each of these points
will now be addressed in more detail.
How effective are the traditional MMPI-2 “fake bad” scales in detecting feigning?
The F, F(p), and F-K traditional fake-bad scales were not shown to have adequate
classification accuracy for detecting psychiatric, neurocognitive, or combined psychiatric and
neurocognitive feigning in this setting. For the F scale, using an established T-score cutoff of >
107, the highest hit rate was found for the PSY/NCF group at .86, the hit rate for the PSY group
was .74, and the lowest hit rate occurred in the NCF group at .65. Using a cutoff of T > 100, the
F(p) scale resulted in the highest hit rate for the PSY/NCF group at .82, the PSY group had the
next highest hit rate of .70, and the NCF group had the lowest hit rate at .64. For the F-K scale, a
raw score cutoff of > 10 obtained the highest hit rate in the PSY/NCF group (.86), the next
highest was for the PSY group (.76), and the lowest hit rate was obtained in the NCF group (.64).
Generally the hit rates for each of the 3 types of feigning groups were consistent across F, F(p),
and F-K. Specifically, the PSY/NCF group was most accurately identified by these validity
scales, the PSY group the next most accurately identified, and the NCF classification rate was
the lowest. The mediocre performance of the F scale is contrary to previous literature on the
effectiveness of the traditional MMPI-2 validity scales. The F scale has been reported in a recent
meta-analysis of the MMPI-2 to be the most effective indicator of feigning psychological
symptoms (mean d = 4.05) (Rogers et al., 2003). Also, although the PSY/NCF and PSY groups
30
were consistently detected with moderate accuracy by the traditional validity scales, the
difference in detection accuracy for psychiatric vs. neurocognitive feigning by the traditional
MMPI-2 validity scales here does not appear to be a large as previous research has reported
(Larrabee, in press; Greiffenstein et al., 1997; Greiffenstein et al., 1995).
The relative success of the MMPI-2 Fb scale in detecting feigning in this study.
For all 3 feigning groups, the highest relative hit rates were obtained using the Fb scale.
At a T-score cutoff of > 108, Fb had the highest hit rate for the PSY/NCF group at .97, a hit rate
of .87 for the PSY group, and .69 for the NCF group. Fb’s superiority here is inconsistent with
much research on the relative effectiveness of each the traditional MMPI-2 validity scales. In
fact, the latest meta-analysis of MMPI-2 validity scales questioned the effectiveness of the Fb
scale for detecting feigned psychological deficits, stating that bona fide patients often elevate this
scale (M = 71.34) and show considerable variation in Fb scores (Rogers, et al. 2003). In
addition, the NCF group was classified more accurately by the Fb scale than the FBS which is
directly contradictory of past reports (Greiffenstein et al., 1997). However, Lewis et al. (2002)
found Fb to be the best predictor of feigning in forensic evaluees studied in a known-groups
design.
How effective was the FBS in differentiating psychiatric and neurocognitive feigners?
Finally, the FBS unexpectedly appears to significantly differentiate honest responders
from all 3 types of feigners. The HON group had the lowest mean raw FBS score (22.3), while
the 3 feigning groups’ scores were considerably higher, although not significantly different from
each other (average FBS score = 28.8). Thus, although the FBS was designed to detect somatic
& neurocognitive feigning, it did not distinguish between psychiatric and neurocognitive feigners
in the present sample. In addition, the mean raw score for a carefully defined honest group
(passing both psychiatric and neurocognitive malingering measures) is higher than previously
reported means from students and clinical samples. The cause of this discrepancy is unclear. It
may be due to higher symptom reporting in a litigation context, and thus, may not be exclusively
indicative of feigning. Although classification rate data indicated that the FBS has higher
sensitivity to neurocognitive and/or psychiatric feigners than most of the traditional “fake-bad”
validity scales, the specificity of the FBS was much lower (50%) than that for the traditional
fake-bad scales (average specificity = 99%), raising concerns about false positive results.
Overall, there does not appear to be a significant difference in the detection of neurocognitive
31
feigning compared to psychiatric feigning using the FBS. This is contrary to previous literature
which suggested that the FBS scores was particularly effective for identifying individuals
feigning neurocognitive deficits while the traditional validity scales were not (Larrabee, in press;
Greiffenstein et al., 1997; Greiffenstein et al., 1995). However, it is important to note that this is
the first study that examined different types of feigning in an actual forensic setting and
compared their scores to an honest group using strict objective measures to determine feigning
status as well as honest responding. Previous research has generally examined mean FBS
difference between one feigning group compared to an honest group and reported differences
(which the present findings also support).
Study Limitations
Limitations of the present study include the ceiling placed on the validity of all the
MMPI-2 scales by the accuracy of the criterion malingering measures (SIRS, SIMS, LMT,
TOMM, and VSVT) used to establish psychiatric and neurocognitive feigners. Future studies
should use a multimethod approach to define criterion status. In addition, a general limitation of
using a known-groups design is the lack of information regarding whether the individuals
detected as feigning are so identified because they are blatant feigners. In addition, there are no
well-validated measures of somatic malingering to identify this group in the known-groups
design. Perhaps an examination of the existing measures which have been used to identify
Pseudo-PTSD and other forms of exaggerated somatic complaints or emotional distress might
provide a method to examine purely somatic feigners' responses on the FBS.
Conclusions
Overall, the present results do not support the efficacy of the FBS for differentiating
between individuals feigning psychological symptoms and neurocognitive deficits. Future
research on the FBS may include an item analysis of MMPI-2 to identify items that distinguish
the NCF group from the other groups (HON, PSY, & PSY/NCF). The trend of the NCF group to
endorse more items than the HON group on traditional “fake-bad” scales and to endorse more
items than the PSY and PSY/NCF group on the “fake good” scales K and S may be a starting
point for a set of MMPI-2 items that would differentiate the neurocognitive (or somatic
symptom) feigners from honest responders and psychiatric feigners. For instance, a scale with
both types of items (fake good and fake bad) might have individuals feigning neurocognitive
deficits/somatic symptoms endorsing almost two times as many items.
32
This is the first known-groups design used to validate the FBS scale with multiple types
of malingering, and it illustrates the importance of using this type of design in several ways.
First, the validity scale scores for the objectively identified malingering groups were lower than
proposed cutoffs for the F, Fb, F(p), and F-K scales as is also evident in the low sensitivity rates
for the 3 feigning groups. One possible explanation is that some of the cutoff scores were
developed based on simulation study results which typically use college students as participants.
Another issue is that since all of the participants were involved in a forensic evaluation which
could lead to compensation, they may have been coached or instructed by their attorneys not to
“overdo it” on the MMPI-2, thus lowering the sensitivity of the scales. Finally, although this
study has supported the existence of 3 different feigning groups with different response styles;
more research is needed regarding the different types of feigning and optimum detection
strategies. Further research using known-groups designs may be helpful to address these issues.
Copyright © 2003 by Victoria Louise Vagnini. All rights reserved.
33
Appendix
Table 1
Demographic Information
Group
HON
PSY
NCF
PSY/NCF
(N = 54)
(N = 9)
(N = 40)
(N = 17)
M
42.3
38.9
42.9
42.7
SD
12.6
9.1
10.2
7.4
M
12.6
11.7
11.7
11.3
SD
3.2
2.2
2.1
1.4
Male (%)
77.8
66.7
75.0
76.5
White (%)
96.3
100.0
97.5
100.0
Right (%)
83.3
88.9
90.0
88.2
Variable
Age
Education
Note. HON = Honest Responders; PSY Psychiatric Feigners; NCF = Neurocognitive Feigners;
PSY/NCF = Psychiatric and Neurocognitive Feigners. Age and education given in years. Male =
percentage of group of male gender; White = percentage of group of Caucasian ethnic
background; Right = percentage of group reporting right hand dominance; There were no
significant differences between the groups for any demographic variable.
34
Table 2
MMPI-2 Clinical Scale T-score Data
Clinical Scale
T-score
HON
PSY
NCF
PSY/NCF
(N = 54)
(N = 9)
(N = 40)
(N = 17)
P
1
2
3
4
5
6
7
8
9
0
M
74.3
(SD)
abc
a
b
c
90.2
87.1
(13.8)
(8.2)
(9.5)
(7.9)
M
75.2abc
97.3a
91.0b
90.5c
(SD)
(13.5)
(7.0)
(13.0)
(9.7)
M
73.8ab
84.0
89.8a
89.3b
(SD)
(16.4)
(10.0)
(13.7)
(13.2)
abc
ad
54.1
(SD)
(10.4)
(8.5)
(9.6)
(10.5)
M
46.4a
53.3
48.6
54.6a
(SD)
(10.2)
(9.5)
(9.0)
(9.2)
M
54.1abc
95.1ad
68.6bde
84.8ce
(SD)
(14.8)
(12.9)
(16.4)
(18.7)
93.8
a
b
86.0
95.7c
(4.1)
(14.5)
(8.4)
M
69.2
(SD)
(14.6)
M
63.9abc
(SD)
(15.2)
M
63.6
74.5ce
M
abc
73.9
bde
91.4
100.0ad
86.0bde
102.5ce
(6.7)
(14.5)
(10.4)
47.6a
55.2
49.1b
60.5ab
(SD)
(10.9)
(5.8)
(11.3)
(14.2)
M
60.0abc
80.3a
71.5b
80.1c
(SD)
(11.8)
(7.3)
(11.8)
(7.5)
.000
.000
.000
.000
.019
.000
.000
.000
.015
.000
______
Note. P = probability for ANOVA term; a-e means with the same trailing letters differ
significantly from each other using Tukey’s HSD test.; HON = Honest Responders; PSY
Psychiatric Feigners; NCF = Neurocognitive Feigners; PSY/NCF = Psychiatric and
Neurocognitive Feigners; MMPI-2 = Minnesota Multiphasic Personality Inventory-2.
35
Table 3
MMPI-2 Clinical Scale Effect Size Data
Clinical Scale
PSY
NCF
PSY/NCF
(N = 9)
(N = 40)
(N = 17)
Effect Sizes Compared to Honest Group in Cohen’s d
1
1.20
1.05
1.20
2
1.72
1.19
1.20
3
.65
1.04
.99
4
1.95
.94
1.96
5
.34
-.28
-.06
6
2.81
.94
1.94
7
1.80
1.15
1.97
8
2.51
1.48
2.71
9
.73
.14
1.10
0
1.79
.97
1.83
mean d for clinical scales
1.55
.86
1.48
Note. d = Cohen’s d ; PSY = Psychiatric Feigners; NCF = Neurocognitive Feigners; PSY/NCF
= Psychiatric and Neurocognitive Feigners; MMPI-2 = Minnesota Multiphasic Personality
Inventory-2.
36
Table 4
MMPI-2 Validity Scale T-score Data
Validity Scale
HON
PSY
NCF
PSY/NCF
(N = 54)
(N = 9)
(N = 40)
(N = 17)
abc
22.3
29.1
a
28.0
b
29.2
(raw) (SD)
(4.9)
(5.6)
(4.2)
(4.5)
F-K
-7.5abc
T-score
P
FBS
M
M
9.0ad
-.9bde
c
11.7ce
(raw) (SD)
(9.5)
(11.5)
(8.8)
(6.9)
F
M
54.6abc
90.2ad
76.5bde
99.5ce
(SD)
(14.1)
(12.7)
(16.1)
(15.1)
M
58.6abc
108.9a
83.6bd
113.4cd
(SD)
(18.9)
(12.9)
(22.0)
(13.8)
M
49.0abc
61.9a
56.7bd
72.5cd
(SD)
( 9.4)
(14.0)
(13.5)
(17.4)
Fb
F(p)
37
.000
.000
.000
.000
.000
Table 4 continued: MMPI-2 Validity Scale T-score Data
Validity Scale
T-score
HON
PSY
NCF
PSY/NCF
(N = 54)
(N = 9)
(N = 40)
(N = 17)
P
L
K
S
M
59.6
52.9
59.2
56.2
(SD)
(11.1)
(19.9)
(11.2)
(9.9)
M
49.4ab
34.3ac
46.9cd
36.7bd
(SD)
(10.9)
(11.3)
(11.6)
(6.4)
M
49.7ab
32.9ac
47.7cd
36.1bd
(SD)
(10.8)
(16.9)
(12.4)
(8.3)
.358
.000
.000
_______________________________________________________________________
Note. P = probability for ANOVA term; a-e indicate group means that are significantly different
using Tukey’s HSD test; HON = Honest Responders; PSY Psychiatric Feigners; NCF =
Neurocognitive Feigners; PSY/NCF = Psychiatric and Neurocognitive Feigners; MMPI-2 =
Minnesota Multiphasic Personality Inventory-2; F = Infrequency; Fb = Backpage Infrequency;
F(p) = Infrequency Pathology; F-K = Infrequency minus Correction; L = Lie; K = Correction; S
= Superlative.
38
Table 5
MMPI-2 Validity Scale Effect Size Data
Validity Scale
PSY
NCF
PSY/NCF
(N = 9)
(N = 40)
(N = 17)
Effect Sizes Compared to Honest Group in Cohen’s d
FBS
1.36
1.23
1.46
F-K
1.69
.72
2.14
F
2.56
1.46
3.13
Fb
2.76
1.23
3.07
F(p)
1.27
.68
2.00
L
-.53
-.04
-.31
K
-1.38
-.22
-1.27
S
-1.43
-.17
-1.32
______
Note. d = Cohen’s d ; PSY = Psychiatric Feigners; NCF = Neurocognitive Feigners; PSY/NCF
= Psychiatric and Neurocognitive Feigners; MMPI-2 = Minnesota Multiphasic Personality
Inventory-2; F = Infrequency; Fb = Backpage Infrequency; F(p) = Infrequency Pathology; F-K =
Infrequency minus Correction; L = Lie; K = Correction; S = Superlative.
39
Table 6
Classification Rates for MMPI-2 Validity Scales
Scale and
Sensitivity
Specificity
HR
PPP
NPP
BR
cutting score
Group
F
>107
PSY
.11
1.00
.74
1.00
.73
.30
NCF
.05
1.00
.65
1.00
.64
.37
PSY/NCF
.29
1.00
.86
1.00
.86
.19
PSY
.75
1.00
.87
1.00
.84
.30
NCF
.67
1.00
.69
1.00
.67
.37
PSY/NCF
.75
1.00
.97
1.00
.96
.19
PSY
.00
1.00
.70
.00
.70
.30
NCF
.03
1.00
.64
1.00
.63
.37
PSY/NCF
.06
1.00
.82
1.00
.82
.19
Fb
>108
F(p)
>100
40
Table 6 continued: Classification Rates for MMPI-2 Validity Scales
Scale and
Sensitivity
Specificity
HR
PPP
NPP
BR
cutting score
Group
F-K
>10
PSY
.33
.94
.76
.72
.77
.30
(raw)
NCF
.13
.94
.64
.57
.64
.37
PSY/NCF
.53
.94
.86
.69
.89
.19
FBS
>22
PSY
1.00
.50
.65
.46
1.00
.30
(raw)
NCF
.95
.50
.67
.53
.94
.37
PSY/NCF
.88
.50
.57
.30
.95
.19
Note. BR = base rate, percentage of individuals in sample who were malingering; Sensitivity =
percentage of individuals in group falling below cutting score; Specificity = percentage of
individuals in honest group falling above cutting score; HR = hit rate, percentage of both groups
correctly classified by cutting score; PPP = positive predictive power, percentage of those with
positive test sign who were malingering; NPP = negative predictive power, percentage of those
with negative test sign who were not malingering; PSY = Psychiatric Feigners; NCF =
Neurocognitive Feigners; PSY/NCF = Psychiatric and Neurocognitive Feigners; MMPI-2 =
Minnesota Multiphasic Personality Inventory-2; F = Infrequency; Fb = Backpage Infrequency;
F(p) = Infrequency Pathology; F-K = Infrequency minus Correction; L = Lie; K = Correction; S
= Superlative; FBS = Fake Bad Scale.
41
Table 7
Classification Rates for FBS at Different Cutting Scores
FBS (raw)
Sensitivity
Specificity
HR
PPP
NPP
BR
Citation and
cutting score
Group
(Lees-Haley et al., 1991)
FBS > 20
PSY
1.00
.32
.72
.62
.81
.40
NCF
.98
.32
.57
.44
.77
.36
1.00
.32
.77
.47
.92
.21
PSY
1.00
.50
.65
.46
1.00
.30
NCF
.95
.50
.67
.53
.94
.37
PSY/NCF
.88
.50
.57
.30
.95
.19
PSY
.88
.61
.69
.49
.92
.30
NCF
.80
.61
.68
.55
.84
.37
PSY/NCF
.88
.61
.66
.35
.96
.19
PSY/NCF
(Larrabee, in press)
FBS > 22
(Miller and Donders, 2001)
FBS > 23
42
Table 7 continued: Classification Rates for FBS at Different Cutting Scores
FBS (raw)
Sensitivity
Specificity
HR
PPP
NPP
BR
Citation and
cutting score
Group
(Lees-Haley, 1992)
FBS > 24 (all) PSY
.88
.61
.69
.49
.92
.30
NCF
.80
.61
.68
.55
.84
.37
PSY/NCF
.88
.61
.66
.35
.96
.19
FBS > 26 (all) PSY
.63
.74
.71
.51
.82
.30
NCF
.63
.74
.70
.59
.77
.37
PSY/NCF
.77
.74
.75
.41
.93
.19
PSY
.25
.96
.60
.49
.67
.40
NCF
.35
.96
.47
.34
.62
.36
PSY/NCF
.41
.96
.56
.00
.73
.21
(Iverson, et al., 2002)
(Larrabee, in press)
FBS > 30
43
Table 7 continued: Classification Rates for FBS at Different Cutting Scores
FBS (raw)
Sensitivity
Specificity
HR
PPP
NPP
BR
Citation and
cutting score
Group
(Lees-Haley, 1991)
FBS > 24
PSY
.83
.67
.77
.67
.87
.40
(for men)
NCF
.77
.67
.47
.34
.62
.36
PSY/NCF
.92
.67
.73
.42
.90
.21
PSY
1.00
.75
.67
.67
.70
.40
(for women) NCF
.80
.75
.62
.48
.76
.36
.75
.75
.84
.68
.87
.21
(Lees-Haley, 1991)
FBS > 26
PSY/NCF
Note. BR = base rate, percentage of individuals in sample who were malingering; Sensitivity =
percentage of individuals in group falling below cutting score; Specificity = percentage of
individuals in honest group falling above cutting score; HR = hit rate, percentage of both groups
correctly classified by cutting score; PPP = positive predictive power, percentage of those with
positive test sign who were malingering; NPP = negative predictive power, percentage of those
with negative test sign who were not malingering; PSY = Psychiatric Feigners; NCF =
Neurocognitive Feigners; PSY/NCF = Psychiatric and Neurocognitive Feigners; MMPI-2 =
Minnesota Multiphasic Personality Inventory-2; FBS = Lees-Haley Fake Bad Scale.
44
References
American Psychiatric Association. (2000). Diagnostic and statistical manual of mental
disorders (4th ed., Text Revision). Washington, DC: Author.
Arbisi, P. A., & Ben-Porath, Y. S. (1995). An MMPI-2 infrequent response scale for use
with psychopathological populations: The Infrequency-Psychopathology scale, (F(p)).
Psychological Assessment, 7, 424-431.
Berry, D. T. R., Baer, R. A., Rinaldo, J. C., & Wetter, M. W. (2002). Assessment of
Malingering. To appear in Clinical personality assessment: Practical approaches. J.
Butcher (Ed.). New York: Oxford University Press.
Butcher, J. N., Arbisi, P. A., Atlis, M. M., & McNulty, J. L. (2003). The construct
validity of the Lees-Haley Fake Bad Scale: Does this scale measure somatic
malingering and feigned emotional distress? Archives of Clinical Neuropsychology, 18,
473-485.
Butcher, J. N., Dahlstrom, W. G., Graham, J. G., Tellegen, A., & Kaemmer, B. (1989).
MMPI-2: Manual for administration and scoring. Minneapolis: University of Minnesota
Press.
Cohen, J. (1997). Statistical power analysis for the behavioral sciences. New York:
Academic Press.
Cramer, K. M. (1995). The effects of description clarity and disorder type on MMPI-2
fake-bad validity indices. Journal of Clinical Psychology, 51(6), 831-840.
Doss, R. C., Chelune, G. J., & Naugle, R. I. (1999). Victoria Symptom Validity Test:
Compensation-seeking vs. non-compensation-seeking patients in a general
clinical setting. Journal of Forensic Neuropsychology, 1(4), 5-20.
Greene, R. L. (2000). The MMPI-2: An interpretive manual (2nd ed.). Boston: Allyn &
Bacon.
Greiffenstein, M. F., Baker, W. J., Gola, T., Donders, J., & Miller, L. (2002). The Fake
Bad Scale in atypical and severe closed head injury litigants. Journal of Clinical
Psychology, 58(12), 1591-1600.
Greiffenstein, M. F., Gola, T., & Baker, W. J. (1995). MMPI-2 validity scales versus
domain specific measures in detection of factitious traumatic brain injury. The
Clinical Neuropsychologist, 9(3), 230-240.
45
Greve, K. W., Bianchini, K. J., Mathias, C. W., Houston, R. J., & Crouch, J. A. (2003).
Detecting malingered performance on the Wechsler Adult Intelligence Scale:
Validation of Mittenberg’s approach in traumatic brain injury. Archives of Clinical
Neuropsychology, 18, 245-260.
Grote, C. L., Kooker, E. K., Garron, D. C., Nyenhuis, D. L., Smith, C. A., & Mattingly,
M. L. (2000). Performance of compensation seeking and non-compensation
seeking samples on the Victoria Symptom Validity Test: Cross-validation and
extension of a standardization study. Journal of Clinical and Experimental
Neuropsychology, 22(6), 709-719.
Hathaway, S. R., & McKinley, J. C. (1943). A multiphasic personality schedule
(Minnesota): I. Construction of the schedule. Journal of Personality Assessment, 10, 249254.
Heaton, R. K., Smith, H. H., Lehman, R. A., & Vogt, A. T. (1978). Prospects for faking
believable deficits on neuropsychological testing. Journal of Consulting and
Clinical Psychology, 46, 892-900.
Inman, T. H., & Berry, D. T. R. (2002). Cross-validation of indicators of malingering: A
comparison of nine neuropsychological tests, four tests of malingering, and behavioral
observations. Archives of Clinical Neuropsychology, 17, 1-23.
Inman, T. H., Vickery, C. D., Berry, D. T. R., Lamb, D. G., Edwards, C. L., & Smith, G.
T. (1998). Development and initial validation of a new procedure for evaluating adequacy
of effort given during neuropsychological testing: The Letter Memory Test.
Psychological Assessment, 10, 128-139.
Iverson, G. L., Henrichs, T. F., Barton, E. A., & Allen, S. (2002). Specificity of the
MMPI-2 Fake Bad Scale as a marker for personal injury malingering. Psychological
Reports, 90, 131-136.
Larrabee, G. J. (1998). Somatic malingering on the MMPI and the MMPI-2 in personal
injury litigants. The Clinical Neuropsychologist, 12(2), 179-188.
Larrabee, G. J. (in press). Detection of symptom exaggeration with the MMPI-2 in
litigants with malingered neurocognitive deficit. The Clinical Neuropsychologist.
Larrabee, G. J. (in press). Exaggerated MMPI-2 symptom report in personal injury
litigants with malingered neurocognitive deficit. Archives of Clinical
46
Neuropsychology.
Lees-Haley, P. R. (1992). Efficacy of MMPI-2 validity scales and MCMI-II modifier
scales for detecting spurious PTSD claims: F, F-K, fake bad scale, ego strength, subtleobvious subscales, DIS, and DEB. Journal of Clinical Psychology, 48(5), 681-688.
Lees-Haley, P. R. (1997). MMPI-2 base rates for 492 personal injury plaintiffs: Implications and
challenges for forensic assessment. Journal of Clinical Psychology, 53(7), 745-755.
Lees-Haley, P. R., English, L. T., & Glenn, W. J. (1991). A fake bad scale on the
MMPI-2 for personal injury claimants. Psychological Reports, 68, 203-210.
Lewis, J. L., Simcox, A. M., & Berry, D. T. R. (2002). Screening for feigned psychiatric
symptoms in a forensic sample by using the MMPI-2 and the Structured Inventory of
Malingered Symptomatology. Psychological Assessment, 14(2), 170-176.
Mittenberg, W., Patton, C., Canyock, E. M., & Condit, D. C. (2002). Base rates of malingering
and symptom exaggeration. Journal of Clinical and Experimental Neuropsychology,
24(8), 1094-1102.
Orey, S. A., Cragar, D. E., & Berry, D. T. R. (2000). The effects of two motivational
manipulations on the neuropsychological performance of mildly head-injured college
students. Archives of Clinical Neuropsychology, 15(4), 33-348.
Pankrantz, L., & Binder, L. M. (1997). Malingering on intellectual and
neuropsychological measures. In Rogers (Ed.) Clinical assessment of
malingering and deception (2nd ed.). (pp. 223-236). New York, NY: The
Guilford Press.
Rees, L. M., Tombaugh, T. N., Gansler, D. A., & Moczynski, N. P. (1998). Five
validation experiments of the Test of Memory Malingering (TOMM).
Psychological Assessment 10 (1), 10-20.
Rogers, R. (1988). Clinical assessment of malingering and deception. New York:
Guilford Press.
Rogers, R. (1997). Clinical assessment of malingering and deception (2nd ed.). New
York: Guilford Press.
Rogers, R., Bagby, R. M., & Dickens, S. E. (1992). SIRS Structured Interview of
Reported Symptoms: A professional manual. Odessa, FL: Psychological Assessment
Resources.
47
Rogers, R., Gillis, J. R., & Bagby, R. M. (1990). Cross-validation of the SIRS with a
correctional sample. Behavioral Sciences and the Law, 8, 85-92.
Rogers, R., Gillis, J. R., Dickens, S. E., & Bagby, R. M. (1991). Standardized
assessment of malingering: Validation of the SIRS. Psychological Assessment: A
Journal of Clinical and Consulting Psychology,3, 89-62.
Rogers, R., Harrell, E. H., & Liff, C. D. (1993). Feigning neuropsychological
impairment. Clinical Psychology Review, 13 (3), 255-274.
Rogers, R., Hinds, J. D., & Sewell, K. W. (1996). Feigning psychopathology among
adolescent offenders: Validation of the SIRS, MMPI-A, and SIMS. Journal of
Personality Assessment, 1, 227-237.
Rogers, R., Kropp, R., Bagby, R. M., & Dickens, S. E. (1993). Faking specific disorders:
A study of the Structured Interview of Reported Symptoms (SIRS). Journal of Clinical
Psychology, 48, 643-648.
Rogers, R. Sewell, K. W., & Salekin, R. T. (1994). A meta-analysis of malingering on the
MMPI-2. Assessment, 1(3), 227-237.
Rogers, R. Sewell, K. W., & Ustad, K. L. (1995). Feigning among chronic outpatients
on the MMPI-2: A systematic examination of Fake-Bad indicators. Assessment,
2, 81-89.
Rogers, R., Sewell, K. W., Martin, M. A., & Vitacco, M. J. (2003). Detection of feigned mental
disorders: A meta-analysis of the MMPI-2 and malingering. Assessment 10(2), 160-177.
Ross, S. R., Millis, S. R., Kruckowski, R. A., Putnam, S. HON., & Adams, K. M. (in
press). Detecting incomplete effort on the MMPI-2: An examination of the Fake-Bad
Scale in mild head injury. Journal of Clinical and Experimental Neuropsychology.
Slick, D. J., Hopp, G., Strauss, E., & Spellacy, F. J. (1996). Victoria Symptom Validity
Test: Efficiency for detecting feigned memory impairment and relationship to
neuropsychological tests and MMPI-2 validity scales. Journal of Clinical and
Experimental Neuropsychology, 18(6), 911-922.
Slick, D. J., Sherman, E. M. S., & Iverson, G. L. (1999). Diagnostic criteria for
malingered neurocognitive dysfunction: Proposed standards for clinical practice and
research. The Clinical Neuropsychologist, 13(4), 545-561.
Smith, G. P., & Burger, G. K. (1997). Detection of malingering : Validation of the
48
Structured Inventory of Malingered Psychopathology (SIMS). Journal of the Academy
of Psychiatry and the Law, 25, 180-183.
Strong, D. R., Greene, R. L., & Schinka, J. A. (2002). A taxometric analysis of MMPI-2
infrequency [F and F(p)]. Psychological Assessment, 12, 166-173.
Tombaugh, T. N. (1996). The Test of Memory Malingering. New York: MHS.
Tombaugh, T. N. (1997). The Test of Memory Malingering (TOMM): Normative data
from cognitively intact and cognitively impaired individuals. Psychological
Assessment, 9, 260-268.
Tsushima, W. T., & Tsushima, V. G. (2001). Comparison of the Fake Bad Scale and
other MMPI-2 validity scales with personal injury litigants. Assessment, 8, 205212.
Vickery, C. D., Berry, D. T. R., Dearth, C. S., Vagnini, V. L., Baser, R. E., Cragar, D. E.,
& Orey, S. A. (in press). Head injury and the ability to feign neuropsychological
deficits. Archives of Clinical Neuropsychology, 599, 1-12.
Vickery, C. D., Berry, D. T. R., Inman, T. H., Harris, M. J., Orey, S. A. (2001). Detection
of inadequate effort on neuropsychological testing: A meta-analytic review of
selected procedures. Archives of Clinical Neuropsychology, 16, 45-73.
49
VITA
AUGUST 2003
PERSONAL INFORMATION
Name:
Date of Birth:
Place of Birth:
Victoria Louise Vagnini
March 12, 1976
Hartford, CT
EDUCATION
University of Kentucky. Lexington, Kentucky
M. S.
Major: Counseling Psychology.
Graduation Date: May 2000
Centre College. Danville, Kentucky
B. S.
Major: Psychology.
Graduation Date: May 1998
PROFESSIONAL EXPERIENCE
August 2000 Present
University of Kentucky
Department of Psychology
July 2003 Present
Psychology Student Intern/Psychometrician
Department of Neurology-UK
June 2003August 2003
Instructor
(PSY 312, Brain and Behavior)
August 2002May 2003
Teaching Assistant
(PSY 215, Experimental Psychology lab)
September 2002- Group leader, Dialectical Behavior Therapy group
August 2003
June 2002August 2002
Instructor
(PSY 312, Brain and Behavior)
July 2001June 2002
Psychology Student Affiliate, Eastern State Hospital
50
June 2000June 2002
Psychology Intern, Cardinal Hill Rehabilitation Hospital
August 2001Present
Therapist, Jesse G. Harris Psychological Services Center
August 2000May 2001
Teaching Assistant
(PSY 215, Experimental Psychology)
August 2000July2001
Psychometrician
(C. Christopher Allen, Ph.D., PSC-private practice)
May 1999May 2000
Research Assistant
Department of Psychiatry-UK
July 1999February 2000
Research Assistant
Department of Behavioral Science and Internal Medicine-UK
SELECTED PAPERS AND PRESENTATIONS
Vickery, C.D., Berry, D.T.R., Dearth, C.M.S., Vagnini, V.L., Baser, R. E., Crager, D.E., & Orey,
S.A. (2002). Head Injury and the Ability to Feign Neuropsychological Deficits.
Archives of Clinical Neuropsychology, 599, 1-12.
Dearth, C. M. S., Vickery, C.D., Berry, D.T.R., Orey, S.A., Vagnini, V.L., Crager, D.E.,
& Baser, R. (2001). Detection of Malingered Head Injury with the MMPI-2. Poster
presented at the 29th Annual Meeting of the International Neuropsychological Society,
Illinois.
Vagnini, T. L., & Thompson, J. (1998, March). Mood shifts during the menstrual
cycle. Poster presented at the meeting of the Southeastern Psychological Association,
Alabama.
Vagnini, T. L., & Thompson, J. (1997, November). Mood shifts during the menstrual
cycle. Poster presented at the meeting of the Kentucky Academy of Science, Kentucky.
51