PSYC 162 Week 2 (lec. 4-8)

Lie Detection

People vs. Instruments

Can people tell when others are lying?
- The average person is not very good at detecting lies
- Certain professional groups are noticeably better (e.g. Secret Service agents)
Can instruments detect lies?
- Polygraph (i.e., lie-detector test)
- fMRI
People, in general, are poor lie detectors
- People fare only slightly better than a coin toss at detecting lied
- A meta-analysis of 253 studies showing that overall accuracy was approx. only 53%
- Dr. Paul Ekman, one of the foremost experts on deception detection
  - Emotions show on our faces

Paul Ekman’s Approach to Studying Lie Detection

His theory is universal, all facial expressions are the same regardless of culture
Assumption: any ability to detect a lie requires that the lie arouse emotion (i.e., the liar has to be lying about something important)
“Low stakes” lies (e.g., lying about visiting a museum or about whether or not you like the taste of a drink) do not elicit emotion, so the lies would be hard to detect
- Nothing to detect
- It doesn’t matter
“High stakes” lies arouse emotions that create facial expressions that “leak through” attempts to mask them (potentially detectable)
- Lies that are important
- Hard to conceal your emotions, they’ll leak through and show on your face, those small movements are
  - Micro-expressions
Do facial expressions differ in any measurable way when someone is lying compared to when they are telling the truth?
Many studies using low stakes lies say “no”
Using high stakes lies, Ekman, Friesen & O’Sullivan (1988) found measurable differences using the Facial Action Coding System
- Subtle facial movements associated with being happy occurred more often in the truthful condition
- Subtle facial movements associated with negative emotions occurred more often in the lying condition
- When you’re lying about something important, your facial expressions are different than if you’re telling the truth about something important
A general approach to studying whether or not people can detect when someone is lying
Ten confederates either lie or tell the truth in each condition (the lies are important, not trivial, and the liars are motivated to deceive) - they are high stakes lies
Five confederates are lying and five are telling the truth
Test-takers watch each person on tape and indicate whether or not the person on the tape is lying
- Guilty trials - 5 guilty of lying
- Innocent trials - 5 telling the truth
- They could’ve interrogated people, could’ve used the false evidence ploy
Three categories of high-stakes lies in this study
- Emotion Deception Judgment Task
  - A confederate talks about watching a nature video while watching an upsetting video (lying) - 5 lying
  - A confederate talks about watching a nature video while watching a nature video (telling the truth) - 5 telling truth
- Opinion Deception Judgment Task
  - Express an opinion (e.g., about the death penalty) that goes against your strong beliefs (lying)
  - Express an opinion that agrees with your strong beliefs (telling the truth)
- Crime Deception Judgment Task
  - Deny stealing $50 (lying)
  - Deny stealing $50 (telling the truth)
Secret service agents were better at detecting lies

Facial Action Coding System

Facial motion movements associated with emotions
Ex: face of anger
- Eyebrows pulled down and together
- Eyes opened wide, staring hard
- Margins of lips rolled inward, pressed together

Wizards of Lie Detection

O’Sullivan & Ekman (2004)
They found 50 individuals (out 20,000 screened) who performed very well on all three high-stakes lie detection tests (opinions, a mock crime, and emotion)
A concern is that, when so many people are tested, some will perform well merely due to random chance, not because they are good at detecting lies (Bond & Uysal, 2007)
Whether or not that is true, some “wizards” made the most of their special status
Some of these “wizards” make a career out of it
- Teaching people how to find liars

Ekman in Later Years

Trained law enforcement and TSA in lie detection
Stopped publishing scientific research on lie detection (to keep the information from hostile nations and terrorists)
In the absence of new science, skeptics doubt his body of work
His work has been the most influential and left a lasting legacy
Who killed my relative? Police officers’ ability to detect real-life high-stake lies
- The police officers in the study correctly detected lying 50% of the time (i.e. hit rate = .50)
- At random chance if the false alarm rate is 50%
- False alarm rate is important!!
Follow up study they actually have the false alarm rate but they focus more on the accuracy of liars and those who tell the truth

Wright Whelan, Wagstaff & Wheatcroft (2015)

Participants:
- 70 police officers & 37 undergraduates
Stimuli:
- Videos of people making public appeals for help with missing relatives (high stakes)
- 18 were honest, 18 deceptive
Participants decided if they were telling the truth or lying
Results:
Anytime hit rate = false alarm rate it = chance, there’s no information
This study is consistent with Paul’s idea that they have to be high-stakes lies to detect liars
Undergrads had a lower hit rate and a higher false alarm rate than police officers
If you were asked which is better police have hit rate of .70 and false alarm rate of .26, or the undergrads having hit rate of .60 and false alarm rate of .20
- You shouldn’t be able to tell easily which is better
- You just see one false and hit alarm rate for the police officers and one for the undergrads
- You want to plot them on the ROC and then you’ll see which one is higher and which one is better
Hidden false alarm and hit rates/ Confidence scale
- What’s hidden is that there’s an arbitrary decision that has been made in order to call things hits and false alarms
- Okay to thing of the underlying thing as confidence
- You’re picking an arbitrary point on a confidence scale
- Ex: when you’re watching a video, you automatically make the decision if they’re lying or telling the truth
  - You have some degree of confidence
- The farther you are from an arbitrary threshold, the more confident you are
- Matters because there is nothing special about that point, if you move it, it changes both your false alarm and hit rates
- Continous scale - can be 1 to whatever

Declaring more confidence to call them a liar

Those who previously ended in 5.1, 5.2, etc. were previously called liars, but now that the threshold changed, you call them truth-tellers because you changed your standard to require more confidence to call them liars
You’ll have fewer hits and fewer false alarms, both rates will go down

Calling more people liars
False alarm and hit rates both go up

ROC Analysis

Plot hit and false alarm rate
The black points more down to the left is conservative
The black points more up and to the right are more liberal
To get these points, you split the people into four groups, but have the same scale as before, some being more liberal and others more conservative
- Four groups and you tell them how conservative or liberal they should be about calling someone a liar
You give the same instructions, and get 4 ROC points again, but before you do you run them through a Paul Ekman training to get better at detecting liars
If the training is effective you’ll see a higher ROC
Higher ROC is objectively better

Actively Prompting Information to Detect Lies

In most deception-deception experiments, all judges have to go on is “sender“ demeanor, which makes sense because theories posit that deception is detectable based on demeanor (e.g. microexpressions of emotion)
However, police do not merely passively watch demeanor, but instead actively and strategically question a suspect (e.g. they compare what the person says to what is known about the crime)
In short, what makes an expert adept at lie detection is not the passive observation of sender demeanor, but the active prompting of diagnostic information (i.e. interrogation)
Tested by using a confederate to induce some participants to cheat on a test involving answering hard trivia questions
- You induce some people to cheat - guilty group
- Don’t induce others to cheat - innocent group
- Try to figure out when they deny cheating if they are lying or if they are telling the truth
The interviewers were five federal agents with substantial polygraph and interrogation experience
The interviews (3-18 minutes) were completely unscripted and varied substantially in style and substance from expert to expert
The hit rate was 39/40 = .98 and the false alarm rate was 1/49 = .02

Criticism of Levine et al. (2014)

The trivia questions were almost impossible for anyone to answer
- Therefore, if someone answered several of them correctly, it was already easy to tell that they were probably cheating
- Too easy to tell the liars
The fact that they were cheating would quickly become obvious if you asked them what else they knew about the trivia question (nothing)
- They only knew the answer to the question, asked them something similar and they wouldn’t get it
Study didn’t accomplish getting on a higher ROC

A Cognitive Approach to Lie Detection

Vrij, Fisher, & Blank (2017)
Method to pass lie detection in a police interrogation
- Impose cognitive load (because lying is more mentally demanding than truth-telling), such as asking interviewee to tell their story in reverse order
  - Harder to tell a lie than to tell the truth
  - Make interviewee engage in more mental labor
- Encourage interviewee to say more (because liars tend to “weave a tangled web” the more they talk)
  - Harder to talk for a liar
- Ask unexpected questions (because liars prepare themselves for anticipated interviews), such as general opening questions (expected) versus spatial questions (unexpected)
  - They think it’ll start with the beginning of the story
Results
- Better overall accuracy for the cognitive vs. the standard approach
- Standard approach: hit rate = .47, false alarm rate = .43
  - Almost equal means nothing, chance
- Cognitive approach: hit rate = .67, false alarm rate = .33
  - Don’t need ROC analysis because hit rate is a good amount larger

Summary

Passive lie-detection is really hard
High-stakes lies are necessary for it to be possible (so that emotion leaks through in the form of microexpressions)
Using videos of high-stakes expressions of true or false claims about missing relatives, people are not bad at detecting lies (and police are a little better than that)
Lie-detection may be much better for an interviewer who is able to actively question the individual (though more work is needed to verify these controversial new findings)

The Polygraph

A polygraph (poly = many, graph = write) is an instrument that simultaneously records changes in physiological processes such as heart rate, blood pressure, respiration and electrical resistance of the skin (galvanic skin response)
The underlying theory of the polygraph is that when people lie they also get measurably nervous about lying. Heart rate increases, blood pressure goes up, breathing rhythms change, perspiration increases, etc

History of Polygraph

William Moulton Marston (1893-1947)
- Student of Hugo Münsterberg at Harvard
- Discovered correlation between blood pressure and arousal during lying
- Invented Wonder Woman
John Augustus Larson
- Rookie police officer in the Berkeley, CA, police department
- Ph.D. in physiology from UC
  - Read Marston’s article “Physiological Possibilities of the Deception Test”
- Improved Marston’s test through continuous recording of blood pressure
First real-world application
- Berkeley sorority house - 1921
- Money and an expensive ring had been stolen from rooms
Larson asked suspects questions while hooked up
- “Are you interested in math?”
- “Did you steal the money?”
- “The test shows you stole it. Did you spend it?”
Suspect = a woman named Helen Graham
- Her record showed a large change in blood pressure, skipped heartbeats and a halt in her breathing
- She was arrested but later convinced Larson she was innocent

How to Question Suspect Taking Polygraph Test

Relevant/Irrelevant Test
- Ask some irrelevant questions (“Are you interested in math?”)
- Ask questions relevant to the crime (“Did you steal the money?”)
- If arousal(relevant ) > arousal(irrelevant ) = guilty of lying
Comparison Question (control question) Test
- Add a probable-lie comparison question (“Have you ever stolen anything?”)
- Arousal(relevant ) > Arousal(comparison) = guilty of lying
Concealed Information (i.e., Guilty Knowledge) Test
- Present one true and several false details of an incident that has not been publicized, so the true answer would be known only to the perpetrator
- Innocent suspects will not be more aroused in response to the true item, but guilty suspects will be
- Arousal(true detail) > Arousal(false details) = guilty of lying
- Some people believe this approach is the best

Comparison Question Test

Imagine that the examiner rates the difference on each physiological measure using a 0 → 3 scale (summed range 0 → 9)
Further imagine that the examiner concludes that a subject is lying whenever the summed score across three measures is 5 or more
This test-taker gets a score of 0 + 3 + 2 = 5 and is judged to be guilty of lying
Is it a hit or a false alarm?

Example of Guilty Knowledge Test

Carmel et al. (2003)
Participants were instructed to enter the office of a named TA in Psychology and steal a CD with a red case containing an exam
Told to convince the examiner that they are innocent
Hooked up to polygraph and given “guilty knowledge“ questions
- Color of the CD case (blue, green, red, purple) +1
- Name of TA (name 1, name 2, name 3, name 4) +3
- Subject of the exam (Anthro, Psych, Bio, Physics) +2
Scoring
- For each question, if the “guilty knowledge” item elicits the largest response, +3 points; if 2nd largest response, +2 points; if 3rd largest response, +1 point, otherwise 0
- Values are then summed up across all questions to produce a single score between 0 and 9 (this test-taker gets a score of 1 + 3 + 2 =6)
- Decision rule: if score > 5 → “guilty of lying“, so this test-taker is judged guilty
- Is it a hit or a false alarm?
Remember hit or false alarm rate involves an arbitrary criterion that is hidden from you

To find out if guilty knowledge or comparison question test, do the process for the ROC analysis for both
- Look at the other false and hit rates
- Split into conservative, liberal, and neutral groups
- Plot the points
- Higher ROC is better
- More conservative → hit and false alarm rates go down
- More liberal → hit and false alarm rates go up

Polygraph Accuracy

Many believe it is worst for the relevant-irrelevant method and best for the guilty knowledge method
The data suggest that all methods are about equally effective

Polygraph Accuracy Might be Lower in Practice

Analogy: medical tests seldom perform as well in general field use as their performance in initial evaluations seem to promise
- Initial evaluations are typically conducted on people whose disease status is uncomplicated by other conditions that might interfere with test accuracy
- Samples are drawn, tests conducted, and results analyzed under optimal conditions
Similar issues could pose problems for regular use of the polygraph
- In experimental tests, polygraph instrumentation and maintenance and examiner training and proficiency are typically well above field situations
- Testing is undertaken soon after the crime, so limited forgetting or “emotional distracting“ occurs
- Countermeasures might be more of a problem (e.g. biting one’s tongue)
- If polygraph examiners have preexisting knowledge of evidence against the suspect (e.g. fingerprint evidence), it can bias the interpretation of their lie-detection test

The Green River Killer

On July 15, 1982, the body of Wendy Lee Coffield was found in the Green River in a rural area of the City of Kent in King County, Washington. Another body was found in the river within a month. While surveying the riverbank near two bodies on August 15th, detectives discovered another body.
Thus began one of the longest and largest serial murder investigations in United States history. Eventually, the deaths of at least 48 women would be linked the Green River killer.
In the early months of 1987, investigators had a new suspect in relation to the Green River murders. Previously known to police, the newest suspect had been picked up for attempting to solicit an undercover police officer posing as a prostitute in May 1984. However, the man was released after he successfully passed a lie detector test.
They might’ve been to conservative with the ROC and that’s why they think he wasn’t guilty/he passed the test
- That’s why it’s so important going on a higher ROC
They ended up catching him because of DNA

A World of Difference

Polygraph testing applied to a suspect in a criminal investigation
- vs.
  - Polygraph testing used for screening purposes (e.g., screening applicants for a job in law enforcement)

Lie Detection in Connection with a Single Incident vs. Mass Screening

Polygraph testing has been used as a general screening device (e.g., for CIA applicants)
A much more problematic use of the polygraph
No specific incident, so only general questions can be asked (“have you ever had any unauthorized foreign contacts?” “Have you ever smoked marijuana?”)
Base rate of guilt (i.e., percent of examinees who are lying) is very low → more false alarms than hits

Base Rate Problem

this is what a good test looks like because the hit rate is way higher than the false alarm rate
Even though it’s a good test but if you test a lot of innocent people, people are still going to fail

Using a polygraph test in the "probable cause" scenario
- The police already have reason to suspect a particular person of a crime (e.g., Gary Ridgeway)
- For every 200 suspects given a polygraph, ~100 will be innocent and ~100 will be guilty
Using a polygraph test to search for a suspect (screening)
- Almost everyone tested will be innocent
- For every 200 employees given a polygraph, ~198 will be innocent and ~2 will be guilty
What’s the problem with screening?
- Far more of the examinees are innocent than guilty (i.e., base rate of innocent far exceeds base rate of guilty)
- Therefore, even a good test (high hit rate, low false alarm rate) will wrongly identify far more innocent people than guilty people

Legal Status of the Polygraph

Polygraph results are sometimes admissible in court
1993 Supreme Court decision (Daubert v. Merrell Dow Pharmaceuticals)
- Admissibility of scientific evidence determined case-by-case based on evidentiary hearings
- Judge makes the call (after considering if there is a scientific consensus, will the jury understand the results, etc.)
1998 Justice Clarence Thomas expressed concerns about polygraph testing
- Lack of consensus about scientific validity and reliability
- Concern that polygraph evidence usurps jury role

Polygraph as Coercion

Polygraphs are not used only to detect lies
Used to induce confessions
Presented as opportunity to prove innocence
“The results of a lie detector are not admissible in court, but if you confess during the course of interrogation, that’s admissible. The lie detector is essentially used in practice as a way to get people to confess to crimes” (Ken Alder, professor of history at Northwestern University and author of The Lie Detectors: The History of an American Obsession).
Even the lie detector’s harshest critics concede it can be a useful interrogation tool

Jurors and the Polygraph

Jurors generally find results persuasive
Results can change outcome of a trial
Experts are more skeptical than general public

Lies in the Brain

Functional magnetic resonance imaging (fMRI) detects “where” lying occurs in the brain
Does not measure physiological arousal, so it might even be able to detect low-stakes lies

Some areas of the brain began to light up when you told more low stake lies
fMRI shows you group differences but nothing about the individual that’s why it doesn’t help you