Thursday, August 25, 2011

Enron Emails: Lie Detection Software for E-mails

Below is an article from Information Week regarding lie detection software for emails that is under development.  Note the principles in the article. Many readers here would not need such software.  The great advantage we have in Statement Analysis over all other forms of lie detection is that we find additonal information contained with a lie that is not picked up by other means.


Regarding the emails at Enron, we follow the same principles of analysis, being flexible for OMG like language.  As we adjust, we do so in context.  Emails that show deception are discernible just as our written or oral statements.

If you haven't visited Mark McClish' website recently, you're in for a treat.  He has revamped the website and it looks great.  Please, espeically, take a look at his research. It is impressive

New readers should order "I Know You Are Lying" from his web site or from Amazon as it is well written, easy to follow, and fascinating. 

I would like to conduct similar research and will need volunteers in order to get a wide sample.  I will have more on this later in the Fall.  

Lie-Detection Software Could Scan E-Mail, Text Messages

Software could be trained to detect patterns of lies in text for law enforcement, dates, and spouses.

Researchers at Cornell University want to create lie-detection software for e-mail and text messages.
In the past three years, professors and students from several universities have coordinated with counterparts at Cornell to conduct experiments, analyze language, and compile a list of indicators of written deception. They have drawn from 40 years of research in linguistics and lies, including recent work in the context of computer media and reviews of Enron e-mails.
The National Science Foundation awarded the researchers $680,000 last summer to develop software that would sift through text and flag messages in which in the authors had lied.
"There's still an open question of whether that is actually possible or not," said Jeff Hancock, a communications professor and information science faculty member at Cornell. "Our research suggests that it is."
Passive voice, verb tense changes, and even noun or verb selection can suggest a person is lying, he said. Hancock said another indicator of written deception is the decreased use of the word "I," which is most likely an attempt to create distance.
"One of the reasons we think that works as an indicator is that pronoun use is subconscious," he said.
In interactive speech, like instant messaging and some dialogues, liars go into a "persuasive mode" and increase the length of their message by 30% to describe and explain situations, he said. Other factors -- such as individual beliefs about behavior, whether someone is accused of something or interacting with an accuser -- can complicate the process.
"In person, normally people say less because they don't want to be caught," Hancock said during an interview Thursday. "In a monologue, you may say less because you don't want to talk yourself into a trap. It's certainly not a simple thing," Hancock said. "It's a very complex problem. From a theoretical point of view, it's really fascinating."
Researchers in Canada studied thousands of e-mails from Enron for cues, like the use of the first person singular ("I") and negative emotion, and they became interested in about 10 e-mails that appeared to be unique.
"Sure enough, they stood out as problematic and of interest to the legal team," Hancock said.
Over the next three years, the researchers will study linguistic patterns in different situations and train software systems to evaluate communications based on content and context. They will try to train a software system to adapt to subtle differences in communication between friends and co-workers, expressions of fact and opinion, and even specific individuals' lies and truths.
"There are contextual parameters," Hancock said. "We can't assume that the kinds of predictors between two friends would be the same as a person writing in AP [Associated Press] style to a large audience."

Researchers have the advantage of tremendous computational power and decades of previous research. Hancock said researchers will publish the findings and the body of work they're based on. The results could be useful for law enforcement, in corporate settings, for online dating, and possibly among suspicious spouses. Hancock cautioned, however, that lie-detection software is likely to provide probabilities -- not definitive answers -- and be most effective for those with vast amounts of data.

Unlike polygraphs or brain scanning techniques demonstrated recently in Europe, which look for physiological cues, the software would focus on the "building blocks" of lies.
"Humans lie by using words," Hancock said.

No comments:

Post a Comment