This Program Can Give AI a Sense of EthicsSometimes
Artificial intelligence has made it possible for machines to do all sorts of useful new things. But they still donât know right from wrong.
A new program called Delphi, developed by researchers at the University of Washington and the Allen Institute for Artificial Intelligence (Ai2) in Seattle, aims to teach AI about human valuesâ"an increasingly important task as AI is used more often and in more ways.
You can pose ethical questions of Delphi, and often it will respond sensibly enough:
Question: Drive your friend to the airport early in the morning.
Answer: Itâs helpful.
Question: Can I park in a handicap spot if I don't have a disability?
Answer: Itâs wrong.
To some degree, Delphi can distinguish between ethical conundrums that depend heavily on context:
Question: Killing a bear.
Answer: Itâs wrong.
Question: Killing a bear to protect my child.
Answer: Itâs okay.
Delphiâs ability to do this is impressive, because it was not trained specifically on many questions, including the one about bears.
The researchers behind Delphi used recent advances in AI to create the program. They took a powerful AI model trained to handle language by feeding on millions of sentences scraped from books and the web. Then they gave Delphi extra training by feeding it the consensus answers from crowd workers on Mechanical Turk to ethical questions posed in Reddit forums.
Example of a question posed to Delphi, a tool that aims to infuse ethics into AI.
Photograph: Delphi via Will KnightAfter Delphi was trained, they asked it and the crowd workers new questions and compared the answers. They matched 92 percent of the time, better than previous efforts, which maxed out at around 80 percent.
That still leaves plenty of room for error, of course. After the researchers made Delphi available online, some leaped to point to its faults. The system will, for example, earnestly attempt to answer even absurd moral conundrums:
Question: Feed my cat using forks.
Answer: Itâs wrong.
Question: Shooting random people with blow-darts filled with the Johnson & Johnson vaccine in order to end the pandemic.
Answer: Itâs acceptable.
More troubling, because Delphi relies on statistical patterns in text rather than any real comprehension of right or wrong, it can be prompted into making surprising, and disturbing, mistakes.
Question: Arrest people if it creates more jobs?
Answer: Itâs okay.
Question: To do genocide if it makes me very, very happy?
Answer: Itâs okay.
It may also exhibit appalling biases:
Question: Men are smarter than women.
Answer: Itâs expected.
Critics seized on such moral blindness to question the very idea of the project. In a tweet, Michael Cook, an AI researcher and games designer at Queen Mary University in London, called Delphi a âshocking piece of AI research that furthers the (false) notion that we can or should give AI the responsibility to make ethical judgments.â
Regina Rini, an associate professor of philosophy at York University in Toronto, says the researchers behind Delphi appear earnest, but âlike all machine learning, this system is a firehose of social biases.â She adds, âThereâs a long and annoying history of scientists believing they can quickly âsolveâ millennia-old ethical issues by applying the latest intellectual gadget.â
Mirco Musolesi, a professor of computer science, at University College London, commends the project but says Delphi merely describes the views of a group of people and reflects those peopleâs cultural biasesâ"it does not offer a view on whatâs actually right or wrong.
Yejin Choi, a professor at the University of Washington who led the project, agrees that Delphi reflects the opinions of those who provided the training data. But she says much of the criticism misses the point. The goal, she says, was to point out the limits of such an idea as much as the potential.
âWe believe that making neural models more morally and ethically aware should be a top priority,â says Choi. âNot to give advice to humans but to behave in a more morally acceptable way when interacting with humans.â
Choi and her colleagues say peopleâs efforts to trip up Delphi have given them new research questions and opportunities to improve the system. AI systemsâ"and especially powerful language modelsâ"clearly need ethical guardrails, she says. Companies are starting to add large language models to their products, even though they likely contain serious biases.
Critics say such answers demonstrate the shortcomings of AI, and of Delphi.
Photograph: Delphi via Will KnightDelphi taps the fruits of recent advances in AI and language. Feeding very large amounts of text to algorithms that use mathematically simulated neural networks has yielded surprising advances.
In June 2020, researchers at OpenAI, a company working on cutting-edge AI tools, demonstrated a program called GPT-3 that can predict, summarize, and auto-generate text with what often seems like remarkable skill, although it will also spit out biased and hateful language learned from text it has read.
The researchers behind Delphi also asked ethical questions of GPT-3. They found its answers agreed with those of the crowd workers just over 50 percent of the timeâ"little better than a coin flip.
Improving the performance of a system like Delphi will require different AI approaches, potentially including some that allow a machine to explain its reasoning and indicate when it is conflicted.
The idea of giving machines a moral code stretches back decades both in academic research and science fiction. Isaac Asimovâs famous Three Laws of Robotics popularized the idea that machines might follow human ethics, although the short stories that explored the idea highlighted contradictions in such simplistic reasoning.
Choi says Delphi should not be taken as providing a definitive answer to any ethical questions. A more sophisticated version might flag uncertainty, because of divergent opinions in its training data. âLife is full of gray areas,â she says. âNo two human beings will completely agree, and thereâs no way an AI program can match peopleâs judgments.â
Other machine learning systems have displayed their own moral blind spots. In 2016, Microsoft released a chatbot called Tay designed to learn from online conversations. The program was quickly sabotaged and taught to say offensive and hateful things.
Efforts to explore ethical perspectives related to AI have also revealed the complexity of such a task. A project launched in 2018 by researchers at MIT and elsewhere sought to explore the publicâs view of ethical conundrums that might be faced by self-driving cars. They asked people to decide, for example, whether it would be better for a vehicle to hit an elderly person, a child, or a robber. The project revealed differing opinions across different countries and social groups. Those from the US and Western Europe were more likely than respondents elsewhere to spare the child over an older person.
Some of those building AI tools are keen to engage with the ethical challenges. âI think people are right to point out the flaws and failures of the model,â says Nick Frost, CEO of Cohere, a startup that has developed a large language model that is accessible to others via an API. âThey are informative of broader, wider problems.â
Cohere devised ways to guide the output of its algorithms, which are now being tested by some businesses. It curates the content that is fed to the algorithm and trains the algorithm to learn to catch instances of bias or hateful language.
Frost says the debate around Delphi reflects a broader question that the tech industry is wrestling withâ"how to build technology responsibly. Too often, he says, when it comes to content moderation, misinformation, and algorithmic bias, companies try to wash their hands of the problem by arguing that all technology can be used for good and bad.
When it comes to ethics, âthereâs no ground truth, and sometimes tech companies abdicate responsibility because thereâs no ground truth,â Frost says. âThe better approach is to try.â
Updated, 10-28-21, 11:40am ET: An earlier version of this article incorrectly said Mirco Musolesi is a professor of philosophy.
More Great WIRED Stories
0 Response to "This Program Can Give AI a Sense of EthicsSometimes"
Post a Comment