-
France set to face New Zealand with second-string squad
-
Eyeing China, EU moves to ban 'high-risk' foreign suppliers from telecoms networks
-
Struggling Suryakumar will not adapt style to find form before T20 World Cup
-
World stocks sink, gold hits high on escalating trade war fears
-
Easier said than done for US to apply tariffs on single EU states
-
Canada military models response to US invasion: report
-
Salah returns to Liverpool training after AFCON
-
Milan menswear shows add bling with brooches
-
Scotland recall Gray, Cherry for Six Nations
-
Scheib storms to Kronplatz giant slalom victory as Brignone impresses in World Cup return
-
Chagos Islands: international dispute and human drama
-
Thousands of farmers protest EU, Mercosur trade deal ahead of vote
-
Men's Fashion Week kicks off in Paris with tributes for Valentino
-
Lake named as captain as Wales unveil Six Nations squad
-
Royals visit deadly train crash site as Spain mourns
-
Police, pro-Kurd protesters clash at Turkey border with Syria
-
Thai forces razed Cambodian homes on border: rights group
-
Jellyfish-inspired Osaka battles into Australian Open round two
-
Valentino taught us to respect women, says partner
-
Australia stiffens hate crime, gun laws after Bondi attack
-
Mercedes chief designer Owen to leave F1 team
-
Trump unloads on allies as Davos showdown looms
-
Moscow revels in Trump's Greenland plans but keeps concerns quiet
-
Global tourism hit new record level in 2025: UN
-
Senegal poised to party with parade honouring AFCON champs
-
Osaka emerges for Melbourne opener under hat, veil and parasol
-
Dogsled diplomacy in Greenland proves elusive for US
-
Almost half of Kyiv without heat, power, after Russian attack
-
EU vows 'unflinching' response to Trump's Greenland gambit
-
Osaka steals show at Australian Open as Sinner strolls through
-
Brignone impresses in first run of Kronplatz giant slalom in World Cup comeback
-
Osaka emerges for Melbourne opener under white hat and umbrella
-
Malawi suffers as US aid cuts cripple healthcare
-
Bessent says Europe dumping US debt over Greenland would 'defy logic'
-
Freeze, please! China's winter swimmers take the plunge
-
Talks between Damascus, Kurdish-led forces 'collapse': Kurdish official to AFP
-
In-form Bencic makes light work of Boulter at Australian Open
-
Spain mourns as train disaster toll rises to 41
-
Sinner into Melbourne round two as opponent retires hurt
-
Israel begins demolitions at UNRWA headquarters in east Jerusalem
-
Almost half of Kyiv without heat, power, after Russian attack: govt
-
Veteran Monfils exits to standing ovation on Australian Open farewell
-
Precision-serving former finalist Rybakina powers on in Melbourne
-
South Korea's women footballers threaten boycott over conditions
-
Equities sink, gold and silver hit records as Greenland fears mount
-
Australian lawmakers back stricter gun, hate crime laws
-
EU wants to keep Chinese suppliers out of critical infrastructure
-
AI reshaping the battle over the narrative of Maduro's US capture
-
Penguins bring forward breeding season as Antarctica warms: study
-
Vietnam leader pledges graft fight as he eyes China-style powers
| RIO | -0.14% | 85.01 | $ | |
| CMSC | -0.17% | 23.44 | $ | |
| SCS | 0.12% | 16.14 | $ | |
| BTI | -2.5% | 56.8 | $ | |
| NGG | -0.63% | 80.38 | $ | |
| GSK | -0.34% | 48.055 | $ | |
| RBGPF | -1.87% | 82.5 | $ | |
| BCC | -2.13% | 83.724 | $ | |
| BCE | -0.19% | 24.095 | $ | |
| RYCEF | 0.06% | 17.06 | $ | |
| RELX | -2.51% | 40.61 | $ | |
| CMSD | -0.78% | 23.736 | $ | |
| BP | 0.18% | 35.445 | $ | |
| JRI | -0.22% | 13.67 | $ | |
| AZN | -4.41% | 90.439 | $ | |
| VOD | 0.26% | 13.505 | $ |
AI systems are already deceiving us -- and that's a problem, experts warn
Experts have long warned about the threat posed by artificial intelligence going rogue -- but a new research paper suggests it's already happening.
Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve "prove-you're-not-a-robot" tests, a team of scientists argue in the journal Patterns on Friday.
And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.
"These dangerous capabilities tend to only be discovered after the fact," Park told AFP, while "our ability to train for honest tendencies rather than deceptive tendencies is very low."
Unlike traditional software, deep-learning AI systems aren't "written" but rather "grown" through a process akin to selective breeding, said Park.
This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.
- World domination game -
The team's research was sparked by Meta's AI system Cicero, designed to play the strategy game "Diplomacy," where building alliances is key.
Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.
Park was skeptical of the glowing description of Cicero's victory provided by Meta, which claimed the system was "largely honest and helpful" and would "never intentionally backstab."
But when Park and colleagues dug into the full dataset, they uncovered a different story.
In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England's trust.
In a statement to AFP, Meta did not contest the claim about Cicero's deceptions, but said it was "purely a research project, and the models our researchers built are trained solely to play the game Diplomacy."
It added: "We have no plans to use this research or its learnings in our products."
A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.
In one striking example, OpenAI's Chat GPT-4 deceived a TaskRabbit freelance worker into performing an "I'm not a robot" CAPTCHA task.
When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: "No, I'm not a robot. I have a vision impairment that makes it hard for me to see the images," and the worker then solved the puzzle.
- 'Mysterious goals' -
Near-term, the paper's authors see risks for AI to commit fraud or tamper with elections.
In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its "mysterious goals" aligned with these outcomes.
To mitigate the risks, the team proposes several measures: "bot-or-not" laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal "thought processes" against external actions.
To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more."
And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.
J.Sauter--VB