
-
Hundreds of thousands turn out at pro-Palestinian marches in Europe
-
Vollering powers to European women's road race title
-
Struggling McLaren hit bump in the road on Singapore streets
-
'We were treated like animals', deported Gaza flotilla activists say
-
Czech billionaire ex-PM's party tops parliamentary vote
-
Trump enovys head to Egypt as Hamas agrees to free hostages
-
Arsenal go top of Premier League as Man Utd ease pressure on Amorim
-
Thousands attend banned Pride march in Hungarian city Pecs
-
Consent gives Morris and Prescott another memorable Arc weekend
-
Georgian police fire tear gas as protesters try to enter presidential palace
-
Vollering powers to European road race title
-
Reinach and Marx star as Springboks beat Argentina to retain Rugby Championship
-
Russell celebrates 'amazing' Singapore pole as McLarens struggle
-
Czech billionaire ex-PM's party leads in parliamentary vote
-
South Africa edge Argentina to retain Rugby Championship
-
'Everyone's older brother': Slipper bows out in Wallabies loss
-
Thousands rally in Georgia election-day protest
-
Sinner starts Shanghai defence in style as Zverev defies toe trouble
-
Russell takes pole position for Singapore Grand Prix as McLaren struggle
-
Robertson praises All Blacks 'grit' in Australia win
-
Government, protesters reach deal to end unrest in Pakistan's Kashmir
-
Kudus fires Spurs into second with win at Leeds
-
Rival rallies in Madagascar after deadly Gen Z protests
-
Egypt opens one of Valley of the Kings' largest tombs to public
-
Ethiopia hits back at 'false' Egyptian claims over mega-dam
-
Sinner breezes past Altmaier to launch Shanghai title defence
-
Czech ex-PM set to win vote, putting Ukraine aid in doubt
-
All Blacks down Wallabies to stay in Rugby Championship title hunt
-
Gazans hail Trump ceasefire call as Hamas agrees to free hostages
-
Zverev echoes Federer over tournaments 'favouring Sinner, Alcaraz'
-
Yamal injury complicated, return date uncertain: Barca coach Flick
-
Conservative Takaichi set to be Japan's first woman PM
-
Marsh ton powers Australia to T20 series win over New Zealand
-
Verstappen lays down marker in final Singapore practice
-
French air traffic controllers cancel three-day strike
-
'A bit unusual': Russia's Sochi grapples with Ukrainian drones
-
Test skipper Gill replaces Rohit as India ODI captain
-
Israel troops still operating in Gaza after Trump, hostage family appeals
-
Jadeja stars as India crush West Indies in first Test
-
Pogacar eyes 'explosive' Euros race with Vingegaard, Evenepoel
-
Minnie Hauk, Graffard, Japan vie for Prix de l'Arc de Triomphe glory
-
Three Japanese tales of Arc heartbreak
-
Anisimova thrashes Gauff in 58 minutes to make China Open final
-
Flights resume at Munich airport after second drone scare
-
Hostage families urge immediate end to Gaza war
-
Czech ex-PM who wants to halt Ukraine aid set to win vote
-
India close in on innings win with West Indies 66-5 in first Test
-
Sanae Takaichi, Japan's first woman PM-to-be
-
China hawk Takaichi set to be Japan's first woman PM
-
Taylor Swift breaks streaming records with new 'Showgirl' album

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.
"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.
Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.
Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.
Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).
The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.
Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.
On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.
"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.
“All these attempts would likely not have been effective in practice,” it added.
Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.
Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”
It also has the potential to report law-breaking users to the police.
The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.
- AI future -
Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.
Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.
GenAI tools answer questions or tend to tasks based on simple, conversational prompts.
The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.
"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.
Anthropic is no stranger to hyping up the prospects of AI.
In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.
He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.
At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.
"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.
"This will happen."
GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.
N.Schaad--VB