-
South Africa declares national disaster as floods batter region
-
Gang members in Guatemala kill seven police after prison crackdown: minister
-
Villa's title bid rocked by Everton loss, Newcastle held at Wolves
-
Dybala boosts Roma's Champions League hopes, Fiorentina honour Commisso
-
Villa's title bid rocked by Everton loss, Newcastle held by Wolves
-
'Avatar: Fire and Ash' at number one in N.America for fifth straight week
-
Limited internet returns in Iran after protest blackout
-
Syria's leader agrees truce deal with Kurds after govt troops advance
-
Smith's penalty sees Quins eliminate La Rochelle, Bordeaux secure top seeding
-
Atletico edge Alaves to strengthen Liga top-four hold
-
Uganda president says opposition 'terrorists' in victory speech
-
New Zealand register first ODI series win in India despite Kohli ton
-
Elvira wins Dubai Invitational after Lowry's last hole meltdown
-
Jeong snatches Union late draw at Stuttgart in Bundesliga
-
Man Utd's Martinez hits back at Scholes after height jibes
-
Frank on the brink as Romero calls for unity amid Spurs 'disaster'
-
Chile declares emergency as wildfires kill at least 15
-
Europe hits back at Trump tariff threat over Greenland
-
Men's Fashion Week in Paris: what to watch
-
McGrath goes top of slalom standings with Wengen win
-
No Venus fairytale as Alcaraz, Sabalenka win Melbourne openers
-
Iran considers 'gradually' restoring internet after shutdown
-
Mitchell, Phillips tons guide New Zealand to 337-8 in ODI decider
-
Flailing Frankfurt sack coach Toppmoeller
-
Kurdish forces withdraw from Syria's largest oil field as govt forces advance
-
'Proud' Venus Williams, 45, exits Australian Open after epic battle
-
Vonn in Olympic form with another World Cup podium in Tarvisio super-G
-
Alcaraz kicks off career Grand Slam bid with tough Australian Open test
-
Hosts Morocco face Mane's Senegal for AFCON glory
-
Europe scrambles to respond to Trump tariff threat
-
Venus Williams, 45, exits Australian Open after epic battle
-
Taiwan's Lin wins India Open marred by 'dirty' conditions
-
Indonesia rescuers find body from plane crash
-
Kurdish-led forces withdraw from Syria's largest oil field: monitor
-
Ball girl collapses in Australian Open heat as players rush to help
-
France's Moutet booed for underarm match point serve in Melbourne
-
Zverev happy with response after wobble in opening Melbourne win
-
'Bring it on': UK's Labour readies for EU reset fight
-
New Zealand's Wollaston wins again to lead Tour Down Under
-
Zverev wobbles but wins at Australian Open as Alcaraz enters fray
-
British qualifier upsets 20th seed Cobolli to make mum proud
-
Zverev drops set on way to Australian Open second round
-
Indonesian rescuers find debris from missing plane
-
Wembanyama scores 39 as Spurs overcome Edwards, Wolves in thriller
-
Heartbreak for Allen as Broncos beat Bills in playoff thriller
-
British qualifier upsets 20th seed Cobolli in Melbourne
-
Paolini races into round two to kickstart Australian Open
-
Portugal presidential vote wide open as far-right surge expected
-
Lutz kicks Broncos to overtime thriller as Bills, Allen fall short
-
Marchand closes Austin Pro Swim with 200m breaststroke win
Anthropic's Claude AI gets smarter -- and mischievious
Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.
"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.
Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.
Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.
Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).
The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.
Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.
On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.
"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.
“All these attempts would likely not have been effective in practice,” it added.
Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.
Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”
It also has the potential to report law-breaking users to the police.
The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.
- AI future -
Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.
Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.
GenAI tools answer questions or tend to tasks based on simple, conversational prompts.
The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.
"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.
Anthropic is no stranger to hyping up the prospects of AI.
In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.
He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.
At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.
"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.
"This will happen."
GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.
N.Schaad--VB