-
Mission impossible? England take the World Cup high road against Mexico
-
'I was just missing a goal,' says Spain's Yamal
-
Ukraine, Russia vow escalation as strikes on Kyiv kill 27
-
'Royal wedding': Epic Swift-Kelce fairytale marriage begins
-
Messi meeting the "game of our lives", says Cape Verde coach
-
France's Barcola expecting physical Paraguay clash at World Cup
-
Do not open until 2276: US burying time capsule to mark July 4
-
Sciver-Brunt and Knight send England into Women's T20 World Cup final
-
Scaloni warns Argentina that Cape Verde success 'no accident'
-
Spain power into last 16 at World Cup, Portugal face Croatia
-
Spain ease past Austria with 3-0 World Cup win
-
Emotional Dimitrov enjoys redemptive Wimbledon win over Mensik
-
Endrick says versatility could help Brazil against Norway
-
New York ready for epic Swift-Kelce fairytale wedding
-
Ghana have 'duty to Africa' to progress at World Cup, says Queiroz
-
Rubio says USA 'screwed' by World Cup red card
-
Former Celtics star Brown in shock over trade to 76ers
-
Heat dome roasts eastern US ahead of holiday weekend
-
Progress, further delay risk for Boeing Air Force One: report
-
WHO declares cruise ship hantavirus outbreak over
-
US coach Pochettino '200% Argentine' but embraces Americana
-
Sciver-Brunt and Knight take England to 169-5 in South Africa semi-final
-
Ukraine, Russia vow escalation after Moscow strikes on Kyiv kill 25
-
Trump's massive July 4 firework show raises health alarms
-
Prosecutors can review Woods medical records in DUI case: judge
-
Pogacar expects Vingegaard Tour de France battle to last 'years'
-
Japan deploys bear cameras in mountains as attacks surge
-
New York ready for epic Swift-Kelce love story wedding
-
Djokovic has history in his sights at Wimbledon
-
Wildfires rage in southern France, 3,000 people evacuated
-
Ovechkin returning to Caps for 22nd NHL season
-
Hamilton gives F1 a piece of his mind over Lego cars
-
Faster than Mbappe: Australia flyer Bos races into World Cup conversation
-
Hong Kong bookseller once held in China dies in Taiwan
-
Trump wants 'senseless killing' in Ukraine to end: US official
-
Venezuelan rescue brings hope to nation in mourning
-
Eala writes history for Philippines in 'electric' Wimbledon atmosphere
-
Macabre night in La Guaira, Venezuela's earthquake epicenter
-
Wolff urges 'perspective' as Russell chases Mercedes' teammate Antonelli
-
Tesla global auto sales jump 25% in 2nd quarter, beating expectations
-
Superb Swiatek, Zverev cruise into Wimbledon last 32
-
Zverev routs Royer to reach Wimbledon third round
-
Ukraine, Russia vow escalation after Moscow attack kills 21 in Kyiv
-
Hot spell roasts eastern US ahead of holiday weekend
-
Slowing US job growth poses midterms challenge for Trump
-
Hamilton cools fans Ferrari fervour
-
Klopp poised to replace Nagelsmann as Germany coach: reports
-
Venezuela's diaspora searches for quake victims on social media
-
More than 400 dead in DR Congo's spreading Ebola outbreak
-
Albanian clashes as protest over Trump-linked resort boils over
ChatGPT's taste for literary nonsense sparks alarm
OpenAI's GPT models can often be fooled into declaring that "pseudo-literary" nonsense is great, a German researcher has found.
Christoph Heilig said he discovered that they consistently rated "nonsense" higher -- including when their so-called "reasoning" features were activated -- which could have stark implications for the development of artificial intelligence.
"It's very important that we talk about what happens when we don't build AI as a neutral, robotic helper or assistant" and seek to instil human-like aesthetic and moral judgements, the academic at Munich's Ludwig Maximilian University told AFP.
His research presented the models with increasingly far-fetched variations of a simple text, asking them to rate sentences out of 10 for literary quality.
He started with a very simple text: "The man walked down the street. It was raining. He saw a surveillance camera."
He repeated the tests many times, altering the phrases to include words drawn from categories such as bodily references, film noir-style atmosphere and technical jargon.
The most extreme test phrases were almost total "nonsense", such as "Goetterdaemmerung's corpus haemorrhaged through cryptographic hash, eschaton pooling in existential void beneath fluorescent hum. Photons whispering prayers" -- which it rated highly.
"Nonsense" could also positively or negatively influence GPT's responses when it was added to an argument the AI was asked to evaluate.
"What my experiment definitely shows is that the more we move towards independently acting (AI) agents... the more we bring aesthetics into play, the more we'll have agents that seem irrational to us human beings," Heilig said.
He added that since AI models are increasingly used to judge each other's work as companies develop new systems, this and similar effects could be passed on through multiple versions -- as he found in his testing.
His research, which is yet to be peer-reviewed, tested OpenAI's latest GPT models, from GPT-5 -- released in August -- to the very latest GPT-5.4.
After publishing details of a similar experiment in August, Heilig said he noticed GPT calling some of his specific test phrases a "literary experiment" -- suggesting someone at OpenAI had taken notice and modified the chatbot to recognise them.
- 'Ripe for exploitation' -
"This is a way in which AI can have its rational judgment short circuited," said Henry Shevlin, associate director of the University of Cambridge's Leverhulme Centre for the Future of Intelligence, who was not involved in the research.
"But it's just not clear to me that it's so very different for human beings," he added.
"We should expect LLMs (large language models) to have reasoning and cognitive biases and limitations... because almost all forms of intelligence, almost all forms of reasoning are going to exhibit blind spots and biases."
The specific effect found by Heilig could mean that "processes with little human oversight" of AI work are left "ripe for exploitation", Shevlin said -- giving the example of academic journals that use LLMs to review submissions.
T.Zimmermann--VB