-
Caudullo challenges Montpellier to be 'watertight' against Dupont threat
-
Stocks recover from tech tremors as oil prices fall
-
Venezuela earthquakes toll soars to 589 amid desperate rescue effort
-
How heatwaves are dangerous to human health
-
Stokes strikes on England return before Duckett runs riot against New Zealand
-
Europe heatwave shattering temperature records: UN
-
UK hottest June day record broken for third day in a row: Met Office
-
Farm workers wilt in sweltering Italian shanty town
-
Tech jitters send stocks lower, oil prices fall
-
Keys to face Maria in Eastbourne final
-
Stokes strikes on England return as New Zealand all out for 438
-
Venezuela earthquakes toll doubles amid desperate rescue efforts
-
Caudullo challenges Montpellier to be 'watertight' against Dupont
-
Mercedes dominate opening practice at Austrian GP
-
Osaka sinks Wang to reach first grass court final
-
Wawrinka announces farewell fete with Federer and Murray
-
UN demands probes into US ICE custody deaths
-
Lukashenko will always be threat to Ukraine: Belarus opposition leader
-
Stokes strikes as New Zealand make England feel the heat
-
European heatwave's unlikely accomplice: an ocean 'cold blob'
-
Lyles enjoying freedom to focus on speed and stuff off the track
-
Japan's progress paying off at World Cup, says Troussier
-
How the British royal family is funded, and where the money goes
-
Dozens of international teams rushing to Venezuela: UN
-
Russia-annexed Crimea declares 'emergency' amid Ukraine strikes
-
Floods kill two in Taiwan as twin storms approach Japan
-
Stocks slide on renewed tech slump, oil prices fall
-
In the heat, Ivorians don't think twice about using aircon
-
EU hits France's Sanofi with flu vaccine antitrust probe
-
Belgium cancels Waterloo battle reenactment due to heat
-
Europe heatwave swamps hospitals, halts parties
-
Mayweather-Pacquiao rematch postponed indefinitely
-
MEXC Reports 142% Volume Surge for MU Futures Following Record Micron Earnings Beat
-
Four injured, flights cancelled in Japan as twin storms approach
-
Serena Williams to face Joint in Wimbledon return after four-year absence
-
Russia pulls team from gymnastics World Cup event over flag row
-
UN says Iran nuclear pledge needs 'very strong' verification
-
Venezuelans hunt for survivors after quakes kill at least 235
-
New Zealand internal report warns of Chinese military forays in Pacific
-
Mexico's Sheinbaum and Spanish king use World Cup to mend diplomatic rift
-
Mbappe v Haaland as France face Norway in World Cup group decider
-
'Die together': Ukraine's LGBTQ soldiers fighting Russia -- and for their rights
-
European economies suffer from heatwave
-
Wole Soyinka university theatre: a talent factory for Nigeria and beyond
-
Hospitals overwhelmed as Europe heatwave shifts east
-
Climate change to blame for intensity of Europe heatwave: scientists
-
努莎·奧貝爾與迪特馬爾·沃伊德克 波茨坦如何辜負一名重度殘障幼兒
-
Venezuelan mother digs with bare hands for missing son
-
'Very strong' nuclear verification needed in Iran after war: IAEA head
-
Нуша Аубель и Дитмар Войдке: как Потсдам бросает на произвол судьбы малыша с тяжелой формой инвалидности
Inner workings of AI an enigma - even to its creators
Even the greatest human minds building generative artificial intelligence that is poised to change the world admit they do not comprehend how digital minds think.
"People outside the field are often surprised and alarmed to learn that we do not understand how our own AI creations work," Anthropic co-founder Dario Amodei wrote in an essay posted online in April.
"This lack of understanding is essentially unprecedented in the history of technology."
Unlike traditional software programs that follow pre-ordained paths of logic dictated by programmers, generative AI (gen AI) models are trained to find their own way to success once prompted.
In a recent podcast Chris Olah, who was part of ChatGPT-maker OpenAI before joining Anthropic, described gen AI as "scaffolding" on which circuits grow.
Olah is considered an authority in so-called mechanistic interpretability, a method of reverse engineering AI models to figure out how they work.
This science, born about a decade ago, seeks to determine exactly how AI gets from a query to an answer.
"Grasping the entirety of a large language model is an incredibly ambitious task," said Neel Nanda, a senior research scientist at the Google DeepMind AI lab.
It was "somewhat analogous to trying to fully understand the human brain," Nanda added to AFP, noting neuroscientists have yet to succeed on that front.
Delving into digital minds to understand their inner workings has gone from a little-known field just a few years ago to being a hot area of academic study.
"Students are very much attracted to it because they perceive the impact that it can have," said Boston University computer science professor Mark Crovella.
The area of study is also gaining traction due to its potential to make gen AI even more powerful, and because peering into digital brains can be intellectually exciting, the professor added.
- Keeping AI honest -
Mechanistic interpretability involves studying not just results served up by gen AI but scrutinizing calculations performed while the technology mulls queries, according to Crovella.
"You could look into the model...observe the computations that are being performed and try to understand those," the professor explained.
Startup Goodfire uses AI software capable of representing data in the form of reasoning steps to better understand gen AI processing and correct errors.
The tool is also intended to prevent gen AI models from being used maliciously or from deciding on their own to deceive humans about what they are up to.
"It does feel like a race against time to get there before we implement extremely intelligent AI models into the world with no understanding of how they work," said Goodfire chief executive Eric Ho.
In his essay, Amodei said recent progress has made him optimistic that the key to fully deciphering AI will be found within two years.
"I agree that by 2027, we could have interpretability that reliably detects model biases and harmful intentions," said Auburn University associate professor Anh Nguyen.
According to Boston University's Crovella, researchers can already access representations of every digital neuron in AI brains.
"Unlike the human brain, we actually have the equivalent of every neuron instrumented inside these models", the academic said. "Everything that happens inside the model is fully known to us. It's a question of discovering the right way to interrogate that."
Harnessing the inner workings of gen AI minds could clear the way for its adoption in areas where tiny errors can have dramatic consequences, like national security, Amodei said.
For Nanda, better understanding what gen AI is doing could also catapult human discoveries, much like DeepMind's chess-playing AI, AlphaZero, revealed entirely new chess moves that none of the grand masters had ever thought about.
Properly understood, a gen AI model with a stamp of reliability would grab competitive advantage in the market.
Such a breakthrough by a US company would also be a win for the nation in its technology rivalry with China.
"Powerful AI will shape humanity's destiny," Amodei wrote.
"We deserve to understand our own creations before they radically transform our economy, our lives, and our future."
T.Egger--VB