Dubai Telegraph - Inner workings of AI an enigma

Dubai Telegraph - Inner workings of AI an enigma - even to its creators

Dubai 21°C

AED 4.326058

AFN 77.139899

ALL 96.549397

AMD 445.222644

ANG 2.10837

AOA 1079.46412

ARS 1698.693815

AUD 1.696726

AWG 2.120054

AZN 1.991648

BAM 1.953756

BBD 2.372917

BDT 144.08925

BGN 1.977975

BHD 0.444005

BIF 3486.310929

BMD 1.177808

BND 1.50053

BOB 8.140518

BRL 6.211168

BSD 1.178167

BTN 106.473605

BWP 15.597747

BYN 3.374769

BYR 23085.03183

BZD 2.369421

CAD 1.613214

CDF 2626.511201

CHF 0.916676

CLF 0.025853

CLP 1020.817577

CNY 8.171689

CNH 8.173762

COP 4350.232911

CRC 584.088911

CUC 1.177808

CUP 31.211905

CVE 110.507883

CZK 24.258172

DJF 209.319869

DKK 7.46659

DOP 74.352211

DZD 153.163736

EGP 55.196195

ERN 17.667116

ETB 183.5728

FJD 2.606429

FKP 0.862372

GBP 0.870123

GEL 3.168063

GGP 0.862372

GHS 12.926468

GIP 0.862372

GMD 86.565372

GNF 10317.595829

GTQ 9.036546

GYD 246.482124

HKD 9.204037

HNL 31.120441

HRK 7.531959

HTG 154.558297

HUF 379.805904

IDR 19869.086669

ILS 3.674695

IMP 0.862372

INR 106.344965

IQD 1543.38527

IRR 49615.151504

ISK 144.799462

JEP 0.862372

JMD 184.267215

JOD 0.835086

JPY 184.980006

KES 151.93744

KGS 102.99914

KHR 4755.045332

KMF 491.146061

KPW 1060.062311

KRW 1730.806135

KWD 0.362105

KYD 0.981819

KZT 581.062078

LAK 25322.506925

LBP 105507.31126

LKR 364.588558

LRD 219.141892

LSL 19.033287

LTL 3.47776

LVL 0.712444

LYD 7.463192

MAD 10.813487

MDL 20.022137

MGA 5212.546496

MKD 61.579789

MMK 2473.140934

MNT 4203.780708

MOP 9.481064

MRU 46.995832

MUR 54.226305

MVR 18.208707

MWK 2042.862703

MXN 20.569647

MYR 4.648834

MZN 75.097215

NAD 19.033287

NGN 1609.510075

NIO 43.354641

NOK 11.5385

NPR 170.357767

NZD 1.976408

OMR 0.452871

PAB 1.178177

PEN 3.960257

PGK 5.121642

PHP 69.236319

PKR 329.876375

PLN 4.224973

PYG 7779.860505

QAR 4.293908

RON 5.093072

RSD 117.368304

RUB 90.396418

RWF 1719.581228

SAR 4.416898

SBD 9.498604

SCR 15.920008

SDG 708.45608

SEK 10.670308

SGD 1.501946

SHP 0.883661

SLE 28.914899

SLL 24698.038676

SOS 672.096835

SRD 44.603273

STD 24378.242367

STN 24.474394

SVC 10.308215

SYP 13026.052983

SZL 19.024177

THB 37.451938

TJS 11.027263

TMT 4.128216

TND 3.413828

TOP 2.835878

TRY 51.277982

TTD 7.977654

TWD 37.306474

TZS 3044.633176

UAH 50.838711

UGX 4205.59999

USD 1.177808

UYU 45.462436

UZS 14450.881107

VES 445.192896

VND 30570.000059

VUV 140.969068

WST 3.21111

XAF 655.302006

XAG 0.015944

XAU 0.000245

XCD 3.183084

XCG 2.123288

XDR 0.813984

XOF 655.271438

XPF 119.331742

YER 280.701005

ZAR 19.144735

ZMK 10601.69265

ZMW 21.88429

ZWL 379.253614

SCS

0.0200

16.14

+0.12%
RBGPF

0.1000

82.5

+0.12%
CMSD

0.0200

23.89

+0.08%
CMSC

0.0300

23.55

+0.13%
BCC

-1.0700

89.16

-1.2%
RYCEF

-0.0600

16.62

-0.36%
NGG

-0.9000

86.89

-1.04%
BCE

-0.7700

25.57

-3.01%
RIO

-5.3600

91.12

-5.88%
RELX

0.3100

30.09

+1.03%
GSK

1.9400

59.17

+3.28%
VOD

-1.0900

14.62

-7.46%
JRI

-0.1500

13

-1.15%
BTI

0.3300

61.96

+0.53%
AZN

-0.2900

187.16

-0.15%
BP

-1.0300

38.17

-2.7%

Inner workings of AI an enigma - even to its creators / Photo: Kirill KUDRYAVTSEV - AFP

Inner workings of AI an enigma - even to its creators

ECONOMY 13.05.2025

Even the greatest human minds building generative artificial intelligence that is poised to change the world admit they do not comprehend how digital minds think.

Text size:

"People outside the field are often surprised and alarmed to learn that we do not understand how our own AI creations work," Anthropic co-founder Dario Amodei wrote in an essay posted online in April.

"This lack of understanding is essentially unprecedented in the history of technology."

Unlike traditional software programs that follow pre-ordained paths of logic dictated by programmers, generative AI (gen AI) models are trained to find their own way to success once prompted.

In a recent podcast Chris Olah, who was part of ChatGPT-maker OpenAI before joining Anthropic, described gen AI as "scaffolding" on which circuits grow.

Olah is considered an authority in so-called mechanistic interpretability, a method of reverse engineering AI models to figure out how they work.

This science, born about a decade ago, seeks to determine exactly how AI gets from a query to an answer.

"Grasping the entirety of a large language model is an incredibly ambitious task," said Neel Nanda, a senior research scientist at the Google DeepMind AI lab.

It was "somewhat analogous to trying to fully understand the human brain," Nanda added to AFP, noting neuroscientists have yet to succeed on that front.

Delving into digital minds to understand their inner workings has gone from a little-known field just a few years ago to being a hot area of academic study.

"Students are very much attracted to it because they perceive the impact that it can have," said Boston University computer science professor Mark Crovella.

The area of study is also gaining traction due to its potential to make gen AI even more powerful, and because peering into digital brains can be intellectually exciting, the professor added.

- Keeping AI honest -

Mechanistic interpretability involves studying not just results served up by gen AI but scrutinizing calculations performed while the technology mulls queries, according to Crovella.

"You could look into the model...observe the computations that are being performed and try to understand those," the professor explained.

Startup Goodfire uses AI software capable of representing data in the form of reasoning steps to better understand gen AI processing and correct errors.

The tool is also intended to prevent gen AI models from being used maliciously or from deciding on their own to deceive humans about what they are up to.

"It does feel like a race against time to get there before we implement extremely intelligent AI models into the world with no understanding of how they work," said Goodfire chief executive Eric Ho.

In his essay, Amodei said recent progress has made him optimistic that the key to fully deciphering AI will be found within two years.

"I agree that by 2027, we could have interpretability that reliably detects model biases and harmful intentions," said Auburn University associate professor Anh Nguyen.

According to Boston University's Crovella, researchers can already access representations of every digital neuron in AI brains.

"Unlike the human brain, we actually have the equivalent of every neuron instrumented inside these models", the academic said. "Everything that happens inside the model is fully known to us. It's a question of discovering the right way to interrogate that."

Harnessing the inner workings of gen AI minds could clear the way for its adoption in areas where tiny errors can have dramatic consequences, like national security, Amodei said.

For Nanda, better understanding what gen AI is doing could also catapult human discoveries, much like DeepMind's chess-playing AI, AlphaZero, revealed entirely new chess moves that none of the grand masters had ever thought about.

Properly understood, a gen AI model with a stamp of reliability would grab competitive advantage in the market.

Such a breakthrough by a US company would also be a win for the nation in its technology rivalry with China.

"Powerful AI will shape humanity's destiny," Amodei wrote.

"We deserve to understand our own creations before they radically transform our economy, our lives, and our future."

B.Krishnan--DT