Dubai Telegraph - Inner workings of AI an enigma - even to its creators

EUR -
AED 4.326058
AFN 77.139899
ALL 96.549397
AMD 445.222644
ANG 2.10837
AOA 1079.46412
ARS 1698.693815
AUD 1.696726
AWG 2.120054
AZN 1.991648
BAM 1.953756
BBD 2.372917
BDT 144.08925
BGN 1.977975
BHD 0.444005
BIF 3486.310929
BMD 1.177808
BND 1.50053
BOB 8.140518
BRL 6.211168
BSD 1.178167
BTN 106.473605
BWP 15.597747
BYN 3.374769
BYR 23085.03183
BZD 2.369421
CAD 1.613214
CDF 2626.511201
CHF 0.916676
CLF 0.025853
CLP 1020.817577
CNY 8.171689
CNH 8.173762
COP 4350.232911
CRC 584.088911
CUC 1.177808
CUP 31.211905
CVE 110.507883
CZK 24.258172
DJF 209.319869
DKK 7.46659
DOP 74.352211
DZD 153.163736
EGP 55.196195
ERN 17.667116
ETB 183.5728
FJD 2.606429
FKP 0.862372
GBP 0.870123
GEL 3.168063
GGP 0.862372
GHS 12.926468
GIP 0.862372
GMD 86.565372
GNF 10317.595829
GTQ 9.036546
GYD 246.482124
HKD 9.204037
HNL 31.120441
HRK 7.531959
HTG 154.558297
HUF 379.805904
IDR 19869.086669
ILS 3.674695
IMP 0.862372
INR 106.344965
IQD 1543.38527
IRR 49615.151504
ISK 144.799462
JEP 0.862372
JMD 184.267215
JOD 0.835086
JPY 184.980006
KES 151.93744
KGS 102.99914
KHR 4755.045332
KMF 491.146061
KPW 1060.062311
KRW 1730.806135
KWD 0.362105
KYD 0.981819
KZT 581.062078
LAK 25322.506925
LBP 105507.31126
LKR 364.588558
LRD 219.141892
LSL 19.033287
LTL 3.47776
LVL 0.712444
LYD 7.463192
MAD 10.813487
MDL 20.022137
MGA 5212.546496
MKD 61.579789
MMK 2473.140934
MNT 4203.780708
MOP 9.481064
MRU 46.995832
MUR 54.226305
MVR 18.208707
MWK 2042.862703
MXN 20.569647
MYR 4.648834
MZN 75.097215
NAD 19.033287
NGN 1609.510075
NIO 43.354641
NOK 11.5385
NPR 170.357767
NZD 1.976408
OMR 0.452871
PAB 1.178177
PEN 3.960257
PGK 5.121642
PHP 69.236319
PKR 329.876375
PLN 4.224973
PYG 7779.860505
QAR 4.293908
RON 5.093072
RSD 117.368304
RUB 90.396418
RWF 1719.581228
SAR 4.416898
SBD 9.498604
SCR 15.920008
SDG 708.45608
SEK 10.670308
SGD 1.501946
SHP 0.883661
SLE 28.914899
SLL 24698.038676
SOS 672.096835
SRD 44.603273
STD 24378.242367
STN 24.474394
SVC 10.308215
SYP 13026.052983
SZL 19.024177
THB 37.451938
TJS 11.027263
TMT 4.128216
TND 3.413828
TOP 2.835878
TRY 51.277982
TTD 7.977654
TWD 37.306474
TZS 3044.633176
UAH 50.838711
UGX 4205.59999
USD 1.177808
UYU 45.462436
UZS 14450.881107
VES 445.192896
VND 30570.000059
VUV 140.969068
WST 3.21111
XAF 655.302006
XAG 0.015944
XAU 0.000245
XCD 3.183084
XCG 2.123288
XDR 0.813984
XOF 655.271438
XPF 119.331742
YER 280.701005
ZAR 19.144735
ZMK 10601.69265
ZMW 21.88429
ZWL 379.253614
  • SCS

    0.0200

    16.14

    +0.12%

  • RBGPF

    0.1000

    82.5

    +0.12%

  • CMSD

    0.0200

    23.89

    +0.08%

  • CMSC

    0.0300

    23.55

    +0.13%

  • BCC

    -1.0700

    89.16

    -1.2%

  • RYCEF

    -0.0600

    16.62

    -0.36%

  • NGG

    -0.9000

    86.89

    -1.04%

  • BCE

    -0.7700

    25.57

    -3.01%

  • RIO

    -5.3600

    91.12

    -5.88%

  • RELX

    0.3100

    30.09

    +1.03%

  • GSK

    1.9400

    59.17

    +3.28%

  • VOD

    -1.0900

    14.62

    -7.46%

  • JRI

    -0.1500

    13

    -1.15%

  • BTI

    0.3300

    61.96

    +0.53%

  • AZN

    -0.2900

    187.16

    -0.15%

  • BP

    -1.0300

    38.17

    -2.7%

Inner workings of AI an enigma - even to its creators
Inner workings of AI an enigma - even to its creators / Photo: Kirill KUDRYAVTSEV - AFP

Inner workings of AI an enigma - even to its creators

Even the greatest human minds building generative artificial intelligence that is poised to change the world admit they do not comprehend how digital minds think.

Text size:

"People outside the field are often surprised and alarmed to learn that we do not understand how our own AI creations work," Anthropic co-founder Dario Amodei wrote in an essay posted online in April.

"This lack of understanding is essentially unprecedented in the history of technology."

Unlike traditional software programs that follow pre-ordained paths of logic dictated by programmers, generative AI (gen AI) models are trained to find their own way to success once prompted.

In a recent podcast Chris Olah, who was part of ChatGPT-maker OpenAI before joining Anthropic, described gen AI as "scaffolding" on which circuits grow.

Olah is considered an authority in so-called mechanistic interpretability, a method of reverse engineering AI models to figure out how they work.

This science, born about a decade ago, seeks to determine exactly how AI gets from a query to an answer.

"Grasping the entirety of a large language model is an incredibly ambitious task," said Neel Nanda, a senior research scientist at the Google DeepMind AI lab.

It was "somewhat analogous to trying to fully understand the human brain," Nanda added to AFP, noting neuroscientists have yet to succeed on that front.

Delving into digital minds to understand their inner workings has gone from a little-known field just a few years ago to being a hot area of academic study.

"Students are very much attracted to it because they perceive the impact that it can have," said Boston University computer science professor Mark Crovella.

The area of study is also gaining traction due to its potential to make gen AI even more powerful, and because peering into digital brains can be intellectually exciting, the professor added.

- Keeping AI honest -

Mechanistic interpretability involves studying not just results served up by gen AI but scrutinizing calculations performed while the technology mulls queries, according to Crovella.

"You could look into the model...observe the computations that are being performed and try to understand those," the professor explained.

Startup Goodfire uses AI software capable of representing data in the form of reasoning steps to better understand gen AI processing and correct errors.

The tool is also intended to prevent gen AI models from being used maliciously or from deciding on their own to deceive humans about what they are up to.

"It does feel like a race against time to get there before we implement extremely intelligent AI models into the world with no understanding of how they work," said Goodfire chief executive Eric Ho.

In his essay, Amodei said recent progress has made him optimistic that the key to fully deciphering AI will be found within two years.

"I agree that by 2027, we could have interpretability that reliably detects model biases and harmful intentions," said Auburn University associate professor Anh Nguyen.

According to Boston University's Crovella, researchers can already access representations of every digital neuron in AI brains.

"Unlike the human brain, we actually have the equivalent of every neuron instrumented inside these models", the academic said. "Everything that happens inside the model is fully known to us. It's a question of discovering the right way to interrogate that."

Harnessing the inner workings of gen AI minds could clear the way for its adoption in areas where tiny errors can have dramatic consequences, like national security, Amodei said.

For Nanda, better understanding what gen AI is doing could also catapult human discoveries, much like DeepMind's chess-playing AI, AlphaZero, revealed entirely new chess moves that none of the grand masters had ever thought about.

Properly understood, a gen AI model with a stamp of reliability would grab competitive advantage in the market.

Such a breakthrough by a US company would also be a win for the nation in its technology rivalry with China.

"Powerful AI will shape humanity's destiny," Amodei wrote.

"We deserve to understand our own creations before they radically transform our economy, our lives, and our future."

B.Krishnan--DT