Dubai Telegraph - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.228897
AFN 72.544603
ALL 96.183662
AMD 434.229157
ANG 2.061288
AOA 1055.928483
ARS 1608.200783
AUD 1.625385
AWG 2.075586
AZN 1.956154
BAM 1.959533
BBD 2.316513
BDT 141.128872
BGN 1.968276
BHD 0.434856
BIF 3414.980192
BMD 1.151504
BND 1.471235
BOB 7.976196
BRL 6.034567
BSD 1.150196
BTN 106.089037
BWP 15.682946
BYN 3.426227
BYR 22569.474238
BZD 2.313207
CAD 1.576633
CDF 2608.156684
CHF 0.906193
CLF 0.026536
CLP 1047.776192
CNY 8.010147
CNH 7.929762
COP 4265.757296
CRC 540.24567
CUC 1.151504
CUP 30.51485
CVE 110.475953
CZK 24.447343
DJF 204.811085
DKK 7.472275
DOP 70.205887
DZD 152.237997
EGP 60.200932
ERN 17.272557
ETB 181.174658
FJD 2.547069
FKP 0.865734
GBP 0.863685
GEL 3.131737
GGP 0.865734
GHS 12.518905
GIP 0.865734
GMD 84.639353
GNF 10083.517103
GTQ 8.815834
GYD 240.758681
HKD 9.02418
HNL 30.449068
HRK 7.536477
HTG 150.750475
HUF 391.080654
IDR 19547.928299
ILS 3.595824
IMP 0.865734
INR 106.424571
IQD 1506.670433
IRR 1521194.078995
ISK 143.201496
JEP 0.865734
JMD 180.925476
JOD 0.816406
JPY 183.220375
KES 149.234346
KGS 100.698929
KHR 4611.886464
KMF 493.994725
KPW 1036.403966
KRW 1714.0307
KWD 0.353201
KYD 0.958426
KZT 555.408136
LAK 24682.022961
LBP 102995.121174
LKR 358.152334
LRD 210.470063
LSL 19.349464
LTL 3.400091
LVL 0.696533
LYD 7.372077
MAD 10.805486
MDL 20.012126
MGA 4788.142922
MKD 61.653234
MMK 2418.334396
MNT 4116.047513
MOP 9.275872
MRU 45.857361
MUR 53.68307
MVR 17.80246
MWK 1994.007542
MXN 20.353348
MYR 4.511602
MZN 73.586935
NAD 19.349464
NGN 1575.601776
NIO 42.322837
NOK 11.08236
NPR 169.747291
NZD 1.972077
OMR 0.442684
PAB 1.150191
PEN 3.970264
PGK 4.959556
PHP 68.741757
PKR 321.293307
PLN 4.26821
PYG 7465.417237
QAR 4.204128
RON 5.094269
RSD 117.401537
RUB 94.518744
RWF 1678.605284
SAR 4.321598
SBD 9.271517
SCR 16.144156
SDG 692.054169
SEK 10.733385
SGD 1.471432
SHP 0.863926
SLE 28.330837
SLL 24146.471141
SOS 656.152919
SRD 43.263728
STD 23833.803528
STN 24.547513
SVC 10.064174
SYP 127.674013
SZL 19.33492
THB 37.259785
TJS 11.041287
TMT 4.036021
TND 3.397187
TOP 2.772544
TRY 50.902244
TTD 7.79986
TWD 36.722026
TZS 3002.549389
UAH 50.705321
UGX 4342.272682
USD 1.151504
UYU 46.75888
UZS 13906.49396
VES 513.854247
VND 30264.398299
VUV 137.705052
WST 3.171483
XAF 657.211941
XAG 0.014246
XAU 0.000229
XCD 3.111996
XCG 2.072849
XDR 0.817361
XOF 657.211941
XPF 119.331742
YER 274.636692
ZAR 19.256299
ZMK 10364.926801
ZMW 22.398673
ZWL 370.78375
  • RBGPF

    0.1000

    82.5

    +0.12%

  • CMSD

    -0.0400

    22.95

    -0.17%

  • CMSC

    0.0000

    22.99

    0%

  • BCC

    1.7200

    71.72

    +2.4%

  • NGG

    -0.0100

    90.89

    -0.01%

  • RIO

    2.0300

    89.86

    +2.26%

  • JRI

    -0.0500

    12.54

    -0.4%

  • AZN

    2.1100

    192.01

    +1.1%

  • RELX

    0.3300

    34.47

    +0.96%

  • BCE

    0.6521

    25.9

    +2.52%

  • GSK

    0.3800

    53.77

    +0.71%

  • BP

    0.2300

    42.9

    +0.54%

  • RYCEF

    0.3800

    16.5

    +2.3%

  • VOD

    0.1900

    14.6

    +1.3%

  • BTI

    1.0100

    60.94

    +1.66%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

F.Saeed--DT