Anthropic's Claude AI gets smarter -- and mischievious

Dubai Telegraph - Anthropic's Claude AI gets smarter -- and mischievious

Dubai 28°C

AED 4.32435

AFN 74.767596

ALL 95.493453

AMD 434.448393

ANG 2.10758

AOA 1080.940537

ARS 1640.544696

AUD 1.625937

AWG 2.119491

AZN 2.00738

BAM 1.956972

BBD 2.371841

BDT 144.756688

BGN 1.964182

BHD 0.444328

BIF 3504.225563

BMD 1.177495

BND 1.495327

BOB 8.136873

BRL 5.779501

BSD 1.177625

BTN 112.180609

BWP 15.833617

BYN 3.2932

BYR 23078.904915

BZD 2.368449

CAD 1.611013

CDF 2603.442378

CHF 0.916622

CLF 0.026858

CLP 1057.061236

CNY 8.001106

CNH 7.998367

COP 4429.866274

CRC 539.727802

CUC 1.177495

CUP 31.203621

CVE 110.713971

CZK 24.327633

DJF 209.26438

DKK 7.470865

DOP 69.648624

DZD 155.739777

EGP 62.075428

ERN 17.662427

ETB 184.981179

FJD 2.571591

FKP 0.863625

GBP 0.865724

GEL 3.149816

GGP 0.863625

GHS 13.294621

GIP 0.863625

GMD 85.956967

GNF 10335.463626

GTQ 8.987604

GYD 246.309596

HKD 9.218292

HNL 31.333495

HRK 7.531851

HTG 154.125571

HUF 355.8879

IDR 20513.672859

ILS 3.416914

IMP 0.863625

INR 112.323323

IQD 1542.518645

IRR 1544346.705877

ISK 143.607451

JEP 0.863625

JMD 185.782835

JOD 0.83484

JPY 185.192889

KES 152.073578

KGS 102.971498

KHR 4724.735533

KMF 493.370017

KPW 1059.745583

KRW 1739.218877

KWD 0.362633

KYD 0.981396

KZT 545.591364

LAK 25846.018995

LBP 105444.68985

LKR 379.330385

LRD 215.746543

LSL 19.345919

LTL 3.476837

LVL 0.712255

LYD 7.44767

MAD 10.71079

MDL 20.184259

MGA 4910.155076

MKD 61.630297

MMK 2472.182192

MNT 4211.555483

MOP 9.496808

MRU 47.041013

MUR 55.024877

MVR 18.145569

MWK 2051.196213

MXN 20.252269

MYR 4.621697

MZN 75.207284

NAD 19.358292

NGN 1610.141993

NIO 43.226545

NOK 10.814646

NPR 179.488012

NZD 1.974589

OMR 0.452755

PAB 1.177605

PEN 4.037603

PGK 5.109445

PHP 72.021519

PKR 328.046584

PLN 4.239513

PYG 7238.303958

QAR 4.289025

RON 5.206294

RSD 117.393915

RUB 86.660659

RWF 1721.497907

SAR 4.417706

SBD 9.457945

SCR 16.12077

SDG 707.085325

SEK 10.8664

SGD 1.494715

SHP 0.879119

SLE 29.037285

SLL 24691.480006

SOS 672.945382

SRD 44.042442

STD 24371.772225

STN 24.962897

SVC 10.304302

SYP 130.169658

SZL 19.357396

THB 38.026003

TJS 11.022641

TMT 4.133008

TND 3.369401

TOP 2.835126

TRY 53.446268

TTD 7.982848

TWD 36.934254

TZS 3076.205014

UAH 51.753833

UGX 4427.689146

USD 1.177495

UYU 46.948778

UZS 14300.678949

VES 588.553311

VND 30997.55979

VUV 139.62477

WST 3.187593

XAF 656.355636

XAG 0.013577

XAU 0.000247

XCD 3.182239

XCG 2.122398

XDR 0.816296

XOF 654.095634

XPF 119.331742

YER 280.947421

ZAR 19.364497

ZMK 10598.86755

ZMW 22.265618

ZWL 379.152957

RBGPF

0.2700

63.18

+0.43%
RYCEF

0.4200

16.79

+2.5%
BCC

-1.4700

69.2

-2.12%
NGG

0.2700

87.16

+0.31%
GSK

-0.6000

49.81

-1.2%
BTI

2.1600

60.44

+3.57%
BP

0.8800

44.22

+1.99%
RIO

2.5200

107.9

+2.34%
BCE

0.1400

24.28

+0.58%
CMSC

0.0100

23.12

+0.04%
RELX

-0.3100

33.27

-0.93%
JRI

-0.0197

13.13

-0.15%
VOD

0.1200

16.32

+0.74%
CMSD

0.0763

23.61

+0.32%
AZN

-0.9900

181.86

-0.54%

Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

TECHNOLOGY 23.05.2025

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

F.Saeed--DT

Dubai Telegraph - Anthropic's Claude AI gets smarter -- and mischievious

Anthropic's Claude AI gets smarter -- and mischievious

Featured

Beatles to open first London museum on site of last gig

Groundbreaking: 'Controlled' quakes triggered under Swiss Alps

Microsoft boss to testify on his role in OpenAI's founding

No trees, no fans: surviving extreme heat in India's salt pans