Dubai Telegraph - As AI data scrapers sap websites' revenues, some fight back

EUR -
AED 4.186804
AFN 72.962441
ALL 94.259056
AMD 418.549568
ANG 2.041136
AOA 1045.418899
ARS 1684.10666
AUD 1.651889
AWG 2.052077
AZN 1.936931
BAM 1.955487
BBD 2.296633
BDT 140.257564
BGN 1.927676
BHD 0.429931
BIF 3386.658257
BMD 1.140043
BND 1.475464
BOB 7.880051
BRL 5.900179
BSD 1.140318
BTN 107.028002
BWP 15.497201
BYN 3.307171
BYR 22344.835632
BZD 2.293293
CAD 1.616934
CDF 2587.896628
CHF 0.921609
CLF 0.026661
CLP 1049.283409
CNY 7.756679
CNH 7.75807
COP 3917.562706
CRC 517.717184
CUC 1.140043
CUP 30.21113
CVE 110.246881
CZK 24.264557
DJF 203.065532
DKK 7.474507
DOP 66.999283
DZD 151.982519
EGP 56.441918
ERN 17.10064
ETB 183.847154
FJD 2.583449
FKP 0.86269
GBP 0.862499
GEL 3.015381
GGP 0.86269
GHS 12.857451
GIP 0.86269
GMD 83.222763
GNF 9991.401736
GTQ 8.699608
GYD 238.651244
HKD 8.940488
HNL 30.510119
HRK 7.535342
HTG 149.03616
HUF 354.147428
IDR 20362.5295
ILS 3.418629
IMP 0.86269
INR 107.599675
IQD 1493.761052
IRR 1567615.623977
ISK 143.998889
JEP 0.86269
JMD 179.591272
JOD 0.808274
JPY 184.289059
KES 147.646835
KGS 99.696357
KHR 4577.267802
KMF 494.7783
KPW 1026.03877
KRW 1752.35789
KWD 0.35298
KYD 0.95029
KZT 553.271497
LAK 25028.996263
LBP 102117.195723
LKR 383.315495
LRD 207.715883
LSL 18.744002
LTL 3.366249
LVL 0.689601
LYD 7.319797
MAD 10.692496
MDL 20.218652
MGA 4823.143858
MKD 61.655153
MMK 2393.462693
MNT 4081.628965
MOP 9.21159
MRU 45.50872
MUR 54.39115
MVR 17.613684
MWK 1977.361744
MXN 19.968844
MYR 4.661976
MZN 72.849226
NAD 18.744002
NGN 1572.118647
NIO 41.963287
NOK 11.298147
NPR 171.247607
NZD 2.018041
OMR 0.438339
PAB 1.140368
PEN 3.888378
PGK 5.004156
PHP 69.892026
PKR 317.357353
PLN 4.286982
PYG 6959.856149
QAR 4.156517
RON 5.241007
RSD 117.374218
RUB 88.643027
RWF 1670.006102
SAR 4.282215
SBD 9.179569
SCR 16.010093
SDG 684.025293
SEK 11.076665
SGD 1.475445
SHP 0.851157
SLE 28.272923
SLL 23906.128197
SOS 651.724331
SRD 42.546623
STD 23596.580793
STN 24.496082
SVC 9.97736
SYP 126.011304
SZL 18.733003
THB 38.047216
TJS 10.553828
TMT 3.990149
TND 3.379908
TOP 2.74495
TRY 53.154875
TTD 7.749624
TWD 36.346152
TZS 2989.981828
UAH 51.183064
UGX 4185.220382
USD 1.140043
UYU 45.774685
UZS 13697.40965
VES 707.684868
VND 29983.121282
VUV 136.749145
WST 3.175585
XAF 655.852087
XAG 0.019615
XAU 0.000282
XCD 3.081022
XCG 2.055071
XDR 0.816787
XOF 655.849211
XPF 119.331742
YER 272.042682
ZAR 18.768497
ZMK 10261.75068
ZMW 20.541075
ZWL 367.093263
  • RYCEF

    0.7000

    18.7

    +3.74%

  • CMSC

    -0.0860

    21.96

    -0.39%

  • RELX

    0.1600

    31.08

    +0.51%

  • RIO

    -0.4850

    94.625

    -0.51%

  • BP

    -0.3600

    37.36

    -0.96%

  • NGG

    -0.1200

    83.3

    -0.14%

  • AZN

    2.9200

    188.6

    +1.55%

  • GSK

    0.6900

    52.58

    +1.31%

  • RBGPF

    0.0000

    61.3

    0%

  • BTI

    0.3150

    62.795

    +0.5%

  • VOD

    -0.0350

    13.825

    -0.25%

  • CMSD

    -0.1000

    21.83

    -0.46%

  • BCE

    -0.2100

    22.99

    -0.91%

  • JRI

    0.1700

    12.75

    +1.33%

  • BCC

    0.0400

    79.8

    +0.05%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: PATRICIA DE MELO MOREIRA - AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

B.Gopalan--DT