Dubai Telegraph - As AI data scrapers sap websites' revenues, some fight back

EUR -
AED 4.400314
AFN 77.881147
ALL 96.814682
AMD 454.172547
ANG 2.144834
AOA 1098.729057
ARS 1730.745379
AUD 1.690809
AWG 2.158218
AZN 2.042821
BAM 1.959124
BBD 2.414607
BDT 146.498583
BGN 2.012185
BHD 0.451686
BIF 3551.270346
BMD 1.198178
BND 1.512786
BOB 8.284057
BRL 6.227767
BSD 1.198839
BTN 110.119313
BWP 15.686617
BYN 3.408698
BYR 23484.290754
BZD 2.411101
CAD 1.620506
CDF 2683.918435
CHF 0.917625
CLF 0.026186
CLP 1033.955485
CNY 8.33291
CNH 8.319544
COP 4397.74497
CRC 595.019577
CUC 1.198178
CUP 31.75172
CVE 110.45288
CZK 24.298095
DJF 213.48135
DKK 7.46704
DOP 75.429249
DZD 154.714803
EGP 56.109364
ERN 17.972671
ETB 186.414713
FJD 2.618439
FKP 0.869432
GBP 0.866031
GEL 3.229063
GGP 0.869432
GHS 13.103234
GIP 0.869432
GMD 87.466656
GNF 10519.982279
GTQ 9.197645
GYD 250.81559
HKD 9.348245
HNL 31.637684
HRK 7.534031
HTG 156.996396
HUF 379.901498
IDR 20117.410294
ILS 3.70231
IMP 0.869432
INR 110.191403
IQD 1570.47137
IRR 50473.252638
ISK 144.787493
JEP 0.869432
JMD 187.928883
JOD 0.849516
JPY 183.431525
KES 154.589225
KGS 104.78044
KHR 4819.23774
KMF 493.649685
KPW 1078.290613
KRW 1708.440222
KWD 0.367097
KYD 0.999099
KZT 604.037467
LAK 25827.933287
LBP 107356.012463
LKR 371.221447
LRD 221.78726
LSL 19.062325
LTL 3.537908
LVL 0.724766
LYD 7.528744
MAD 10.839493
MDL 20.104197
MGA 5349.076452
MKD 61.600431
MMK 2516.151613
MNT 4280.660921
MOP 9.634588
MRU 47.858006
MUR 54.097074
MVR 18.523892
MWK 2078.827408
MXN 20.521616
MYR 4.695675
MZN 76.395464
NAD 19.062325
NGN 1673.830778
NIO 44.115408
NOK 11.440744
NPR 176.1907
NZD 1.969217
OMR 0.460694
PAB 1.198834
PEN 4.011306
PGK 5.131772
PHP 70.569096
PKR 335.375273
PLN 4.204707
PYG 8050.626917
QAR 4.358915
RON 5.095247
RSD 117.400304
RUB 91.721686
RWF 1749.067864
SAR 4.49358
SBD 9.678495
SCR 17.176644
SDG 720.702641
SEK 10.541367
SGD 1.511975
SHP 0.898944
SLE 29.118971
SLL 25125.194783
SOS 683.960562
SRD 45.640962
STD 24799.867551
STN 24.541951
SVC 10.489843
SYP 13251.340431
SZL 19.054412
THB 37.190847
TJS 11.203157
TMT 4.193623
TND 3.428532
TOP 2.884925
TRY 52.020807
TTD 8.136841
TWD 37.52634
TZS 3043.372756
UAH 51.245655
UGX 4292.283258
USD 1.198178
UYU 45.36717
UZS 14504.672432
VES 429.518272
VND 31224.521278
VUV 143.387393
WST 3.265465
XAF 657.071937
XAG 0.010054
XAU 0.000214
XCD 3.238136
XCG 2.160575
XDR 0.817187
XOF 657.06919
XPF 119.331742
YER 285.649307
ZAR 18.761325
ZMK 10785.036009
ZMW 23.826529
ZWL 385.812859
  • SCS

    0.0200

    16.14

    +0.12%

  • RBGPF

    0.0000

    82.4

    0%

  • CMSD

    -0.0457

    24.0508

    -0.19%

  • BTI

    -0.1800

    60.16

    -0.3%

  • BCC

    -0.8900

    80.85

    -1.1%

  • GSK

    -0.7000

    50.1

    -1.4%

  • BCE

    -0.2500

    25.27

    -0.99%

  • CMSC

    -0.1000

    23.7

    -0.42%

  • RELX

    -0.9800

    37.38

    -2.62%

  • AZN

    -2.3800

    93.22

    -2.55%

  • NGG

    0.3700

    84.68

    +0.44%

  • RIO

    0.4600

    93.37

    +0.49%

  • BP

    0.0800

    37.7

    +0.21%

  • RYCEF

    -0.5500

    16.6

    -3.31%

  • VOD

    0.0700

    14.57

    +0.48%

  • JRI

    -0.6900

    12.99

    -5.31%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: PATRICIA DE MELO MOREIRA - AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

B.Gopalan--DT