Dubai Telegraph - As AI data scrapers sap websites' revenues, some fight back

EUR -
AED 4.327108
AFN 75.40719
ALL 95.469537
AMD 434.725041
ANG 2.108923
AOA 1081.629064
ARS 1650.727597
AUD 1.623956
AWG 2.123787
AZN 1.999297
BAM 1.958219
BBD 2.373352
BDT 144.848906
BGN 1.965433
BHD 0.444753
BIF 3507.596044
BMD 1.178245
BND 1.49628
BOB 8.142056
BRL 5.793314
BSD 1.178375
BTN 112.252074
BWP 15.843703
BYN 3.295298
BYR 23093.607434
BZD 2.369957
CAD 1.610379
CDF 2668.725934
CHF 0.915662
CLF 0.02668
CLP 1050.048955
CNY 8.012951
CNH 8.001941
COP 4426.585029
CRC 540.071638
CUC 1.178245
CUP 31.2235
CVE 110.355877
CZK 24.335949
DJF 209.842743
DKK 7.473127
DOP 69.766763
DZD 155.830536
EGP 62.116854
ERN 17.673679
ETB 183.994217
FJD 2.571521
FKP 0.864175
GBP 0.863712
GEL 3.151798
GGP 0.864175
GHS 13.303544
GIP 0.864175
GMD 86.595675
GNF 10339.902681
GTQ 8.99333
GYD 246.466508
HKD 9.224035
HNL 31.332966
HRK 7.534409
HTG 154.223758
HUF 355.640351
IDR 20525.504027
ILS 3.419091
IMP 0.864175
INR 112.28689
IQD 1543.726344
IRR 1545268.680998
ISK 143.781277
JEP 0.864175
JMD 185.901189
JOD 0.83536
JPY 184.998636
KES 152.169713
KGS 103.03766
KHR 4727.839461
KMF 492.506219
KPW 1060.420699
KRW 1732.75698
KWD 0.362782
KYD 0.982021
KZT 545.938935
LAK 25850.147493
LBP 105523.730332
LKR 379.572039
LRD 215.649098
LSL 19.367285
LTL 3.479052
LVL 0.712709
LYD 7.453332
MAD 10.74397
MDL 20.197117
MGA 4899.092559
MKD 61.651293
MMK 2473.757107
MNT 4214.238473
MOP 9.502858
MRU 47.052515
MUR 55.059614
MVR 18.140327
MWK 2043.341119
MXN 20.233818
MYR 4.621669
MZN 75.301835
NAD 19.367285
NGN 1608.469828
NIO 43.365402
NOK 10.818336
NPR 179.602355
NZD 1.975352
OMR 0.453022
PAB 1.178355
PEN 4.0483
PGK 5.118409
PHP 71.976664
PKR 328.269425
PLN 4.238932
PYG 7242.915151
QAR 4.305546
RON 5.209374
RSD 117.398042
RUB 86.718484
RWF 1723.343166
SAR 4.42052
SBD 9.448858
SCR 16.485242
SDG 707.533214
SEK 10.85829
SGD 1.494239
SHP 0.879679
SLE 29.043548
SLL 24707.209823
SOS 673.437493
SRD 44.070499
STD 24387.298371
STN 24.530715
SVC 10.310866
SYP 130.252583
SZL 19.361242
THB 38.019607
TJS 11.029663
TMT 4.123858
TND 3.418944
TOP 2.836932
TRY 53.464883
TTD 7.987934
TWD 36.970039
TZS 3078.17328
UAH 51.786803
UGX 4430.509825
USD 1.178245
UYU 46.978687
UZS 14307.854103
VES 588.222424
VND 31017.306923
VUV 139.713719
WST 3.189624
XAF 656.77377
XAG 0.013838
XAU 0.000249
XCD 3.184266
XCG 2.12375
XDR 0.816816
XOF 656.779351
XPF 119.331742
YER 281.158781
ZAR 19.283646
ZMK 10605.622741
ZMW 22.279802
ZWL 379.394499
  • CMSC

    0.0000

    23.11

    0%

  • BCE

    0.2950

    24.435

    +1.21%

  • RELX

    -0.3250

    33.255

    -0.98%

  • NGG

    0.7100

    87.6

    +0.81%

  • JRI

    -0.0147

    13.135

    -0.11%

  • RBGPF

    0.2700

    63.18

    +0.43%

  • GSK

    -0.3300

    50.08

    -0.66%

  • RIO

    2.8500

    108.23

    +2.63%

  • BCC

    -0.1750

    70.495

    -0.25%

  • CMSD

    0.0661

    23.5998

    +0.28%

  • RYCEF

    0.4300

    16.8

    +2.56%

  • AZN

    0.4100

    183.26

    +0.22%

  • BP

    0.8850

    44.225

    +2%

  • VOD

    0.1850

    16.385

    +1.13%

  • BTI

    2.0000

    60.28

    +3.32%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: PATRICIA DE MELO MOREIRA - AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

B.Gopalan--DT