Re: Dealing with bot traffic - what tools/services are you using?

From: George Macgregor <George.Macgregor_at_nyob>
Date: Fri, 1 May 2026 07:48:53 +0000
To: CODE4LIB_at_LISTS.CLIR.ORG
Hi Shannon,

At our institution we have employed a great diversity of techniques, including experimentation with Anubus and Cloudflare Turnstile — depending on the nature of the service. But, in general, the approach has been a combination of a) monitoring traffic more effectively and blocking where necessary, b) rate limiting, and c) web application firewalls (WAFs), as above.

On WAFs: We have deployed a WAF on only two services, and only out of necessity. And I would say that they have indeed been effective, though they are not a silver bullet and the other techniques mentioned remain necessary. I dislike using WAFs because it is difficult to anticipate honourable bot actors in our domain, and it is highly likely that indiscriminate blocking of 'welcome' bots is frequently occurring, despite the ingenuity of the WAF approach. My understanding is that Cloudflare Turnstile is free to universities (at least in the UK); my institution has a bunch of other Cloudflare products, so I imagine we pay for Turnstile indirectly. Anubus is OS, as you note. So is 'Go Away', though I don't have experience of using it — it might be superior to Anubus.

Apologies if you have already checked it out, but COAR have a website dedicated to 'dealing with the bots'. It includes some useful advice, suggested strategies, and suggested solutions, vendors, etc. See: https://dealing-with-bots.coar-repositories.org/

Hope some of this helps!

Cheers

George

--

Dr George Macgregor | Assistant Director – Digital Library

Information Services | University of Glasgow

Web: https://purl.org/g3om4c  | Fediverse: @g3om4c@code4lib.social<https://code4lib.social/@g3om4c>
[ORCID logo]orcid.org/0000-0002-8482-3973<http://orcid.org/0000-0002-8482-3973>

Mobile: +44 (0)7977 858281
--

The University of Glasgow is a registered Scottish charity: Registration Number SC004401

--

________________________________
From: Code for Libraries <CODE4LIB_at_LISTS.CLIR.ORG> on behalf of Lucky, Shannon <shannon.lucky_at_USASK.CA>
Sent: Thursday, April 30, 2026 19:35
To: CODE4LIB_at_LISTS.CLIR.ORG <CODE4LIB_at_LISTS.CLIR.ORG>
Subject: [CODE4LIB] Dealing with bot traffic - what tools/services are you using?

[Some people who received this message don't often get email from shannon.lucky@usask.ca. Learn why this is important at https://aka.ms/LearnAboutSenderIdentification ]

Hi all,

I am curious what methods folks are using to deal with aggressive AI harvesting on websites - particularly digital project sites. Many of our servers are being hammered with traffic that impacts our service delivery and the methods we have been using cannot keep up.

Specifically I am wondering who is using services like Cloudflare or implementing OS solutions like Anubis, or are you using something else? I'm gathering information about what services or methods are being using at academic libraries hosting DH/digital projects so we can look at investing in some kind of service or process solution.

What are you using? Are you happy with it? What kinds of costs are associated?)


Shannon Lucky, MLIS MA

she/her

Associate Librarian


University of Saskatchewan

University Library

Ph: 306-966-2740

ORCID 0000-0001-9134-8560<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Forcid.org%2F0000-0001-9134-8560&data=05%7C02%7CGeorge.Macgregor%40glasgow.ac.uk%7Cc38c746aa30b4c5e961208dea6e75509%7C6e725c29763a4f5081f22e254f0133c8%7C1%7C0%7C639131709662824787%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C80000%7C%7C%7C&sdata=G53Egsj1Plb2pkWMrgtvYoGIlIuCwQWq%2BpuNJ%2FG0MoE%3D&reserved=0<https://orcid.org/0000-0001-9134-8560>>


I acknowledge that I live and work on Treaty 6 Territory and the Homeland of the Métis. We pay our respect to the First Nations and Métis ancestors of this place and reaffirm our relationship with one another.


image.png
Received on Fri May 01 2026 - 03:48:55 EDT