On 04/03/2026 04:16, Nick Holland wrote:
> cvsweb does not care about your browser.  It only cares about the IP
> address.

maybe it should care, tho

simple user agent filtering will already go far, i have several systems
under heavy bot traffic and most AI bots use either specific user agent
headers, or broken/empty ones

create a matching list of most common browser user-agents headers, and
match that against the traffic

on desktops webkit will lead the charge in that regard,
safari/chrome/opera will account to some 70-75% of the traffic, with
firefox following with 8-10%

on mobile webkit will show 85-90%, firefox around 6%

...which means you will only need several regex entries to handle almost
all the legit traffic

nginx map dynamic regex will go far here, nginx map proves to be
extremely high performing

speaking from experience here, running a project dealing with 1000-25000
RPS during calmer hours up to 10k RPS during busy hours every day, user
agent whitelisting helps a lot






Reply via email to