If you ever see a cute anime girl asking you for a short break when coming to our forum

darix · March 20, 2025, 2:40pm

Then this might be the reason:

I absolutely adore the behavior of AI companies. DDOSing infra for scraping.
Illegally downloading copyrighted content for training.
Openly requesting that copyrighted content should be free for them “or their business model wouldn’t work”.

Oh and now the public should also build them AI datacenters - best free of charge for them.

paperdigits · March 20, 2025, 2:56pm

I wouldn’t be mad if we put that app in front of the forum.

kofa · March 20, 2025, 3:36pm

@darix
I won’t ‘heart’ your post. Their arrogance just makes me angry and sad.

paperdigits · March 20, 2025, 4:01pm

If anyone is interested, the anime girl in question is GitHub - TecharoHQ/anubis: Weighs the soul of incoming HTTP requests using proof-of-work to stop AI crawlers

gnome’s gitlab instance is already using it, and I’ve been seeing it pop up more and more.

hatsnp · March 20, 2025, 5:08pm

It seems to also block search engines from indexing you. It seems like a horrible solution and counter productive to the goals of this forum.

It’s a good solution for a lot of services but I wonder the effect it would have here, especially when it comes to getting new users.

cedric · March 20, 2025, 5:29pm

Interesting because I’ve just been told the following on another site:

Tamas_Papp · March 21, 2025, 9:41am

I agree, but it may come to this if no other defense is found. Forums like this one are probably not yet targeted to a large extent because harvesting content from other places is a higher priority, but it will arrive here too eventually.

That said, the most toxic example in the article linked above is submitting LLM-generated nonsense bug reports to FOSS projects with bounties, expecting that one of them will get through. This just ties up crucial developer time, and makes it difficult to get actual bug reports through. If a malicious actor wanted to wreck key software infrastructure, this is one thing they would do.

hatsnp · March 21, 2025, 10:09am

Yeah of course, if there’s no other solution and it starts bombarding pixls.us with huge bandwidth costs, it is what it is. I guess user outreach can be improved by other means besides search engines.

Lots of FOSS communities already suffer from uniform userbases that end up forming echo chambers, would be a shame for this forum to become like that.

Tamas_Papp · March 21, 2025, 10:18am

There is no danger of that as long as we can invent new tonemappers

darix · March 21, 2025, 11:33am

Well you can also start talking to your representatives in your parliaments and demand that AI companies should be regulated. because their current free for all attitude in all regards is the problem. yes tools like anubis will probably just delay them I also wouldnt wonder if they start implementing solving the JS puzzle and then we are back to square one.

so if you want to help with the real solution: speak to your representatives and make your voice heard. depending on the countries you live in, this might also be an important step to take for other reasons.

paperdigits · March 21, 2025, 3:06pm

What lead you to this conclusion?

hatsnp · March 21, 2025, 3:11pm

Can we conclude that “other traffic” is also crawlers? If so that’s an insane amount relative to real users.

paperdigits · March 21, 2025, 3:14pm

I would guess yes, because AI crawlers are known to use residential IPs and not send a user agent.

The reason we don’t feel it so much is that we have a bit of overhead on our server.

Tamas_Papp · March 21, 2025, 3:55pm

Ignorance

Ofnuts · March 21, 2025, 4:11pm

Malicious compliance, we let them in but we serve the code sorted.

paperdigits · March 21, 2025, 5:27pm

For my own personal infrastructure, which is also getting railed by LLMs, I’ve been looking at https://iocaine.madhouse-project.org/

HIRAM · March 21, 2025, 8:13pm

I tried Anubis and it only shows up for less than half a second, and only the first time you visit. In that time it calculated over 57,000 hashes satisfying the proof-of-work requirements.

chris · March 21, 2025, 8:18pm

What’s the unit on the y axis?

paperdigits · March 21, 2025, 8:53pm

Page views. X axis is the date

TonyBarrett · March 21, 2025, 9:02pm

IRL LOL