Top
Best
New

Posted by flexagoon 1 day ago

You Don't Need Anubis(fxgn.dev)
171 points | 163 commentspage 3
hubraumhugo 1 day ago|
What's the endgame of this increasing arms race? A gated web where you need to log in everywhere? Even more captchas and Cloudflare becoming the gateway to the internet? There must be a better way.

We're somehow still stuck with CAPTCHAs (and other challenges), a 25 years old concept that wastes millions of human hours and billions in infra costs [0].

[0] https://arxiv.org/abs/2311.10911

DecoySalamander 1 day ago|
Maybe a web where you provide your credit card number upfront and pay for each outgoing request.
utopiah 1 day ago||
"Yes, it works, and does so as effectively as Anubis, while not bothering your visitors with a 10-second page load time."

Cool... but I guess now we need a benchmark for such solutions. I don't know the author, I roughly know the problem (as I self host and most of my traffic now comes from AI scrapper bots, not the usual indexing bots or, mind you, humans) but when they are numerous solutions to a multi-dimensional problem I need a common way to compare them.

Yet another solution is always welcomed but without being able to efficiently compare it doesn't help me to pick the right one for me.

1vuio0pswjnm7 22 hours ago||
Looks Anubis allows Internet Archive's bot sometimes

Allowed

https://web.archive.org/web/20250419222331if_/https://anubis...

https://web.archive.org/web/20250419222331if_/https://anubis...

https://web.archive.org/web/20250420152651if_/https://anubis...

https://web.archive.org/web/20250420152651if_/https://anubis...

Blocked

https://web.archive.org/web/20250424235436if_/https://anubis...

https://web.archive.org/web/20250510230703if_/https://anubis...

https://web.archive.org/web/20250511110518if_/https://anubis...

https://web.archive.org/web/20250630101240if_/https://anubis...

https://web.archive.org/web/20250808051637if_/https://anubis...

https://web.archive.org/web/20250909160601if_/https://anubis...

Allowed

https://web.archive.org/web/20250921062513if_/https://anubis...

Razengan 1 day ago||
How else would I inter my dead and make sure they get to the afterlife?
echelon 1 day ago||
This whole thing is pointless.

OpenAI Atlas defeats all of this by being a user's web browser. They got between you and the user you're trying to serve content, and they slurp up everything the user browses to return it back for training.

The firewall is now moot.

The bigger AI company, Google, has already been doing this for decades. They were the middlemen between your reader and you, and that position is unassailable. Without them, you don't have readers.

At this point, the only people you're keeping out with LLM firewalls are the smaller players, which further entrenches the leaders.

OpenAI and Google want you to block everybody else.

happyopossum 1 day ago||
> Google, has already been doing this for decades

Do you have any proof, or even circumstantial evidence to point to this being the case?

If chrome actually scraped every site ever you visited and sent it off to Google, it’d be trivially simple to find some indication of that in network traffic, or heck - even chromium code.

echelon 1 day ago||
Sorry, I mean they're between the customer relationship.

Who would dare block Google Search from indexing their site?

The relationship is adversarial, but necessary.

ranger_danger 1 day ago|||
> Who would dare block Google Search from indexing their site?

People who don't want to be indexed. Or found at all.

Dylan16807 1 day ago|||
Is it confirmed that site loads go into the training database?

But for anyone whose main concern is their server staying up, Atlas isn't a problem. It's not doing a million extra loads.

heavyset_go 1 day ago||
> Is it confirmed that site loads go into the training database?

Would you trust OpenAI if they told you it doesn't?

If you would, would you also trust Meta to tell you if its multibillion dollar investment was trained on terabytes of pirated media the company downloaded over BitTorrent?

viraptor 1 day ago||
We don't have to trust it or not. If there's such claim, surely someone can point at least at a pcap file with an unknown connection. Or at some decompiled code. Otherwise it's just a conspiracy theory.
_flux 1 day ago|||
Surely the data must go to the OpenAI servers, how else would they use LLMs on it? We cannot see if that data ends up in the training data.

Personally I would just believe what they say for the time being; there would be backlash in doing something else, possibly legal one.

viraptor 1 day ago||
I think the original claim was about something different. "Is it confirmed that site loads..." - I read it as the author taking about general browsing, not just explicit questions, with the context of the page.
heavyset_go 1 day ago|||
Whatever is included in context is in OpenAI's control from that point forward, and you just have to trust them not to do anything with it.

That isn't a conspiracy theory, it's fundamentally how interfacing with 3rd party hosted LLMs works.

seba_dos1 1 day ago|||
The "LLM firewall" is usually there so AI companies don't take the server down, not to prevent model training (that's just an acceptable side effect).
_flux 1 day ago|||
As I understand it, the main point of Anubis is to reduce the costs caused by (AI company) bots and agent-generated load is still a lot less than simply spidering the complete web site; it might actually be quite close to what a user would manually browse.

Unless the user asked something that just needs visiting many pages, I suppose. For example, Google Gemini was pretty helpful in finding out the typical price ranges and dishes a local shopping centre coffee shops have, as the information was far from being just in a single page..

masklinn 1 day ago||
> This whole thing is pointless.

It's definitely pointless if you completely miss the point of it.

> OpenAI Atlas defeats all of this by being a user's web browser. They got between you and the user you're trying to serve content, and they slurp up everything the user browses to return it back for training.

Cool. Anubis' fundamental purpose is not to prevent all bot access tho, as clearly spelled in its overview:

> This program is designed to help protect the small internet from the endless storm of requests that flood in from AI companies.

OpenAI atlas piggybacking on the user's normal browsing is not within the remit of anubis, because it's not going to take a small site down or dramatically increase hosting costs.

> At this point, the only people you're keeping out with LLM firewalls are the smaller players

Oh no, who will think of the small assholes?

herpessimplex10 1 day ago||
[flagged]
viraptor 1 day ago||
Or you know, they may be serious and grown up enough to not be bothered by an image of an anime cat girl. There's really nothing there to be offended by.
pkal 1 day ago|||
I am also not a fan, but since a recent discussion on HN I have been thinking about what I don't like about it.

The conclusion I have come to is more general: I just personally don't like nerd-culture. Having an anime girl (but the same would be the case for an star trek, my litte pony/furry, etc.-themed site) signifies a kind of personality that I don't feel comfortable about, mainly due to the oblivious social awkwardness, but also due to "personal" habits of some people you meet in nerdy spaces. I guess there is something about the fact of not distinguishing between a public presentation and personal interests that this is reminiscent of. For instance: A guy can enjoy model trains, sure, but he is your college at work and always just goes on about model trains (without considering if this interests you or not!), then the fact that this subsumes his personality becomes a bore or even just plain unpleasant. This is not to generalize that this is the case for everyone in these spaces, I am friends with nerdy-people on an individual basis, but I am painfully aware that I don't fit in perfectly like the last piece of a jigsaw puzzle -- and increasingly have less of a desire to do so.

So for me at least this is not offence, but in addition to the above also some kind of reminder that there is a fundamental rift in how decency and intersocial relations are imagined between the people who share my interests and me, which does bother me. Having that cat-girl appear every time I open some site reminds me of this fact.

Does any of this make sense? The way you and others phrase objections to the objections makes it seem like anyone who dislikes this is some obsessive or bigoted weirdo, which I hope I don't make the impression of. (Hit me up, even privately off-HN if anyone wants to chat about this, especially if you disagree with me, this is a topic that I find interesting and want to better understand!)

GaryBluto 1 day ago|||
Thank you for putting into words what I could not. "Nerd culture" has fallen a long way since the early 2000s and your quote about decency and intersocial relations spoke to me.

It is really bizarre how everybody tries to make it about politics. While I may or may not disagree with a developer's politics, it's their conduct that I care about, and I associate those who express appreciation for anime at every possible opportunity with especially poor conduct and have yet to have encounter an exception to the rule.

The amount of flags I'm seeing for posts simply expressing disagreement on the matter is quite worrying.

pkal 2 hours ago||
I am glad my comment resonated with you! There are probably people here with political motivations (on both sides), but it is encouraging to hear that there is a value in the direction of my exposition.
adlinb 1 day ago||||
[flagged]
GaryBluto 1 day ago||||
> grown up enough to not be bothered by an image of an anime cat girl

This is some real four-dimensional chess. "You're the childish one for not wanting Japanese cartoons on software projects!"

tavavex 1 day ago|||
It's not just "not wanting" something, the original comment wasn't nearly that mild. It's being enraged by it to the extent of making petty, low, personal attacks on someone who steps just barely out of line of their preferred behaviors.

This whole comment chain solidifies my opinion that disgust is one of the driving human emotions. People feel initial, momentary disgust and only then explain it using the most solid justification that comes to mind, but the core disgust is unshakable and precedes all explanations. No one here has managed to procure any argument for why seeing a basic sketch in a certain style is objectively bad or harmful to someone, only that it's "weird" in some vague way. Basically, it goes against the primal instinct of how the person thinks the world "ought to work", therefore it's bad, end of story.

To me it seems obvious. The anime art style is in, especially in Western countries, especially^2 among younger people, and especially^3 among techy people. Ergo, you may see a mascot in that style once in a while in hobbyist projects. Doesn't seem like anything particularly objectionable to me.

krapp 1 day ago||
It isn't a driving human emotion. The world is full of serious businesses that use "cute" icons or employ anime-styled elements, and most people don't care. It's just a subset of tech and CS people who feel compelled to register their disdain at every opportunity.

And yet if you bring up that "Gimp" is an unserious name, or anything about RMS that's far more problematic than a cute cartoon, that same subset will defend it to the death.

tavavex 3 hours ago|||
> It isn't a driving human emotion. The world is full of serious businesses that use "cute" icons or employ anime-styled elements, and most people don't care. It's just a subset of tech and CS people who feel compelled to register their disdain at every opportunity.

I'm not talking about anything this narrow - disgust manifests itself in every facet of human life, this comment thread is just one minor example. Lots of laws were initially justified by some form of disgust. There's practically an infinite number of examples of people feeling immediately "icky" about something (absolutely anything, applicable at any point in history) and only then trying to create a justification for these feelings, basically working backwards to make their instincts seem more reasonable and palatable. You can easily spot it because when one justification is taken down, another one takes its place, and it can go on for eternity - justifications are temporary, the only thing that's permanent is the unshakable feeling of correctness and righteousness about the initial disgust. Notice how OP's argument about standards and professionalism was quickly swapped out for a more dignified version of "well, I just feel like everyone who likes XYZ is a sweaty anti-social manchild", as soon as arguing the original point became more difficult?

GaryBluto 1 day ago|||
I'd argue there's a difference between a funny mascot or punny name (that have been used in professional environments) and a mascot that looks like a child and is designed to look "cute" or "silly" to a fandom mostly comprising of eccentric (in a bad way) grown men. I don't think I've ever seen a developer on the internet who publicly enjoys anime who didn't act neurotically or childishly, although that's just anecdotal.

> The world is full of serious businesses that use "cute" icons or employ anime-styled elements

I can't think of any outside of Japan.

Mawr 1 day ago||
> a mascot that looks like a child and is designed to look "cute" or "silly" to a fandom mostly comprising of eccentric (in a bad way) grown men.

Go ahead, say it.

GaryBluto 21 hours ago||
What? That anime fans are mostly grown men? Is this controversial? There's a reason the neckbeard stereotype of body pillows and paedophilia and whatnot exists.

You're aggressively trying to imply I'm saying something here and I'm not sure what.

pyrale 1 day ago|||
> This is some real four-dimensional chess. "You're the childish one for not wanting Japanese cartoons on software projects!"

I would be OK with the sentence if it was "You're the childish one for not wanting Japanese cartoons on your software projects!".

As you wrote it, well, that's none of your business.

herpessimplex10 1 day ago|||
[flagged]
tomhow 17 hours ago|||
Your comments are almost all of the kind that the guidelines ask us to avoid – i.e., combative and escalatory, rather than curious and kind. Please take a moment to read the guidelines and make an effort to observe the if you want to participate here.

https://news.ycombinator.com/newsguidelines.html

viraptor 1 day ago||||
If those 2 are in any way close in your mind... maybe think about why a cartoon character and explicit sexual image seem related? There's nothing weird about an anime cat girl for me, but also I don't see any relation to anything sexualised in it.
mariusor 1 day ago||||
Mister simplex, having a preference for small internet to crumble under the shit LLM companies are piling upon it to being exposed once in a while to an inoffensive image of an anime character would make for a very worrisome direction of my priorities if I were you.
petesergeant 1 day ago||||
> an anime cat-girl

> an animated cock and balls

You don't see a difference between these things?

GaryBluto 1 day ago||
While it's a bit of an extreme comparison, they're both weird, unprofessional imagery associated with things you wouldn't wan to associate with a software project.
lakecresva 1 day ago|||
If it were just the imagery I don't think this would be such a huge flashpoint relative to something like tux or octocat (the github mascot).
squigz 1 day ago||
Because it's really about the type of people (they think) watch anime, and their inability to separate this preconception from reality.
petesergeant 1 day ago|||
I guess I'm not seeing the special category that an anime cat girl sits in. Is there some kind of sex implication I'm just not aware of? Linux has a penguin, FreeBSD has a devil(!), OpenBSD has a blowfish, Go has the weird Gopher thing, Gnome has a foot...

Wikipedia suggests that there's an association with queer and trans youth, is that what's meant to make the cock-and-balls comparison work? But it also says it has a history back to 17th century Japan...

spinf97 1 day ago|||
I don't, actually. Why is it weird?
spinf97 1 day ago||
Only man-children can be bothered by anime catgirls enough to post about it on on hacker news, so it says more about you tbh
GaryBluto 1 day ago|||
When did this notion that caring about things and wanting things to be professional is bad, or makes you a "man-child"? That would mean that practically everybody in human history has been a man-child. It feels like the whole world (even formerly professional areas) have decided to be casual and it's frustrating to those who think things matter.
DecoySalamander 1 day ago|||
Adhering to a narrow definition of a "professional" look signifies immaturity, stemming from a desire for approval from a stereotypically "adult" third party. Personally, I wouldn't take seriously anyone who has a problem with Anubis but doesn't blink when presented with people drawn in the corporate Memphis style.
GaryBluto 1 day ago|||
>Adhering to a narrow definition of a "professional" look signifies immaturity, stemming from a desire for approval from a stereotypically "adult" third party.

I'm not looking for anyone's approval. If I was, I wouldn't be publicly disagreeing with people on an internet forum, would I? Relax with your armchair psychology.

> Personally, I wouldn't take seriously anyone who has a problem with Anubis but doesn't blink when presented with people drawn in the corporate Memphis style.

I don't like either and find them both ugly.

redwall_hp 1 day ago|||
Whole countries of comparable size to the US happily put similar mascots all over their products, and pay other companies big money to use their characters. They're all over busses and billboards. The Korean ramen brand I buy has Kpop Demon Hunters on it now. (And Buldak usually has their little chicken dude.) Casio and Fender have expensive products with Hatsune Miku on them...which has been used in ad campaigns by petroleum and rail companies in Japan.

American corporate culture is dehumanizing and dystopian, not a standard for professionalism.

pyrale 1 day ago||||
> wanting things to be professional

Nothing says "professional" like starting a debate on HN about the weirdness of the mascot of a free software project, likely for political reasons.

GaryBluto 1 day ago|||
> likely for political reasons.

You're engaging in bad faith here. Nobody has brought up politics at all. If an almost identical clone of myself (with the same opinions on everything but mascots) developed a software project with an anime mascot I'd still disapprove.

adlinb 1 day ago|||
[flagged]
ncruces 1 day ago||||
Why is this worse than the octocat?
brendoelfrendo 1 day ago||||
I want less professionalism, thanks. I think the idea that everything needs to be an emotionless product has been largely harmful to the internet as a place of community and expression.
GaryBluto 1 day ago||
Professional =/= emotionless or product. I'd argue that early Linux, with all of Linus' rants, was more professional than most companies today.

I suppose it all comes down to what your definition of "professional" is.

watwut 1 day ago|||
> That would mean that practically everybody in human history has been a man-child.

I would argue that this statement is blatantly false. Currently, most people really do not care about anubis anime cat girl icon which is actually fairly tame and boring picture.

In history, people used all kind of images for professional things, including stuff they found funny or cute.

adlinb 1 day ago|||
[flagged]
GaryBluto 1 day ago||
[flagged]
GauntletWizard 1 day ago|
Anubis's design is copied from a great botnet protection mechanism - You serve the Javascript cheaply from memory, and then the client is forced to do expensive compute in order to use your expensive compute. This works great at keeping attackers from attempting to waste your time; It turns a 1:1000 amplification in compute costs into a 1000:1.

It is a shitty, and obviously bad solution for preventing scraping traffic. The goal of scraping traffic isn't to overwhelm your site, it's to read it once. If you make it prohibitively expensive to read your site even once, nobody comes to it. If you make it only mildly expensive, nobody scraping cares.

Anubis is specifically DDOS protection, not generally anti-bot, aside from defeating basic bots that don't emulate a full browser. It's been cargo-culted in front of a bunch of websites because of the latter, but it was obviously not going to work for long.

viraptor 1 day ago||
> The goal of scraping traffic isn't to overwhelm your site, it's to read it once.

If the authors of the scrapers actually cared about it, we wouldn't have this problem in the first place. But today the more appropriate description is: the goal is to scrape as much data as possible as quickly as possible, preferably before your site falls over. They really don't care and side effects beyond that. Search engines have an incentive to leave your site running. AI companies don't. (Maybe apart from perplexity)

reppap 1 day ago|||
First of all Anubis isn't meant to protect simple websites that gets read once. It's meant for things like a gitlabs instance where AI bots are indexing every single commit of every single file. Resulting in thousands of not millions of reads. And reading an Anubis page once isn't expensive either. So I don't really understand what point you are trying to make as the premise seems completely wrong.
purple_turtle 1 day ago|||
Some people deployed Anubis not to stop scraping, but to stop scraping the same page multiple times per second.
ranger_danger 1 day ago||
> Anubis is specifically DDOS protection

Only well-behaved application-level DDoS protection maybe.

A real network-level attack in the many-gigabits/sec+ will not be stopped by anubis itself.