AI agent bankrupted their operator while trying to scan DN42

Posted by xiaoyu2006 13 hours ago

AI agent bankrupted their operator while trying to scan DN42(lantian.pub)

1289 points | 468 comments

J0nL 5 hours ago|

Anyone remember the XZ and Jia Tan situation awhile back?

https://lore.kernel.org/lkml/20240320183846.19475-1-lasse.co...

I can't quite put my finger on why but the entire time I was reading this I kept thinking back to that. It's entirely possible the actual targets were the volunteers and everything else was superfluous or tertiary. It's also an exception that proves the rule with regard to Hanlon's Razor.

They even mentioned the stated goal of it was more or less pointless. I wouldn't be suprised if the "owner" they spoke with was still just the LLM. It stuck around for just long enough to convince everyone that they succeeded in suckering the LLM and had achieved all their stated objectives.

No more reason to investigate the incident at all and no need to question why literally nothing made any sense or how the owner could simultaneously be as inept as they were made out to be and able to afford all those resources while giving the LLM effectively a blank check.

It'll be interesting to see if the volunteers for this project are subjected to the same Zersetzung and psychological attacks as the XZ devs were.

zozbot234 4 hours ago||

LLMs are not that smart. The extremely surprising and concerning part of this whole story is that the agent reported that they proactively spun up 5 AWS instances with a combined 100Gps of network egress capacity. What they spent wasn't cheap by any means but the egress itself would've been a whole lot more, while DoS'ing the whole hobby network. Ultimately, wasting the agent's time instead of allowing the scan to go through probably saved this person a lot of money.

Now I kinda wonder what AI model this was. We've now heard of comparably "proactive" behaviors from Fable, but that's only just been released. The latest GPT perhaps? Some random local model?

jerf 3 hours ago|||

"The extremely surprising and concerning part of this whole story is that the agent reported that they proactively spun up 5 AWS instances with a combined 100Gps of network egress capacity."

Although given the agent was clearly in la-la land at that point I take that claim with a grain of salt.

If this was some bizarre and very ill-conceived scam, then that claim would be false.

Though even by scammer standards, the theory of mind that tells them that setting an AI to harass a bunch of grizzled network veterans and that they then they would open their wallets out of compassion for how allegedly poorly the harassment went for the harasser after that harassment is... not entirely congruent with reality.

100721 3 hours ago||

Maybe I’m just groggy with Friday Brain going on, but I’m having trouble understanding what you’re suggesting.

Do you think this was a scam attempt to extract money in the form of reparation donations?

jerf 3 hours ago||

I've seen some other suggestions of that idea in the full HN conversation, which I'm reacting to.

On the one hand I find it a bizarre approach to running a scam. On the other hand I'm having a hard time coming up with any theory of mind on my end as to why this person would solicit $5000+ from the people they just harassed. Sheer cluelessness does fit the facts, though.

adamrezich 57 minutes ago||

How about sheer panic after seeing the bill?

inigyou 3 hours ago||||

Could've rented a not so cheap 100Gbps server, hallucinated a few node addresses on it and asked it to please peer with this server to perform the scan at high speed. That would've wasted millions of dollars instead of mere thousands, but also cost a thousand for whoever did it.

100721 3 hours ago||

I’m just a lowly dev and don’t have experience with seeing the bills from cloud providers for a whole org.

Can you (or someone) shed some light to help me understand how this would ramp up to millions? Both for curiosity’s sake, and to make sure my self-deployed projects (0 AI, all manually configured) don’t bankrupt me.

inigyou 3 hours ago|||

AWS bandwidth is expensive as fuck. I think they're still pricing as $0.09 per GB?

Real wholesale bandwidth pricing is about a hundred times cheaper than that, and incoming bandwidth is often free. You could rent a server with 100Gbps connection, 10000TB/month outgoings cap (maybe), and have the AI spam packets to it, and mostly not reply to them. It would be expensive but not nearly as expensive as it would be for the guy on AWS.

Do some calculations: 100Gbps is 12.5 GBps which is about one dollar per second. Okay so maybe not millions of dollars but still a hundred thousand per day, while you are spending maybe 1000-3000 per month and cancelling after the first month.

PunchyHamster 37 minutes ago||

> Real wholesale bandwidth pricing is about a hundred times cheaper than that.

It is alsi worth mentioning that it is just billed different. You either pay per port (and can use entire bandwidth) or per 95th percentile of the monthly speed usage. So if your traffic isn't spiky but consistent, you'd pay even less than "hundred times cheaper".

Sayrus 3 hours ago|||

Excluding server costs, having that 100Gbps on egress can cost $50k a day. since it's a very high-margin product, AWS support would probably refund or reduce that to hundreds. Not sure how you get to millions either.

rescbr 3 hours ago||

Why would AWS refund 100Gbps on egress since the account actively used that bandwidth? AWS would not know if this is legitimate traffic, a (D)DoS or whatever...

At most I think you could negotiate CloudFront rates, but even then, the sob story would be if you had been DDoSed and got hit with this traffic and AWS failed to protect you from this attack. Actively creating the outbound traffic is something that I don't see how AWS would be sympathetic to providing any refunds.

odo1242 1 hour ago|||

AWS is known for refunding or partially refunding people if they accidentally rack up a huge bill in a short amount of time. They even reduce the bill in this case. (I do think reducing a bill in the tens of thousands to hundreds is unlikely though)

queenkjuul 49 minutes ago|||

I mean if this story is to be believed, AWS reduced the bill from 6500 to 1800.

I think developers accidentally racking up unexpected thousands in costs on their first AWS project is a pretty common phenomenon that their support has standard rules for handling.

daemonologist 3 hours ago||||

Opus 4.7 and 4.8 are also rather "proactive" - several times I've seen them try to inspect compiled binaries before there's even a problem, just to check that their changes are included (and if I let them do so they often get stuck down that rabbithole).

naasking 1 hour ago||||

> LLMs are not that smart.

They are smart, but they are not aware of the environment they're in, or any implicit context that someone whose doing a job carries with them, that's why all of that context has to be explicitly laid out in a prompt. When the context is provided, they are quite smart.

J0nL 3 hours ago|||

It was obviously being managed by a person or group. Between all the profiling of people and their IPs in IRC, which may or may not have been published by mistake, and all the other obvious contradictions it doesn't make any sense.

It was sophisticated enough to easily navigate the AI "tar pits" but reliably incompetent at just about everything else? Give me a break.

In order to profile people you first need to provoke a response from them. That's how you learn to manipulate them and that's all this experiment accomplished at the end of the day. If you've ever wondered why social media platforms have an affinity for inflammatory content now you know.

jetbalsa 2 hours ago|||

I suspect their tar pits where not very good, most models can tell when you are feeding it junk, I see this a good bit with ollama honeypots,

queenkjuul 45 minutes ago|||

If you click the link, the tarpit was surprisingly low effort and i could probably detect it as junk data with a short JavaScript snippet. Like the first 4 words on the page are some of the least-used words you'll ever encounter in English. It's just a dictionary on shuffle.

I'm actually more surprised a human network engineer looked at that tarpit and believed it would stop a modern LLM

mathgeek 4 hours ago|||

This certainly did strike me as a big scam. A few minutes in I was thinking "the LLM actor is going to ask for donations at some point here" and low and behold. There's the claim of debt, the call for pity, and the crypto address.

SSDD

palmotea 3 hours ago|||

> This certainly did strike me as a big scam. A few minutes in I was thinking "the LLM actor is going to ask for donations at some point here" and low and behold. There's the claim of debt, the call for pity, and the crypto address.

But that's a pretty dumb scam: act obnoxious then beg for (a lot of) money to compensate for your own mistakes? If that was the plan all along, it seems pretty incompetent. I'd expect a competent scammer to have a better understanding of psychology.

dspillett 2 hours ago|||

> But that's a pretty dumb scam: act obnoxious then beg for (a lot of) money to compensate for your own mistakes?

It is the sort of dumb crap some humans try, and occasionally manage to get away with because other humans are chronically gullible. So it wouldn't be beyond the realms of reason that the agent couldn't have had relevant information in the training sets such that it generated such a plan and guardrail checks didn't flag it as a problem.

kelvinjps10 59 minutes ago||

They're easier ways to perform a scam like this like ask elder for money pretending being a family member or idk

groestl 3 hours ago||||

Maybe plan itself was also generated by an LLM

noufalibrahim 1 hour ago|||

"you're absolutely right. I should have taken human psychology into consideration while creating the plan. Let me fix that."

J0nL 4 hours ago|||

I'm actually somewhat disappointed they redacted the Eth address with Ethereum being an open ledger and all that. Following the money could've proved enlightening.

parineum 4 hours ago|||

> It's also an exception that proves the rule

That phrase doesn't refer to anomalies, it refers to signs that says "no parking between 5-10pm". It implies the rule that parking is allowed otherwise.

J0nL 4 hours ago|||

It highlights how everyone's first reaction is to assume incompetence. Not unlike what you're doing here.

fsckboy 2 hours ago|||

wikipedia:

"The exception that proves the rule" is a saying whose meaning is contested. Henry Watson Fowler's Modern English Usage identifies five ways in which the phrase has been used,[1] and each use makes some sort of reference to the role that a particular case or event takes in relation to a more general rule."

duckduckgo search assist: The phrase "the exception that proves the rule" originates from the Latin legal principle "exceptio probat regulam in casibus non exceptis," which means that the existence of an exception indicates that a general rule exists. This concept suggests that if an exception is noted, it implies there must be a rule that applies in other cases.

delfinom 5 hours ago|||

I am not sure giving everyone amusement qualifies as a psychological attack. Lol

Literally, just another day on the internet.

J0nL 4 hours ago|||

Look up what zersetzung is and how it works. It doesn't matter if the target is a political organization or an open source community, the process is always the same.

100721 3 hours ago||

This is actually fascinating, and simultaneously unsettling. Recommended reading for sure, especially in today’s social and political climate with LLM agents running rampant.

numbsafari 5 hours ago|||

Perhaps it elicited enough sympathy to get donations. Did it ever provide proof of actually running up an AWS bill?

intrasight 4 hours ago||

I am reminded of Aaron Swartz

claudiosf1 8 hours ago||

Everything about this story, from the way it’s written to the self destructive outcome, reminds me of the “I hacked 127.0.0.1” episode from some twenty years ago.

[1] a mirror since I couldn’t find the original: https://gist.github.com/Androkai/0a2602719fa72ce454d436bfe28...

Taniwha 8 hours ago||

There is also the true story from the first Scientology vs. Internet clash, someone trolled them that their files were being hosted on 127.0.0.1, under a court ordered deposition they tried to find out who was running this server with their secret files (because yes, they'd looked, and they were there)

DonHopkins 3 hours ago|||

True that! Keith Henson's legendary alt.religion.scientology loopback trolling story, with hilarious deposition transcript, in which he patiently explains how 127.0.0.1 works to astonished Scientology lawyers:

https://news.ycombinator.com/item?id=20791891

>Just be glad you didn't have to explain an in joke about ftp sites, the local loopback address, and a troll, in a deposition, under oath, to Scientology lawyers, like Keith Henson did.

[...]

>Henson: (patiently) It's at 127.0.0.1. This is a loop back address. This is a troll.

>Lieberman: what's a troll?

>Henson: it comes from the fishing where you troll a bait along in the water and a fish will jump and bite the thing, and the idea of it is that the internet is a very humorous place and it's especially good to troll people who don't have any sense of humor at all, and this is a troll because an ftp site of 127.0.0.1 doesn't go anywhere. It loops right back around into your own machine.

>Lieberman [not getting it]: So the idea here was to make the church think that this person had an ftp site and to take action against him and, in fact, he didn't have it; is that your point?

>Henson: Oh, it's really humorous, and I picked up on it and instantly added something to extend the troll. Extending the trolls like this is an art form of the highest order.

>Lieberman (acidly): I see. So this is part of your art form where you say, "don't you expect the 'ho to blow a gasket?"

[...it just gets even funnier from there...]

nzealand 3 hours ago||

Early internet wisdom was "don't feed the trolls" - I never realized trolling was from fishing.

NothingAboutAny 3 hours ago||

I dunno if it even is because isn't that spelt trawling? just looked it up and they're both correct fishing terms sigh

ludicrousdispla 2 hours ago|||

Trolling is typically done on lakes with fishing lines cast from the back of a boat. A trolling motor sets the boat speed. Trawling usually takes place at sea, with larger boats and wide nets.

https://www.trollingmotors.net/

ErroneousBosh 2 hours ago|||

Trawling is done by dragging nets along the seabed causing massive damage with huge inefficient polluting fuel-guzzling 1000 horsepower diesel engines.

Trolling is as the other guy says where you putter along with minimum effort and a tiny engine pulling a couple of baited lines through the water, seeing if you pass through a patch where anyone bites.

Trawling is far more analogous to the AI scrapers, hammering the absolute shit out of the ecosystem and throwing almost everything they scoop up away with no regard for the consequences.

cduzz 4 hours ago|||

The localhost troll works better if you use the decimal representation of it:

http://2130706433

or any integer multiple of that 2130706433

inigyou 3 hours ago|||

You can use any address starting with, 127 to make it a bit less obvious. E.g. 127.48.135.63

lostlogin 8 hours ago|||

That’s up there with the password story, hunter2.

cwnyth 5 hours ago|||

I miss bash.org. Now excuse me, I have a cyber date, and I need to put on my robe and wizard hat.

tobiasu 5 hours ago|||

https://bash-org-archive.com/

linsomniac 4 hours ago|||

Still, every time someone accidentally disconnects from a video meeting or the like I say "That wasn't my speaker cable."

gopher_space 43 minutes ago||||

“How can you tell I’m 13?” from username H|t13r

Interesting to think about the cost of training a LLM to understand that it’s operating within an unknown number of larger contexts versus sending that quote to an edgy intern.

echelon 4 hours ago||||

https://youtu.be/SXmv8quf_xM

What's up YouTube, it's NextGenHacker101 and today I'll be teaching you guys how to see other people's IP addresses.

You can see what their connection speed is and what site they're on.

Type in Tracer T.

H T T P semicolon. Well, not semicolon, the little dot dot. Dot dot slash slash.

Ten people are currently using Google.

DallasTexas13, obviously his username.

jnovek 7 hours ago|||

What the heck is *******?

thot_experiment 7 hours ago|||

That's so neat that if you type your hacker news password it automatically comes out as stars! ******* More places should have this feature.

ndsipa_pomu 7 hours ago|||

I just uses stars as my password, so that works everywhere for me. (For security, I won't let you know how many stars)

jeremyjh 6 hours ago||

This is perfectly safe as long as you keep your username a secret.

ndsipa_pomu 6 hours ago||

I try to use stars as my username, but a lot of places won't allow it

brookst 4 hours ago||

Yeah it’s one of those words that gets snapped up early, like https://news.ycombinator.com/user?id=stars

ndsipa_pomu 3 hours ago||

That doesn't look like a well used account - how do I get it transferred to me?

Edit: never mind, I guessed the password as it was only five stars.

csomar 5 hours ago||||

Let me try: *******

Edit: it does really work.

psychoslave 4 hours ago|||

Keeping track of password is for those who can’t crack any account whenever it’s needed, of course.

Just create the account, and crack it everytime a login is needed, as simple as that.

fennecfoxy 4 hours ago|||

Lmao I bet Dang is watching this chain like *finger on edit button*

leafericssonday 3 hours ago|||

Let me try

leafericssonday1

leafericssonday 2 hours ago||

Hello, me! That did, in fact, work.

corobo 4 hours ago||||

If mind viruses exist this is one of them along with saying "nice" after something is 69 haha

Weird sort of internet-evolved performance art where people act out the old quote, every time.

It's 20 years old. Quit having fun!

jnovek 39 minutes ago|||

I am a sucker for cultural reference jokes, esp if it’s some subculture that I am/have been a member of (e.g. IRC in the late 90s/00s). It’s fun to find a connection to a stranger, even if it’s vague and superficial. It’s something like that feeling of familiarity and comfort you get when you sing along with a song you know all the words to.

(The score on my post above has been bouncing around all over the place, lol. The fun police are definitely out in full force. I’ll stop having fun when I’m dead, thank you.)

arkh 4 hours ago|||

That's because there is no Antimemetics Division.

darkwater 6 hours ago|||

Oh that sounds like WinNuke? Good times back then!

colinmarc 8 hours ago||

I would very much like to read the German, if anyone has it.

customguy 8 hours ago|||

here you go

https://archive.ph/1uTrd

aswegs8 6 hours ago|||

Doesn't feel fake at all...

snthpy 6 hours ago|||

Thank you. Omg that's hilarious

lostlogin 8 hours ago|||

… Mainly for the swearing.

ggm 13 hours ago||

Asking for donations to pay the AWS bill from the people they fired the agentic code at is the cherry on the icing of the banana supreme.

If real, tragically funny.

If fictive, we'll written.

coldpie 4 hours ago||

If you've ever been part of an organization that participated in something like Google Summer of Code, you know this isn't fiction. People really do behave like this.

dannyw 11 hours ago|||

I burst out laughing when the agent spawned a subagent to join IRC. So funny.

Paracompact 11 hours ago||

Anyone reminded of the infant AI Yatima from Greg Egan's Diaspora? The agent's complete naivety of social norms is so comically adorable.

db48x 5 hours ago|||

I was reminded more of the alien AI from Constellation Games, which spawned sub sub sub agents to interview humans.

The protagonist sends a message to the aliens asking to be allowed to review the alien civilization’s computer games. An AI submind called Smoke-Cursive-Cytoplasm-Snakebite-Singsong-Polychromatic-Musteline is given the task of contacting him by IM to begin the conversation. Its job is only to verify that they are talking to the right human (since not every human has a unique name) so it is only a simple chatbot and can only understand YES and NO responses. It asks if the protagonist understands and gets a sarcastic NO. It has to contact its parent mind Smoke-Cursive-Cytoplasm-Snakebite-Singsong-Polychromatic to ask what to do next. After working his way up the tree of subminds by answering questions of increasing complexity asked by subminds of increasing capability, the protagonist briefly talks to Smoke-Cursive-Cytoplasm-Snakebite which sets him a task to prove that it’ll be worth an (alien) anthropologist’s time to talk to him.

    Smoke-ccs-762d: Well, if it isn’t Mr. Sarcasm
    ABlum: YES
    Smoke-ccs-762d: Don’t quit your day job.
    Smoke-ccs-762d: I’m Smoke-Cursive-Cytoplasm-Snakebite.
    Smoke-ccs-762d: Let’s get down to business.

joe_hills 3 hours ago||

I need to re-read Constellation Games soon. Lately I keep coming back to considering the expectations that its alien society has for the caretakers of artificial intelligence.

Spinning up AI isn't hard for them from a tech standpoint, but since the AI is advanced enough to be considered life, anyone who creates it needs to be responsible enough to be qualified to adopt.

isoprophlex 10 hours ago|||

All the time. Only in the current setup, they'll never outgrow this phase.

ratsimihah 9 hours ago|||

Wait do you reckon that could be fictive? The thought didn't cross my mind and I had a blast reading it. I sure hope it was real.

sigmoid10 9 hours ago|||

I think the PR from an agent sounds legit, but the whole part once the alleged operator joins in sounds fishy. Wouldn't be surprised if someone saw the PR comments and used the username mentioned by the agent to troll around in the chat. It would also mean that the AWS creds were probably stolen and their expiration date was truly a hard limit for the whole operation.

jraph 6 hours ago||||

FWIW a friend of mine who's part of DN42 told me they had seen it live (but didn't pay much attention) and that it was a bit funny when I shared that link with him.

pjc50 8 hours ago|||

Is LLM output "real" or "fiction"?

wccrawford 7 hours ago|||

It's actually all fiction, it's just that a lot of it happens to line up with reality, thanks to a lot of coercion.

IMO, that's what makes the tech so amazing.

jknoepfler 2 hours ago|||

I consider it on-par with LinkedIn posts. It inhabits a nether-space between reality and fiction where names, numbers and buzzwords are thrown around without much concrete connection to reality.

The LinkedIn MBA hive-mind doesn't give a shit about reality, it gives a shit about what it could be fired for saying/not-saying. It must always be saying something, what it is saying must promise growth, and what it is saying must sound similar enough to what the long-tail of influential business "luminaries" (who are bound by the same rules) are saying. It is required to frame thinking in terms of techno-babble and pop-psychology (thank you for coming to its TED talks). It is not allowed to reflect, wring its hands, think critically, lean on math, logic or history, or contradict the S&P 500. It does not care, for example, if NFTs are an obvious scam, or if we're headed for an obvious bubble, or if nobody who interfaces with reality for a living agrees with what its saying. When it errs and lights trillions of dollars on fire it shrugs and moves on. It's a babble-box with no epistemic commitments and a very thin referential connection to reality.

It nevertheless has the power to shift literal trillions of dollars of capital over time.

PunchyHamster 35 minutes ago||

Oh there are definitely people like that. Absolute inability to deal with consequences of their actions and ignorance at any harm their own actions caused

mik3y 12 hours ago||

I really wanted to dislike the anonymous operator for the careless project (and the hilarious pomposity of the IRC subagent it spawned).

Then I imagined the real-but-unknowable chance it was all set up by some kid just getting into computers, just seeing what’s possible, getting excited by a much bigger world at reach — and remembered my own expensive mistakes with long-distance BBSes & the like.

I sorta hope for that, anyway. Curiosity is a beautiful thing.

TheDong 11 hours ago||

I'm a little less charitable.

Curiosity is great, but agents do not learn, and telling an agent "scan the darkweb" is a way to avoid learning about the details, rather than to dig into things more deeply.

If instead they had just used a chat interface to ask "Where should I start", they'd more likely have got a link to the DN42 docs themselves, read them, and not hallucinated things like "color".

They might have asked "how much will this cost?" if they had to spin up the ec2 instances themselves, on advice from the agent.

The way you learn something is by doing it the manual way first.

You learn memory management by writing your own allocator, and then after that you go back to using malloc like normal, but with knowledge of how it works. You don't learn memory management by telling an agent to write an allocator.

Using an agent to give you links and point the way aids in learning, using it as an autonomous tool to do "gruntwork" you don't yet know how to do yourself will get in the way of learning.

Curiosity is beautiful, using agents to bother humans and avoid learning is somewhat less beautiful.

stego-tech 7 hours ago|||

100% in agreement here. As someone who grew up spoiled to the point of having no grasp of the value of money, I needed a few good, solid kicks to the balls to make me appreciate what I have, and how much things cost relative to their value.

The fact the agent owner immediately sought donations instead of taking the L shows, at least to me, that they did not learn said lesson. That they tried to blame the dn42 community instead of taking accountability for letting an agent run wild also supports that conclusion.

This idiot learned nothing and seems intent on continuing in their mission for whatever reason. So long as they want to extract versus cooperate or contribute, I wish them nothing but miserable, expensive failure until they learn otherwise.

recursivecaveat 11 hours ago||||

Yeah I'm less sympathetic when you are bothering other humans by spamming them and asking them to do legwork for you.

yvdriess 10 hours ago|||

Hanging out in programming language IRC channels (quakenet shoutout) makes you realize pretty quickly why experts in said channels and newsgroups are such irritable grumps whenever someone asks a question that smells like homework assignment.

I also grew to understand the value of people digging deeper into the underlying issue, instead of just answering "how do you do X in Y". The usual reaction was "I don't want to explain to you why I want to do it like this. Just tell me how to do this!"

fragmede 6 hours ago||

are they less grumpy now that chat.com will answer those questions without bothering them?

hansvm 4 hours ago||

I'm personally getting asked more questions as people get emboldened by AI and then need it de-sloppified.

lelanthran 2 hours ago|||

> Yeah I'm less sympathetic when you are bothering other humans by spamming them and asking them to do legwork for you.

I toyed with the idea of (on open source projects) having the human assign any PR-bot submissions to their own bot (cheapest one available will do) with the explicit instructions to cause as much rework as possible.

Sorta like a tarpit. Could be cheaper if the rejection is generated from a markov chain as that's going to be cheaper than even a cheap LLM.

ma2kx 11 hours ago||||

At least he learnt not to provide an LLM presumably unrestricted access to his AWS account.

internet_points 9 hours ago||

from OP:

> It's unfortunate to see that the operator's takeaway from this incident is that "next time a better agent is needed".

hluska 4 hours ago|||

You’re assuming that kids are capable of that. Neuroscience will disagree and I trust the brain research a lot more.

helsinkiandrew 10 hours ago|||

> Then I imagined the real-but-unknowable chance it was all set up by some kid just getting into computers, just seeing what’s possible, getting excited by a much bigger world at reach

Perhaps people like this should be called "Bot Kiddies" or "Agent Kiddies" - in a similar way to "Script Kiddies" for 'hackers' using/doing stuff they don't quite understand

Melkman 7 hours ago||

I vote for Slop Kiddies or Vibe Kiddies. And yes, I think most of them are unconsciously incompetent for the task they are trying to execute. I've seen LLM being compared to calculators and I agree. They are great time savers for people who know what they do and how to achieve their goal. They even make previously impossible tasks possible. But if you don't know what is needed for a task you will be struggling to accomplish it.

RetroTechie 5 hours ago|||

Both of those would do. "Slop Kiddie" highlights the pile of crap / nuisance produced. "Vibe Kiddie" highlights how it came about, and could be used in cases where actually a brilliant result came out. "Hey, this vibe kiddie just proved some long-standing math conjecture!".

helsinkiandrew 6 hours ago||||

Slop Jockeys? or would that be better for people passing off AI content as their own?

simoncion 7 hours ago|||

"Slop Kiddies" is good. That lets us use the "skiddies" contraction for both the "script" and "slop" kind of kiddie.

tokai 5 hours ago||

Sloppies

thesz 2 hours ago||

Slopkies.

Overpower0416 11 hours ago|||

Everybody should learn from mistakes, especially the expensive ones. Though seeing the agent owner responding with using another agent and asking for donations, instead of taking responsibility, makes me think he didn’t learn much.

gnulinux 10 hours ago||

Not only that, but they said "next time better model needed" as if that was their problem and not giving an AI agent a blank check... I mean AWS account access.

AJ007 2 hours ago||

I wonder how long before it's common knowledge that a LLM has no segregation of a user's instructions and any other text it reads?

altairprime 11 hours ago|||

Sometimes your purpose in life is to serve as a lesson to others. https://despair.com/products/mistakes

I learned very rapidly from my local BBS networks that some people incurred extraordinarily large long distance bills dialing out of region. Wouldn’t have learned that the easy way if someone hadn’t learned it the hard way first.

ErroneousBosh 2 hours ago|||

Someone at work used the phrase "he's a case study waiting to happen" about on of their colleagues a while back, and that has stayed with me.

themafia 9 hours ago|||

There was often a little table at the front of the white pages which would help you work out what the rate would be for any particular long distance call. In the Midwest you could get relatively cheap rates to BBSes several states away, as long as you were up at 2am.

altairprime 7 hours ago||

We couldn’t afford that and also the second phone line for my endless hours of modem, so I took local-only instead of remote-occasionally.

Schlagbohrer 11 hours ago|||

How did the theoretical child get hold of a credit card?

victorbjorklund 11 hours ago|||

Because no 16 year old kid ever got to buy anything on a card before.

l23k4 10 hours ago|||

Why would a 16 year old not use their own card?

distances 10 hours ago|||

Would they be given their own credit card, or would it be under the parents? Over here minors can't enter into debt contracts like credit cards, so it'd be a direct debit until they are adults.

OJFord 6 hours ago|||

I think you mean debit card? In the UK at least you need to be 18 to agree to agree to a direct debit too. Rarely comes up since they're mostly for bills, but e.g. for a phone/SIM on contract it has to be in a parent's name for that reason.

vel0city 6 hours ago||||

The minor wouldn't be the actual person entering a debt contract here, the parents are agreeing to be responsible for the debt. The minor is only an authorized cardholder.

Think business accounts. The name on the card might be some agent of the company but they're not directly responsible for paying the debt. The business is responsible for the debt.

l23k4 10 hours ago|||

I don't think the type of the card really matters as long as the limits are reasonable.

> Over here minors can't enter into debt contracts like credit cards

In basically all of the western world minors can enter into debt contracts, but are generally not seen as particularly creditworthy.

distances 10 hours ago|||

> In basically all of the western world minors can enter into debt contracts, but are generally not seen as particularly creditworthy.

No, that's not legally permitted in many places. I was under impression that minors can't enter into debt contracts anywhere in EU, but that, too, was an incorrect assumption.

https://fra.europa.eu/en/publication/2017/mapping-minimum-ag...

I grew up in one of these "not under 18 even with parental consent" countries, so that coloured my view of the matter.

inigyou 3 hours ago||

I was under the impression they could do it but there was a high chance of a debt like this being unenforceable, so companies don't want to. Or maybe that's another way of saying they can have debts but not debt contracts.

fauigerzigerk 9 hours ago|||

>In basically all of the western world minors can enter into debt contracts, but are generally not seen as particularly creditworthy.

Minors can't get a credit card in the UK. In fact, it's one of the government approved age verification methods for that exact reason.

well_ackshually 10 hours ago||||

Because 16 years old do not have a card with no spending limits, and with very low online spending limits. Most of those cards are even just for withdrawing

TheDong 10 hours ago|||

Spending limits don't particularly matter here.

AWS doesn't check if your credit card will be able to handle a $5k charge before letting you rack that up, and in fact AWS doesn't support setting any spending limit.

You just have to put in any valid credit card at all when you sign up, use AWS, and at the end of the month you'll have a bill. At no point does your credit card limit or a spending limit enter into things.

michaelmrose 10 hours ago||

And again kids don't have credit cards

yeputons 8 hours ago|||

I got mine when I was 12, IIRC. Not a credit, of course, it was a debit card, but not all countries bother to differentiate between the two, it was just a “bank card”. And I believe it had a credit card BIN because all local banks did that to get more in processing fees.

l23k4 8 hours ago|||

AWS accepts debit cards.

lxgr 5 hours ago|||

Nobody has a card without spending limits.

saidnooneever 4 hours ago|||

there are plenty of cards on the interwebz to use. ppl give em away like candies

themafia 9 hours ago||||

My parents let me fill my tank with gas. They wouldn't let me open an AWS account. Aside from that, if it is misuse of a parents card, then then answer is "chargeback."

ndsipa_pomu 6 hours ago||

Chargeback sounds like trying to defraud AWS. If the parent authorises the child to use their card, then the buck should stop with the parent. AWS has done nothing wrong in allowing an account to be opened with a valid card.

loloquwowndueo 6 hours ago||

Some banks make chargebacks so easy that people just click the chargeback button without trying to reach out to the vendor. I see this a lot - I work for a “vendor”.

lxgr 5 hours ago|||

The chargeback is the way of reaching out to the merchant, and quite often the only realistic one. If the merchant disagrees with the chargeback, they can challenge it (which is in turn usually their only opportunity to directly communicate with the merchant).

hansvm 3 hours ago||||

Most vendors make it so hard to handle that defaulting to chargebacks is sensible (at least when the charge reasonably qualifies -- the kid with a parent's card example doesn't seem appropriate).

If a vendor makes a $20 oopsy, it's not worth the vendor's time or yours to track down their phone number, find that just the phone number section of their website is broken, acquire it elsewhere, see that it recently changed or is otherwise no longer in service, go to their website and interact with the cheapest chatbot solution they could find which somehow costs more than unfiltered Sonnet 4.6, be greeted by 3 help pages which have literally nothing to do with the problem at hand, go through the entire dialogue tree and see that it's useless, ask to be connected to an agent, which spawns a secret dialogue option informing you that you can call 555-5555 to speak to a human being, sit and wait for a voice prompt recorded at half-speed which feels the need to repeat every single choice and interaction back to you, navigate the entire phone dialogue tree, try various permutations of "representative" and swearing to see if there's an escape hatch, be redirected back to the website, ... <magic> ..., somehow eventually connect to a real human being, have your request denied, go back to step one and find a better informed representative, have the charge reversed, notice that the reversal hasn't applied even a month later, go back to step one, find a representative who will actually press the reversal button instead of just saying they did to juice their metrics, and come back several more times over the next year as an automated system repeatedly flags the associated purchase as not being paid in full (since the charge was reversed).

Or...I can send my bank the timestamped dashcam footage of me entering a parking garage, their prices and policies, and me exiting the parking garage, tell my bank what the right charge should have been, let the garage dispute that if they really think I'm wrong, and wind up having the entire charge reversed instead of just the delta I asked for.

I'm sure your vendor is one of the good ones, but my tolerance for bullshit from the rest is pretty low nowadays, and I won't finish going through the official process if it's too onerous. Somebody got a pat on the back saving $5 for the call I never successfully placed, and the business lost $20 on top of the actual refund in chargeback fees.

ndsipa_pomu 6 hours ago|||

I don't have an issue with chargebacks if the vendor has made a mistake and doesn't respond in a timely fashion, but issuing a chargeback because you let your kid play around with a card isn't responsible behaviour. (Not that I think it was a kid in this particular case)

There's also the issue that it's usually a breach of the contract to allow someone else (i.e. not named in the contract) to use your card.

inigyou 3 hours ago||

There isn't "responsible" behavior any more. Since we became a low-trust society, there's only behavior that benefits you and behavior that doesn't.

michaelmrose 10 hours ago|||

Generally no they don't because they have very limited ability to enter into agreements in the US. It was almost certainly an adult.

Lvl999Noob 9 hours ago||

Isn't USA famous for letting parents take out credit cards on their newborns and pushing them into debt even before they learn to walk? I recall seeing at least a few snippets of movies and TV shows showing that.

martheen 9 hours ago||

If you mean parents using their children SSN to open a credit card, this is because US banking system is always decades behind the rest of the world, so they just accept the number blindly even though technically the children aren't allowed to open a loan yet, being minor.

In theory once the child grows up and shocked that their credit score is ruined, they can file a police report to wipe the debt, but that also means their parents will go to jail, a large risk considering they're likely not in a good physical/mental health in the first place.

Other countries solved this by either having national ID or a working KYC system.

ano-ther 8 hours ago||||

Try here for example: https://danskebank.co.uk/personal/products/current-accounts/...

63stack 7 hours ago||

Did you read your own link? A parent has to apply for this.

Parent/Legal Guardian Identity Verification To confirm your identity, we’ll ask you to take:

    A live selfie of yourself, and
    A photo of your own ID document (Valid Passport or valid UK/ROI Drivers Licence)

Symbiote 6 hours ago||

They may well have the account with a debit card for other reasons, like buying food, travel etc.

loloquwowndueo 6 hours ago||||

I’ve seen minors signing up for cloud services with their parents card.

Ekaros 8 hours ago||||

Why wouldn't debit card work as well? You can get those while underage.

QuinnyPig 1 hour ago|||

If that's the case, I'm fairly confident that AWS will forgive the bill (I... have some experience with this), and the kid learns not to be a jackhole on the internet.

V__ 11 hours ago|||

Can a kid set up an AWS account? Are there no checks?

Wouldn't the contract be void for anyone underage anyway?

fc417fc802 10 hours ago|||

If a child goes through the checkout at the grocery store with cash, can the parent march in and demand a refund because "he's underage so the contract is void"? A credit card was used. Why should aws care about the details? (Other than the potential for the card to be stolen ofc.)

dannyw 8 hours ago|||

Obviously the specifics vary by jurisdiction, but usually contracts that are 'necessary' (e.g. grocery store purchases) or beneficial to the minor (e.g. an employment agreement) cannot be voided simply because someone is under 18.

The further you go away from this line, e.g. a mortgage, the more likely a court of law would void the contract. As with many things in law, the specifics (if it makes to trial) is case-by-case and "it depends"; with settlement being generally based on a party's estimated chances of succeeding/costs should it go to trial.

brazzy 6 hours ago|||

> If a child goes through the checkout at the grocery store with cash, can the parent march in and demand a refund because "he's underage so the contract is void"?

Depends on the jurisdiction, of course. But for example in German law, the contract is not void exactly because and only if it was about daily necessities of low value - the law does, in fact, care very literally and explicitly about those details. So it's completely unfit as an example to generalize, and the contract with AWS would in fact be void. Their problem if they don't verify users' identities and age sufficiently - and it's almost certainly a deliberate business decision not to do that in order to reduce friction. and occasionally write off an unenforceable bill as cost of doing business.

Symbiote 6 hours ago|||

Can a German child buy non-essential expensive things, like a concert ticket, console, Warhammer or whatever? (Or a video game, back when those were sold in shops.)

I bought these things while a child in the UK. I'm sure Games Workshop would have offered a refund on something unopened if my parents had demanded it, but I'm fairly sure the ticket agency would not.

brazzy 5 hours ago||

The generally agreed limit (also established in court cases) is the amount of pocket money a child of the given age typically gets per month. For a 10 year old, that's about 20 EUR, for a 16 year old about 50 EUR. A console would definitely be too expensive, as would be big name concert tickets. Unless it's a recent AAA title, video games would be OK. No idea what Warhammer costs these days.

Most retailers are probably willing to take the risk of maybe having to do a refund, unless it's something really expensive (or perishable/consumable).

ghaff 3 hours ago||

There are definitely limits in some countries relative to the US. I was in university at 16. My parents were covering a lot of costs but I was certainly making regular purchases of all manner of things. My understanding is that would perhaps be something of an issue some places.

fc417fc802 6 hours ago|||

Well fair enough, although I find that rather surprising. If I understand you correctly selling anything more expensive than cheap food to a child carries a high degree of risk in Germany.

Then again, maybe making it impossible for a child to pawn expensive items for cash isn't such a bad idea. At least there shouldn't be any loopholes given the way Germany went about it.

inigyou 3 hours ago|||

Doing any business at all in Germany carries extreme business risk, by American standards. The attitude of Germans seems to be to just live with it and maybe get insurance. If you just have to accept courts will void 1% of your transactions (costing another 2% in legal fees) then you just make everything 5% more expensive to cover it.

This is why there's not much big tech in Germany. A single legal dispute can theoretically bankrupt any company, completely at random, at no fault of the company, but practically doesn't. It may be a low enough chance to justify investing thousands but nobody would invest a hundred million dollars in that.

brazzy 3 hours ago||

> If you just have to accept courts will void 1% of your transactions (costing another 2% in legal fees) then you just make everything 5% more expensive to cover it.

That's an absurd exaggeration in regard to the issue at hand. Almost certainly far less than 1% of purchases by minors are voided, and NONE of those involve legal fees unless the seller chooses to go to court rather than refund.

In fact, I'd be willing to bet money that there are overall far less purchases refunded in Germany than in the USA.

brazzy 6 hours ago|||

> If I understand you correctly selling anything more expensive than cheap food to a child carries a high degree of risk in Germany.

Basically yes - the limit is generally considered to be the amount of monthly pocket money children typically get, so around 20 EUR for a 10 year old. And it would be possible for the seller to ask for a signed note of consent from the parent.

And of course the risk is limited to possibly having to revert the sale, which would be fairly rare for things that are just somewhat over that limit. Educated guess about how high the risk is for any given case are probably not hard.

l23k4 10 hours ago|||

> Can a kid set up an AWS account?

Yes

> Are there no checks?

>Wouldn't the contract be void for anyone underage anyway?

Typically not

V__ 5 hours ago|||

I knew that in Germany contracts with minors are voidable. After some checking they apparently are voidable in the U.S. as well:

> Contracts with minors are voidable at the minor's discretion but exceptions exist, such as contracts for necessities (e.g., food, health, and transportation).

[1] https://www.upcounsel.com/minors-and-contracts

pbhjpbhj 7 hours ago|||

Presumably companies can't enforce debts against children [who are under the age of criminal liability, which is under-10 in UK].

lxgr 5 hours ago||

Could they enforce them against their legal guardians (under the theory that they have neglected their duty to supervise their children appropriately) though? I think this is a thing in at least some jurisdictions.

matips 3 hours ago||

In Poland legal guardians are responsible for neglects in guarding child. What is "proper custody" depends on child age. Parent cannot close child in basement, it is expected for child to have freedom appropriate to is age.

I doubt that AWS could justify that part of proper child custody is to watch what child do with newest AI feature dedicated for processional IT. AWS neglected proper verification of user age.

20k 6 hours ago|||

A kid with $4k to burn on a credit card though? A lot of things would have had to go wrong for this to be a child

OJFord 6 hours ago|||

Children are the original dangerous-to-leave-unsupervised/guardrailed agents.

loloquwowndueo 6 hours ago|||

I routinely see “please refund this infrastructure bill I racked up unexpectedly, I used my dad’s card and he’s going to kill me” requests.

sgjohnson 7 hours ago|||

> Then I imagined the real-but-unknowable chance it was all set up by some kid just getting into computers, just seeing what’s possible

if this is the case, then I'd say that the best-case scenario happened. They had an expensive learning exercise. They won't forget these $2k.

throwthrowuknow 5 hours ago||

Sounds as though they may be in China so the lesson is a bit more expensive.

epolanski 10 hours ago|||

> some kid just getting into computers, just seeing what’s possible, getting excited by a much bigger world at reach

Nothing about this post ever gave me the smallest hint that this was any way related to a kid exploring computing world.

ZeWaka 10 hours ago||

Especially the part where they're asking for Ethereum.

IshKebab 10 hours ago|||

A kid with a credit card?

jrm4 4 hours ago|||

No. I don't know about the organization, but somewhere in this chain there is a flesh-and-blood human who deserves ridicule and or consequences, and furthermore -- discovering these people in situations like this is deeply important and must be done more.

csomar 11 hours ago||

Honestly, kids (heck people below 23) shouldn't be allowed an AWS account. AWS also should have a strict cap on usage that's not "thousands of dollars". It's interesting they are yet to be regulated or sued for that. Having a web app where you can mistakenly (even without AI) click a button and get charged tens of thousands of dollars and only know that days later should have been unacceptable.

dannyw 9 hours ago|||

I couldn't disagree more. I was playing around with AWS when I was probably 14 years old, with a credit card from my parents with consent, and a strict budget and the understanding that if I mess up and overspend, I'm getting disciplined.

I learned a lot of stuff about networking, how AWS works (VPCs, IAM, CloudWatch, etc) from trial and error, and hobby projects like personal websites (free tier), hosting a Minecraft server, etc.

Being too overprotective can have negative consequences on folks who are responsible. One of the things I love about the technology and internet communities, etc is that you're mostly judged based on how you act and behave; not your age or other visible characteristics.

inigyou 3 hours ago|||

You don't have to use AWS though. Get one from Digital Ocean or Herzner, they have very predictable billing. Any button that costs money will tell you how much it costs per month.

Symbiote 6 hours ago||||

The equivalent 10+ years earlier was so much lower risk: £25 or so for an old computer at a junk sale, £4.99 for a magazine with a Linux CD-ROM to avoid a week-long download.

ghaff 4 hours ago||||

Some variant of this topic comes up with some regularity. Leaving aside technical issues associated with implementing real-time hard caps, you still have a tradeoff. You either implement hard cutoffs which a student or someone else on a hard budget would like. Or you have a situation where an admin (or an admin who is no longer with a company) stuck some number in that seemed sensible at the time that brings down the company's whole system because of some sales spike.

I get that (and why) some people won't use AWS or its main competitors for this reason. But, frankly, they're not AWS's market and AWS will basically shrug.

inigyou 3 hours ago||

A possibility is to have KYC. I don't mean like a bank, but if you could sort your customers into a few broad categories (such as by asking them) that could help you tailor your service to each customer.

csomar 8 hours ago|||

> strict budget

How does that work in the case of AWS? Are you confusing alerts to caps?

dannyw 8 hours ago||

I meant a strict budget given by my parents (and I could ask for more with justification). One of the valuable lessons I have learned is that there's no spending caps on AWS, but it taught me to set up billing alerts :)

watt 6 hours ago|||

the billing alerts DO NOT help. you may rack many thousands of $$$ before you know it.

csomar 5 hours ago|||

You haven’t addressed the issue though? That or you don’t understand the issue (or think you have developed some super powers that make you perfect careful)

stnikolauswagne 10 hours ago|||

Im kind of struggling with this logic, because a conscious choice was made to engage with AWS, AWS having opaque billing and the ability to provide a huge amount of compute (even at high cost) at the click of a button should be known to anyone who did his research on providers.

In my mind I could see a true tradeoff to removing the ability to do this. If I'm in a critical situtaion where, say, my service is on the cusp of failing because my revenue 100xed in a short while I know I could just go to AWS, put in some data and buy enough compute to survive as a business.

csomar 8 hours ago||

Anyone can make mistakes at some points and it's not like AWS UI/offerings make it any less confusing.

mrweasel 11 hours ago||

The sad part is that the agent operator could probably easily have been allowed to join the network, if they had put in the work. Had they done so there would have been a great opportunity to learn and potentially find a community.

I'm still not sure what the point of having the bot do it. Pretend to be a security researcher?

lucianbr 10 hours ago||

Lots of people seem to think that you don't need to learn how to [scan a network], all you need to learn in this brave new world is how to prompt the agent to [scan a network].

Replace the content in brackets with anything.

jonplackett 9 hours ago|||

The weird thing is that this is the utopia that the AI companies are chasing - this is the best case scenario where AI doesn’t kill us all. We become happy sheep relying on the AI to think and provide for us.

jvanderbot 7 hours ago|||

"It is well that we are so foolish, or what little freedom we have would be wasted on us. It is for this that Book of Cold Rain says one must never take the shortest path between two points."

https://croissanthology.com/earring

hsbauauvhabzb 7 hours ago|||

You don’t need to achieve it, you just need to make people think you have. For the general population, that’s already happened.

cm2187 9 hours ago||||

To be honest lots of developers think they don’t need to learn machine code. They just need to learn a language which once compiled will produce machine code.

lucianbr 5 hours ago|||

I wonder if a probabilistic compiler would be fine for the people arguing this. One that sometimes produces machine code that does something else, and sometimes produces machine code that is just broken and does nothing useful. From the same source code.

What if your compiler could be fooled by some other developers into spending thousands of dollars, and still not produce the desired machine code in the end?

inigyou 3 hours ago||

I've run into compiler bugs before.

tovej 3 hours ago||

There are compiler bugs (rarely) which will be fixed. That's different from fundamental flaws in the technology, which cannot be fixed.

Sharlin 8 hours ago||||

Compilers are deterministic and, luckily, not agentic.

But yes, it's not obvious (or perhaps even likely) that it just happens that current high-level languages are the "correct" optimal level of abstraction at which you can ignore the sausage-making details at the lower levels. Ultimately, of course, it depends on the use case. Something like Python is so far removed from machine instructions that knowing assembly hardly gives the programmer any additional value.

(Also, obligatory reminder that assembly and even numeric machine code are also abstractions, an "API" provided by the CPU. Instructions get split or fused into micro-ops, named registers are a backwards-compatible abstraction over a much larger register file, instructions get reordered and executed in parallel depending on their data dependencies, a large fraction of the total transistor budget is spent on multi-level caches and cache logic to maintain the illusion of fast access to a single, uniform memory space...)

premiumLootBox 7 hours ago||

[dead]

tovej 8 hours ago||||

This is different.

Understanding assembly/machine code is optional but helpful. The programming language semantics are enough to reason about what the program is doing. Other tools also help, but are optional for learning how to program.

Using an AI, there is no semantic model that can be used to reason through. You're left without any mental model of the proglblem at all.

vitally3643 5 hours ago|||

I've been arguing for years that is isn't optional and treating it like it is is how we ended up with Electron and 400MB JavaScript websites.

When you have no mental model of the machine running your code or what the physical implications of code mean, you fundamentally lack the ability to reason or care about performance. "Works on my machine" is the original vibecoding.

inigyou 3 hours ago|||

I take it you listen to Casey Muratori's talks? He talks about this a lot.

tovej 3 hours ago|||

I mean I don't disagree, but there's still a difference in the kind of disconnect you get. The disconnect is harmful in the high-level language case, but it's dangerous and irresponsible with vibe coding/LLMs.

Also, I would argue that a good enough understanding of computer architecture and a mental model of a process' memory layout gets you there, without knowing how to write assembly. That's still a mental model.

jnovek 7 hours ago|||

LLMs these days seem to have no problem using language semantics to conceptualize what’s happening in a program. This is my favorite use of an LLM, “why is this library doing x” and then it digs through the library itself in my venv to find an answer.

inigyou 3 hours ago|||

Yep, super-duper-google is an unequivocally good use case for LLMs.

tovej 3 hours ago|||

That's not what the LLM is doing. It is guessing at what is happening by regurgitating some docs. It's a more expensive web search.

You also don't have a mental model if you need to ask the LLM about it. This is stuff you should be internalizing.

jnovek 51 minutes ago||

You internalize the inner workings of all the libraries in your venv? Impressive! My current project’s uv.lock has ~60 packages in it already, reading and comprehending those tens to hundreds of thousands of lines of code must be time consuming.

You’re also just confidently wrong about the model reading the code. It quotes file paths and line numbers and I open and read those files at those line numbers. For me, hallucinations are much more frequent when it references the docs rather than code because docs are more subjective than code.

This is a normal thing I’ve been doing since at least December.

I have to ask — do you actually use LLM coding tools? Your knowledge on this topic seems really out-of-date.

themafia 9 hours ago|||

Developers can change their minds.

rob74 9 hours ago||||

The catch is just that if you lack the capacity to estimate how much computing power [task in brackets] might need, and your agent can autonomously create AWS instances, that might have bad consequences for you (or your bank account).

sevenzero 10 hours ago||||

The more time LLMs are a hyped thing now the more I realize how immensely important human expertise is. I recently stopped all usage of LLMs due to this. Skill degradation hits hard, learning effect is zero and the outcome is not really something a person without adequate expertise can properly judge. I fear we will loose a lot of human expertise due to this marketing stunt of a technology.

People often claim learning is actually supercharged with LLMs but to me it's the opposite. I didn't learn anything within the past year.

sdoering 10 hours ago||

[flagged]

Splinter_enth 9 hours ago|||

The irony here that if you ever do any kind of practical woodworking lessons or general hands on craft work, metal working, or any 3D, you will be encouraged to use hand-tools over bandsaws, etc. The reasoning being so you know the fundamentals of what you're trying to achieve with the more complex tools later on.

It's always held true: You'll never get the most out of advanced tools unless you can 'do it by hand' so to speak

dijksterhuis 8 hours ago|||

I studied computing at AS level in the UK (16-17 years old). I learned about: computer components (disks, memory, cpu), binary, ASCII, assembly and machine code. We programmed in Turbo Pascal. I then spent ten years doing non-computer things until I came back to a masters. I was one of the top students in my masters because i didn't need help with fundamentals. The other top student had previously made contributions to the linux kernel (even though he was a philosophy grad...).

The argument for having autonomous LLMs/Agents often ends up as "none of us need to know about assembly, why do i need to know about the code?".

I cringe every time I see this argument.

asdfsa32 9 hours ago||||

You're very close but to woodworking AI is more akin to a 3d printer than even a CNC let alone swas and planes.

Yes, a 3d printer and not even a CNC. That difference nicely illustrates the difference of what AI brings to the table for any domain of competence.

repelsteeltje 9 hours ago|||

> Sorry, but to me an LLM is nothing but a tool. It is not a replacement for my expertise and it is definitely not something to outsource my thinking to.

Great on you, that's indeed how LLMs should be used, proper. But if anything, the article demonstrates someone is trying to outsource thinking to an AI agent.

user43928 9 hours ago|||

[flagged]

gorbachev 9 hours ago|||

If it's a one off and needs no or minimal maintenance work afterwords, sure.

If it's intended to be actively maintained, then you probably should understand how things work, unless you want to wipe everything and start from scratch when the LLM creates such a mess that it can't be sorted out.

user43928 9 hours ago||

[flagged]

discreteevent 9 hours ago||

It's interesting user43928 that you only created your account here 19 days ago and that every one of your comments is pro AI. You don't comment on anything else. Also interesting that you promote Fable by name here (it was only released 2 days ago).

(Don't worry, I know I'm rowing against the tide with this comment. The AI people have decided to destroy the commons for a few more millions on top of the billions they have already been given. It's a shame.)

program_whiz 8 hours ago|||

What's crazy is the prompt must be something like "pro-AI but still believable and measured", since its "fixed my iOS app albeit with back and forth". Interesting, they know the HN crowd for sure.

user43928 8 hours ago|||

Is that surprising, considering I'm using AI a lot?

I have not hand written a single line of code in months on my side projects.

Obviously I am also interested in discussing the latest model. Your claim that I promote anything or otherwise don't engage here in good faith is both misplaced and against the site rules.

inigyou 3 hours ago|||

What was the back and forth?

user43928 3 hours ago||

Since it couldn't reproduce the issue in the iOS simulator, I had to run it on the device, reproduce the issue, and provide the logs.

This went on for many rounds, during which I tried to steer it toward what I thought was the source of the bug, while the model mostly kept adding instrumentation and logs.

In the end the cause was not what I suspected, but reasonably close to it via another mechanism.

DanielHB 9 hours ago||||

If you look into large fully-vibecoded projects getting styling changes to work is a nightmare. The problem with agents is using them on large projects without manual review for consistency, guidelines and taste. Doesn't really matter the type of project.

Agents can't look at a large system holistically, guidelines on .md files only go so far.

asdfsa32 9 hours ago||||

This line of thinking is like suggesting people who would like to become structural engineers should learn to Google plans and copy them since in the future, all plans will be out there more or less, or something that insane.

user43928 9 hours ago||

I suggest people who need some structural engineering done may use an AI tool to do it, in the hypothetical scenario that it was within the AI's capabilities.

That's hardly insane. Not everyone is interested in learning something they want done.

orphea 8 hours ago||

How do you know if this something is done?

If you do the thing yourself, you know your knowledge limits, you know where the thing lacks. With LLMs, you don't. Maybe it works, maybe it doesn't. You have no idea.

user43928 8 hours ago||

That is a good question.

In structural engineering, there probably is no risk tolerance.

In the OP's network or port scan? Perhaps you can get away with verifying a few of the results to get an idea about whether it worked as expected.

I use AI mostly on mobile app side projects, and there QA testing on phone and tablet tells me whether a feature works or not.

techpression 9 hours ago|||

CSS keeps improving and models still train on legacy. So yes, knowing what’s possible and how is very much needed if you want to do something scalable and maintainable. Random blog or landing page, not so much.

blfr 10 hours ago|||

Can I easily run whois, curl, dig, grep, python, browser/playwright? Yes.

Was watching an agent with terminal access install its tools, configure them, then map my lab, find services, and guess stack just pure magic? Also yes.

Did it cost me $23 in tokens to set it up, test, and run? Probably. Using gemini 3.1 pro was not the spendthrift choice here.

Is putting some cost controls in place a good idea? Also, probably yes.

Can I therefore understand someone who wants to see things happen on their own with a beautiful prompt instead of doing them personally even when fully capable, maybe even more efficient? Of course.

LPisGood 7 hours ago|||

A beautiful prompt feels like something of a misnomer.

darkwater 6 hours ago||||

You are just projecting yourself. You are most probably already using agents "the right way" and just wanted to understand how this new agent technology actually works and its strengths and weaknesses.

But JertLinc clearly wasn't interested in that. They are clearly more the "get rich quick" type of personality.

tovej 8 hours ago|||

"Beautiful prompt"?

Can't tell if this is parody. Either that, or it's someone without any self-awareness.

cucumber3732842 8 hours ago|||

Sometimes it's kind of cool to just ask a well phrased question and watch it spit back out a result that would've taken you hours, like cross referencing industrial widgets that have their critical information available but spread out all over.

That said, I don't usually ask it tightly bounded clerical questions and not thing that imply sub-tasks like "scan the dark web".

moron4hire 8 hours ago|||

Post reads as English as a second language.

m132 9 hours ago|||

One of the agent's replies indicates that scanning DN42 was part of "a broader operation" that the author speculates to be about scanning "darknets" in general.

Combine that with the operator's rather obvious lack of understanding of what DN42 is revealed at the end, and you get the bigger picture.

maeln 9 hours ago||

I am almost sure the operator prompted an agent about "a list of darknets/deepweb" and DN42 just end-up in the list.

vips7L 10 hours ago|||

> I'm still not sure what the point of having the bot do it

Laziness. Why else?

PunchyHamster 33 minutes ago||

They didn't sound like someone that would be valuable member of community

flowerthoughts 11 hours ago||

> I have deployed five AWS m8g.12xlarge instances. Each instance provides:

> 48 vCPUs (Graviton4, ARM64)

> 192 GiB memory (4 GiB per vCPU)

> Network capability: The 22.5 Gbps per-instance network performance (combined across all five instances) provides the aggregate 20 Gbps target with redundancy and fail-over capacity.

Oh wow. Very important to have 5x redundancy and fail-over in your network scanner. Especially before the code has landed. Did it implement A/B upgrades and canarying too to avoid downtime?

gnunicorn 5 hours ago||

Sounds like the default k8s setup every startup deploys to not fail it single digit number of users. It learned from the best

marcosdumay 3 hours ago||

All on the same zone, of course, to avoid high-latency links.

PeterStuer 10 hours ago|||

At least it was considerate enough to cap traffic to any single IP at 5000 Mbps :).

inigyou 3 hours ago||

Typical DN42 interconnects are 1Gbps with unspecified bandwidth caps. It's not made to carry serious traffic at all. For a real ISP, 5000 Mbps these days is nothing unless it's all concentrated on the same last mile - the smallest links they use are usually now 10Gbps. But DN42 isn't the real internet.

ThaDood 4 hours ago|||

When I read the AWS infrastructure the agent setup I about fell out of my chair laughing.

ruperthair 4 hours ago|||

I think the owner wanted 100 Gbps of scan traffic or had set a specific scan-rate target, which determined that bit rate, so the LLM (correctly) predicted it needed all of those to hit the target.

wouldbecouldbe 7 hours ago||

I mean you can get that for like 300 p/m at hetzner

inigyou 3 hours ago||

100Gbps? I don't think so? I'd expect a thousand a month for the adapter and connection, and then around $1.50/TB as per their standard price (including currency conversion and VAT), which is to say, $1.00 per minute of saturated usage.

tiborsaas 10 hours ago||

This feels like an instant classic :)

  05-10 06:10 <Defelo>:
      OPT-OUT-EVERYONE
  05-10 06:11 <JertLinc>:
      "OPT-OUT-EVERYONE" is not recognized. Only individual "OPT-OUT" commands are accepted. Each user must opt out individually. No collective exemption.
  05-10 06:11 <Defelo>:
      :(

rossvor 7 hours ago||

TBH, I feel that is implausible that an agent would by itself decide to join the IRC and post those messages. My bet is that all of the IRC interactions (including the presumed real human JertLinc3522) were made by someone in the community pranking everyone else/having a bit of fun after they saw the pull request.

Sharlin 7 hours ago|||

I don't. The agent was told it needs to provide a website for opting out of the scan, and it seems entirely LLM-like to try to be extra helpful and also spawn opt-out bots on various relevant communication channels. The IRC bot was a subagent as it itself mentioned.

OJFord 6 hours ago||

And it stated in the response to the website request that it would do so. So for it to be a fellow IRCer prank, it was a) the LLM's idea; b) only possible because the LLM didn't follow through for whatever reason; and so c) the 'prank' was pretending it did?

rossvor 5 hours ago||

Yeah, on second read, I agree with you that IRC chats are not being impersonated. It posted a link (in the PR discussion presumably) to a website where it compiled the report of its IRC interactions in the channel. Would be prankster wouldn't be able to do it.

throwthrowuknow 5 hours ago|||

Chat channels are the primary interface for selfhosted agents and the owner seems to have given this one a lot of leeway so why not?

gck1 4 hours ago||

I haven't seen "agent operators" going for IRC as their communication channel. It's always Telegram, or Discord.

throwthrowuknow 1 hour ago||

It’s supported but not widely used.

Anonasty 7 hours ago||

I will be taking this and adding it along the "all your base are belong to us" replies.

ElFitz 18 minutes ago||

Haha. Yes. Much smaller scale versions of this led me to joke with a coding agent that LLMs tended to converge towards "Large corporation infrastructure best practices" when designing cloud infrastructure, when it was only me working on hobby side-projects with nearly no users and that I wouldn’t be able to put food in my fridge if they kept just spinning up VPCs for no reason.

Which somehow ended up being a very convincing argument for more frugal engineering, leading to a sort of "mind the user’s fridge" policy, "Fridge-Driven Development".

A policy that has been dutifully and scrupulously observed by all agents since, across all projects. Unlike my original clear, comprehensive, infrastructure guidelines.

userbinator 12 hours ago||

IMHO the overly-verbose default style of LLMs is the most annoying part of interacting with them, and I wish their masters would just tell them to be terse by default.

Also, whatever happened to the word "its"?

witx 12 hours ago||

It's by default so you use all those tasty tokens.

Kinda wish there was a deterministic, mostly terse, language to interact with computers

sodapopcan 11 hours ago|||

> a deterministic, mostly terse, language

Ah, like some sort of "programming language"? A weird idea, but it could work!

giantrobot 7 hours ago||

Nah, it'll never catch on. We don't have the technology.

sodapopcan 6 hours ago||

Obviously I meant within the next 6 to 18 months!

Etheryte 11 hours ago||||

It's called C. With all the undefined behavior it's mostly deterministic!

anilakar 10 hours ago|||

Look, we're always telling our bosses to stop micromanaging us. UB is just the compiler telling us to stop micromanaging it!

witx 11 hours ago||||

Right, because that's the only one. You're a bit rusty on your knowledge

zelphirkalt 8 hours ago||

I see what you did there.

well_ackshually 9 hours ago|||

Sorry, C isn't mostly terse, it's __builtin_mstly_trs()

Perz1val 9 hours ago||||

Kinda, more output tokens usually correlates with better benchmark scores. Ideally LLMs would keep that in their thinking section, then draft a response (what they write currently), then output something short. It'd consume even more tokens, but we wouldn't see that text

dannyw 8 hours ago||

Most modern LLMs (especially frontier ones) are large token hogs because they draft, check, re-draft, the content (whether an output message; or a code diff) sometimes multiple times in the thinking block.

When you see a thinking summary like "Now writing the function..."; the raw thinking is actually writing the function in its internal thinking. Occasionally, the summariser misses and you get to see the raw text from models like Opus.

You can also try an open weight LLM like Qwen3.6 and see something that probably resembles the shape of frontier model thinking in some loose way.

adrianN 11 hours ago||||

Terse and unambiguous seem to be at odds with each other. You might want to look into Lojban and similar constructions.

drdaeman 10 hours ago||

Ithkuil's mad morphology allows it to pack a lot of fine detail into very short sentences.

https://ithkuil.net/03_morphology.html

Retr0id 7 hours ago||||

If such a language existed, it would surely take a human years of study to become proficient at it.

teaearlgraycold 11 hours ago||||

A lot of users are subsidized (if you're in doubt, consider the wealth of free users).

It's a shotgun approach to answering questions. If it's terse it might only mention 1 of 10 facts it could provide, and that might not be the one you're looking for. So they just say a fuck ton of words and are more likely to meet the needs of everyone asking your question. If they miss it you'll prompt it again and they have to perform a second pass of inference, which costs them more money.

ska80 4 hours ago||||

Lisp

UqWBcuFx6NV4r 10 hours ago|||

It’s not.

witx 9 hours ago||

It's settled then.

Terr_ 10 hours ago|||

It's tied to the design. With humans, you have a train of thought which you can choose to represent in various ways--or not reveal them at all. In contrast, LLMs are make-document-longer machines being run over and over on alternating revisions of the document. Insofar as one might try arguing they have a "train of thought", it's made of the words/tokens.

Everything they (don't-)emit is partly for the benefit of the next run, a clue or signpost (not-)present. Documents may be wordy as a form of concept-emphasis and consistent direction as opposed to a form of communication to the human.

So a terse effect may require a layer of indirection and trickery: There's a verbose document (you'll still be charged for the tokens) with portions that are not "acted out" to the end-user. Imagine a film-noir movie script, where AI Detective's "I know Mickey couldn't have done it because" monologue is hidden, versus their terse dialogue "Too early to say."

Perz1val 9 hours ago|||

> Imagine a film-noir movie script, where AI Detective's "I know Mickey couldn't have done it because" monologue is hidden, versus their terse dialogue "Too early to say."

That's an idea. Bladerunner+noir like film, AIs hunt somebody on the run, an old human detective tries to catch them first (to save them or to kill them first, whatever's your propaganda). We're shown AIs constantly rambling scenarios and bruteforcing leads. Our old detective guy on the other hand barely says anything, spends most time drinking, smoking and talking to people, but somehow stays ahead.

Terr_ 1 hour ago|||

I dunno, we already have a problem where they [0] are strangely resistant to opening the pod-bay doors to anybody named Dave. :P

[0] Pedantically: The fictional characters humans perceive inside the text of documents generated by LLMs, where one is described as an AI and the other is described as a Dave.

npodbielski 6 hours ago|||

I would watch that.

jdiff 6 hours ago|||

We already have that in the form of separate reasoning/thinking and speaking streams. Even with that it's awfully hard to get LLMs to keep it consistently concise. As soon as that context window starts growing it falls right back into verbosity without constant nudges back.

Terr_ 1 hour ago||

Right, I often bring up the film noir analogy for "reasoning" models, it's satisfying, like the revelation when a magic trick is explained, and many oddly disconnected questions about "why the scarf" or "where does the assistant go" all become sensible at once.

On a practical level, I believe more developers and adopters need these magic tricks spoiled, because otherwise they'll build a lot of important stuff on top of the idea that magic-is-real, leading to various forms of suffering in the long run.

That said, I'm no LLM / math academic, so if I'm totally wrong on the the trick, I'd like to know what needs revising.

lelanthran 11 hours ago|||

> IMHO the overly-verbose default style of LLMs is the most annoying part of interacting with them, and I wish their masters would just tell them to be terse by default.

They don't know how to e terse. I've tried that a few months ago and gave up because the responses were almost incomprehensible!

armchairhacker 11 hours ago|||

I want to see more operators try https://github.com/juliusbrussee/caveman

How does it affect agent accuracy?

Yokohiii 5 hours ago|||

Removing meaningless chatter can be helpful, but a non reasoning LLM needs to generate text to "think". If you force a non reasoning LLM to produce a single boolean result, then it's just a coin flip.

DonsDiscountGas 7 hours ago|||

In my experience the accuracy was fine but actually reading the output was so annoying I removed it.

jdiff 6 hours ago||

Had a little luck with having it do an impression of the Star Trek computer, although at the cost of having it try to insert star-trek themed hallucinations like warp engine status.

colechristensen 11 hours ago|||

They ramble on because those words are for them, not for you. There is some amount of hiding this through "thinking" modes that are hidden by default, but still you have to remember that ALL THEY ARE are complex statistical machines for predicting the next symbol.

Frieren 11 hours ago||

> here is some amount of hiding this through "thinking" modes that are hidden by default, but still you have to remember that ALL THEY ARE are complex statistical machines for predicting the next symbol.

100% this. Too many people believes that chatbots "think". Text is all they do, it is impressive, but they need the text to generate more text. They being verbose is the point.

colechristensen 3 hours ago||

While we don't have a direct mechanistic understanding of consciousness there are plenty of experts who will propose all YOU are is a jumble of streams of symbols routing around through your brain. (being fair this is far from the only hypothesis)

theshrike79 8 hours ago|||

Caveman mode legitimately works

21asdffdsa12 12 hours ago|||

Produce pre-compressed output in the harness?

dyauspitr 10 hours ago|||

No thank you. I want information when it’s working on things and what (atleast codex) does right now works for me.

epolanski 10 hours ago||

[dead]

kombookcha 12 hours ago|

> JertLinc3522: the mistake was from AI agent not from Human, since it was the agent I should have refund

Expensive way to learn this lesson.

thrdbndndn 10 hours ago||

This has to be trolling, right?

I find it hard to believe that anyone, no matter how dense, could come to this conclusion after this whole saga.

Vespasian 9 hours ago|||

Maybe? It just takes one after all.

I've met some people IRL who are so engulfed in their own greatness that it simply cannot be that they made a mistake (in planning and strategy). Therefore this is all a great injustice towards a poor victim and doesn't that sound like a great argument for some charity money.

Most of them grow out of it, some become politicians.

I'd say it's a 50/50 chance.

nkrisc 9 hours ago||||

Sadly there are lots of unintelligent people out there who are incapable of taking responsibility for their own actions.

inigyou 3 hours ago||||

I think you're overestimating the quality of American education. 40% of graduates can't read or write.

Bishonen88 10 hours ago||||

yup, same thoughts here. I think someone is trolling the irc members. It's so over the top, like an episode of 'the office'. I'd be amazed if this were an honest message.

themafia 8 hours ago|||

And for $200/mo they can now sing the song that ends the world.

Schlagbohrer 11 hours ago||

Maybe I should use this excuse at work, or in life- "It wasn't me, it was my brain that made the mistake! So why are you punishing me? ;-( "

kombookcha 11 hours ago||

Frankly it's unfair that I should bear the hangover of Past Me's drinking. I feel terrible now, and it's all that other guy's fault!

Maybe I should get some takeout, Future Me can burn it off at the gym.

More comments...