Posted by speckx 1 day ago
You're telling me I need to use 100% of my brain, reasoning power, and time to go over your code, but you didn't feel the need to hold yourself to the same standard?
It’s strange how AI style is so easy to spot. If LLMs just follow the style that they encountered most frequently during training, wouldn’t that mean that their style would be especially hard to spot?
Or someone made a call that emoji-infested text is "friendlier" and tuned the model to be "friendlier."
It's not because they can't write PRs indistinguishable from humans, or can't write code without Emojis. It's because they don't want to freak out the general public so they have essentially poisoned the models to stave off regulation a little bit longer.
That's a lot of expensive work they're doing, and ignoring, if they're just later poisoning the models!
So it's entirely possible that training in one area (eg: Reddit discourse) might influence other areas (such as PRs)
Later the cutest of the emojis paved their way into templates used by bots and tools, and it exploded like colorful vomit confetti all over the internets.
When I see this emojiful text, my first association is not with an LLM, but with a lumberjack-bearded hipster wearing thick-framed fake glasses and tight garish clothes, rolling on a segway or an equivalent machine while sipping a soy latte.
Jk, your comments don't seem at all to me like AI. I don't see how that could even be suggested
1. What is this change supposed to do?
2. Why is this change needed?
3. How was it tested?
4. Is there anything else reviewers should know?
5. Link to issue:
There's no "What changed?" because that's the diff. Explain your intent, why you think it's a good idea, how you know you accomplished your intent, and any future work needed or other concerns noticed while making the change. PR descriptions suffer from the same problem as code comments by beginners: they often just describe the "what" when that's obvious from the code, when the "why" is what's needed. So try very hard to avoid doing that.
i++; // increment i (by 1)
Generally small startups after initial pmf. I have no idea how to run a big company and pre pmf Im guilty of "all cowboy, all the time" - YMMV
Code review is one of the places where experience is transferred. It is disheartening to leave thoughtful comments and have them met with "I duno. I just had [AI] do it."
If all you do is 'review' the output of your prompting before cutting a CR, I'd prefer you just send the prompt.
Almost nobody uses it for that today, unfortunately, and code reviews in both directions are probably where the vast majority of learning software development comes from. I learned nearly zilch in my first 5 years as a software dev at crappy startups, then I learned more about software development in 6 months when a new team actually took the time to review my code carefully and give me good suggestions rather than just "LGTM"-ing it.
Firstly, we get important benefits even when there's nothing to talk about: we get to see what the other person is working on, which stops us getting siloed or working alone. Secondly, we do leave useful feedback and often link to full articles explaining concepts, and this can be a good enough explanation for the PR author to just make the requested change. Thirdly, we escalate things to in-person discussion when appropriate, so we end up having the most valuable discussions anyway, which are around architecture, ongoing code style changes, and teaching/learning new things.
I don't understand how someone could think that async code review has almost zero value unless they worked somewhere with a culture of almost zero effort code reviews.
Sometimes a PR either merits limited input or the situation doesn't merit a thorough and thoughtful review, and in those cases a simple "lgtm" is acceptable. But I don't think that diminishes the value of thoughtful non-in-person code review.
Which is awesome and essential!
But the reason that the value of code reviews drops if they aren't done live, conducted by the person whose code is being reviewed, isn't related to the quality of the feedback. It's because a very large portion of the value of a code review is having the dev who wrote the code walk through it, explaining things, to other devs. At least half the time, that dev will encounter "aha" moments where they see something they have been blind to before, see a better way of doing things, spot discontinuities, etc. That dev has more insight into what went into the code than any other, and this is a way of leveraging that insight.
The modern form of code review, where they are done asynchronously by having reviewers just looking at the code changes themselves, is not worthless, of course. It's just not nearly as useful as the old-school method.
Makes it a lot easier to ignore, at the very least.
Then the golden age of ascii encoded source, where all was easy to change.
Now we've forgotten that lesson and changed to ascii encoded binary.
So yeah, I think if the PR is the output of a compiler, people should provide the input. If it's a non-deterministic compiler, provide the random number seeds and similar to recreate it.
This is not just disheartening - this should be flat out refused. I'm sensitive to issues of firing people but honestly this is just someone not pulling their weight for their job.
But otherwise, writing code with LLM‘s is more than just the prompt. You have to feed it the right context, maybe discuss things with it first so it gets it and then you iterate with it.
So if someone has done the effort and verified the result like it‘s their own code, and if it actually works like they intended, what’s wrong with sending a PR?
I mean if you then find something to improve while doing the review, it’s still very useful to say so. If someone is using LLMs to code seriously and not just to vibecode a blackbox, this feedback is still as valuable as before, because at least for me, if I knew about the better way of doing something I would have iterated further and implemented it or have it implemented.
So I don‘t see how suddenly the experience transfer is gone. Regardless if it’s an LLM assisted PR or one I coded myself, both are still capped by my skill level not the LLMs
I’ve noticed in empirical studies of informal code review that most humans tend to have a weak effect on error rates which disappears after reading so much code per hour.
Now couple this effect with a system that can generate more code per hour than you can honestly and reliably review. It’s not a good combination.
> But once that’s done, why not?
Do you have the same understanding of the code?Be honest here. I don't think you do. Just like none of us have the same understanding of the code somebody else wrote. It's just a fact that you understand the code you wrote better than code you didn't.
I'm not saying you don't understand the code, that's different. But there's a deeper understanding to code you wrote, right? You might write something one way because you had an idea to try something in the future based on an idea to had while finding some bug. Or you might write it some way because some obscure part of the codebase. Or maybe because you have intuition about the customer.
But when AI writes the code, who has responsibility over it? Where can I go to ask why some choice was made? That's important context I need to write code with you as a team. That's important context a (good) engineering manager needs to ensure you're on the right direction. If you respond "well that's what the AI did" then how that any different from the intern saying "that's how I did it at the last place." It's a non-answer, and infuriating. You could also try to bullshit an answer, guessing why the AI did that (helpful since you promoted it), but you're still guessing and now being disingenuous. It's a bit more helpful, but still not very helpful. It's incredibly rude to your coworkers to just bullshit. Personally I'd rather someone say "I don't know" and truthfully I respect them more for that. (I actually really do respect people that can admit they don't know something. Especially in our field where egos are quite high. It's can be a mark of trust that's *very* valuable)
Sure, the AI can read the whole codebase, but you have hundreds or thousands of hours in that codebase. Don't sell yourself short.
Honestly I don't mind the AI acting as a reviewer to be a check before you submit a PR, but it just doesn't have the context to write good code. AI tries to write code like a junior, fixing the obvious problem that's right in front of you. But it doesn't fix the subtle problems that come with foresight. No, I want you to stumble through that code because while you write code you're also debugging and designing. Your brain works in parallel, right? I bet it does even if you don't know it. I want you stumbling through because that struggling is helping you learn more about the code and the context that isn't explicitly written. I want you to develop ideas and gain insights.
But AI writing code? That's like measuring how good a developer is by the number of lines of code they write. I'll take quality over quantity any day of the week. Quality makes the business run better and waste fewer dollars debugging the spaghetti and duct tape called "tech debt".
If the AI writes the code, you can still understand the code, but you will never know why the code is written that way. The AI itself doesn’t know, beyond the fact that that’s how it is in the training data (and that’s true even if it could generate a plausible answer for why, if you asked it).
If people are letting the LLM decide how the code will be written then I think they're using them wrong and yes 100% they won't understand the code as well as if they had written it by hand.
LLMs are just good pattern matchers and can spit out text faster than humans, so that's what I use them for mostly.
Anything that requires actual brainpower and thinking is still my domain. I just type a lot less than I used to.
And that's a problem. By typing out the code, your brain has time to process its implications and reflect on important implementation details, something you lose out on almost entirely when letting an LLM generate it.
Obviously, your high-level intentions and architectural planning are not tied to typing. However, I find that an entire class of nasty implementation bugs (memory and lifetime management, initialization, off-by-one errors, overflows, null handling, etc.) are easiest to spot and avoid right as you type them out. As a human capable of nonlinear cognition, I can catch many of these mid-typing and fix them immediately, saving an significant amount of time compared to if I did not. It doesn't help that LLMs are highly prone to generate these exact bugs, and no amount of agentic duct tape will make debugging these issues worthwhile.
The only two ways I see LLM code generation bring any value to you is if:
* Much of what you write is straight-up boilerplate. In this case, unless you are forced by your project or language to do this, you should stop. You are actively making the world a worse place.
* You simply want to complete your task and do not care about who else has to review, debug, or extend your code, and the massive costs in capital and human life quality your shitty code will incur downstream of you. In this case, you should also stop, as you are actively making the world a worse place.
The best time to review is when writing code.
The best time to iterate on design is when writing code.
Writing code is a lot more than typing. It's the whole chimichanga
> I know why the LLM wrote the code that way. Because I told it to and _I_ know why I want the code that way.
That's a different "why". > If people are letting the LLM decide how the code will be written then I think they're using them wrong
I'm unconvinced you can have an LLM produce code and you do all the decision making. These are fundamentally at odds. I am convinced that it will tend to follow your general direction, but when you write the code you're not just writing either.I don't actually ever feel like the LLMs help me generate code faster because when writing I am also designing. It doesn't take much brain power to make my fingers move. They are a lot slower than my brain. Hell, I can talk and type at the same time, and it isn't like this is an uncommon feat. But I also can't talk and type if I'm working on the hard part of the code because I'm not just writing.
People often tell me they use LLMs to do boilerplate. I can understand this, but at the same time it begs the question "why are you writing boilerplate?" or "why are you writing so much boilerplate?" If it is boilerplate, why not generate it through scripts or libraries? Those have a lot of additional benefits. Saves you time, saves your coworkers time, and can make the code a lot cleaner because you're now explicitly saying "this is a routine". I mean... that's what functions are for, right? I find this has more value and saves more time in the long run than getting the LLMs to keep churning out boilerplate. It also makes things easier to debug because you have far fewer things to look at.
There needs to be some responsible entity that can discuss the decisions behind the code. Those decisions have tremendous business value[0]
[0] I stress because it's not just about "good coding". Maybe in a startup it only matters that "things work". But if you're running a stable business you care if your machine might break down at any moment. You don't want the MVP. The MVP is a program that doesn't want to be alive but you've forced into existence and it is barely hanging on
It undoubtedly saved me time vs learning all that first, and in fact was itself a good chance to “review” some decent TS myself and learn about the stdlib and some common libraries. I don’t think that effort missed many critical idioms and I would say I have decent enough taste as an engineer that I can tell when something is janky and there must be a better way.
> but I’m not a TS expert
Although this is ultimately related. How can you verify that it is working as intended? You admit to not having those skills. To clarify, I'm sure "it's working" but can you verify the "as intended" part? This is the hard part of any coding. Getting things working isn't trivial, but getting things working right takes a lot more time. > So the most recent thing that I did
I'll share a recent thing I tried too...I was working on a setup.py file and I knew I had done something small and dumb, but was being blind to it. So I pulled up claude code and had it run parallel to my hunt. Asked it to run the build command and search for the error. It got caught up in some cmake flags I was passing, erroneously calling them errors. I get a number of prompts in and they're all wrong. I fixed the code btw, it was a variable naming error (classic!).
I've also had success with claude, but it is super hit or miss. I've never gotten it to work well for anything remotely complicated if there also isn't the code in a popular repo I could just copy paste. But it is pretty hit or miss for even scripts, which I write a lot of bash. People keep telling me it is great for bash and honestly guys, just read the man pages... (and use some god damn functions!)
That's why it isn't necessary to add the "to be fair" comment i see crop up every time someone complains about the low quality of AI.
Dealing with low effort people is bad enough without encouraging more people to be the same. We don't need tools to make life worse.
If a company builds an industrial poop delivery system that lets anyone with dog poop deliver it directly into my yard with the push of a button I have a much different and much bigger problem
It's as if someone created a device that made cancer airborne and contagious and you come in to say "to be fair, cancer existed before this device, the device just made it way worse". Yes? And? Do you have a solution to solving the cancer? Then pointing it out really isn't doing anything. Focus on getting people to stop using the contagious aerosol first.
It's 90% AI, but that 90% was almost entirely boilerplate and would have taken me a good chunk of time to do for little gain other than the fact I did it.
I don’t think they are (telling you that). The person who sends you an AI slop PR would be just as happy (probably even happier) if you turned off your brain and just merged it without any critical thinking.
I can see the code, I know what changed. Give me the logic behind this change. Tell me what issues you ran into during the implementation and how you solved them. Tell me what other approaches you considered and ruled out.
Just saying "This change un-links frobulation from reticulating splines by doing the following" isn't useful. It's like adding code comments that tell you what the next line does; if I want to know that I'll just read the next line.
The AI has far more energy than I do when it comes to writing PR summaries, I have done it so many times, it's not the main part of my job. I have already provided all the information for a PR, why should I repeat myself? What happened to DRY?
A good PR summary should be the why of the PR. Not redundantly repeat what changed, give me description of why it changed, what alternatives were tested, what you think the struggles were, what you think the consequences may be, what you expect the next steps to be, etc.
I've never seen an AI generated summary that comes close to answering any of those questions. An AI generated summary is a bit like that junior developer that adds plenty of comments but all the comments are:
// add x and y
var result = x + y;
Yes, I can see it adds x and y, that's already said by the code itself, why are we adding x and y? What's the "result" used for?I'm going to read the code anyway to review a PR, a summary of what the code already says it does is redundant information to me.
I have worked at smaller firms, mostly, early stage (< 50 engineers), and folks are super busy. Having AI support in writing better thoughtful commentary, provide deeper context is a boon.
In the end, I'll have to say "it depends" -- you can't just throw slop at people but there's definitely a middle ground where everyone wins.
This is like reviewing your own PRs, it completely defeats the purpose.
And no, using different models doesn’t fix the issue. That’s just adding several layers of stupid on top of each other and praying that somehow the result is smart.
As insulting as it is to submit an AI-generated PR without any effort at review while expecting a human to look it over, it is nearly as insulting to not just open the view the reviewer will have and take a look. I do this all the time and very often discover little things that I didn't see while tunneled into the code itself.
Yes. You just have to be in a different mindset. I look for cases that I haven't handled (and corner cases in general). I can try to summarize what the code does and see if it actually meets the goal, if there's any downsides. If the solution in the end turns out too complicated to describe, it may be time to step back and think again. If the code can run in many different configurations (or platforms), review time is when I start to see if I accidentally break anything.
In the sense that you double check your work, sure. But you wouldn’t be commenting and asking for changes, you wouldn’t be using the reviewing feature of GitHub or whatever code forger you use, you’d simply make the fixes and push again without any review/discussion necessary. That’s what I mean.
> open the view the reviewer will have and take a look. I do this all the time
So do I, we’re in perfect agreement there.
It is, but for all the reasons AI is supposed to fix. If I look at code I myself wrote I might come to a different conclusion about how things should be done because humans are fallible and often have different things on their mind. If it's in any way worth using an AI should be producing one single correct answer each time, rendering self PR review useless.
> This is like reviewing your own PRs, it completely defeats the purpose.
I've been the first reviewer for all PRs I've raised, before notifying any other reviewers, for so many years that I couldn't even tell you when I started doing it. Going through the change set in the Github/Gitlab/Bitbucket interface, for me, seems to activate an different part of my brain than I was using when locked in vim. I'm quick to spot typos, bugs, flawed assumptions, edge cases, missing tests, to add comments to pre-empt questions ... you name it. The "reading code" and "writing code" parts of my brain often feel disconnected!
Obviously I don't approve my own PRs. But I always, always review them. Hell, I've also long recommended the practice to those around me too for the same reasons.
You don’t, we’re on the same page. This is just a case of using different meanings of “review”. I expanded on another sibling comment:
https://news.ycombinator.com/item?id=45723593
> Obviously I don't approve my own PRs.
Exactly. That’s the type of review I meant.
The point of most jobs is not to get anything productive done. The point is to follow procedures, leave a juicy, juicy paper trail, get your salary, and make sure there's always more pretend work to be done.
That's certainly not my experience. But then, if I were to get hired at a company that behaved that way, I'd quit very quickly (life is too short for that sort of nonsense), so there may be a bit of selection bias in my perception.
ultimately I'm happy to fight fire with fire. there was a time I used to debate homophobes on social media - I ended up writing a very comprehensive list of rebuttals so I could just copy and paste in response to their cookie cutter gotchas.
It's a joke.
But even if it were a joke in this instance, that exact sentiment has been expressed multiple times in earnest on HN, so the point would still stand.
That is literally how civilization works.
As an example, knowing that a service is offered by a registered company with presence in my area gives me the knowledge "that they know that I know" that if something goes wrong, I can sue them for negligence, possibly up to piercing the corporate veil the company and having the directors serve prison time. From that I can somewhat rationally derive that if the company has been in business offer similar services for years, it is likely that they have processes in place to maintain a level of professionalism that would lower the risk of such lawsuits. And on an organisational level, even if I still have good reason to think that most of the employees are incompetent, the fact that the company is making it work gives me a significantly higher preference in the "result" than I would in any individual "stupid" component.
And for a closer-to-home example, the internet is well known to be a highly reliable system built from unreliable components.
So, your minimum bar for a useful AI is that it must always be perfect and a far better programmer than any human that has ever lived?
Coding agents are basically interns. They make stupid mistakes, but even if they're doing things 95% correctly, then they're still adding a ton of value to the dev process.
Human reviewers can use AI tools to quickly sniff out common mistakes and recommend corrections. This is fine. Good even.
You are transparently engaging in bad faith by purposefully straw manning the argument. No one is arguing for “far better programmer than any human that has ever lived”. That is an exaggeration used to force the other person to reframe their argument within its already obvious context and make it look like they are admitting they were wrong. It’s a dirty argument, and against the HN guidelines (for good reason).
> Coding agents are basically interns.
No, they are not. Interns have the capacity to learn and grow and not make the same mistakes over and over.
> but even if they're doing things 95% correctly
They’re not. 95% is a gross exaggeration.
> This makes no sense, and it’s absurd anyone thinks it does. If the AI PR were any good, it wouldn’t need review. And if it does need review, why would the AI be trustworthy if it did a poor job the first time?
This is an entirely unfair expectation. Even the best human SWEs create PRs with significant issues - it's absurd by the parent to say that if a PR is "any good, it wouldn’t need review"; it's just an unreasonable bar, and I think that @latexr was entirely justified in pushing back against that expectation.
As for the "95% correctly", this appears to be a strawman argument on your end, as they said "even if ...", rather than claiming that this is the situation at the moment. But having said that, I would actually like to ask both of you - what does it even mean for a PR to be 95% correct - does it mean that that 95% of the LoC are bug-free, or do you have something else in mind?
Waiting for the rest of the comment to load in order to figure out if it's sincere or parody.
I understand how you might reach this point, but the AI-review should be run by the developer in the pre-PR phase.
Do you review your comments too with AI?
I first read that as "coworkers (who are) fully AI generated" and I didn't bat an eye.
All the AI hype has made me immune to AI related surprises. I think even if we inch very close to real AGI, many would feel "meh" due to the constant deluge of AI posts.
This reminds me of an awesome bit by Žižek where he describes an ultra-modern approach to dating. She brings the vibrator, he brings the synthetic sleeve, and after all the buzzing begins and the simulacra are getting on well, the humans sigh in relief. Now that this is out of the way they can just have a tea and a chat.
It's clearly ridiculous, yet at the point where papers or PRs are written by robots, reviewed by robots, for eventual usage/consumption/summary by yet more robots, it becomes very relevant. At some point one must ask, what is it all for, and should we maybe just skip some of these steps or revisit some assumptions about what we're trying to accomplish
I've been thinking this for a while, despairing, and amazed that not everyone is worried/surprised about this like me.
Who are we building all this stuff for, exactly?
Some technophiles are arguing this will free us to... do what exactly? Art, work, leisure, sex, analysis, argument, etc will be done for us. So we can do what exactly? Go extinct?
"With AI I can finally write the book I always wanted, but lacked the time and talent to write!". Ok, and who will read it? Everybody will be busy AI-writing other books in their favorite fantasy world, tailored specifically to them, and it's not like a human wrote it anyway so nobody's feelings should be hurt if nobody reads your stuff.
In the dating scenario what's really absurd and disgusting isn't actually the artificiality of toys.. it's the ritualistic aspect of the unnecessary preamble, because you could skip straight to tea and talk if that is the point. We write messages from bullet points, ask AI to pad them out uselessly with "professional" sounding fluff, and then on the other side someone is summarizing them back to bullet points? That's insane even if it was lossless, just normalize and promote simple communications. Similarly if an AI review was any value-add for AI PR's, it can be bolted on to the code-gen phase. If editors/reviewers have value in book publishing, they should read the books and opine and do the gate-keeping we supposedly need them for instead of telling authors to bring their own audience, etc etc. I think maybe the focus on rituals, optics, and posturing is a big part of what really makes individual people or whole professions obsolete
> No, don't use it to fix your grammar, or for translations, or for whatever else you think you are incapable of doing. Make the mistake. Feel embarrassed. Learn from it. Why? Because that's what makes us human!
It would be more human to handwrite your blog post instead. I don’t see how this is a good argument. The use of tools to help with writing and communication should make it easier to convey your thoughts, and that itself is valuable.
(Edit: in Diet Coke. Not too sure about Coke Zero).
I don’t think having a ML-backed proofreading system is an intrinsically bad idea; the oft-maligned “Apple Intelligence” suite has a proofreading function which is actually pretty good (although it has a UI so abysmal it’s virtually useless in most circumstances). But unless you truly, deeply believe your own writing isn’t as good as a precocious eighth-grader trying to impress their teacher with a book report, don’t ask an LLM to rewrite your stuff.
Just ask it to write "in the style of" a few famous writers with a recognizable style. It just can't do it. It'll do an awfully cringe attempt at it.
And that's just how bad LLMs are at it. There's a more general problem. If you've ever read a posthumous continuation of a literary series by a different but skilled author, you know what I mean.
For example, "And another thing..." by Eoin Colfer is written to be the final sequel to the Hitchhiker's Guide, after Douglas Adams died. And to their absolute credit, the author Eoin Colfer, in my opinion, pretty much nails Douglas Adams's tone to the extent it is humanly possible to do so. But no matter how close he got, there's a paradox here. Colfer can only replicate Adams's style. But only Adams could add a new element, and it would still be his style. While if Colfer had done exactly the same, he'd have been considered "off".
Anyway, if a human writer can't pull it off, I doubt an LLM can do it.
That's no crime, so far. It's very normal to have writers and editors.
But it's highly abnormal for everyone to have the _same_ editor, famous for the writing exactly the text that everybody hates.
It's like inviting Uwe Boll to edit your film.
If there's a good reason to send outgoing slop, OK. But if your audience is more verbally adept, and more familiar with its style, you do risk making yourself look bad.
What is a writing "voice"? It's more than just patterns and methods of phrasing. ChatGPT would say "rhythm and diction and tone" and word choice. But that's just the paint. A voice is the expression of your conscious experience trying to convey an idea in a way that reflects your experience. If it were just those semi-concrete elements, we would have unlimited Dickens; the concept could translate to music, we could have unlimited Mozart. Instead—and I hope you agree—we have crude approximations of all these things.
Writing, even technical writing, is an art. Art comes from experience. Silicon can not experience. And experiencers (ie, people with consciousness) can detect soullessness. To think otherwise is to be tricked; listen to anything on suno, for example. It's amazing at first, and then you see through the trick. You start to hear it the way most people now perceive generated images as too "shiny". Have you ever generated an image and felt a feeling other than "neat"?
This is why heavily assisted ai writing is still slop. That fundamental learning that is baked in is gone. It is the same reason why corporate speak is so hated. It is basically intentional slop.
Whether I hand write a blog post or type it into a computer, I'm the one producing the string of characters I intend for you to read. If I use AI to write it, I am not. This is a far, far, far more important distinction than whatever differences we might imagine arise from hand writing vs. typing.
> your thoughts
No, they aren't! Not if you had AI write the post for you. That's the problem!
That apparently is not the case for a lot of people.
I cannot tell you that it objectively matters whether or not an article was written by a human or an LLM, but it should be clear to anybody that it is at least a significant difference in kind vs. the analogy case of handwriting vs. typing. I think somebody who won't acknowledge that is either being intellectually dishonest, or has already had their higher cognitive functions rotted away by excessive reliance on LLMs to do their thinking for them. The difference in kind is that of using power tools instead of hand tools to build a chair, vs. going out to a store and buying one.
> I think somebody who won't acknowledge that is either being intellectually dishonest, or has already had their higher cognitive functions rotted away by excessive reliance on LLMs to do their thinking for them.
This feels too aggressive for a good faith discussion on this site. Even if you do think that, there's no point in insulting the humans who could engage with you in that conversation.
My interpretation of your comment was that it related to my use of the word "important", which has a more subjective connotation than "significant" and arguably allows my comment to be interpreted in two ways. The second way (that I feel people should care more about the distinction I highlighted) was not my intended meaning, since obviously people can care about whatever they want. It was a relevant observation of imprecise wording on my part.
> there's no point in insulting the humans who could engage with you in that conversation.
There would be no point in engaging them in that conversation, either.
Disagreeing with me that the difference in kind I highlighted is important is fine, and maybe even an interesting conversation for both sides. Disagreeing with me that there is a significant difference in kind is just nonsensical, like arguing that there's no meaningful difference, at any level, between painting a painting yourself and buying one from a store. How can you approach a conversation like that? Yet positions like that appear in internet arguments all the time, which are generally arguments between anonymous strangers who often have no qualms about embracing total intellectual dishonesty because the[ir] goal is just to make their opponent mad enough that they forget the original point they were trying to make and go chasing the goalposts all over the room.
The only winning move is not to play, which requires being honest with yourself about who you're talking to and what they're trying to get out of the conversation. I am willing to share that honesty.
I am, to be clear, not saying you are one of these people.
“Blog” stands for “web log”. If it’s on the web, it’s digital, there was never a period when blogs were hand written.
> The use of tools to help with writing and communication should make it easier to convey your thoughts
If you’re using an LLM to spit out text for you, they’re not your thoughts, you’re not the one writing, and you’re not doing a good job at communicating. Might as well just give people your prompt.
I’ve seen exactly that. In one case, it was JPEG scans of handwriting, but most of the time, it’s a cursive font (which may obviate “handwritten”).
I can’t remember which famous author it was, that always submitted their manuscripts as cursive writing on yellow legal pads.
Must have been thrilling to edit.
For example, there was never a period when movies were made by creating frames as oil paintings and photographing them. A couple of movies were made like that, but that was never the norm or a necessity or the intended process.
This is just pedantic nonsense
The thoughts I put into a text are mostly independent of the sentences or _language_ they're written in. Not completely independent, but to claim thoughts are completely dependent on text (thus also the language) is nonsense.
> Might as well just give people your prompt.
What would be the value of seeing a dozen diffs? By the same logic, should we also include every draft?
Not even true! Turning your thoughts into words is a very important and human part of writing. That's where you choose what ambiguities to leave, which to remove, what sort of implicit shared context is assumed, such important things as tone, and all sorts of other unconscious things that are important in writing.
If you can't even make those choices, why would I read you? If you think making those choices is unimportant, why would I think you have something important to say?
Uneducated or unsophisticated people seem to vastly underestimate what expertise even is, or just how much they don't know, which is why for example LLMs can write better than most fanfic writers, but that bar is on the damn floor and most people don't want to consume fanfic level writing for things that they are not fanatical about.
There's this weird and fundamental misconception in pro-ai realms that context free "information" is somehow possible, as if you can extract "knowledge" from text, like you can "distill" a document and reduce meaning to some simple sentences. Like, there's this insane belief that you can meaningfully reduce text and maintain info.
If you reduce "Lord of the flies" to something like "children shouldn't run a community", you've lost immense amounts of info. That is not a good thing. You are missing so much nuance and context and meaning, as well as more superficial (but not less important!) things like the very experience of reading that text.
Like, consider that SOTA text compression algorithms can reduce text to 1/10th of it's original size. If you are reducing a text by more than that to "summarize" or "reduce to it's main points" a text, do you really think you are not losing massive amounts of information, context, or meaning?
You can have the thoughts in a different language and the same ideas are still there.
You can tell an LLM to tweak a paragraph to better communicate a nuance until you're happy with it.
---
Language isn't thought. It's extremely useful in that it lets us iterate on our thoughts. You can add in LLMs in that iteration loop.
I get you wanted to vent because the volume of slop is annoying and a lot of people are degrading their ability to think by using it poorly, but "If you’re using an LLM to spit out text for you, they’re not your thoughts" is just motivated reasoning.
To be honest, and I hate to say this because it's condescending, it's a matter of literacy.
Some people don't see the value in literature. They are the same kind of people who will say "what's the point of book X or movie Y? All that happens is <sequence of events>", or the dreaded "it's boring, nothing happens!". To these people, there's no journey, no pleasure with words, the "plot" is all that matters and the plot can be reduced to a sequence of A->B->C. I suspect they treat their fiction like junk food, a quick fix and then move on. At that point, it makes logical sense to have an LLM write it.
It's very hard to explain the joy of words to people with that mentality.
Did you use AI to write this...? Because it does not follow from the post you're replying to.
It's like listening to Bach's Prelude in C from WTCI where he just came up with a humdrum chord progression and uses the exact same melodic pattern for each chord, for the entire piece. Thanks, but I can write a trivial for loop in C if I ever want that. What a loser!
Edit: Lest HN thinks I'm cherry picking-- look at how many times Bach repeats the exact same harmony/melody, just shifting up or down by a step. A significant chunk of his output is copypasta. So if you like burritos filled with lettuce and LLM-generated blogs, by all means downvote me to oblivion while you jam out to "Robo-Bach"
And unless I'm misunderstanding, it's literally the exact point you made, with no exaggeration or added comparisons.
Like, I’m totally on board with rejecting slop, but not all content that AI was involved in is slop, and it’s kind of frustrating so many people see things so black and white.
It’s not a literal suggestion. “Might as well” is a well known idiom in the English language.
The point is that if you’re not going to give the reader the result of your research and opinions and instead will just post whatever the LLM spits out, you’re not providing any value. If you gave the reader the prompt, they could pass it through an LLM themselves and get the same result (or probably not, because LLMs have no issue with making up different crap for the same prompt, but that just underscores the pointlessness of posting what the LLM regurgitated in the first place).
IMO a lot of the dumb and bad behavior around LLMs could be solved by a “just share the prompts” strategy. If somebody wants to generate an email from bullet points and send it to me: just send the bullet points, and I can pass them into an LLM if I want.
Blog post based on interesting prompts? Share the prompt. It’s just text completion anyway, so if a reader knows more about the topic than the prompt-author, they can even tweak the prompt (throw in some lingo to get the LLM to a better spot in the latent space or whatever).
The only good reason not to do that is to save some energy in generation, but inference is pretty cheap compared to training, right? And the planet is probably doomed anyway at this point so we as well enjoy the ride.
What does bother me is when clearly AI-generated blog posts (perhaps unintentionally) attempt to mask their artificial nature through superfluous jokes or unnaturally lighthearted tone. It often obscures content and makes the reading experience inefficient, without the grace of a human writer that could make it worth it.
However, if I’m reading a non-technical blog, I am reading because I want something human. I want to enjoy a work a real person sank their time and labor into. The less touched by machines, the better.
> It would be more human to handwrite your blog post instead.
And I would totally ready handwritten blog posts!
But it can make for tiresome reading. Like, a 2000 word post can be compressed to 700 or something had a human editor pruned it.
Somehow this is currently the top comment. Why?
Most non-quantitative content has value due to a foundation of distinct lived experience. Averages of the lived experience of billions just don't hit the same, and are less likely to be meaningful to me (a distinct human). Thus, I want to hear your personal thoughts, sans direct algorithmic intermediary.
This is similar to the common objection for AI-coding that the hard part is done before the actual writing. Code generation was never a significant bottleneck in most cases.
All I care about is content, too, but people using LLMs to blog and make readmes is routinely getting garbage content past the filters and into my eyeballs. It's especially egregious when the author put good content into the LLM and pasted the garage output at us.
Are there people out there using an LLM as a starting point but taking ownership of the words they post, taking care that what they're posting still says what they're trying to say, etc? Maybe? But we're increasingly drowning in slop.
It's not an assumption. Look at this example: https://news.ycombinator.com/item?id=45591707
The author posted their input to the LLM in the comments after receiving critcism, and that input was much better than their actual post.
In this thread I'm less sure: https://news.ycombinator.com/item?id=45713835 - it DOES look like there was something interesting thrown into the LLM that then put garbage out. It's more of an informed guess than an assumption, you can tell the author did have an experience to share, but you can't really figure out what's what because of all the slop. In this case the author redid their post in response to criticism and it's still pretty bad to me, and then they kept using an LLM to post comments in the thread, I can't really tell how much non-garbage was going in.
This whole AI thing is rapidly becoming very tiresome. But the trend seems to be to push it everywhere, regardless of merit.
I cannot blame people for using software as a crutch when human-based writing has become too hard and seldom rewarded anymore unless you are super-talented, which statistically the vast majority of people are not.
An LLM generated blog post is by definition derivative and bland.
> I use ChatGPT to learn about a variety of different things, so if someone came up with an interesting set of prompts and follow ups and shared a summary of the research ChatGPT did, it could be meaningful content to me.
Then say so, up front.
But that's not what people do. They're lazy or lack ideas but want "content" (usually for some kind of self-promotional reason). So you get to read that.
0% of your HN comments include URLs for sources that support the positions and arguments you've expressed at HN.[1] Do you generally not care about the sources of ideas? For example, when you study public policy issues, do you not differentiate between research papers published in the most prestigious journals and 500-word news articles written at the 8th-grade level by nonspecialist nobodies?
[1] https://hn.algolia.com/?type=comment&query=author:alyxya+htt...
The content itself does have value, yes.
But some people also read to connect with other humans and find that connection meaningful and important too.
I believe the best writing has both useful content and meaningful connection.
Maybe humans aren't so unique after all, but that's its own topic.
https://archive.ph/20250317072117/https://www.bloomberg.com/...
But if I'm truly honest with myself, I think in the long run I wouldn't care. I grew up on Science Fiction, and the stories I've always found most interesting were ones that explored human nature instead of just being techno fetishism. But the reality is I don't feel a human connection to Asimov, or Cherryh, or any of the innumerable short form authors who wrote for the SF&F magazines I devoured every chance I got. I remember the stories, but very rarely the names. So they might as well have been written by an AI since the human was never really part of the equation (for me as a reader).
And even when I do remember the names, maybe the human isn't one I want a lot of "human connection" with anyway. Ender's Game, the short story and later the novel were stories I greatly enjoyed. But I feel like my enjoyment is hampered by knowing that the author of a phenomenal book that has some interesting things to day on the pains caused by de-humanizing the other has themselves become someone who dehumanizes others often. The human connection might be ironic now, but that doesn't make the story better for me. Here too, the story might as well have been written by an AI for all that the current person that the author is represents who they were (either in reality or just in my head) when I read those stories for the first time.
Some authors I have been exposed to later in life, I have had a degree of human connection with. I felt sadness and pain when Steve Miller died and left his spouse and long time writing partner Sharon Lee to carry on the Liaden series. But that connection isn't what drew me to the stories in the first place and that connection is largely the same superficial parasocial one that the easy access into the private lives of famous people gives us. Sure I'm saddened, but honesty requires me to note I'm more sad that it reminds me eventually this decades spanning series will draw to a close, and likely with many loose ends. And so even here, if an AI were capable of producing such a phenomenal series of books, in a twisted way as a reader it would be better because they would never end. The world created by the author would live on forever, just like a "real" world should.
Emotionally I feel like I should care that a book was or wasn't written by an AI. But if I'm truly honest with myself, the author being a human hasn't so far added much to the experience, except in some ways to make it worse, or to cut short something that I wish could have continued forever.
All of that as a longwinded way of answering, "no, I don't think I would care".
It's like getting an unsolicited text with a "Let Me Google That For You" link. Yes, we can all ask ChatGPT about the thing. We don't need you to do it for us.
Why do you trust the output? Chatbots are so inaccurate you surely must be going out of your way to misinform yourself.
And it will make them up just like it does everything else. You can’t trust those either.
In fact, one of the simplest ways to find out a post is AI slop is by checking the sources posted at the end and seeing they don’t exist.
Asking for sources isn’t a magical incantation that suddenly makes things true.
> It isn’t guaranteed that content written by humans is necessarily correct either.
This is a poor argument. The overwhelming difference with humans is that you learn who you can trust about what. With LLMs, you can never reach that level.
In tech-related matters such as coding, I've come to expect every link ChatGPT provides as reference/documentation is simply wrong or nonexistent. I can count with fingers from a single hand the times I clicked on a link to a doc from ChatGPT that didn't result in a 404.
I've had better luck with links to products from Amazon or eBay (or my local equivalent e-shop). But for tech documentation which is freely available online? ChatGPT just makes shit up.
If you have a habit of asking random lay persons for technical advice, I can see why an idiot chatbot would seem like an upgrade.
(I'm not saying not to read books, but seriously: there are shortcuts)
There are many 100's of professions, and most of them take a significant fraction of a lifetime to master, and even then there usually is a daily stream of new insights. You can't just toss all of that information into a bucket and expect that to outperform the < 1% of the people that have studied the subject extensively.
When Idiocracy came out I thought it was a hilarious movie. I'm no longer laughing, we're really putting the idiots in charge now and somehow we think that quantity of output trumps quality of output. I wonder how many scientific papers published this year will contain AI generated slop complete with mistakes. I'll bet that number is >> 0.
I've done small plumbing jobs after asking AI if it was safe, I've written legal formalia nonsense that the government wanted with the help of AI. It was faster, cheaper and I didn't bother anyone with the most basic of questions.
Try asking ChatGPT or whatever is your favorite AI supplier about a subject that you are an expert about something that is difficult, on par with the kind of evaluations you'd expect a qualified doctor or legal professional to do. And then check the answer given, then extrapolate to fields that you are clueless about.
AI will just make stuff up instead of saying it doesn't know, huh? Have you talked to real people recently? They do the same thing.
It's about to find the sweet spot.
Vibe coding is crap, but I love the smarter autocomplete I get from AI.
Generating whole blog posts from thin air is crap, but I love smart grammar, spelling, and diction fixes I get from AI.
It's like being okay with reading the entirety of generated ASM after someone compiles C++.
I write pretty long blog posts that some enjoy and dump them into various llms for review. I am pretty opinionated on taste so I usually only update grammar but it can be dangerous for some.
To be more concrete, often ai tells me to be more “professional” and less “irreverent” which i think is bullshit. The suggestions it gives are pure slop. But if english isnt first language or you dont have confidence, you may just accept the slop.
If yes, there's probably value in putting it out. I don't care if you used paper and ink, a text editor, a spell checker, or asked an LLM for help.
On the flip side, if anyone could've asked an LLM for the exact same text, and if you're outsourcing a critical thinking to the reader - then yeah, I think you deserve scorn. It's no different from content-farmed SEO spam.
Mind you, I'm what you'd call an old-school content creator. It would be an understatement to say I'm conflicted about gen AI. But I also feel that this is the most principled way to make demands of others: I have no problem getting angry at people for wasting my time or polluting the internet, but I don't think I can get angry at them for producing useful content the wrong way.
If it's not substantially their own writing or ideas, then sure, they shouldn't pass it off as such and claim individual authorship. That's a different issue entirely. However, if someone just wanted to share, "I'm 50 prompts deep exploring this niche topic with GPT-5 and learned something interesting; quoted below is a response with sources that I've fact-checked against" or "I posted on /r/AskHistorians and received this fascinating response from /u/jerryseinfeld", I could respect that.
In any case, if someone is posting low-quality content, blame the author, not the tools they happened to use. OOP may as well say they only want to read blog posts written with vim and emacs users should stay off the internet.
I just don't see the point in gatekeeping. If someone has something valuable to share, they should feel free to use whatever resources they have available to maximize the value provided. If using AI makes the difference between a rambling draft riddled with grammatical and factual errors, and a more readable and information-dense post at half the length with fewer inaccuracies, use AI.
What about plagiarism? If a person hacks together a blog post that is arguably useful but they plagiarized half of it from another person, is that acceptable to you? Is it only acceptable if it's mechanized?
One of the arguments against GenAI is that the output is basically plagiarized from other sources -- that is, of course, oversimplified in the case of GenAI, but hoovering up other people's content and then producing other content based on what was "learned" from that (at scale) is what it does.
The ecological impact of GenAI tools and the practices of GenAI companies (as well as the motives behind those companies) remain the same whether one uses them a lot or a little. If a person has an objection to the ethics of GenAI then they're going to wind up with a "binary take" on it. A deal with the devil is a deal with the devil: "I just dabbled with Satan a little bit" isn't really a consolation for those who are dead-set against GenAI in its current forms.
My take on GenAI is a bit more nuanced than "deal with the devil", but not a lot more. But I also respect that there are folks even more against it than I am, and I'd agree from their perspective that any use is too much.
I think we have a better shot at making that argument for music, visual art, etc. Most of it is utilitarian and most people don't care where it comes from, but we have a cultural heritage of recognizing handmade items as more valuable than the mass-produced stuff.
I don't think GenAI or LLMs are going away entirely - but I'm not convinced that they are inevitable and must be adopted, either. Then again, I'm mostly a hold-out when it comes to things like self checkout, too. I'd rather wait a bit longer in line to help ensure a human has a job than rush through self-checkout if it means some poor soul is going to be out of work.
Sadly, I agree. That's why I removed my works from the open web entirely: there is no effective way for people to protect their works from this abuse on the internet.
The way LLMss are now, outside of the tech bubble the average person has no use for them.
> on some ultimately muddied arguments along the lines of "it's OK for humans to train on books, but it's not OK for robots"
This is a bizarre argument. Humans don't "train" on books, they read them. This could be for many reasons, like to learn something new or to feel an emotion. The LLM trains on the book to be able to imitate it without attribution. These activities are not comparable.
I think that's the best use case and it's not AI related as spell-checkers and translation integrations exist forever, now they are just better.
Especially for non-native speakers that work in a globalized market. Why wouldn't they use the tool in their toolbox?
Grammatical deviations constitute a large part of an author's voice. Removing those deviations is altering that voice.
This ship sailed a long time ago. We have been exposed to AI-generated text content for a very long time without even realizing it. If you read a little more specialized web news, assume that at least 60% of the content is AI-translated from the original language. Not to mention, it could have been AI-generated in the source language as well. If you read the web in several languages, this becomes shockingly obvious.
Maybe someone will build an AI model that's succinct and to the point someday. Then I might appreciate the use a little more.
I will also take a janky script for a game hand-translated by an ESL indie dev over the ChatGPT House Style 99 times out of 100 if the result is even mostly comprehensible.
This will invalidate even ispell in vim. The entire point of proofreading is to catch things you didn’t notice. Nobody would say “you don’t need the red squiggles underlining strenght because you already know it is spelled strength.”
My wife is ESL. She's asked me to review documents such as her resume, emails, etc. It's immediately obvious to me that it's been run through ChatGPT, and I'm sure it's immediately obvious to whomever she's sending the email. While it's a great tool to suggest alternatives and fix grammar mistakes that Word etc don't catch, using it wholesale to generate text is so obvious, you may as well write "yo unc gimme a job rn fr no cap" and your odds of impressing a recruiter would be about the same. (the latter might actually be better since it helps you stand out.)
Humans are really good at pattern matching, even unconsciously. When ChatGPT first came out people here were freaking out about how human it sounded. Yet by now most people have a strong intuition for what sounds ChatGPT-generated, and if you paste a GPT-generated comment here you'll (rightfully) get downvoted and flagged to oblivion.
So why wouldn't you use it? Because it masks the authenticity in your writing, at a time when authenticity is at a premium.
These type of complains about LLMs feel like the same ones people probably said about using a typewriter for writing a letter vs. a handwritten one saying it loses intimacy and personality.
Now you could argue but you don't know it was AI it could just be really mediocre writing - it could indeed but I hit the back button there as well so it's a wash either way.
I'd sooner have a ship painting from the little shop in the village with the little old fella who paints them in the shop than a perfect robotic simulacrum of a Rembrandt.
Intention matters but it matters less sometimes but I think it matters.
Writing is communication, it's one of the things we as humans do that makes us unique - why would I want to reduce that to a machine generating it or read it when it has.
I've been learning piano too, and I find more joy in performing a piece poorly, than listening to it played competently. My brother asked me why I play if I'm just playing music that's already been performed (a leading question, he's not ignorant). I asked him why he plays hockey if you can watch pros play it far better. It's the journey, not the destination.
I've been (re-)re-re-watching Star Trek TNG and Data touches on this issue numerous times, one of which is specifically about performing violin (but also reciting Shakespeare). And the message is what you're sharing: to recite a piece with perfect technical execution results an in imperfect performance. It's the _human_ aspects that lend a piece deep emotion that other humans connect with, often without being able to concretely describe why. Let us feel your emotions through your work. Everyting written on the page is just the medium for those emotions. Without emotion, your perfectly recited piece is a delivered blank message.
https://www.poetryfoundation.org/poems/43745/andrea-del-sart...
The Matrix was and is fantastic on many levels.
At this point, I don't know there's much more to be said on the topic. Lines of contention are drawn, and all that's left is to see what people decide to do.
Also, reminds me of this cartoon from March 2023. [0]
[0] https://marketoonist.com/2023/03/ai-written-ai-read.html
Because I've never seen anyone actually use a summarizing AI willingly. And especially not for blogs and other discretionary activities.
That's like getting the remote from the hit blockbuster "Click" starring Adam Sandler (2006) and then using it to skip sex. Just doesn't make any sense.
Isn't that the same with AI-generated source code? If lazy programmers didn't bother writing it, why should I bother reading it? I'll ask the AI to understand it and to make the necessary changes. Now, let's repeat this process over and over. I wonder what would be the state of such code over time. We are clearly walking this path.
Programming languages were originally invented for humans to write and read. Computers don't need them. They are fine with machine code. If we eliminate humans from the coding process, the code could become something that is not targeted for humans. And machines will be fine with that too.
But you are saying that is wrong, you should judge the messenger, not the message.
pre-AI scientists would publish papers and then journalists would write summaries which were usually misleading and often wrong.
An AI operating on its own would likely be no better than the journalist, but an AI supervised by the original scientist quite likely might do a better job.
AI is a tool to help you _finish_ stuff, like a wood sander. It's not something you should use as a hacksaw, or as a hammer. As long as you are writing with your own voice, it's just better autocorrect.
It can also double as a peer reviewer and point out potential counterarguments, so you can address them upfront.
I recently wrote about the dead internet https://punkx.org/jackdoe/zero.txt out of frustration.
I used to fight against it, I thought we should do "proof of humanity", or create rings of trust for humans, but now I think the ship has sailed.
Today a colleague was sharing their screen on google docs and a big "USE GEMINI AI TO WRITE THE DOCUMENT" button was front and center. I am fairly certain that by end of year most words you read will be tokens.
I am working towards moving my pi-hole from blacklist to whitelist, and after that just using local indexes with some datahorading. (squid, wikipedia, SO, rfcs, libc, kernel.git etc)
Maybe in the future we just exchange local copies of our local "internet" via sdcards, like in Cuba's Sneakernet[1] El Paquete Semenal[2].
Where are the explanations what all of them mean? What is (nothing) vs `maxi` vs `mini` vs `nopic`? What is `100` vs `all` vs `top1m` vs `top` vs `wp1-0.8`?
Mini is the introduction and infobox of all articles, nopic is the full articles with no pictures, maxi is full articles with (small) images. Other tags are categories (football, geography, etc.)
100 is the top 100 articles, top1m is top 1 million, 0.8 is (inexplicably) the top 45k articles.
My recommendation: sort by size and download the largest one you can accommodate in the language you prefer. wikipedia_en_all_maxi_2025-08.zim is all wikipedia articles, with images, as of 2025-08 and it's a paltry 111G.
Kiwix publishes a library here, but it's equally unhelpful: https://library.kiwix.org/
I thought about this in another context and then I realized: what system is going to declare you're human or not? AI of course