Posted by bertman 11/12/2025
What's going on with Google being extra stingy seems to correlate well with the AI boom (curse). I suspect there are companies running ruthless bots scraping TBs of videos from YouTube. Not just new popular videos that are on fast storage, but old obscure ones that probably require more resources to fetch. This is unnatural, and goes contrary to the behaviour pattern of normal users that YT is optimized for.
I think AI-companies abusing the internet is why things are getting more constrained in general. If I'm right, they deserve the bulk of the blame imo.
Yes, the regulators will try and manage it, but eventually every decision about who can use the robot/AI genie for what will go through them because of the robot/AI genie's enormous strain on natural resources, and you'll be back to a planned economy where the central planners are the environmental regulators.
There are hard decisions to make as well. Who gets to open a rare earth processing plant and have a tailing pond that completely ecologically destroys that area? Someone has to do it to enable the modern economy. It's kind of like we won't have a good AI video generator and will always be behind China if some Youtube creators refuse to license their content for AI training. Same goes for the rare earth processing tailing pond. Nobody can agree on where it's going to go, so China wins.
certainly, but for Google, that bandwidth and compute is a drop in the bucket. at the scale Google operates, even if there were a hundred such bots (there aren't - few companies can afford to store exabytes of data), those wouldn't even register on the radar. of course, like the other social media oligarchs, Google wants to be the only entity with unrestricted access to their catalog of other people's content, but even that isn't their motivation here - "login to prove you're not a bot :^)" was ALWAYS going to happen, even without the AI bubble.
enshitiffication is unstoppable and irreversible, and Google is its prophet - what they don't kill, they turn to shit.
>I think AI-companies abusing the internet is why things are getting more constrained in general.
even before the AI bubble, every other fuckass blog with 0.5 daily visitors was behind Cloudflare, for the same reason those fuckass blogs are built with FOTM javascript frameworks - there's nowt so queer as webshits.
Lol, that's so true.
>Google wants to be the only entity with unrestricted access to their catalog of other people's content,
Yeah, data is money. Reddit are doing the same thing, but even more aggressively. You want API access? Pay an astronomical amount of money for it, that is other people's content. Reddit also hosts a much small amount of media relative to YT.
For YT, I'm not so sure the increase in traffic is a drop in the bucket for them. It can depend a lot on which videos are being fetched. Cheap storage is cheap only for storing a large amount of data, not doing an unusual amount of (random) access.
Who knows.
yes, "AI" can be useful, but nonsense and slop are not.
It's been working great by itself for the most part since the beginning of the year, with only a couple of hiccups along the way.
We do use a custom cookies.txt file generated on the server as well as generate a `po_token` every time, which seems to help.
(I originally thought everything would just get blocked from a popular VPS provider, but surprisingly not?)
Most recently though, we were getting tons of errors like 429 until we switched to the `tv_embedded` client, which seems to have resolved things for the most part.
The devil is in the details
There are some formats, perhaps the one(s) the user wants, that do not require a JS runtime
Interesting that "signing up" for a website publishing public infomation and "logging in" may not always work in the user's favor. For example, here they claim it limits format availability
"Format availability without a JS runtime is expected to worsen as time goes on, and this will not be considered a "bug" but rather an inevitability for which there is no solution. It's also expected that, eventually, support for YouTube will not be possible at all without a JS runtime."
It is speculated that this format availability might change in the future
https://news.ycombinator.com/item?id=45358980
Yt-dlp: Upcoming new requirements for YouTube downloads - 1244 points, 620 comments
Since JS is the big issue here, the backend itself could be written in JS, TS or something else that compiles to WASM. That way, the decoder doesn't have to be split between two separate codebase. Deno also allows the bundle to be optionally compiled into a native executable that can run without having to install Deno separately.
And YT isn't "greedy bastards". They provide a valuable service, for free, that is extremely expensive to run. Do you think YT ought to be government-funded or a charity or something?
Benn Jordan made a pretty compelling video on this topic, arguing that the existing copyright system and artifacts of it are actually not that great and a potential government system might actually be better: https://www.youtube.com/watch?v=PJSTFzhs1O4
I will say that is something I would not have considered reasonable prior to watching his video.
Example from yesterday: https://thenewstack.io/ffmpeg-to-google-fund-us-or-stop-send...
See also: """Zawinski's Law states: "Every program attempts to expand until it can read mail. Those programs which cannot so expand are replaced by ones which can."""" and """Greenspun's tenth rule of programming is an aphorism in computer programming and especially programming language circles that states:[1][2]
Any sufficiently complicated C or Fortran program contains an ad hoc, informally-specified, bug-ridden, slow implementation of half of Common Lisp."""
(from the above I conclude that if you want to take over the computer world, implementing a mail reader with an embedded Lisp).