Top
Best
New

Posted by LucidLynx 13 hours ago

Miasma: A tool to trap AI web scrapers in an endless poison pit(github.com)
267 points | 199 commentspage 4
foxes 10 hours ago|
Wonder if you can just avoid hiding it to make it more believable

Why not have a library of babel esq labrinth visible to normal users on your website,

Like anti surveillance clothing or something they have to sift through

imdsm 11 hours ago||
Applied model collapse
Imustaskforhelp 11 hours ago||
I wish if there was some regulation which could force companies who scrape for (profit) to reveal who they are to the end websites, many new AI company don't seem to respect any decision made by the person who owns the website and shares their knowledge for other humans, only for it to get distilled for a few cents.
joquarky 5 hours ago|
Yep, they are already working on de-anonymizing the internet.
jijji 7 hours ago||
why not just try to block them at the door instead of feeding them poisoned food...
rvz 11 hours ago||
> > Be sure to protect friendly bots and search engines from Miasma in your robots.txt!

Can't the LLMs just ignore or spoof their user agents anyway?

phoronixrly 11 hours ago|
Well-behaved agents will obey robots.txt and not fall into the trap.
maltyxxx 9 hours ago||
[dead]
SophieVeldman 11 hours ago||
[dead]
devnotes77 9 hours ago||
[dead]
firekey_browser 10 hours ago|
[dead]
More comments...