Top
Best
New

Posted by kcorbitt 10/23/2024

Show HN: Agent.exe, a cross-platform app to let 3.5 Sonnet control your machine(github.com)
406 points | 232 commentspage 5
Simon321 10/23/2024|
Does it support AWS Bedrock instead of Anthropic as a provider?
mt_ 10/23/2024|
Feature request
waihtis 10/24/2024||
Windows Defender now flags this as a trojan?
DeathArrow 10/23/2024||
Ok, now I can install this on my work laptop and go on vacation for a few months. :)
binary132 10/23/2024||
kinda want to run this in a vm just to see how fast it bricks it
mensetmanusman 10/23/2024||
I hope this is the start of SkyNet.
danudey 10/23/2024||
SkyNet with ADHD: https://x.com/anthropicai/status/1848742761278611504
bloomingkales 10/23/2024|||
So long as we make the launch nuke methods private, we should be okay I think.

But there’s an insurgent class of developers who insist on letting the AI rewrite its own code, which is terrible news in the grand scheme of things.

meindnoch 10/23/2024||
Ok, this is funny :D

For those who don't know: there's an old movie titled "Terminator", and in this movie a military AI (Artificial Intelligence) takes over the world and wages a war against humanity. The name of this AI in the movie is "SkyNet", so this is what the parent comment is referring to :D

another_devy 10/23/2024||
can this be used for desktop/ mobile app testing?
tadeegan 10/23/2024||
This is literally how Skynet happens lol
ImHereToVote 10/23/2024|
Doomers like you have completely lost touch with reality. Anything that happens in sci-fi movies can't happen in reality. Don't you guys know anything?
charlierguo 10/23/2024||
It's fascinating/spooky how different LLMs are slowly developing their own "personalities," so to speak. And they seem to be emerging as we're giving them access to more tools and modalities which are harder to do broad RLHF on.

With computer use, we first learned that Claude sometimes takes breaks to browse pictures of Yosemite, and now this:

> Claude really likes Firefox. It will use other browsers if it absolutely has to, but will behave so much better if you just install Firefox and let it go to its happy place.

abixb 10/23/2024||
>Claude really likes Firefox.

I don't mind being reigned over by AI overlords that'll choose FOSS over proprietary.

dangsux 10/23/2024||
[dead]
photonthug 10/23/2024|||
>> > Claude really likes Firefox. It will use other browsers if it absolutely has to, but will behave so much better if you just install Firefox and let it go to its happy place.

It's hard to ignore the glimpse into the future of engineering that we're seeing here. Deterministic processes are out the door, no specs, no tolerances, no design. When did undefined behaviour become a cute thing that we're bragging about and compensating for, something to work around rather than something to understand and to fix?

It's not a big deal until you realize that software always gets stacked on software, and the only thing that ever made that complexity manageable was the fundamental assumption that it was all pretty deterministic. Of course users will sacrifice the strategic (good engineering) for the tactical (mere convenience) all day long, but the fact that so many engineers are all-in on the same short-sighted POV has been surprising to me.

danudey 10/23/2024|||
> we first learned that Claude sometimes takes breaks to browse pictures of Yosemite

We learned what now?

abixb 10/23/2024||
For those lacking context: https://x.com/anthropicai/status/1848742761278611504

From the Anthropic tweet (X post?):

"Even while recording these demos, we encountered some amusing moments. In one, Claude accidentally stopped a long-running screen recording, causing all footage to be lost.

Later, Claude took a break from our coding demo and began to peruse photos of Yellowstone National Park."

danudey 10/23/2024|||
SkyNet with ADHD, great.
fullstackchris 10/23/2024|||
I dont know about you, but sounds like every lazy developer I know... this must be proof of AGI! :D
m463 10/23/2024|||
step 2: make posts to hacker news with source code link, causing reproduction of Agent.exe, possibly with mutations via forking
tomjen3 10/23/2024|||
I mean if the goal is to humanize and make AIs more relatable, then fine.

If it had stopped the coding task to browse hackernews, I would have to start to march for AI rights.

tacone 10/23/2024||
> Claude really likes Firefox. It will use other browsers if it absolutely has to, but will behave so much better if you just install Firefox and let it go to its happy place.

Good boy!

Oras 10/23/2024|
There might be a reason. I played around with Playwright before and once you run chromium for few times, it will get blocked and you start seeing captcha.

Never happened when I tried Firefox

cibyr 10/23/2024|
20 years ago: "I would never let the AI out of the box! I'm not an idiot!"

Today: "Sure, I'll give the AI full control over my computer. WCGW?"

CaptainFever 10/23/2024||
Similarly...

20 years ago: "Don't meet strangers from the Internet. Don't get into strangers' cars."

Today: Literally summon strangers from the Internet to get into their cars

dr_kiszonka 10/23/2024|||
I wonder how their safety team goes about monitoring Claude's actions. Would it be possible for multiple instances of Claude to coordinate their actions via their users' machines? What I have in mind is, is there a malicious sequence of benign subsequences of actions such that the malicious intent can be achieved by different AI instances completing the benign subsequences in a distributed, yet coordinated manner? If yes, how to catch it?
More comments...