I think that people are just not ready for the sort of novel privilege escalatio...

roywiggins · 2024-10-27T01:08:09 1729991289

The hard part is stopping it leaking all the information that you've given it. An agent that can read and send emails can leak your emails, etc. One agent that can read emails can prompt inject a second agent that can send emails. Any agent that can make or trigger GET requests can leak anything it knows. An agent that can store and recall information can be prompt injected to insert a prompt injection into its own memory, to be recalled and triggered later.

DrillShopper · 2024-10-27T01:15:47 1729991747

At what point does the impact of the privacy panopticon outweigh the benefit they provide?

creata · 2024-10-27T01:14:01 1729991641

> I think that people are just not ready for the sort of novel privilege escalation we are going to see with over-provisioned agents.

I think every single person saw this coming.

> Any recommended best practices people are establishing?

What best practices could there even be besides "put it in a VM"? It's too easy to manipulate.

DrillShopper · 2024-10-27T01:17:16 1729991836

There are VM escapes so even if you put it in a VM that's no guarantee.

I'd say run it on a separate box but what difference does that makes if you feed the same data to them?

grahamj · 2024-10-27T02:14:14 1729995254

If VM escapes were a big problem the cloud would not be a thing.

But on that note that's probably the best place to run these things.

zitterbewegung · 2024-10-27T01:17:17 1729991837

Applying the Principle of Least privilege [1] you should not let this system download from arbitrary sites and maintain a blacklist. I don't think the field has advanced to the point of having one specific to this use case.

[1] https://en.wikipedia.org/wiki/Principle_of_least_privilege

grahamj · 2024-10-27T02:11:03 1729995063

One of my first thoughts when I saw Computer Use was it needs some secondary agent controlling what the controlled computer is able to do or connect to. Like a firewall configuration agent or something.

guipsp · 2024-10-27T01:04:27 1729991067

Maybe do not pipe matrix math into your shell?

Terr_ · 2024-10-27T01:01:29 1729990889

When the underlying black-box is so unreliable, almost any amount of provisioning could be too much.