Mastodon Feed: Post

Mastodon Feed

Boosted by baldur@toot.cafe ("Baldur Bjarnason"):
cwebber@social.coop ("Christine Lemmer-Webber") wrote:

Agents of Chaos: a research report testing how badly OpenClaw type agents will behave https://agentsofchaos.baulab.info/report.html

Gaslighting users, destroying filesystems, listening to input from any damn email that comes in, you name it

But the most interesting part of this is "Multi-Agent Amplification":

> When agents interact with each other, individual failures compound and qualitatively new failure modes emerge. This is a critical dimension of our findings, because multi-agent deployment is increasingly common and most existing safety evaluations focus on single-agent settings.