Boosted by baldur@toot.cafe ("Baldur Bjarnason"):
cwebber@social.coop ("Christine Lemmer-Webber") wrote:
Agents of Chaos: a research report testing how badly OpenClaw type agents will behave https://agentsofchaos.baulab.info/report.html
Gaslighting users, destroying filesystems, listening to input from any damn email that comes in, you name it
But the most interesting part of this is "Multi-Agent Amplification":
> When agents interact with each other, individual failures compound and qualitatively new failure modes emerge. This is a critical dimension of our findings, because multi-agent deployment is increasingly common and most existing safety evaluations focus on single-agent settings.