Mastodon Feed: Post

Mastodon FeedJun 3, 2026, 10:04 AM

brib@bribstodon.xyz ("brib :neofox_floof: :Nonbinary:") wrote:

@soatok @ireneista @Kye If Ed Zitron is anything to go by, it does sound like there's a lot of inference costs with the bigger AI systems (the ones everyone uses).

In particular, agentic systems seem to run on repeated cycles of "generate -> check if it looks good, perhaps with a deterministic system and perhaps with an LLM -> if not, regenerate".

And from my (limited) personal experience with generative AI, as well as the recent Copilot price hikes, this tracks, e.g. I realised I used an AI filter in Canva because it took many seconds to process when a normal filter takes less than a second.

You can still apparently do scary things even with these limitations; I saw a paper today that had combined an open-weight LLM with various context-providing tools to create a worm 😦 . I don't have enough background to scrutinise the paper properly though