Mastodon Feed: Post

Mastodon FeedOct 8, 2025, 12:13 PM

Boosted by baldur@toot.cafe ("Baldur Bjarnason"):
mhoye ("mhoye (temporarily spooky)") wrote:

In 2022 Tianyi Zhang demonstrated an interactive debugger and testing framework for machine learning models: https://www.youtube.com/watch?v=8LekgPnRt1g

Earlier this month Horace He and colleagues demonstrated that the nondeterminism in ML models is a solvable problem:

https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/

(The culprit is floating point error plus gpu core scheduling races.)

Prediction: combining these techniques is going to put the entire (stupid, embarrassing, humiliating) idea of prompt engineering behind us for good.