Mastodon Feed: Post

Mastodon Feed

Boosted by db@social.lol ("David Bushell ☕"):
kc@chaos.social ("Casey") wrote:

Introducing WebAccessBench, a novel benchmark for AI language models to assess #accessibility quality and WCAG conformance in generated web interfaces under realistic prompting conditions.

I did a bit of research and found that LLMs are incredibly bad at basic digital accessibility tasks. You can compare models and read the full white paper at https://conesible.de/wab.

Overall data suggests massive implications for society at large, and major discrimination of people with disabilities. #a11y

A sharepic that lists all benchmarked models and their score in a bar chart. Find them listed at https://conesible.de/wab. Beneath is a preview of the whitepaper PDF.