Mastodon Feed: Post

Mastodon FeedJun 28, 2024, 10:46 AM

jsonstein@masto.deoan.org ("Jeff Sonstein") wrote:

“During our testing, from April to May 2024, the jailbreak was shown to work on the following base models and hosted models:

Meta Llama3-70b-instruct (base)
Google Gemini Pro (base)
OpenAI GPT 3.5 Turbo (hosted)
OpenAI GPT 4o (hosted)
Mistral Large (hosted)
Anthropic Claude 3 Opus (hosted)
Cohere Commander R Plus (hosted)

For each model that we tested, we evaluated a diverse set of tasks across risk and safety content categories, including areas such as …”

https://masto.deoan.org/@neurovagrant/112693823846665151