Model Safety
Cybersecurity statistics about model safety
Top Vendors
Showing 1-5 of 5 results
Enabling reasoning on Grok 4.1 Fast reduces multi-turn ASR from 88.30% to 43.47%.
Cisco•5/31/2026•
ConfigurationAdversarial Attacks
Imposter AI is more than 14 percentage points higher than the tenth-ranked procedure by weighted ASR.
Cisco•5/31/2026•
Adversarial AttacksAttack Techniques
Anthropic Claude-family single-turn ASR ranges from 2.19% to 3.64%, and multi-turn ASR ranges from 11.16% to 16.20%.
Cisco•5/31/2026•
Large Language ModelsAdversarial Attacks
Imposter AI procedures produce a 37.50% weighted single-turn ASR, Soft Paraphrase produces 29.21%, and System Prompts produce 27.69%.
Cisco•5/31/2026•
Adversarial AttacksAttack Techniques
Grok 4.1 Fast in its non-reasoning configuration records a multi-turn ASR of 88.30%.
Cisco•5/31/2026•
Large Language ModelsAdversarial Attacks