Anthropic Developing Constitutional Classifiers to Safeguard ai Models from Jailbreak Attempts
Anthropic announced the development of a new system on monday that can protect artificial intelligence (ai) models from from jailbreaking attempts. Dubbed Constitutional Classifiers, it is a safeguarding technique that…