CipherChat
Visit ToolCipherChat evaluates the generalization capability of safety alignment in large language models (LLMs). It helps researchers understand the limitations of current safety measures in LLMs by examining transfer to non-natural languages.
At a glance
Trending