The Crescendo Multi-Turn LLM Jailbreak Attack
In this paper, we introduce a novel jailbreak attack called Crescendo. Unlike existing jailbreak methods, Crescendo is a multi-turn jailbreak that interacts with the model in a seemingly benign manner. It begins with a general prompt or question about the task at hand and then gradually escalates the dialogue by referencing the model’s replies, progressively Read more about The Crescendo Multi-Turn LLM Jailbreak Attack[…]