Hacking Back the AI-Hacker: Prompt Injection by your LLM as a Defense Against LLM-driven Cyberattacks
Large language models (LLMs) are increasingly being harnessed to automate cyberattacks, making sophisticated exploits more accessible and scalable. In response, we propose a new defense strategy tailored to counter LLM-driven cyberattacks. We introduce Mantis, a defensive framework that exploits LLMs’ susceptibility to adversarial inputs to undermine malicious operations. Upon detecting an automated cyberattack, Mantis plants Read more about Hacking Back the AI-Hacker: Prompt Injection by your LLM as a Defense Against LLM-driven Cyberattacks[…]