Hello, you are using an old browser that's unsafe and no longer supported. Please consider updating your browser to a newer version, or downloading a modern browser.
Training Camp • Cybersecurity Glossary
AI red teaming adversarially tests LLMs and AI systems for prompt injection, jailbreaks, data leakage, and harmful outputs to harden them.
AI Red Teaming Definition: AI red teaming adversarially tests LLMs and AI systems for prompt injection, jailbreaks, data leakage, and harmful outputs to harden them.
AI red teaming is the practice of adversarially testing artificial intelligence systems, especially large language models, to uncover vulnerabilities such as prompt injection, jailbreaks, data leakage, harmful content generation, and bias. It combines traditional security testing with techniques specific to model behavior, probing both the model and its surrounding application and tooling. Frameworks like the NIST AI Risk Management Framework and OWASP guidance for LLM applications inform structured AI red-teaming programs.
Turn knowledge into credentials with our instructor-led cybersecurity boot camps.
View All Courses →