Hello, you are using an old browser that's unsafe and no longer supported. Please consider updating your browser to a newer version, or downloading a modern browser.
Training Camp • Cybersecurity Glossary
System prompt leakage is when an LLM exposes its hidden instructions, revealing sensitive configuration, guardrails, or secrets to attackers.
System Prompt Leakage Definition: System prompt leakage is when an LLM exposes its hidden instructions, revealing sensitive configuration, guardrails, or secrets to attackers.
System prompt leakage occurs when an LLM application inadvertently reveals its hidden system or developer instructions, which may contain sensitive configuration details, guardrail logic, internal rules, or embedded credentials. Attackers elicit this through crafted prompts that coax the model into reproducing or summarizing its instructions, undermining the assumption that system prompts remain confidential. It appears in the OWASP Top 10 for LLM Applications and can expose information that aids further attacks.
Turn knowledge into credentials with our instructor-led cybersecurity boot camps.
View All Courses →