What is Model Extraction Attack?

A model extraction attack steals an ML model by querying its API and training a substitute model that mimics its behavior, exposing IP and enabling further attacks.

Model Extraction Attack Definition: A model extraction attack steals an ML model by querying its API and training a substitute model that mimics its behavior, exposing IP and enabling further attacks.

Understanding Model Extraction Attack

A model extraction (or model stealing) attack is an adversarial machine-learning technique in which an attacker repeatedly queries a target model's prediction API and uses the inputs and outputs to train a substitute model that approximates the original's behavior. This steals intellectual property, exposes the victim to query-based monetary cost, and can serve as a stepping stone for crafting transferable adversarial examples or inferring training data. Defenses include rate limiting, query monitoring, output perturbation, and watermarking.

Learn More About Model Extraction Attack:

Ready to Get Certified?

Turn knowledge into credentials with our instructor-led cybersecurity boot camps.

View All Courses →

Popular Searches

Quick Links

Popular Searches

Quick Links

Trending Topics

What is Model Extraction Attack?

Understanding Model Extraction Attack

Learn More About Model Extraction Attack:

Ready to Get Certified?

What is Model Extraction Attack?

Understanding Model Extraction Attack

Learn More About Model Extraction Attack:

Related Cybersecurity Terms

Ready to Get Certified?