AI Glossary: What Is Model Extraction (ME)? Definition & Meaning

Modellextraktion

Model extraction refers to the process by which an individual or entity attempts to replicate a maschinellem Lernen model’s behavior and functionality by querying it. This is often done to gain access to the proprietary knowledge embedded in the original model without direct access to the model itself. The extracted model may not be identical to the original but can exhibit similar performance on specific tasks.

Der Prozess umfasst typischerweise das Senden einer Reihe von Eingaben an das Zielmodell und die Analyse der generierten Ausgaben. Durch systematisches Variieren der Eingaben und Beobachten der Ausgaben kann ein Angreifer die zugrunde liegenden Muster und Entscheidungsgrenzen des Modells ableiten. Diese Technik ist besonders bedenklich, wenn das Originalmodell auf sensiblen oder proprietären Daten trainiert wurde, da sie zu Diebstahl geistigen Eigentums oder unbeabsichtigter Offenlegung privater Informationen führen kann.

Model extraction attacks can occur in various contexts, including cloud-based machine learning services, where companies provide access to their models via APIs. Sicherheit measures such as Ratenbegrenzung, Ausgaberauschen addition, and input sanitization can help mitigate the risks associated with model extraction. However, as machine learning technologies continue to evolve, so too do the tactics employed by attackers, making it essential for organizations to remain vigilant about the security of their KI-Modelle.

Overall, model extraction presents significant challenges for the protection of intellectual property and sensitive data in the Bereich der künstlichen Intelligenz verwendet wird.