
The Technical Anatomy of Model Extraction in 2026 (The Great AI Theft of the Century?)
Article Brief
Why this article matters
Model extraction has industrialized this year and perhaps last year as well. Attackers now run phased pipelines of synthetic queries, hydra-style account multiplexing, and targeted logit harvesting to distill proprietary models at scale. This post presents the mathematics behind knowledge distillation and temperature manipulation, explains why watermarking fails under character-level perturbations, and covers the 2026 disclosures from Anthropic, Google GTIG, and NDSS. The defense section goes beyond API rate limits to behavioral fingerprinting, semantic embedding clustering, and optional logit poisoning—giving you a structured mental model for why perimeter controls alone are insufficient.
Continue Reading
Next steps in the archive
Keep Exploring
Related reading
Continue through adjacent topics with the strongest tag overlap.

Rules vs. Skills: Creating Secure AI Context in Engineering Teams
At my company we ran into a familiar question while scaling AI coding assistants: when should context live in a Rule or `CLAUDE.md`, and when does it deserve a Skill...

MCP Security for Enterprise Organizations: Real-world experiences and advanced defense
A personal reflection and technical analysis on the MCP protocol, from the challenge of presenting to the community to the real-world methods and risks in AI Security, MCP Server, and recommended defenses for organizations. Includes resources, papers, and key sites for modern research in AI agent security.

A2AS: A New Standard for Security in Agentic AI Systems
Reflection, explanation, and analysis of the A2AS paper, the BASIC model, and the A2AS framework, from the perspective of real-world challenges in controls and attack mitigation in AI Security and GenAI Applications.

