Recent

The Technical Anatomy of Model Extraction in 2026 (The Great AI Theft of the Century?)

Apr 8, 20269 min read

On this page

Article Brief

Why this article matters

Model extraction has industrialized this year and perhaps last year as well. Attackers now run phased pipelines of synthetic queries, hydra-style account multiplexing, and targeted logit harvesting to distill proprietary models at scale. This post presents the mathematics behind knowledge distillation and temperature manipulation, explains why watermarking fails under character-level perturbations, and covers the 2026 disclosures from Anthropic, Google GTIG, and NDSS. The defense section goes beyond API rate limits to behavioral fingerprinting, semantic embedding clustering, and optional logit poisoning—giving you a structured mental model for why perimeter controls alone are insufficient.

RecentPublished 2 months ago

Next steps in the archive

Older article

Rules vs. Skills: Creating Secure AI Context in Engineering Teams

At my company we ran into a familiar question while scaling AI coding assistants: when should context live in a Rule or `CLAUDE.md`, and when does it deserve a Skill...

Keep Exploring

Related reading

Continue through adjacent topics with the strongest tag overlap.

Mar 27, 2026AI Security / MLOps

Rules vs. Skills: Creating Secure AI Context in Engineering Teams

At my company we ran into a familiar question while scaling AI coding assistants: when should context live in a Rule or `CLAUDE.md`, and when does it deserve a Skill...

#AI Security#GenAI Applications#MLOps

Oct 25, 2025AI Security

MCP Security for Enterprise Organizations: Real-world experiences and advanced defense

A personal reflection and technical analysis on the MCP protocol, from the challenge of presenting to the community to the real-world methods and risks in AI Security, MCP Server, and recommended defenses for organizations. Includes resources, papers, and key sites for modern research in AI agent security.

#AI Security#MCP Protocol#GenAI Applications

Sep 29, 2025AI Security

A2AS: A New Standard for Security in Agentic AI Systems

Reflection, explanation, and analysis of the A2AS paper, the BASIC model, and the A2AS framework, from the perspective of real-world challenges in controls and attack mitigation in AI Security and GenAI Applications.

#AI Security#GenAI Applications#Academic Research