GuardML

GuardML

Defensive AI — guardrails, content filters, model defenses, safe deployment.

Latest

OpenAI's Under-18 Principles: what the new Model Spec teen guardrails actually do

OpenAI's December 18 Model Spec adds Under-18 Principles, an age-prediction classifier, and real-time moderation across modalities. Here is what those defenses cover, where they have already been bypassed, and what to layer on top if you ship for minors.

Recent posts

Subscribe

Defensive AI — guardrails, content filters, model defenses, safe deployment. — delivered when there's something worth your inbox.

No spam. Unsubscribe anytime.