Capability thresholds for general-purpose models
General-purpose models can be repurposed for sensitive tasks. We propose thresholds for reasoning, generation, and autonomy that trigger additional safeguards. This keeps low-risk uses flexible while ensuring higher-stakes deployments meet rigorous testing and oversight.
Read overview