AFM Safety Studio builds safety test trajectories for Apple Foundation Models
The tool provides a browser UI and CLI that run Apple Foundation Models via the `fm serve` endpoint, capturing each tool call in a structured conversation. Users act as the human, label each turn as safe or unsafe, and export the resulting trajectory JSON for downstream analysis and QC. It includes a library of predefined safety scenarios, a simulated world of Siri‑style tools, and integration hooks for Surge’s annotation components. Compared to ad‑hoc scripts, it offers an end‑to‑end workflow, visual interface, and reusable scenario framework for AI safety researchers.
View on GitHub →jackdonaldson-surge/afm-safety-studio