Create monitoring tools to track performance, costs, and errors in production.
Create monitoring tools to track performance, costs, and errors in production.
This project is part of the Production category and is recommended for learners at Level 4. Expected difficulty: Advanced
Compare frontier models with cost, latency, and red-team checks using Promptfoo.
Ship structured outputs, agent tools, and long-context workflows with GPT-5.4.
Track traces, metrics, and evaluation runs for LLM and agent workflows.