Defined, testable conditions for determining whether a deliverable or service meets requirements (e.g., performance thresholds, evaluation results, latency, groundedness, safety tests). In AI agreements, acceptance criteria often tie to a specific intended use and evaluation methodology.
See: Benchmark; Evaluation (evals); Intended use; SLA/SLO; Testing