LangWatch Evaluation icon

LangWatch Evaluation

Check if evaluating, record results, run and record evaluators; optionally set outputs to dataset (auto writes to dataset only when not running from a dataset/batch).

Discussion