plan_type
and only want to run evaluations on traces from your enterprise customers). See adding metadata to your traces for more information.Run
(reference). This represents the sampled run to evaluate.{"correctness": 1, "silliness": 0}
would create two types of feedback on the run, one saying it is correct, and the other saying it is not silly.