Import judgments + calibrate¶
About this walkthrough
Estimated time: 3 minutes Tags: judgments, calibration, ground-truth
Skip LLM generation by importing pre-curated judgments, then run the kappa calibration to measure agreement against human ground truth.
Trouble playing? Download the walkthrough video.



