med-routing — evaluate on your data

  1. 1Upload
  2. 2Generate ground truth
  3. 3Evaluate routers
  4. 4Eval report

1 — upload your eval data

CSV with columns question (required), ground_truth, subject, difficulty, qid (optional). If ground_truth is present we skip generation; otherwise step 2 generates it with a frontier model.