From 72fa1f86fc7de91cb5193f9fbc468f5139b33c37 Mon Sep 17 00:00:00 2001 From: Kaloyan Nikolov Date: Mon, 27 Apr 2026 19:03:52 +0200 Subject: [PATCH] docs: note pi-mono vs opencode harness usage --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index 340915f..a81047e 100644 --- a/README.md +++ b/README.md @@ -15,6 +15,8 @@ Head-to-head evaluation of six coding LLMs across eight low-level ML kernel task **Take every score with a grain of salt.** LLM judges can be consistent but are not infallible. The relative rankings are more useful than the exact numbers. +**Tooling:** The first 3 challenges (KV-Cache, Fused Softmax+TopK, Layer Norm Backward) were generated using **pi-mono** as the harness. The remaining 5 challenges were generated using **opencode**. + --- ## TL;DR — Final Rankings