shikumi-eval: Typed evaluation framework for shikumi LM programs (EP-8)
The evaluation framework for shikumi: the owned data model (Example,
Prediction, Dataset, Metric, Score, Report — MasterPlan integration
point #5), built-in pure and LM-backed metrics, an evaluate runner that
scores a Shikumi.Program.Program over a typed dataset with bounded
parallelism and per-example error boundaries, and golden testing that pins a
program's behaviour deterministically under a mock or replayed LM.
Modules
[Index] [Quick Jump]
Downloads
- shikumi-eval-0.1.0.0.tar.gz [browse] (Cabal source package)
- Package description (as included in the package)
Maintainer's Corner
For package maintainers and hackage trustees
Candidates
- No Candidates
| Versions [RSS] | 0.1.0.0 |
|---|---|
| Change log | CHANGELOG.md |
| Dependencies | aeson, baikai, base (>=4.20 && <5), bytestring, containers, effectful, generic-lens, lens (>=5.3 && <5.4), shikumi (>=0.1.0.0 && <0.2), tasty, tasty-golden, text (>=2.1 && <2.2), vector [details] |
| License | BSD-3-Clause |
| Author | Nadeem Bitar |
| Maintainer | nadeem@gmail.com |
| Uploaded | by shinzui at 2026-06-13T15:20:25Z |
| Category | AI |
| Distributions | |
| Reverse Dependencies | 1 direct, 0 indirect [details] |
| Downloads | 4 total (4 in the last 30 days) |
| Rating | (no votes yet) [estimated by Bayesian average] |
| Your Rating | |
| Status | Docs uploaded by user Build status unknown [no reports yet] |