shikumi-eval: Typed evaluation framework for shikumi LM programs (EP-8)

[ ai, bsd3, library ] [ Propose Tags ] [ Report a vulnerability ]

The evaluation framework for shikumi: the owned data model (Example, Prediction, Dataset, Metric, Score, Report — MasterPlan integration point #5), built-in pure and LM-backed metrics, an evaluate runner that scores a Shikumi.Program.Program over a typed dataset with bounded parallelism and per-example error boundaries, and golden testing that pins a program's behaviour deterministically under a mock or replayed LM.

Modules

[Index] [Quick Jump]

Shikumi
- Shikumi.Eval

Downloads

shikumi-eval-0.2.0.1.tar.gz [browse] (Cabal source package)
Package description (as included in the package)

Maintainer's Corner

Package maintainers

shinzui

For package maintainers and hackage trustees

edit package information

Candidates

No Candidates

Versions [RSS]	0.1.0.0, 0.1.0.1, 0.1.1.0, 0.2.0.0, 0.2.0.1
Change log	CHANGELOG.md
Dependencies	aeson, baikai (>=0.4 && <0.5), base (>=4.20 && <5), bytestring, containers, effectful, generic-lens, lens (>=5.3 && <5.4), shikumi (>=0.3.0.0 && <0.4), tasty, tasty-golden, text (>=2.1 && <2.2), vector [details]
License	BSD-3-Clause
Author	Nadeem Bitar
Maintainer	nadeem@gmail.com
Uploaded	by shinzui at 2026-07-21T02:42:14Z
Category	AI
Distributions
Reverse Dependencies	1 direct, 0 indirect [details]
Downloads	28 total (12 in the last 30 days)
Rating	(no votes yet) [estimated by Bayesian average]
Your Rating	λ λ λ
Status	Docs uploaded by user Build status unknown [no reports yet]