shikumi-eval: Typed evaluation framework for shikumi LM programs (EP-8)

[ ai, bsd3, library ] [ Propose Tags ] [ Report a vulnerability ]

The evaluation framework for shikumi: the owned data model (Example, Prediction, Dataset, Metric, Score, Report — MasterPlan integration point #5), built-in pure and LM-backed metrics, an evaluate runner that scores a Shikumi.Program.Program over a typed dataset with bounded parallelism and per-example error boundaries, and golden testing that pins a program's behaviour deterministically under a mock or replayed LM.

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.1.0.0
Change log CHANGELOG.md
Dependencies aeson, baikai, base (>=4.20 && <5), bytestring, containers, effectful, generic-lens, lens (>=5.3 && <5.4), shikumi (>=0.1.0.0 && <0.2), tasty, tasty-golden, text (>=2.1 && <2.2), vector [details]
License BSD-3-Clause
Author Nadeem Bitar
Maintainer nadeem@gmail.com
Uploaded by shinzui at 2026-06-13T15:20:25Z
Category AI
Distributions
Reverse Dependencies 1 direct, 0 indirect [details]
Downloads 4 total (4 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]