Projects

Things I build in the open.

Small, focused tools — most started as a frustration that wouldn't go away. PRs welcome.

evalkit

Python

Tiny, opinionated eval harness for LLM apps with replayable traces.

1.2kgithub →

tinyrag

TypeScript

A minimal, dependency-light retrieval-augmented generation core.

640github →

model-card-cli

Python

Generate transparent model cards from training artifacts.

318github →

vec-bench

Go

Reproducible vector-DB benchmark suite with cost modeling.

511github →

promptlint

Rust

Static analysis for prompt templates. Catches stray braces and PII risks.

204github →

noir-ui

TSX

A dark, gold-accented React component kit. Powers this site.

98github →