26 December 2025

2025 Advent of Agentic Humps: Building a useful O(x)Caml library every day

An exploration of agentic programming through building useful OCaml libraries daily using Claude Code while establishing groundrules for responsible development.

Agentic programming has been getting a hilariously bad rap in the OCaml community recently, but it's definitely here to stay despite the security and legal concerns. I realised that to form a useful opinion on all this, I needed to really get into using Claude with OCaml for real outputs and not just toy code. So this holiday month, I'm going to release a new useful OCaml library per day until Christmas using Claude Code: the advent of agentic humps is here!

Day 1: Crockford for Crockford Base32 encoding.
Day 2: Jsonfeed for an implementation of the JSONFeed 1.1 spec.
Day 3: XDGe for a XDG Directory specifiction with Eio capabilities.
Day 4: Claudeio for a Claude OCaml/Eio SDK so I can use Claude to write more Eio.
Day 5: Bytesrw-eio Bytesrw/Eio adapter and automate opam metadata via a custom Claude skill.
Day 6: Yamlrw for a pure OCaml Yaml 1.2 library, to replace ocaml-yaml's C binding.
Day 7: Yamlt to allow jsont codecs to be serialised to Yaml as well as JSON.
Day 8: Sortal: a contacts management CLI using Yaml, Git and Cmdliner.
Day 9: Sortal-Bonsai: adding a Bonsai_term terminal UI to Sortal via Async.
Day 10: Sortal-Mosaic: adding a Mosaic terminal UI to Sortal via Eio.
Day 11: Cookeio, Public-suffix, Punycode: parsing Internet RFCs to build cookie libraries.
Day 12: Conpool: Eio TLS/TCP connection pooling and self-contained performance viz.
Day 13: Requests: Heckling an OCaml HTTP client from 50 other implementations.
Day 14: Karakeep: Live agentic API construction for the Karakeep app.
Day 15: Htmlrw: Vibespiling Rust/Python into a 100% compliant HTML5 manipulation library.
Day 16: Json-pointer: Vibesplaining specifications by generating OCaml Javascript notebooks.
Day 17: Jmap: Vibemailing little CLI agents to bring my JMAP messages under control.
Day 18: Tomlt: Elegant TOML 1.1 codecs inspired by the jsont data soup paper.
Day 19: Zulip, INIt: Zulip bot framework and INI codecs compatible with Python configparser.
Day 20: Langdetect: Statistical detection for human languages in OCaml, JavaScript and wasm.
Day 21: Html5rw_check: Vibespiling the Nu HTML Validator from Java to typed OCaml checkers.
Day 22: Monopam: Monorepo workflow with dune vendoring for cross-cutting fixes.
Day 23: Unpac: Unifying git and opam package management with branch-based monorepos.
Day 24: Tuatara: Tuatara, an evolving Atom aggregator that mutates its own code.
Day 25: OCaml Claude Marketplace: Wrapping up my Claude skills into a reusable bundle.

Claude is also very good at automating non-coding tasks like opam metadata

I'm working through a large backlog of ideas that I'll figure out as each days goes on. Ideas thrown on the pile by colleagues include TCP connection reuse and pooling library with TLS support, HTTP cookie jar handling using Eio, Batteries-include HTTP(S) client library with redirect/cookies, digest vast amounts of Git and summarise it (see a preview), Zulip bindings using Requests and Eio, Kitty graphics protocol to show graphics in your terminal, client bindings for the JMAP protocol, client bindings for the Immich self hosted photo service, client bindings for the Peertube video service, generate image srcsets in various resolutions for websites, DOI resolution of papers to structured metadata, and a Parquet library in pure OCaml. I'm also working on an io_uring OxCaml webserver if I can get the Linux kernel not crashing on me before Santa visits...

My overall goal is to accelerate the heck out of how I manage the growing data in this website. I've been building it as homebrew infrastructure for the past twenty five years, and now I want it to move from ad-hoc scripts to principled data management. I am also using the libraries to do data processing in the day job for the remote sensing of nature or evidence synthesis. I'll edit the above list every day to link to what I actually did.

I've picked these choices fairly carefully as they're not "core" libraries that are difficult to write and require functional ingenuity, but are instead problems that involve a fair amount of boilerplate code that is typically quite tedious to write in OCaml. Hand writing code might be on the ropes, but not quite out of action just yet! But first, let's establish some groundrules for if this is a good idea or not.

1 Isn't this just more AI slop code?

There's a definite gag reflex involved with releasing so much code: by prioritising quantity over quality, aren't I just contributing to the world of AI slop? However, the hypothesis I am exploring is that the software engineering process fundamentally changes when using agents towards specification driven development, which has always been the holy grail of functional programming.

There's been extensive discussion recently about the role of LLMs in open source elsewhere that informed my thinking. I liked Thibaut Mattio stating how he's approaching his own agentic software development:

AI writes a significant amount of the initial code, and I review, revise, and iterate on a large portion of it. That’s how I work these days. But the architecture, design, and core logic are very much the result of deliberate iteration and manual refinement. -- Thibaut Mattio, OCaml Discuss, 2025

Bryan Cantrill came up with a superb set of principles for LLM Usage at Oxide. In particular, he separates out using LLMs for reading, writing and coding. I totally agreed with him that I hate people sending me LLM-generated writing for me to review; I would rather get the raw prompt and use my own LLM+context rather than read through other people's slop.

LLM-generated prose undermines a social contract of sorts: absent LLMs, it is presumed that of the reader and the writer, it is the writer that has undertaken the greater intellectual exertion. (That is, it is more work to write than to read!) For the reader, this is important: should they struggle with an idea, they can reasonably assume that the writer themselves understands it — and it is the least a reader can do to labor to make sense of it. -- Using LLMs at Oxide, RFD0576, Dec 2025

However, there is an undeniable (and growing) power in the ability to generate code at scale using LLMs. I've been doing a lot of this with Python in recent months, but I find myself increasingly frustrated by the lack of typing guardrails involved with agentic coding there.

I believe that a strongly typed, modular language like OCaml could become one of the best languages for agentic coding in the longer term, with advances happening rapidly to cure the data deficiency problem for relatively obscure languages with smaller corpuses. Also, with OxCaml on the horizon, getting help with increasingly complex (but rewarding) code annotations such as modes and kinds sems essential.

2 Groundrules for the Advent of Agentic Humps

After reflecting on the recent discussions, I decided on these for my little December experiment:

No AI-driven contributions to other people's code. All my slop stays in my own lane unless the other person agrees. Luckily my own research group is easy to bribe with some festive beer so I hope to get them (or you, my dear reader) to voluntarily help me judge the success or failure.
Read every line of code that's tagged for release. Even if I haven't written it all, it's vital to look for howlers. However, intermediate pushes may have slop in them, so stick to the tagged releases.
The library has to be used somewhere in my production code stack, for example this website. Time to eat my own agentic slop on my own knowledge bases!
Build on great human designed code. LLMs do not replace or compete with well designed foundation libraries in the OCaml ecosystem like Eio, Core, Lwt or the Bunzli-verse. Each of these have different design ethoses, but if they didn't exist there is no scaffolding over which to compose LLM-driven code outputs. So this is not a competition to beat them, but rather to use them more effectively.

And overall, this process should not help me learn more about agentic workflows but also contribute to the wider discussion, so I'll capture what I learn in this blog series at the end.

Some non-rules:

Keeping agentic code separate from my "real code" seems pointless nowadays, with LLMs everywhere. I tried that earlier in the year, but I fear the poisoning will have to be dealt with by other means.
I'm trying to keep this specific to my own OCaml workflow, and not generalising this for a hypothetical other user. But you should feel free to fork this stuff.
I have no idea how I'm going to maintain all these libraries once released. A problem for 2026. I'm not particularly attached to any of these libraries, so maintainance/rewrite offers are all fine by me.
There's a reasonable chance some of this has some bad bugs, since it's not going through peer review. I'll do my best to handle test coverage, but please be tolerant. Bug reports are welcome.
I've done my best to manually scan code and attribute copyright where possible, but there remains a chance I have horribly screwed up. Any errors in attribution are my own, but I'm going to press on and take the risk.

If anyone else wants to join in the Advent of Agentic Humps, ping me on whatever communication medium you like. Just remember the groundrules: don't waste other maintainer's time without their permission first.

References

[1]Madhavapeddy (2025). Oh my Claude, we need agentic copilot sandboxing right now. 10.59350/aecmt-k3h39

[2]Madhavapeddy (2025). Is AI poisoning the scientific literature? Our comment in Nature. 10.59350/pbxew-d2j78

[3]Madhavapeddy (2025). Arise Bushel, my sixth generation oxidised website. 10.59350/0r62w-c8g63

[4]Madhavapeddy (2025). Holding an OxCaml tutorial at ICFP/SPLASH 2025. 10.59350/55bc5-x4p75

[5]Madhavapeddy (2025). GeoTessera 0.7 out with efficient sampling and Zarr support. 10.59350/nagwp-tnw89

[6]Boruch-Gruszecki et al (2025). Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment. arXiv. 10.48550/arXiv.2508.04865

Weeknotes for week 3Jan 2026

Jon Ludlam. First week back of 2026! Let's write some terse weeknotes.

Devcontainer for using O(x)Caml and Claude in your projectsJan 2026

A prebuilt Docker devcontainer for sandboxed OCaml and OxCaml development with Claude Code, including multiarch builds and network isolation.

Happy new year and my fave readings of the yearJan 2026

My favourite books, podcasts and recommendations from 2025, covering moral ambition, maps, wolves, AI dystopias, geopolitics, Chennai history, and the best tech podcasts.

AoAH Day 25: Claude OCaml Marketplace for all your festive coding needsDec 2025

Wrapping up 25 days of agentic coding with a Claude Code OCaml plugin marketplace to share the skills and tools developed throughout the series.

AoAH Day 24: Tuatara, an evolving Atom aggregator that mutatesDec 2025

Tuatara is a feed aggregator that integrates Claude to evolve and patch its own code when encountering parsing errors, embodying the concept of self-healing software.

AoAH Day 23: Unpac unifies git branching with package managementDec 2025

Introducing unpac, a tool that unifies git and package management into a single workflow where all code dependencies live in one repository as trackable branches.

Dear ACM, you're doing AI wrong but you can still get it rightDec 2025

Critiquing ACM's paywalled AI paper summaries and proposing better alternatives like open feeds, easier downloads, provenance tracking, and personalised agentic interfaces.

AoAH Day 22: Assembling monorepos for agentic OCaml developmentDec 2025

Materialising opam metadata into git submodules and monorepos, enabling cross-cutting fixes and unified odoc3 documentation across dozens of OCaml libraries.

AoAH Day 21: Complete dynamic HTML5 validation in OCaml and the browserDec 2025

Porting the W3C's Nu HTML Validator from Java to OCaml and running in the browser dynamically

AoAH Day 20: Human language detection in native code, JS and wasmDec 2025

Porting the Nu HTML Validator's language detection to OCaml, then optimizing from 115MB to 28MB and fixing WASM array limits for browser deployment.

AoAH Day 19: Zulip bot framework to bring Vicuna the friendly camel backDec 2025

Building an OCaml Zulip bot framework with functional handlers, and pivoting from TOML to INI codecs for Python configparser compatibility

AoAH Day 18: TOML 1.1 codecs directly from the spec and paperDec 2025

Building tomlt, a pure OCaml TOML 1.1 parser with bidirectional codecs following the jsont design patterns

AoAH Day 17: OCaml JMAP to plaster my painful email papercutsDec 2025

Building an OCaml JMAP client that runs in browsers and CLI, then using it to build personalised email workflows for taming notification overload.

AoAH Day 16: Vibesplaining JSON Pointers using OCaml/JavascriptDec 2025

Building interactive OCaml tutorials that compile to JavaScript, using agents to generate executable documentation that teaches protocols like JSON Pointer while you code review.

AoAH Day 15: Porting a complete HTML5 parser and browser test suiteDec 2025

Vibespiling JustHTML from Python to pure OCaml, achieving 100% pass rate on the browser html5lib test suite using agentic workflows.

AoAH Day 14: Debugging a Karakeep CLI against the live serviceDec 2025

Vibe coding an OCaml library for the Karakeep bookmarking service by giving an agent a live API key and letting it debug jsont codecs against the real service.

AoAH Day 13: Heckling an OCaml HTTP client from 50 implementations in 10 languagesDec 2025

Agentically synthesising a batteries-included OCaml HTTP client by gathering recommendations from fifty open-source implementations across JavaScript, Python, Java, Rust, Swift, Haskell, Go, C++, PHP and shell.

AoAH Day 12: Eio Connection pooling and event tracingDec 2025

Building a TCP/TLS connection pooling library for Eio with DNS-based load balancing, stacked error handling, and self-contained HTML visualisations for stress test results.

AoAH Day 10: Building a TUI for Sortal using MosaicDec 2025

Building a simpler single-process terminal UI for Sortal using Mosaic's effects-based direct-style API, with Eio integration and discovering multimodal image debugging for terminal layouts.

AoAH Day 11: HTTP Cookies and vibing RFCs for breakfastDec 2025

Synthesizing three RFC-compliant libraries (punycode, public-suffix, and cookeio) directly from Internet RFC specifications, establishing a workflow for automating standards implementation with proper cross-referencing to spec sections.

AoAH Day 9: Adding a Bonsai terminal UI to SortalDec 2025

Experimenting with OxCaml's bonsai_term framework for Sortal's terminal UI, navigating Eio-Async interoperability challenges through JSON-RPC while discovering image-based debugging techniques for terminal applications.

AoAH Day 8: Building a contacts CLI manager with SortalDec 2025

Creating Sortal, a CLI contacts management application using Yaml storage, XDG directories, Git-based synchronization, and integrating all previously built libraries into a cohesive CLI tool.

AoAH Day 7: Converting between JSON and Yaml with yamltDec 2025

Building yamlt to enable jsont codec definitions to work with both JSON and Yaml, providing data manipulation with location tracking and good error messages for both formats.

AoAH Day 6: Getting a Yaml 1.2 implementation in pure OCamlDec 2025

Implementing a pure OCaml Yaml 1.2 parser using bytesrw by synthesizing from the specification and existing C library behavior, passing thousands of test suite cases while being 20% faster than the C-based implementation.

AoAH Day 5: Bytesrw Eio adapters and automating opam metadataDec 2025

Building Bytesrw-Eio adapters for composable byte stream I/O while discovering Claude Skills as a powerful way to automate opam package metadata management through reusable workflow templates.

AoAH Day 4: Going recursive with Claudeio for ClaudeDec 2025

Creating OCaml bindings for the Claude API using Eio and jsont codecs by reverse-engineering the JSON-RPC protocol from Python and Go SDKs, enabling Claude to write more Claude-powered OCaml code.

AoAH Day 3: XDG filesystem paths using Eio capabilitiesDec 2025

Building an XDG Base Directory Specification library with Eio capabilities and Cmdliner integration, providing sandboxed filesystem access patterns with full environment variable and CLI override support.

AoAH Day 2: Building an OCaml JSONFeed libraryDec 2025

Implementing a JSONFeed specification library using jsont codecs, discovering how Claude can automate the construction of complex combinators from prose specifications with excellent error messages.

AoAH Day 1: Building a Base32 Crockford library in OCamlDec 2025

Building a Base32 Crockford encoding library in OCaml using Claude Code, establishing the development workflow with sandboxed Docker containers and local development environments.

GeoTessera 0.7 out with efficient sampling and Zarr supportNov 2025

GeoTessera 0.7 switches to GeoParquet manifests for faster initialisation, adds Zarr tensor storage support, and provides new sampling APIs for building downstream tasks like solar panel detection.

Holding an OxCaml tutorial at ICFP/SPLASH 2025Oct 2025

Tutorial at ICFP 2025 on OxCaml extensions for performance engineering with modes and locals.

Three steps for OCaml to crest the AI humpsOct 2025

Sadiq Jaffer, Jon Ludlam et al. — proceedings of the 2025 OCaml Workshop

Is AI poisoning the scientific literature? Our comment in NatureJul 2025

Nature comment on AI-generated paper threats to evidence synthesis proposing federated living evidence databases with human-in-loop review.

Oh my Claude, we need agentic copilot sandboxing right nowMar 2025

Claude Code auto-generates OCaml bindings, but lacks robust sandboxing.

Arise Bushel, my sixth generation oxidised websiteJan 2025

Learn about my sixth generation oxidised website built with a bleeding-edge OCaml variant.

Conservation Evidence CopilotsJan 2024

Remote Sensing of NatureJan 2023