.plan-26-18: From tropical forest protection to oi swallowing its oxcaml tail

#4c #redd #carboncredits #policy #evidence #ai #oi #ocaml3 May 2026

Our REDD+ over-crediting paper hits Nature Communications just as Microsoft retreats from removals, we talk responsible evidence synthesis while LLMs appear in UK planning, and oi grows a self-update bootstrap.

1 REDD+ over-crediting in Nature Communications

Our paper on learning lessons from over-crediting in REDD+ projects came out this week in Nature Communications, led by Thomas Swinfield. The reception to the paper has generally been encouragingly positive, especially with the framing that "bad credits are not the same as bad projects".

The timing of the paper hits the carbon market at a particular low time, since it's reeling from Microsoft's abrupt retreat from carbon removals meaning that the single largest buyer of removals is scaling back its commitments from just a few months ago. While this hits the whole stack of direct-air-capture pipelines through to nature-based projects, at least it can hopefully rebuild now with nature and technological removals/avoidance working in harmony rather than bickering about which specific method is 'the best'.

We need them all, and I think the necessary response should be to raise the floor on both at once, and not try to pick a winner. The Cambridge University and Computer Lab writers have done a great job carrying that point through, and I've written up my own full argument for those who want to dig in.

2 The inevitable rise of LLMs in government decision-making

Sadiq Jaffer and Sam Reynolds also gave a great talk at the Digital Statecraft Academy on our evidence synthesis work, on "The inevitable rise of Large Language Models in government decision making". Civil servants and policy folk in the room were asking practical questions about how to do the right thing. There were questions about best practices for reducing hallucinations with responses discussing retrieval grounding, structured outputs, human-in-the-loop checkpoints, and maintaining proper evaluation harnesses.

The inevitable rise of Large Language Models in government decision making (Sadiq and Sam)

Then there were concerns about the computational and power crunch to keep all of this affordable as adoption scales across government. We discussed the use of smaller specialised models, on-prem inference for sensitive workloads, and the open question of whether the UK has the data-centre capacity to host serious sovereign deployments. The third was on whether quantum computing changes the picture (quick answer: no).

Just as this was all happening, the government announced Google had won a tender for planning-decision automation. English councils are trialling a Google AI tool to speed up planning, which is precisely the kind of black-box deployment my red-pill/blue-pill argument was cautioning against. Decisions affecting people's homes are now being filtered through opaque models with no public scrutiny of the reasoning chain.

Sadiq and I took a closer look at the tender notice, and spotted that bidders were required to integrate with the incumbent planning systems, which effectively freezes out smaller UK players. Last year about this time, I was looking at how the UK might benefit from an open-data substrate via a national data library. Without a credible open layer, every public-sector AI tender will keep collapsing onto the same handful of incumbent vendors.

If there's a bright spot, it's that the questions from the Statecraft audience suggest civil servants increasingly understand this, and the government itself is dispatching more contracts to UK firms in other sectors. We'll get there with the open AI story...

3 Hacking updates

3.1 Cyrus visits to talk Hazel

The week started with a fun visit from KC Sivaramakrishnan (over for a PaPOC keynote), Cyrus Omar, Andrew Blinn and Matthew Keenan (formerly an undergrad here at Cambridge, now doing a PhD over with Cyrus in UMich).

We all sat down with Ryan Gibb to brainstorm over ideas for how to combine recent advances in Hazel with the OxCaml work going on around here.

The two most exciting things were the emergence of a Hazel CLI (so we can not only integrate it more easily into MDX workflows but also agentic coding), and also Ryan's package calculus as the basis for a brand new approach to how we express dependencies (in a language that doesn't have any backwards compatibility baggage to worry about). More on this as we convene next at PROPL3 in June!

4 oi gains self-update

I've been making steady progress on oi, the uv-style binary distributor I started working on a few weeks ago, and have been dogfooding it with a few others. Mark Elvers and I have been cross checking that our tools are compatible while doing OCaml maintainance.

The big new feature this week is oi self update. Having distributed binaries via oi for a few weeks, the next obvious step was for oi itself to become one of the binaries it updates. This makes pushing fixes much less painful, and brings it closer to self-hosting itself.

The two features I really want oi to nail at this point are:

how to quickly run a binary in the uv style where you don't install anything. I've added oix for this now, so that oix utop just works — backed by source tracking and a local cache.
easy static-binary builds so you can ship a single binary that runs anywhere without thinking about which libc/arch the target is on. oi handles this by shelling out to Docker for the static-build pipeline. I'm still working on wiring the binary builds through so that it just works (and I need to investigate fat binaries to see if they're worth it, but I'm guessing not).
updates as a library, so that binaries can evolve

I've also been hacking with Thomas Gazagnaire to merge his monopampam tree back into the agentic-libraries trees from last year. Thomas has been doing an enormous amount of new coding for space protocols and has built a lovely CCSDS protocol stack. I've merged almost all of his changes back into my OCaml trees, and will look at OxCaml merges next. More on this mega monorepo as it stabilises in the next few weeks!

5 Fun Links

From the Slate Political Gabfest, I learnt that the number of words we are speaking per day is dramatically shrinking. Read the full study.
As Github crumbles the Tangled team shipped the reputation system I've always wanted since Advogato. Check out my vouch ATProto records here!

References

[1]Madhavapeddy (2026). Discussing effective conservation with all the UK Chief Scientists. 10.59350/qjrmv-38130

[2]Madhavapeddy (2025). Thoughts on the National Data Library and private research data. 10.59350/fk6vy-5q841

[3]Swinfield et al (2026). Learning lessons from over-crediting to ensure additionality in forest carbon credits. Nature Publishing Group. 10.1038/s41467-026-71552-3

[4]Iyer et al (2025). Careful design of Large Language Model pipelines enables expert-level retrieval of evidence-based information from syntheses and databases. 10.1371/journal.pone.0323563

[5]Gibb et al (2026). Package Managers à la Carte: A Formal Model of Dependency Resolution. arXiv. 10.48550/arXiv.2602.18602

[6]Pfeifer et al (2026). Sliding Into Silence? We Are Speaking 300 Daily Words Fewer Every Year. 10.1177/17456916261425131

Helping tropical forest protection keep up with a fast-changing worldApr 2026

We analysed 44 REDD+ projects and find that the voluntary market did over-credit, but also that most projects slowed deforestation on the ground. Forest carbon credits are under priced relative to permanent removals, and fixing the base price would remove most of the financial pressure that drives over-crediting in the first place.

Using `day10` to build an OxCaml projectApr 2026

Mark Elvers. Today, looking at my OxCaml inference engine, I wanted to see whether day10 build . could build an OxCaml project.

we need a federation of forgesApr 2026

Akshay Oppiliappan. Crossposted from the Tangled blog. GitHub seems to be crumbling the past couple of weeks. Whatever the reason, ultimately its not great for 90% of the world's OSS to depend on one provider. Centralized systems always crumble; it's the emails, gits, and IRCs that stand the test of time. Tangled aims …

ocaml-ci moves to significant versionsApr 2026

Mark Elvers. The same OCaml build matrix updates which where deployed in opam-repo-ci have now been applied to ocaml-ci.

Weeknotes 2026 weeks 16-17Apr 2026

Jon Ludlam. A two week update this week. Most of this fortnight has been spent on different sides of the same problem: getting OCaml documentation into a state where an LLM (or a human) can actually rely on it...

From Convergence to Confidence: Push-button verification for RDTsApr 2026

KC Sivaramakrishnan. What does it mean for a replicated data type to be correct? For most of the literature, my own prior work included, the answer has been convergence: two replicas that have applied the same operations end up in the same state. I argued in my PaPoC 2026 keynote last week that for many useful data type…

.plan-26-17: Unwedging kernels, dogfood deployments, and managing beef leakageApr 2026

Welcoming Akshay to Cambridge, TESSERA AWS sync done, oi now self-hosts this site, and a new 4C forest leakage preprint appears.

.plan-26-16: Chennai, Cambridge, Belfast: a week on the wingApr 2026

A week of hops between Chennai, Cambridge and Belfast for the FP Launchpad takeoff at IIT Madras, a surprise Publication of the Year at the Cambridge Ring Hall of Fame, meeting the VC on the upcoming Rokos School of Governance, mirroring half a petabyte of TESSERA tiles and hacking on oi

Reimplementing the Space Protocol Stack from ScratchApr 2026

Thomas Gazagnaire. A satellite link is a radio signal between a ground station and a spacecraft moving at 7.5 km/s, visible for 10 minutes at a time, over a channel measured in kilobits per second. If you have used TCP/IP, the protocol structure will look familiar: there is a transport layer that handles reliability, …

Learning lessons from over-crediting to ensure additionality in forest carbon creditsApr 2026

Tom Swinfield, Abby Williams et al. — Nature Communications

Discussing effective conservation with all the UK Chief ScientistsFeb 2026

Hosting the UK chief scientists for nature conservation at Pembroke to discuss TESSERA and AI for biodiversity, followed by the Conservation Evidence conference where I talked about choosing the open red pill over black-box AI for conservation decision-making.

Package Managers à la Carte: A Formal Model of Dependency ResolutionJan 2026

Ryan Gibb, Patrick Ferris et al.

2025 Advent of Agentic Humps: Building a useful O(x)Caml library every dayDec 2025

An exploration of agentic programming through building useful OCaml libraries daily using Claude Code while establishing groundrules for responsible development.

AoAH Day 24: Tuatara, an evolving Atom aggregator that mutatesDec 2025

Tuatara is a feed aggregator that integrates Claude to evolve and patch its own code when encountering parsing errors, embodying the concept of self-healing software.

Careful design of Large Language Model pipelines enables expert-level retrieval of evidence-based information from syntheses and databasesMay 2025

Radhika Iyer, Alec Philip Christie et al. — PLOS ONE

Thoughts on the National Data Library and private research dataFeb 2025

Exploring the National Data Library and its potential to improve access to private research data while balancing security and privacy concerns.

OxCaml LabsJan 2025

Conservation Evidence CopilotsJan 2024