Most of my week has been dominated by being on the BBC/ITV/etc and so just a short weeknote this time around! Aside from all the media coverage, there have been some fun events and hacking going on.
1 Pint of Science Cambridge about TESSERA
Sadiq Jaffer spoke at the Cambridge Pint of Science, which was held in the congenial surroundings of the Station Tavern near the train station. The room itself was quite small and long, and was hugely noisy due to being right beside the actual pub, but it was a wonderfully quirky and informal way to present science to a generalist audience. The specific theme of the event we attended was "How is AI accelerating science?", and other speakers ranged from Moe Vali talking about ultrasound detection of Adenomyosis to Anna Breger talking about reconstructing medieval music from ancient transcripts! Sadiq had the pressure piled on being the last speaker at a point when the audience had mostly had their third pint:
As always though, Sadiq pulled off explaining these complex ideas just brilliantly! His slides were some of the best I've seen yet, as it was done with a combination of his expert knowledge about TESSERA and the use of Claude Design (the latest iteration of an AI vision model from Anthropic).

You can also see another iteration of Sadiq speaking about TESSERA in this news piece on ITV the day after this talk.
2 Cambridge Part II projects due in
The Cambridge undergrads had to get their Part II dissertation projects in, and in particular I really enjoyed two (which I couldn't directly mark/supervise as I'm on sabbatical, but I cheered on from the sidelines).
2.1 Conservation actions using multimodal foundation models
The first is Radhika Iyer on applying TESSERA to test the effectiveness of conservation actions which was a very bold foray into the unknown that she did a brilliant job of writing up. I'll talk more about this one in a while, but the execution in combining two very separate (and cutting edge) projects into a cohesive thesis was remarkable work.
2.2 Hybrid vector databases in O(x)Caml
The other was as ambitious but in a totally different dimension: the first OxCaml project we've supervised here with an undergraduate, with Ryan Gibb keeping a close eye. Oliver Fogelin went off and built a hybrid vector RAG database in OxCaml, including embedding a big stack of arxiv papers and showing a beautiful browser visualisation of the embeddings in an interactive way. You can try it for yourself on his site.


We had an entertaining session in my office with Oli demonstrating it to us. I almost got it to run out of the box via oix --toolchain=oxcaml --with=https://github.com/olifog/gvecdb-ocaml gvecdb-server, except we got scuppered by a few (valid) build failures in dependent libraries that he had patched locally but not committed. It's very close to being able to work, though!
This is some of the most impressive hacking I've seen in a Part II project in a while, and I'm very much going to try to replace my shaky website search with Oli's code when things calm down a bit this summer!
3 TESSERA v1.1 updates
We've had a flurry of activity on preparing a TESSERA 1.1 model update with Frank Feng pulling out all the stops on training with help from Nvidia. This will be the first updated iteration of the TESSERA mode that we've issued. More on the cool new features next week, but in the meanwhile some headlines are:
- We've put up the weights on Huggingface in a new
geotesseraorg. You can find both TESSERA-V-1.0 and TESSERA-V-1.1 there now. - Mark Elvers has an impressive OxCaml inference pipeline we've been using to do spot generation of 1.1, while also collaborating with partners on a full AWS inference run that'll go to Zarr directly. If you need some specific 1.1 embeddings urgently, let me know or raise an issue.
- I'm working on a geotessera 0.9 that'll support the switch to hosting on S3 and also support the 1.1 model variant.
4 Fun Links
- I enjoyed this video interview of economist Clara Mattei speaking about what alternatives to capitalism might look like (via Hamed Haddadi).
- Listened to a fantastic podcast on Amol Rajan's Radical on The Future of Food: Can Regenerative Farming Save Our Soil?
- Read some interesting preprints on how geospatial embedding models are complementary and also on how language models need sleep (in order to improve long context attention).
- I got
oix --toolchain=oxcaml-minus37 strace_tuiworking with a very WIP release from David Allsopp. This will all make sense if you read Jane Street's post about terminal UIs making a comeback. - Resumed work on Eio again with Thomas Leonard returning, so been refreshing a bunch of my trees.
- Next week I'm off to Edinburgh to speak at Rewilding the Web with Jon Crowcroft. It's due to be a heat wave, so wish me luck!
