These are research ideas for students at various levels (Part II, MPhil, PhD, and postdoctoral). Browse through the ideas below to find projects that interest you. You're also welcome to propose your own research ideas that align with our ongoing projects.

%lc

Organising the world's species information requires us to pull together lots and lots of information. This included data about the natural world, ranging from GBIF to iNaturalist to the IUCN's databases. It also need information about what humans are doing, via OpenStreetMap and other maps.

Then, it also needs observational models of the planet such as TESSERA derived from remote sensing data, in order to figure out habitat information.

And after all that, we need to pull together literature with information not yet captured in models, and also derived statistical models such as LIFE.

Enki's our codename for a new project to work on this, beginning with a dashboard for bringing the IUCN Red List to life. Tune in for more in 2026 as this develops!

Ongoing with Shane Weisz
%lc

The Conservation Evidence team at the University of Cambridge has spent years screening 1.6m+ scientific papers on conservation, as well as manually summarising 8600+ studies relating to conservation actions. However, progress is limited by the specialised skills needed to screen and summarise relevant studies -- it took more than 75 person years to manually curate the current database and only a few 100 papers can be added each year! We are working on AI-driven techniques to accelerate addition of robust evidence to the CE database via automated literature scanning, LLM-based copilots and scanning of grey literature. We aim to provide co-pilots that augment human decision making to figure out how to categorise interventions much more quickly and accurately, and ultimately accelerate the positive impact of conservation actions.

Available, with Alec Christie and Sadiq Jaffer
Ongoing with Radhika Agrawal, with Alec Christie and Sadiq Jaffer
Ongoing with Mark Jacobsen, with Sadiq Jaffer
Ongoing with Kittson Hamill, with Sadiq Jaffer
Completed by Radhika Iyer, with Sadiq Jaffer
Completed by Shrey Biswas and Kacper Michalik, with Sadiq Jaffer
Expired, with Sadiq Jaffer and Tom Worthington
Expired, with Sadiq Jaffer, Alec Christie and Bill Sutherland
%lc

Human-driven habitat loss is recognised as the greatest cause of biodiversity loss, but we lack robust, spatially explicit metrics quantifying the impacts of anthropogenic changes in habitat extent on species' extinctions. LIFE is our new metric that uses a persistence score approach that combines ecologies and land-cover data whilst considering the cumulative non-linear impact of past habitat loss on species' probability of extinction. We apply large-scale computing to map ~30k species of terrestrial vertebrates and provide quantitative estimates of the marginal changes in the expected number of extinctions caused by converting remaining natural vegetation to agriculture, and also by restoring farmland to natural habitat. We are also investigating many of the conservation opportunities opened up via its estimates of the impact on extinctions of diverse actions that change land cover, from individual dietary choices through to global protected area development.

Available, with Alison Eyres and Thomas Ball
Available, with Daniele Baisero and Michael Dales
Ongoing with Sam Forbes, with Michael Dales
Ongoing with Emilio Luz-Ricca, with Andrew Balmford
%lc

Measuring the world's forest carbon and biodiversity is made possible by remote sensing instruments, ranging from satellites in space (Landsat, Sentinel, GEDI) to ground-based sensors (ecoacoustics, camera traps, moisture sensors) that take regular samples and are processed into time-series metrics and actionable insights for conservation and human development. However, the algorithms for processing this data are challenging as the data is highly multimodal (multispectral, hyperspectral, synthetic aperture radar, or lidar), often sparsely sampled spatially, and not in a continuous time series. I work on various algorithms and software and hardware systems we are developing to improve the datasets we have about the surface of the earth, in close collaboration with the Planetary Computing and Trusted Carbon Credits projects.

Ongoing with Hannah McLoone, with Charles Emogor and Rob Fletcher
Ongoing with Gabriel Mahler, with Silviu Petrovan
Ongoing with Isabel Mansley, with David Coomes and Aland Chan
Ongoing with Dominico Parish, with Josh Millar
Ongoing with Beatrice Spence, Arissa-Elena Rotunjanu and Anna Yiu, with Tiffany Ki and Edgar Turner
Ongoing with Gabriel Mahler
Ongoing with Onkar Gulati, with Sadiq Jaffer and David Coomes
Ongoing with Josh Millar, with Hamed Haddadi
Ongoing with Andres Zuñiga-Gonzalez, with Ronita Bardhan
Ongoing with Charles Emogor, with Milind Tambe
Completed by Emily Morris, with David Coomes
Completed by Pedro Sousa, with Sadiq Jaffer
Completed by Sharan Agrawal, with Srinivasan Keshav
Completed by Martynas Sinkievič
Expired, with Sadiq Jaffer
%lc

Planetary computing is our research into the systems required to handle the ingestion, transformation, analysis and publication of global data products for furthering environmental science and enabling better informed policy-making. We apply computer science to problem domains such as forest carbon and biodiversity preservation (see Trusted Carbon Credits and Remote Sensing of Nature), and design solutions that can scalably process geospatial data that build trust in the results via traceability and reproducibility. Key problems include how to handle continuously changing datasets that are often collected across decades and require careful access and version control.

Available, with Michael Dales
Available, with Sadiq Jaffer
Ongoing with Becky Terefe-Zenebe, with Mark Elvers
Ongoing with Max Carroll, with Patrick Ferris and Cyrus Omar
Ongoing with Max Carroll, with Patrick Ferris
Ongoing with Max Smith, with Patrick Ferris
Ongoing with George Pool, with Michael Dales and Patrick Ferris
Ongoing with Jessica Man, with Martin Kleppmann
Ongoing with Patrick Ferris, with Srinivasan Keshav
Completed by Omar Tanner, with Sadiq Jaffer
Expired, with Patrick Ferris
Expired, with Patrick Ferris
%lc

The Cambridge Centre for Carbon Credits is an initiative I started with Andrew Balmford, David Coomes, Srinivasan Keshav and Thomas Swinfield, aimed at issuing trusted and verifiable carbon credits towards the prevention of nature destruction due to anthropogenic actions. We combine large-scale data processing from Remote Sensing of Nature and Planetary Computing with econometric techniques and decentralised Tezos smart contracts to design a carbon marketplace with verifiable transactions that link back to trusted primary observations.

Completed by Sophie Chapman, with Eleanor Toye Scott
Completed by Derek Sorensen, with Srinivasan Keshav
Completed by Keshav Sivakumar, with Srinivasan Keshav and David Coomes
%lc

There is now increased hardware support for improving the security and performance of privilege separation and compartmentalization techniques such as process-based sandboxes, trusted execution environments, and intra-address space compartments. We dub these "hetero-compartment environments" and observe that existing system stacks still assume single-compartment models (i.e. user space processes), leading to limitations in using, integrating, and monitoring heterogeneous compartments from a security and performance perspective. This project explores how we might deploy techniques such as fine-grained information flow control (DIFC) to allow developers to securely use and combine compartments, define security policies over shared system resources, and audit policy violations and perform digital forensics across hetero-compartments.

Completed by Samuel Wedgwood
Completed by Zahra Tarkhani
Completed by Jake Hillion
Completed by Malachy O'Connor Brown and Oscar Hill, with Zahra Tarkhani and Lorena Qendro
>_Interspatial OS2018–now
%lc

Digital infrastructure in modern urban environments is currently very Internet-centric, and involves transmitting data to physically remote environments. The cost for this is data insecurity, high response latency and unpredictable reliability of services. I am working on Osmose -- a new OS architecture that inverts the current model by building an operating system designed to securely connect physical spaces with extremely low latency, high bandwidth local-area computation capabilities and service discovery.

Available, with Michael Dales
Ongoing with Dan Kvit, with Josh Millar
Ongoing with Tom Thorogood, with Ryan Gibb
Ongoing with Roy Ang, with Ryan Gibb
Ongoing with Ryan Gibb, with Jon Crowcroft
Completed by Smita Vijayakumar, with Evangelia Kalyvianaki
Spatial Name System2022 · MPhil
Completed by Ryan Gibb, with Jon Crowcroft
Completed by Michał Mgeładze-Arciuch, with Patrick Ferris
Expired, with Trevor Agus
>_OCaml Labs2012–2021
%lc

I founded a research group called OCaml Labs at the University of Cambridge, with the goal of pushing OCaml and functional programming forward as a platform, making it a more effective tool for all users (including large-scale industrial deployments), while at the same time growing the appeal of the language, broadening its applicability and popularity. Over a decade, we retrofitted multicore parallelism into the mainline OCaml manager, wrote a popular book on the language, and helped start and grow an OCaml package and tooling ecosystem that is thriving today.

Available, with KC Sivaramakrishnan and Andy Ray
Ongoing with Jeremy Chen, with David Allsopp
Ongoing with Lucas Ma, with David Allsopp
Completed by Sky Batchelor, with Patrick Ferris
Completed by Rokas Urbonas, with Dmirtij Szamozvancev
Completed by Bartosz Modelski
Completed by Hari Chandrasekaran, with Tom Kelly and Liang Wang
Completed by Dimitar Popov
Completed by Heidi Howard
Completed by Sebastian Funk
Completed by Henry Hughes
>_Unikernels2010–2019
%lc

I proposed the concept of "unikernels" -- single-purpose appliances that are compile-time specialised into standalone bootable kernels, and sealed against modification when deployed to a cloud platform. In return they offer significant reduction in image sizes, improved efficiency and security, and reduce operational costs. I also co-founded the MirageOS project which is one of the first complete unikernel frameworks, and also integrated them to create the Docker for Desktop apps that are used by hundreds of millions of users daily.

Completed by Oliver Hope, with David Allsopp
Completed by Mohammed Daudali
Completed by Craig Ferguson
Completed by Johann Beleites, with David Sheets
Completed by Chris Harding and Ross McIlroy, with Robert M Watson
Completed by William Morland, with Robert M Watson
>_Personal Containers2009–2015
%lc

As cloud computing empowered the creation of vast data silos, I investigated how decentralised technologies might be deployed to allow individuals more vertical control over their own data. Personal containers was the prototype we built to learn how to stem the flow of our information out to the ad-driven social tarpits. We also deployed personal containers in an experimental data locker system at the University of Cambridge in order to incentivise lower-carbon travel schemes.

Completed by Gediminas Lelešius
Completed by Farhān Mannān
%lc

My PhD dissertation work proposed an architecture for constructing new implementations of standard Internet protocols with integrated formal methods such as model checking and functional programming that were then not used in deployed servers. A more informal summary is "rewrite all the things in OCaml from C!", which lead to a merry adventure into implementing many networks protocols from scratch in a functional style, and learning lots about how to enforce specifications without using a full blown proof assistant.

Completed by Hannes Mehnert and David Kaloper-Mersinjak, with Peter Sewell
Completed by Nicholas Skehin
%lc

I investigated how to interface the new emerging class of smartphone devices (circa 2002) with concepts from ubiquitous computing such as location-aware interfaces or context-aware computing. I discovered the surprisingly positive benefits of piggybacking on simple communications medium such as audible sound and visual tags. Our implementations of some of these ended up with new audio ringtone and visual smart tags that worked on the widely deployed mobile phones of the era.

In 2003, the mobile phone market had grown tremendously and given the average consumer access to cheap, small, low-powered and constantly networked devices that they could reliably carry around. Similarly, laptop computers and PDAs became a common accessory for businesses to equip their employees with when on the move. The research question then, was how to effectively interface them with existing digital infrastructure and realise some of the concepts of ubiquitous computing such as location-aware interfaces or context-aware computing.

Completed by Gareth P. Williams