Conservation Evidence Copilots

The Conservation Evidence team at the University of Cambridge has spent years screening 1.6m+ scientific papers on conservation, as well as manually summarising 8600+ studies relating to conservation actions. However, progress is limited by the specialised skills needed to screen and summarise relevant studies -- it took more than 75 person years to manually curate the current database and only a few 100 papers can be added each year! We are working on AI-driven techniques to accelerate addition of robust evidence to the CE database via automated literature scanning, LLM-based copilots and scanning of grey literature. We aim to provide co-pilots that augment human decision making to figure out how to categorise interventions much more quickly and accurately, and ultimately accelerate the positive impact of conservation actions.

The goal of the Conservation Evidence project is to transform conservation so that evidence is routinely embedded in decisions to improve outcomes for biodiversity and society. CE is becoming the authoritative, most comprehensive, freely available platform for evidence-led conservation and is starting to profoundly change the way in which conservationists access and use evidence for improving the state of the planet.

Team AICN in the CCI building, Feb 2024
Team AICN in the CCI building, Feb 2024

The CE collation and synthesis work has significantly improved the availability of evidence for use in conservation practice and remains the only resource of evidence synopses for biodiversity conservation and the largest database of effectiveness reviews of actions outside the field of medicine. The approach of carrying out reviews on an industrial scale means that they can carry out reviews for a fraction of the costs in comparable fields, such as medicine. Using subject-wide evidence synthesis, CE systematically searches the literature and summarise results from (and provides citations for) each study testing the effectiveness of an action. As of April 2024, CE has read 1.6 million paper titles in 17 languages (326 non-english journals) and reviewed evidence for >3600 conservation actions, freely available on their website, with collaboration from over 380 international academics and practitioners.

Accelerating literature surveys with LLMs

We got involved from computer science in 2023 as part of the AI@CAM competition to harness the momentum behind machine learning to accelerate conservation actions. Our overall aim is to help CE to dramatically accelerate their data searching and data extraction pipelines. Currently, the searching of literature and summarising of key data is undertaken by human experts. Although this method of working is time consuming, it does benefit from being thorough and replicable. The main difficulties come in the subtleties of deciphering study designs, methodologies and whether controls are actually appropriate for testing the effectiveness of the specified action. Any LLM-based automation that we deploy must account for these as part of the validation pipeline.

Our evaluation of LLM performance against human experts on conservation intervention questions showed that properly configured LLMs with retrieval augmentation can achieve competitive performance on evidence synthesis tasks. However, out-of-the-box general LLMs performed poorly and risk misinforming decision-makers, reinforcing our commitment to careful validation and human-in-the-loop approaches.

The collaboration originally began in 2022 as part of the Computer Science 1B group projects, when Bill Sutherland, Sam Reynolds and Alec Christie from Zoology proposed a group project related to CE. A team of undergraduate students (including Jamie Cao) trained an ML model to facilitate searching for papers and indexing relevant articles by species and habitat. After the group project completed with encouraging results, Sadiq Jaffer and I joined the collaboration and -- with help from the Cambridge Office for Scholarly Communication -- built up a comprehensive (and legal!) corpus of millions of academic papers related to conservation evidence.

Through 2024, we evaluated ten different LLMs against human experts using the CE database, leading to our LLM evaluation paper showing promising but cautious results. We were joined in the summer of 2024 by three CST undergraduates: Radhika Iyer, Shrey Biswas and Kacper Michalik who built out various elements of the system. Moving into 2025, our focus shifts to production deployment of hybrid retrieval systems while maintaining rigorous validation against the expanding challenges of AI contamination in scientific literature.

Living Evidence Databases

In October 2025, we published AI-assisted Living Evidence Databases for Conservation Science describing a complete, end-to-end pipeline for maintaining living evidence databases. Traditional systematic reviews become outdated quickly, but living evidence databases offer a dynamic alternative by continuously processing new evidence as it emerges. Our pipeline, designed to operate on local infrastructure using self-hosted models, ingests and normalizes documents from academic publishers, screens them for relevance using a multi-stage process, and extracts structured data according to predefined schemas.

The system features a hybrid retrieval model combining keyword search with semantic understanding, and integrates a human-AI collaborative process for refining inclusion criteria from complex protocols. We also incorporated an established, statistically-principled stopping rule to ensure efficiency. In baseline evaluation against a prior large-scale manual review, the fully automated pipeline achieved 97% recall and identified significant numbers of relevant studies not included in the original review, demonstrating its viability as a foundational tool for maintaining living evidence databases. This work was earlier presented at a workshop on AI for evidence synthesis in March 2025 that brought together policymakers, researchers, and practitioners to discuss the responsible integration of AI into evidence synthesis workflows.

Challenges in the AI-Evidence Era

Our work on LLM-based evidence synthesis has gained urgency in 2025 as the body of scientific literature faces challenges from AI-contaminated papers. The publication of ever-larger numbers of problematic papers, including fake ones generated by AI, represents what our Nature paper "Will AI speed up literature reviews or derail them entirely?" argues is an existential crisis for the established way of doing evidence synthesis. Recent analysis suggests that up to 2% of submitted papers may be AI-generated, with some estimates potentially much higher. This contamination poses particular risks for evidence databases like CE, as fake papers with plausible-sounding conservation interventions could mislead decision-makers if incorporated without rigorous validation.

The paper argues that while AI-generated papers pose serious threats to literature reviews, a new approach using AI might also offer solutions. As AI generation becomes more sophisticated, our validation pipeline needs to evolve beyond simple detection to robust experimental design verification and cross-reference validation. The paradox is that AI is both the problem and potentially part of the solution; but only if we build systems with rigorous validation, traceability, and human oversight at their core. See also A FAIR Case for a Live Computational Commons for the importance of FAIR principles throughout science.

Technology for Conservation, Not Division

We also conducted a horizon scan in 2024, which sparked discussions about ensuring AI serves conservation equitably. In response to concerns raised by Katie Murray and colleagues about the potential for AI to divide conservation, we published a response in Trends in Ecology & Evolution. As outlined in both that response and our reflections on technology uniting conservation, we are committed to developing CE tools that:

  • Respect and amplify human expertise rather than replacing it via "human-in-the-loop" methods
  • Follow participatory design principles with conservation practitioners
  • Maintain open source and open data approaches with thorough documentation to facilitate reproducible outputs
  • Address capacity building needs, particularly in the Global South with respect to AI capability
  • Keep conservation goals and not short term technology trends at the center of our research.

Our AI@CAM interview highlights this approach: we are building detailed models of the world that can be queried by policy makers to help make informed decisions. The technology serves the evidence, and the evidence serves conservation practices.

Activity

References

# 1st Jan 2024ai, biodiversity, conservation, evidence, llms
Loading recent items...