<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Reference Databases | del Campo Lab - Microbial Ecology and Evolution</title><link>https://delcampolab.com/tag/reference-databases/</link><atom:link href="https://delcampolab.com/tag/reference-databases/index.xml" rel="self" type="application/rss+xml"/><description>Reference Databases</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Wed, 27 Apr 2016 00:00:00 +0000</lastBuildDate><image><url>https://delcampolab.com/media/logo_hu3705697567409936087.png</url><title>Reference Databases</title><link>https://delcampolab.com/tag/reference-databases/</link></image><item><title>Reference databases for microeukaryote metabarcoding</title><link>https://delcampolab.com/project/pr2/</link><pubDate>Wed, 27 Apr 2016 00:00:00 +0000</pubDate><guid>https://delcampolab.com/project/pr2/</guid><description>&lt;p>High-throughput metabarcoding can only describe biodiversity as well as the reference databases used to interpret it. Turning millions of environmental reads into named, placed, and ecologically meaningful taxa depends on carefully curated reference resources — and microbial eukaryotes, the most diverse and least catalogued part of the tree of life, are where that need is most acute. We help build and curate several of these resources, anchored by the PR2 database, and develop the methods and frameworks that turn them into biological insight.&lt;/p>
&lt;h3 id="pr2-and-eukref">PR2 and EukRef&lt;/h3>
&lt;p>The PR2 database was initiated in 2010 in the frame of the BioMarks project, building on work developed over the previous decade in the Plankton Group at the Station Biologique de Roscoff. Its aim is to provide a reference database of carefully annotated 18S rRNA sequences using eight unique taxonomic fields (from kingdom to species). It currently contains over 184,000 sequences, with metadata fields including geo-localisation, culture or environmental origin, host type, and more.&lt;/p>
&lt;p>Annotation of PR2 is performed by experts from each taxonomic group. An important partner in this effort is &lt;a href="https://eukref.org/" target="_blank" rel="noopener">EukRef&lt;/a>, which has merged its bioinformatics pipelines and workshop-based curation efforts with PR2. EukRef has built tools used during multiple workshops dedicated to specific taxonomic groups, including ciliates, choanoflagellates, and others — work that has also produced standalone, phylogeny-based reference databases such as EukRef-Ciliophora, a manually curated SSU rRNA catalogue for one of the most diverse protist phyla.&lt;/p>
&lt;h3 id="beyond-18s-new-markers-and-groups">Beyond 18S: new markers and groups&lt;/h3>
&lt;p>The 18S rRNA gene is the workhorse of broad eukaryotic metabarcoding, but it can struggle to separate closely related species. To complement it, we helped build &lt;strong>eKOI&lt;/strong>, a curated reference database for the mitochondrial cytochrome oxidase subunit I (COI) gene with a focus on protists. eKOI integrates GenBank and mitochondrial-genome data with extensive manual curation, recovering nearly 16,000 sequences across 80 eukaryotic phyla, and — when used to re-annotate existing COI datasets — surfaces protist diversity that previous surveys had left in the dark.&lt;/p>
&lt;h3 id="from-sequences-to-biology">From sequences to biology&lt;/h3>
&lt;p>Well-curated databases also enable better ways of reading the data. Using a PR2-derived reference set of non-bilaterian animal sequences, we have benchmarked 18S phylogenetic-placement approaches — comparing the V4, V9, and full-length barcodes — to recover divergent and hidden lineages more reliably. Looking beyond taxonomy, we are part of efforts to connect these catalogues to organismal biology through a trait-based framework for protist ecology and evolution — a step toward reading metabarcoding data not as lists of taxa but as communities with measurable functions.&lt;/p>
&lt;p>&lt;strong>Key publications&lt;/strong>&lt;/p>
&lt;p>Guillou L, et al. (2013). &lt;a href="https://academic.oup.com/nar/article/41/D1/D597/1070586" target="_blank" rel="noopener">The Protist Ribosomal Reference database (PR2): a catalog of unicellular eukaryote small sub-unit rRNA sequences with curated taxonomy&lt;/a>. &lt;em>Nucleic Acids Research&lt;/em> 41:D597–D604.&lt;/p>
&lt;p>&lt;strong>del Campo J&lt;/strong>, et al. (2018). &lt;a href="https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.2004473" target="_blank" rel="noopener">EukRef: phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution&lt;/a>. &lt;em>PLOS Biology&lt;/em> 16:e2005849.&lt;/p>
&lt;p>Boscaro V, Santoferrara LF, Zhang Q, Gentekaki E, Syberg-Olsen MJ, &lt;strong>del Campo J&lt;/strong>, Keeling PJ (2018). &lt;a href="https://doi.org/10.1111/1462-2920.14264" target="_blank" rel="noopener">EukRef-Ciliophora: a manually curated, phylogeny-based database of small subunit rRNA gene sequences of ciliates&lt;/a>. &lt;em>Environmental Microbiology&lt;/em> 20:2218–2230.&lt;/p>
&lt;p>González-Miguéns R, Gàlvez-Morante À, Skamnelou M, Antó M, Casacuberta E, Richter DJ, Lara E, Vaulot D, &lt;strong>del Campo J&lt;/strong>, Ruiz-Trillo I (2025). &lt;a href="https://doi.org/10.1093/database/baaf057" target="_blank" rel="noopener">A novel taxonomic database for the eukaryotic mitochondrial cytochrome oxidase subunit I gene (eKOI), with a focus on protist diversity&lt;/a>. &lt;em>Database&lt;/em> 2025:baaf057.&lt;/p>
&lt;p>Jamy M, Ramond P, Bass D, &lt;strong>del Campo J&lt;/strong>, Dunthorn M, Lara E, Mitra A, Vaulot D, Santoferrara L (2026). &lt;a href="https://doi.org/10.1016/j.tim.2025.08.008" target="_blank" rel="noopener">Towards a trait-based framework for protist ecology and evolution&lt;/a>. &lt;em>Trends in Microbiology&lt;/em> 34:242–251.&lt;/p>
&lt;p>Arano-Ansola J, Galan-Luque I, Demenech M, Rico-Martin L, Moody ERR, Suresh M, Vaulot D, &lt;strong>del Campo J&lt;/strong>, Giacomelli M, Lozano-Fernandez J (2025). &lt;a href="https://www.biorxiv.org/content/10.64898/2025.12.09.693319v2" target="_blank" rel="noopener">Evaluating 18S phylogenetic placement accuracy to uncover hidden diversity in early branching animals&lt;/a>. &lt;em>bioRxiv&lt;/em>.&lt;/p>
&lt;p>&lt;strong>Issues and contributions&lt;/strong>&lt;/p>
&lt;p>Report issues or contribute on &lt;a href="https://github.com/vaulot/pr2_database/issues" target="_blank" rel="noopener">GitHub&lt;/a>.&lt;/p></description></item></channel></rss>