GBIF Conversion MDT

This installation of the GBIF Metabarcoding Data Toolkit (MDT) is available for processing real datasets when no other full-featured MDT installation is available. For testing or training purposes, please use the MDT Sandbox (Demo Installation) https://mdt.gbif-test.org/.

This toolkit is being developed as part of the pilot phase of the GBIF Metabarcoding Data Programme
 

Extract from the MDT User Guide


9. Background Material and Resources

9.1. GBIF and DNA

GBIF provides access to extensive biodiversity data, primarily focused on species occurrences, including where and when species have been recorded. These records, gathered from collections, surveys, and citizen science efforts, contain species names, observation dates, locations, and other important details. In addition to traditional biodiversity records, GBIF also supports DNA-associated data, such as DNA metabarcoding datasets. A general guide – Publishing DNA-derived data through biodiversity data platforms – was released in 2021 to assist with these specific data types. However, consultations with the GBIF network and research community in 2022 and 2023 highlighted the need for user-friendly tools to facilitate the formatting and publishing of DNA metabarcoding data in particular.

The Metabarcoding Data Toolkit (MDT) was developed to address this need.

9.2. The GBIF Metabarcoding Data Programme

In October 2024, GBIF launched a pilot phase for the Metabarcoding Data programme, intended to improve GBIF’s integration of DNA metabarcoding data on biodiversity. The pilot programme is open to GBIF node managers who wish to manage an instance of the MDT. Participating nodes are provided a hosted installation of the MDT. An MDT can be configured to operate in one of two modes:

  • Publishing mode: MDT users can register datasets for publication through GBIF through the organizations to which they’re associated. This mode functions similarly to an installation of GBIF’s Integrated Publishing Toolkit (IPT).
  • Conversion-only mode: MDT users can use it to reshape their dataset into a GBIF-ready Darwin Core Archive (DwC-A) but must download them for hosting and publication on another repository, such as an IPT.

Apply to manage an MDT in the pilot programme (GBIF node managers primarily).

9.3. GBIF guides

  • Publishing DNA-derived data through biodiversity data platforms – was released in 2021 and has valuable supplementary in-depth information on publishing DNA-derived biodiversity data. As the MDT will handle several data preparation steps automatically, it is advisable to follow the information in the MDT user guide when in doubt.

 

sections of central relevance: Rationale; Environmental DNA as a source…​; DNA metabarcodin data; Introduction to publishing; Taxonomy of sequences; Mapping metabarcoding data.

9.4. More context about the MDT and DNA-derived data in GBIF

  • Technical support hour for GBIF nodes: August 2025 (topic: MDT) (page+video)
  • Technical support hour for GBIF nodes: February 2024 (topic: DNA data publishing) (page+video)
  • eDNA data and GBIF at the 3rd National Workshop on Marine eDNA, Maryland, USA, 2024 (video)
  • A quickly made example use of the MDT (old). (video)
  • GBIF briefing on Digital Sequence Information (DSI) (web page, text)

9.5. Selected short general videos on eDNA and DNA metabarcoding

  • How we can detect pretty much anything - Hélène Morlon and Anna Papadopoulou - TED-Ed. (video)
  • Biomonitoring using eDNA (environmental DNA) - Henrik’s Lab. (video)
  • What is Environmental DNA (eDNA)? – Illumina AMEA. (video)
  • Environmental DNA - World Wildlife Fund. (video)
  • Environmental DNA - Towards Next-Generation Biomonitoring - LeeseLab. (video)

9.6. Podcasts on eDNA and DNA metabarcoding

  • What is Environmental DNA (eDNA) and How Can it Help? - A Conversation with Dr Shaun Wilkinson – Dr Adam Forbes. (podcast)

9.7. Various resources