For researchers, getting data out of GBIF nodes is easy…
…but sharing your own data is hard.
.xml)galaxias (and friends)galaxias: Build, check & publish DWCAs |
|
corella: Convert a tibble to Darwin Core |
|
delma: Convert markdown to EML or xml |
An archive is a .zip file containing three things:
|
|
|
|
|
data csv format |
metadata eml format |
schema xml format |
|
|
|
|
|
|
|
| data | metadata | schema | archive | validate | submit |
Load galaxias
delma and corella are loaded automatically
Load an example dataset
# A tibble: 2 × 5
latitude longitude date time species
<dbl> <dbl> <chr> <chr> <chr>
1 -35.3 149. 14-01-2023 10:23 Callocephalon fimbriatum
2 -35.3 149. 15-01-2023 11:25 Eolophus roseicapilla
How should we convert this dataset to Darwin Core?
If we follow that advice:
df_dwc <- df |>
set_occurrences(occurrenceID = sequential_id(),
basisOfRecord = "humanObservation") |>
set_coordinates(decimalLatitude = latitude,
decimalLongitude = longitude) |>
set_datetime(eventDate = lubridate::dmy(date),
eventTime = lubridate::hm(time)) |>
set_scientific_name(scientificName = species,
taxonRank = "species")
df_dwc# A tibble: 2 × 8
basisOfRecord occurrenceID decimalLatitude decimalLongitude eventDate
<chr> <chr> <dbl> <dbl> <date>
1 humanObservation 01 -35.3 149. 2023-01-14
2 humanObservation 02 -35.3 149. 2023-01-15
# ℹ 3 more variables: eventTime <Period>, scientificName <chr>, taxonRank <chr>
Save as occurrences.csv:
|
|
|
|
|
|
|
| data | metadata | schema | archive | validate | submit |
|
|
Generate a metadata file
---
title: A Descriptive Title for your Dataset in Title Case
output: html_document
date: 2025-02-01
---
```{=html}
<!--
This is a metadata template.
It is formatted to render as an html document (using the "Knit" button or
`knitr::knit()`) AND to Ecological Metadata language (EML) using the
{delma} R package. Sections can
be added, re-arranged or removed to suit the dataset being described. Some
features to be aware of:Convert to EML
<?xml version="1.0" encoding="UTF-8"?>
<eml:eml xmlns:eml="https://eml.ecoinformatics.org/eml-2.2.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" packageId="the-doi-for-this-archive" system="https://doi.org" scope="system" xsi:schemaLocation="http://rs.gbif.org/schema/eml-gbif-profile/1.3/eml-gbif-profile.xsd">
<dataset>
<title>A Descriptive Title for your Dataset in Title Case</title>
<creator>
<individualName>
<givenName>Firstname</givenName>
<surName>Lastname</surName>
</individualName>
<address>
<deliveryPoint>215 Road Street</deliveryPoint>
<city>Canberra</city>
<administrativeArea>ACT</administrativeArea>
<postalCode>2601</postalCode>
<country>Australia</country>
|
|
|
|
|
|
|
| data | metadata | schema | archive | validate | submit |
|
|
|
Automated process for zipping the /data-publish folder.
We can check that the correct files are present.
The schema file (eml.xml) has been built automatically.
|
|
|
|
|
|
|
| data | metadata | schema | archive | validate | submit |
|
|
|
|
|
|
|
|
|
|
|
|
| data | metadata | schema | archive | validate | submit |
|
|
|
|
|
|
Run submit_archive() to create an issue on data-publication repository
|
|
|
|
|
|
|
| data | metadata | schema | archive | validate | submit |
|
|
|
|
|
|
|
galaxias.xml)| Peggy Newman |
| Martin Westgate |
| Amanda Buyan |
| Dax Kellie |
| Shandiya Balasubramaniam |
galaxias |
|
corella |
|
delma |
|
galah |