The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.

Use Case 5 - Downloading Entire Data Packages Using DOIs

Julien Brun, Mitchell Maier and Irene Steves, NCEAS

2024-08-16

Summary

This vignette aims to showcase a use case when the user wants to download all the datasets of a data package using metajam - download_d1_data_pkg.

In this example we use a csv file storing packages returned after searching for soil bulk density on Arctic Data Center and KNB data repositories.

Libraries and constants

# devtools::install_github("NCEAS/metajam")
library(metajam)  
library(readr)
library(purrr)
# Directory to save the data set
path_folder <- "./Soil_bulk"

# URL to read the search results stored as a csv on Google Drive
csv_search_results_url <- "https://drive.google.com/uc?export=download&id=1WTLP2BcXCXmUyv4kmntyhuPfrBNdPIqV"

Download all the datasets from data packages using DOIs

# Create the local directory to store data sets
dir.create(path_folder, showWarnings = FALSE)

# Read the data listing from Google Drive: https://drive.google.com/open?id=1WTLP2BcXCXmUyv4kmntyhuPfrBNdPIqV
data_listing <- read_csv(csv_search_results_url)


### Download the data and metadata ----

# Create the list of unique dois
dois <- unique(data_listing$identifier)

# batch download the datasets
data_folders <- map(dois, ~download_d1_data_pkg(.x, path_folder))

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.