dataset: Create Data Frames for Exchange and Reuse

The 'dataset' package extends tidy data frames with machine-readable metadata, semantic definitions, and provenance information. It supports incremental semantic stabilization, interoperable dataset exchange, and FAIR-oriented publication workflows by preserving contextual metadata directly within R objects. The package facilitates the creation, exchange, reuse, and RDF serialization of datasets in line with ISO and W3C standards.

Version: 0.4.5
Depends: R (≥ 3.5)
Imports: assertthat, haven, ISOcodes, labelled, pillar, stats, tibble, utils, vctrs
Suggests: dplyr, jsonld, knitr, rdflib, rmarkdown, spelling, tidyr, testthat (≥ 3.0.0)
Published: 2026-06-03
DOI: 10.32614/CRAN.package.dataset
Author: Daniel Antal ORCID iD [aut, cre], Marcelo Perlin ORCID iD [rev], Anna Márta Mester ORCID iD [rev], Mauro Lepore ORCID iD [rev]
Maintainer: Daniel Antal <daniel.antal at dataobservatory.eu>
BugReports: https://github.com/ropensci/dataset/issues
License: GPL (≥ 3)
URL: https://docs.ropensci.org/dataset/, https://github.com/ropensci/dataset, https://dataset.dataobservatory.eu
NeedsCompilation: no
Language: en-GB
Citation: dataset citation info
Materials: README, NEWS
CRAN checks: dataset results

Documentation:

Reference manual: dataset.html , dataset.pdf
Vignettes: Modernising Citation Metadata in R: Introducing 'bibrecord' (source, R code)
dataset_df: Create Datasets that are Easy to Share Exchange and Extend (source, R code)
defined: Semantically Enriched Vectors (source, R code)
Design Principles & Future Work Semantically Enriched, Standards-Aligned Datasets in R (source, R code)
Example Dataset Definitions (source)
An Introduction to the dataset Package (source, R code)
Handling Semantic Ambiguity with prelabelled Vectors (source, R code)
From R to RDF (source, R code)

Downloads:

Package source: dataset_0.4.5.tar.gz
Windows binaries: r-devel: dataset_0.4.4.zip, r-release: dataset_0.4.5.zip, r-oldrel: dataset_0.4.4.zip
macOS binaries: r-release (arm64): dataset_0.4.5.tgz, r-oldrel (arm64): dataset_0.4.5.tgz, r-release (x86_64): dataset_0.4.5.tgz, r-oldrel (x86_64): dataset_0.4.5.tgz
Old sources: dataset archive

Reverse dependencies:

Reverse imports: retroharmonize

Linking:

Please use the canonical form https://CRAN.R-project.org/package=dataset to link to this page.