COPYRIGHTS

This package includes a processed subset of an external public dataset and
synthetic datasets generated by the package authors. The relevant copyright
and license notices are listed below.

1. Adult dataset

Source:
  UCI Machine Learning Repository, Adult dataset
  Also known as the Census Income dataset.

Original dataset contributors:
  Barry Becker and Ronny Kohavi

License:
  Creative Commons Attribution 4.0 International (CC BY 4.0)

Use in this package:
  This package includes a processed numerical subset of the Adult dataset.
  The included dataset is named `adult` and retains the following five
  numerical variables from the original data file:

    - age
    - education_num
    - capital_gain
    - capital_loss
    - hours_per_week

  The selected numerical variables contain no missing values in the original
  data file used here. The processed dataset contains 32,561 observations.

Attribution:
  Becker, B. and Kohavi, R. (1996). Adult dataset.
  UCI Machine Learning Repository.

License notice:
  The Adult dataset is licensed under the Creative Commons Attribution 4.0
  International (CC BY 4.0) license. The license permits sharing and adaptation
  of the dataset, provided that appropriate credit is given.

2. Synthetic Gaussian cluster datasets

Datasets:
  - gau
  - gau_g

Source:
  Generated by the authors of this package.

Use in this package:
  The dataset `gau` is a synthetic 20-dimensional Gaussian cluster dataset.
  It was generated from group-specific multivariate normal distributions for
  reproducible examples of principal component analysis and differentially
  private PCA visualization.

  The dataset `gau_g` is the grouped version of `gau` with an additional
  group label column.

License:
  These synthetic datasets are distributed under the same license as the
  package.
