datasauRus

This package wraps the awesome Datasaurus Dozen datasets. The Datasaurus Dozen show us why visualisation is important – summary statistics can be the same but distributions can be very different. In short, this package gives a fun alternative to Anscombe’s Quartet, available in R as anscombe.

The original Datasaurus was created by Alberto Cairo in this great blog post.

The other Dozen were generated using simulated annealing and the process is described in the paper Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing by Justin Matejka and George Fitzmaurice.

In the paper, Justin and George simulate a variety of datasets that the same summary statistics to the Datasaurus but have very different distributions.

Install

The latest stable version (0.1.2) is available on CRAN

install.packages("datasauRus")

You can get the latest development version from GitHub, so use devtools to install the package

devtools::install_github("stephlocke/datasauRus")

Usage

You can use the package to produce Anscombe plots and more.

library(ggplot2)
library(datasauRus)
ggplot(datasaurus_dozen, aes(x=x, y=y, colour=dataset))+
  geom_point()+
  theme_void()+
  theme(legend.position = "none")+
  facet_wrap(~dataset, ncol=3)

Tests

library(devtools)
test()
#> Loading datasauRus
#> Loading required package: testthat
#> 
#> Attaching package: 'testthat'
#> The following object is masked from 'package:devtools':
#> 
#>     setup
#> Testing datasauRus
#> v |  OK F W S | Context
#> 
/ |   0       | datasets
- |   1       | datasets
\ |  10       | datasets
v |  22       | datasets [0.4 s]
#> 
/ |   0       | Raw files
v |   1       | Raw files
#> 
#> == Results =======================================================================
#> Duration: 1.2 s
#> 
#> OK:       23
#> Failed:   0
#> Warnings: 0
#> Skipped:  0

Contributing to the package

Code of Conduct

Anyone getting involved in this package agrees to our Code of Conduct. If someone is breaking the Will Wheaton rule aka Don’t be a dick, or breaking the Code of Conduct, please let me know at steph@itsalocke.com

Bug reports

When you file a bug report, please spend some time making it easy for us to follow and reproduce. The more time you spend on making the bug report coherent, the more time we can dedicate to investigate the bug as opposed to the bug report.

Ideas

Got an idea for how we can improve the package? Awesome stuff!

Please raise it with some succinct information on expected behaviour of the enhancement and why you think it’ll improve the package.

Package development

We really want people to contribute to the package. A great way to start doing this is to look at the help wanted issues and/or contribute an example.

Examples for this package are done in base R or with ggplot2 as an optional example, using the structure:

if(require(ggplot2)){
#ggplot2 code here
}

As this is a data package, most of the documentation is sitting in one file (R/Datasaurus-package.R) so we keep the examples in a separate directory (inst/examples).

If there isn’t a file for the dataset you want to write an example for, you can make one by just calling it datasetname.R. To reference an example file, add the line @example inst/datasetname.R in the relevant documentation section of R/Datasaurus-package.R.

Conventions

We’re relatively loose on coding conventions.

Datasets are lower-case with underscores between words
R code should be formatted with the “Reformat code” option in RStudio
There are no standards for base R plots
My preferred ggplot2 themes are theme_minimal where axes labels matter and theme_void when they do not but I’m OK with the default ggplot2 theming if you want to avoid writing longer ggplot2 code

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
R		R
README		README
data-raw		data-raw
data		data
docs		docs
inst		inst
man		man
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
.travis.yml		.travis.yml
CONDUCT.md		CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DESCRIPTION		DESCRIPTION
Datasaurus.Rproj		Datasaurus.Rproj
LICENSE		LICENSE
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.Rmd		README.Rmd
README.md		README.md
_pkgdown.yml		_pkgdown.yml
codecov.yml		codecov.yml
codemeta.json		codemeta.json
cran-comments.md		cran-comments.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

datasauRus

Install

Usage

Tests

Contributing to the package

Code of Conduct

Bug reports

Ideas

Package development

Conventions

About

Uh oh!

Releases

Packages

Languages

License

shion92/datasauRus

Folders and files

Latest commit

History

Repository files navigation

datasauRus

Install

Usage

Tests

Contributing to the package

Code of Conduct

Bug reports

Ideas

Package development

Conventions

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages