The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.

Title: Datasets with Strong and Spurious Correlations
Version: 0.1
Description: Provides datasets from Vigen (2015) https://web.archive.org/web/20230607181247/https%3A/tylervigen.com/spurious-correlations rescued from the Internet Wayback Machine. These should be preserved for statistics introductory courses as these make it very clear that correlation is not causation.
License: CC0
Encoding: UTF-8
RoxygenNote: 7.3.2
Depends: R (≥ 2.10)
LazyData: true
NeedsCompilation: no
Packaged: 2025-09-21 13:40:21 UTC; pacha
Author: Tyler Vigen [cph], Mauricio Vargas Sepulveda ORCID iD [aut, cre]
Maintainer: Mauricio Vargas Sepulveda <m.vargas.sepulveda@gmail.com>
Repository: CRAN
Date/Publication: 2025-09-27 08:30:02 UTC

Spurious Correlations datasets

Description

A dataset to preserve tylervigen.com correlations.

Usage

spurious_correlations

Format

A tibbles with 155 observations and 9 variables:

year

Year

var1

Variable 1

var2

Variable 2

var1_short

Variable 1 short name

var2_short

Variable 2 short name

var1_unit

Variable 1 unit

var2_unit

Variable 2 unit

var1_value

Variable 1 value

var2_value

Variable 2 value

source

Source

Source

https://web.archive.org/web/20230607181247/https://tylervigen.com/spurious-correlations

Examples

# Drownings by Falling into a Pool
# correlates with
# Films Nicolas Cage Appeared In
d <- spurious_correlations[spurious_correlations$var2_short == "Nicholas Cage", ]
cor(d$var1_value, d$var2_value)

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.