The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.
pirateplot
Nathaniel Phillips
2017-04-18
What is a pirateplot()?
A pirateplot, is the RDI (Raw data, Descriptive statistics, and Inferential statistics) plotting choice of R pirates who are displaying the relationship between 1 to 3 categorical independent variables, and one continuous dependent variable.
A pirateplot has 4 main elements
- points, symbols representing the raw data (jittered horizontally)
- bar, a vertical bar showing central tendencies
- bean, a smoothed density (inspired by Kampstra and others (2008)) representing a smoothed density
- inf, a rectangle representing an inference interval (e.g.; Bayesian Highest Density Interval or frequentist confidence interval)
data:image/s3,"s3://crabby-images/cf7f8/cf7f8721cf74b8c7ebe2225593a707966237e334" alt=""
Main arguments
Here are the main arguments to pirateplot()
Main Pirateplot Arguments
formula |
A formula |
height ~ sex + eyepatch, weight ~ Time |
data |
A dataframe |
pirates, ChickWeight |
main |
Plot title |
‘Pirate heights’, ’Chicken Weights |
pal |
A color palette |
‘xmen’, ‘black’ |
theme |
A plotting theme |
0, 1, 2 |
inf |
Type of inference |
‘ci’, ‘hdi’, ‘iqr’ |
Themes
pirateplot()
currently supports three themes which change the default look of the plot. To specify a theme, use the theme
argument:
Theme 1
theme = 1
is the default
# Theme 1 (the default)
pirateplot(formula = weight ~ Time,
data = ChickWeight,
theme = 1,
main = "theme = 1")
data:image/s3,"s3://crabby-images/4c41c/4c41c659ba751beb6ec676222c40d0c36b604a2e" alt=""
Theme 2
Here is theme = 2
# Theme 2
pirateplot(formula = weight ~ Time,
data = ChickWeight,
theme = 2,
main = "theme = 2")
data:image/s3,"s3://crabby-images/7e81d/7e81dc7ed7d93e429e1dc3b383964b700a4944f0" alt=""
Theme 3
And now…theme = 3
!
# Theme 3
pirateplot(formula = weight ~ Time,
data = ChickWeight,
theme = 3,
main = "theme = 3")
data:image/s3,"s3://crabby-images/37a54/37a546efa90359b584f3f559ea12a734d733e4b9" alt=""
Theme 4
theme = 4
tries to maintain a classic barplot look (but with added raw data).
# Theme 4
pirateplot(formula = weight ~ Time,
data = ChickWeight,
theme = 4,
main = "theme = 4")
data:image/s3,"s3://crabby-images/e2b88/e2b88f9dab9c25874b25c6ac161c449f159bf7c3" alt=""
Theme 0
theme = 0
allows you to start a pirateplot from scratch – that is, it turns of all elements. You can then selectively turn elements on with individual arguments (e.g.; bean.f.o
, point.o
)
# Default theme
pirateplot(formula = weight ~ Time,
data = ChickWeight,
theme = 0,
main = "theme = 0\nStart from scratch")
data:image/s3,"s3://crabby-images/e25a2/e25a29a82d8c3a4ffa0e6fec087a6f81396d402d" alt=""
Color palettes
You can specify a general color palette using the pal
argument. You can do this in two ways.
The first way is to specify the name of a color palette in the piratepal()
function. Here they are:
data:image/s3,"s3://crabby-images/8100f/8100fe2e66bd87f5494a65efa3bdf671e3894e09" alt=""
For example, here is a pirateplot using the "pony"
palette
pirateplot(formula = weight ~ Time,
data = ChickWeight,
pal = "pony",
theme = 1,
main = "pony color palette")
data:image/s3,"s3://crabby-images/85c2b/85c2b17d2860c92eb35df7832c1e4b34fa4d2201" alt=""
The second method is to simply enter a vector of one or more colors. Here, I’ll create a black and white pirateplot from theme 2 by specifying pal = 'black'
pirateplot(formula = weight ~ Time,
data = ChickWeight,
theme = 2,
pal = "black",
main = "pal = 'black")
data:image/s3,"s3://crabby-images/c1042/c10427fb5b2b6a0b7df02bbca37743d931f2fbb9" alt=""
Customising elements
Regardless of the theme you use, you can always customize the color and opacity of graphical elements. To do this, specify one of the following arguments. Note: Arguments with .f.
correspond to the filling of an element, while .b.
correspond to the border of an element:
Customising plotting elements
points |
point.col, point.bg |
point.o |
beans |
bean.f.col, bean.b.col |
bean.f.o, bean.b.o |
bar |
bar.f.col, bar.b.col |
bar.f.o, bar.b.o |
inf |
inf.f.col, inf.b.col |
inf.f.o, inf.b.o |
avg.line |
avg.line.col |
avg.line.o |
For example, I could create the following pirateplots using theme = 0
and specifying elements explicitly:
pirateplot(formula = weight ~ Time,
data = ChickWeight,
theme = 0,
main = "Fully customized pirateplot",
pal = "southpark", # southpark color palette
bean.f.o = .6, # Bean fill
point.o = .3, # Points
inf.f.o = .7, # Inference fill
inf.b.o = .8, # Inference border
avg.line.o = 1, # Average line
bar.f.o = .5, # Bar
inf.f.col = "white", # Inf fill col
inf.b.col = "black", # Inf border col
avg.line.col = "black", # avg line col
bar.f.col = gray(.8), # bar filling color
point.pch = 21,
point.bg = "white",
point.col = "black",
point.cex = .7)
data:image/s3,"s3://crabby-images/3a379/3a3793541471d8067abddd0eef94dd1a297b2d5b" alt=""
If you don’t want to start from scratch, you can also start with a theme, and then make selective adjustments:
pirateplot(formula = weight ~ Time,
data = ChickWeight,
main = "Adjusting an existing theme",
theme = 2, # Start with theme 2
inf.f.o = 0, # Turn off inf fill
inf.b.o = 0, # Turn off inf border
point.o = .2, # Turn up points
bar.f.o = .5, # Turn up bars
bean.f.o = .4, # Light bean filling
bean.b.o = .2, # Light bean border
avg.line.o = 0, # Turn off average line
point.col = "black" # Black points
)
data:image/s3,"s3://crabby-images/eef4d/eef4d51e3960762f024b3dd2e4e6d532809dbce0" alt=""
Just to drive the point home, as a barplot is a special case of a pirateplot, you can even reduce a pirateplot into a horrible barplot:
pirateplot(formula = weight ~ Time,
data = ChickWeight,
main = "Reducing a pirateplot to a barplot",
theme = 0, # Start from scratch
bar.f.o = .7) # Just turn on the bars
data:image/s3,"s3://crabby-images/b54f1/b54f12c3bace54e8652e36c2ea10d6a46047f308" alt=""
Additional arguments
There are several more arguments that you can use to customize your plot:
Additonal pirateplot elements
Background color |
back.col |
back.col = ‘gray(.9, .9)’ |
Gridlines |
gl.col, gl.lwd, gl.lty |
gl.col = ‘gray’, gl.lwd = c(.75, 0), gl.lty = 1 |
Quantiles |
quant, quant.lwd, quant.col |
quant = c(.1, .9), quant.lwd = 1, quant.col = ‘black’ |
Average line |
avg.line.fun |
avg.line.fun = median |
Inference Calculation |
inf.method |
inf.method = ‘hdi’, inf.method = ‘ci’ |
Inference Display |
inf.disp |
inf.disp = ‘line’, inf.disp = ‘bean’, inf.disp = ‘rect’ |
Here’s an example using a background color, and quantile lines.
pirateplot(formula = weight ~ Time,
data = ChickWeight,
main = "Adding quantile lines and background colors",
theme = 2,
back.col = gray(.98), # Add light gray background
gl.col = "gray", # Gray gridlines
gl.lwd = c(.75, 0),
inf.f.o = .6, # Turn up inf filling
inf.disp = "bean", # Wrap inference around bean
bean.b.o = .4, # Turn down bean borders
quant = c(.1, .9), # 10th and 90th quantiles
quant.col = "black" # Black quantile lines
)
data:image/s3,"s3://crabby-images/1ee2b/1ee2bb8fecf48abb04b4e6373b290f4d6819fefc" alt=""
Multiple IVs
You can use up to 3 categorical IVs in your plot. Here are some examples:
pirateplot(formula = height ~ sex + eyepatch + headband,
data = pirates,
theme = 2,
inf.disp = "bean")
data:image/s3,"s3://crabby-images/dca05/dca0505c0d1c784ab89bfc05e59f7c4eb2b21c24" alt=""
Here’s a pirateplot with showing the relationship between movie running times based on movie genre and whether the movie is a sequel or not.
pirateplot(formula = time ~ sequel + genre + rating,
data = subset(movies,
genre %in% c("Action", "Adventure", "Comedy", "Horror") &
rating %in% c("G", "PG", "PG-13", "R") &
time > 0),
theme = 3,
cex.lab = .8,
inf.disp = "rect",
pal = "up")
data:image/s3,"s3://crabby-images/33fd6/33fd6d9afb9c39c71752b1e8ac1b9f17e7465292" alt=""
Output
If you include the plot = FALSE
argument to a pirateplot, the function will return some values associated with the plot.
times.pp <- pirateplot(formula = time ~ sequel + genre,
data = subset(movies,
genre %in% c("Action", "Adventure", "Comedy", "Horror") &
rating %in% c("G", "PG", "PG-13", "R") &
time > 0),
plot = FALSE)
Here’s the result. The most interesting element is $summary
which shows summary statistics for each bean:
## $summary
## sequel genre bean.num n avg inf.lb inf.ub
## 1 0 Action 1 233 114.73391 112.54208 116.9171
## 2 1 Action 2 80 120.47500 116.47585 124.5282
## 3 0 Adventure 3 206 106.36408 103.33926 109.1894
## 4 1 Adventure 4 78 118.64103 111.37081 124.0271
## 5 0 Comedy 5 400 102.01500 100.92166 103.1729
## 6 1 Comedy 6 51 101.21569 98.31366 103.8705
## 7 0 Horror 7 79 102.13924 98.43406 105.3444
## 8 1 Horror 8 23 97.65217 92.99968 102.9126
##
## $avg.line.fun
## [1] "mean"
##
## $inf.method
## [1] "hdi"
##
## $inf.p
## [1] 0.95
Contribute!
I am very happy to receive new contributions and suggestions to improve the pirateplot. If you come up a new theme (i.e.; customization) that you like, or have a favorite color palette that you’d like to have implemented, please contact me (yarrr.book@gmail.com) or post an issue at www.github.com/ndphillips/yarrr/issues and I might include it in a future update.
References
The pirateplot is really a knock-off of the great beanplot package and visualization from Kampstra and others (2008).
Kampstra, Peter, and others. 2008. “Beanplot: A Boxplot Alternative for Visual Comparison of Distributions.” Journal of Statistical Software 28 (1): 1–9.
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.