The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.

Type: Package
Title: A Comprehensive Collection of Penguin Datasets for Statistical Analysis and Experimental Design
Version: 0.1.0
Maintainer: Juan Pablo Vargas Perez <j.pablovargas340@gmail.com>
Description: Offers a comprehensive collection of penguin-related datasets suitable for descriptive statistics, hypothesis testing, and experimental design. Derived from open ecological and biological sources such as Palmer Station studies, the package integrates datasets covering adult morphology, clutch size, blood isotope composition, and heart rate. It is designed for researchers, students, and educators to explore statistical methods including ANOVA, regression, multivariate analysis, and design of experiments in an accessible and reproducible context.
License: GPL-3
URL: https://github.com/jpablovargas340/PenguinR, https://jpablovargas340.github.io/PenguinR/
BugReports: https://github.com/jpablovargas340/PenguinR/issues
Encoding: UTF-8
LazyData: true
Suggests: ggplot2, dplyr, testthat (≥ 3.0.0), knitr, rmarkdown
RoxygenNote: 7.3.3
Config/testthat/edition: 3
VignetteBuilder: knitr
NeedsCompilation: no
Packaged: 2025-10-28 00:07:17 UTC; Juan Pablo
Author: Juan Pablo Vargas Perez [aut, cre]
Depends: R (≥ 3.5.0)
Repository: CRAN
Date/Publication: 2025-10-31 18:10:14 UTC

PenguinR: A Comprehensive Collection of Penguin Datasets for Statistical Analysis and Experimental Design

Description

Offers a comprehensive collection of penguin-related datasets suitable for descriptive statistics, hypothesis testing, and experimental design.

Details

PenguinR: A Comprehensive Collection of University and Engineering Datasets

logo

A Comprehensive Collection of University and Engineering Datasets.

Author(s)

Maintainer: Juan Pablo Vargas Perez j.pablovargas340@gmail.com

See Also

Useful links:


Size Measurements for Penguins near Palmer Station, Antarctica

Description

This dataset, peng_df, is a data frame containing morphometric measurements for penguins observed near Palmer Station in the Palmer Archipelago, Antarctica. It includes data on three species across multiple islands, with variables for bill dimensions, flipper length, body mass, sex, and year of observation. Some observations may contain missing values.

Usage

data(peng_df)

Format

A data frame with 333 observations and 8 variables:

species

Penguin species (factor with 3 levels).

island

Island of observation (factor with 3 levels).

bill_length

Bill length in millimeters (numeric).

bill_depth

Bill depth in millimeters (numeric).

flipper_length

Flipper length in millimeters (integer).

body_mass

Body mass in grams (integer).

sex

Sex of the penguin (factor with 2 levels).

year

Year of observation (integer).

Details

The data were originally sourced from the palmerpenguins dataset and are provided in the heplots package for illustrative and educational use in multivariate statistical analysis examples.

The dataset name has been kept as 'peng_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the PenguinR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.

Source

Data taken from the heplots package version 1.7.8


Size Measurements for Adult Foraging Penguins near Palmer Station, Antarctica

Description

This dataset, penguins_df, is a data frame with measurements on adult penguins from three species and three islands in the Palmer Archipelago, Antarctica. Variables include flipper length, body mass, bill length and depth, sex, and year of observation. Some observations may contain missing values.

Usage

data(penguins_df)

Format

A data frame with 344 observations and 8 variables:

species

Penguin species (factor with 3 levels).

island

Island of observation (factor with 3 levels).

bill_length_mm

Bill length in millimeters (numeric).

bill_depth_mm

Bill depth in millimeters (numeric).

flipper_length_mm

Flipper length in millimeters (integer).

body_mass_g

Body mass in grams (integer).

sex

Sex of the penguin (factor with 2 levels).

year

Year of observation (integer).

Details

The columns in penguins_df are a subset of those in the more extensive penguins_raw data frame, which also includes nesting observations and blood isotope data. There are differences in column names and data types between the two datasets. See the Format section for details.

The dataset name has been kept as 'penguins_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the PenguinR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.

Source

Data taken from the palmerpenguins package version 0.1.1


Penguin Size, Clutch, and Blood Isotope Data for Foraging Adults near Palmer Station, Antarctica

Description

This dataset, penguins_raw_df, is a data frame containing comprehensive biological data for adult penguins observed near Palmer Station, Antarctica. It includes size measurements, clutch completion status, blood isotope values, and additional observational metadata. The dataset covers three penguin species from three islands in the Palmer Archipelago.

Usage

data(penguins_raw_df)

Format

A data frame with 344 observations and 17 variables:

studyName

Name of the study or project.

Sample Number

Unique numeric sample identifier.

Species

Species name (character).

Region

Region of the Palmer Archipelago (character).

Island

Island of observation (character).

Stage

Stage of breeding or observation period (character).

Individual ID

Unique penguin identifier (character).

Clutch Completion

Clutch completion status (character).

Date Egg

Date the egg was recorded (Date).

Culmen Length (mm)

Bill length in millimeters (numeric).

Culmen Depth (mm)

Bill depth in millimeters (numeric).

Flipper Length (mm)

Flipper length in millimeters (numeric).

Body Mass (g)

Body mass in grams (numeric).

Sex

Sex of the penguin (character).

Delta 15 N (o/oo)

Nitrogen isotope ratio \delta^{15}N (per mil).

Delta 13 C (o/oo)

Carbon isotope ratio \delta^{13}C (per mil).

Comments

Additional notes and remarks (character).

Details

The columns of penguins_raw_df extend those of the penguins_df dataset, adding details such as study name, individual ID, clutch information, isotope composition (\delta^{15}N and \delta^{13}C), and additional observational notes. Some variables include missing values.

The dataset name has been kept as 'penguins_raw_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the PenguinR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.

Source

Data taken from the palmerpenguins package version 0.1.1


Penguin Heart Rate

Description

This dataset, pinguinos_df, is a data frame containing the slopes of regressions of mass-specific metabolic rate on heart rate for three groups of Macaroni Penguins. The data summarize physiological relationships between metabolic and cardiac activity under different breeding and sex conditions.

Usage

data(pinguinos_df)

Format

A data frame with 24 observations and 2 variables:

group

Penguin group (factor with 3 levels: "breeding female", "breeding male", "nonbreeding female").

slope

Slope of the regression of mass-specific metabolic rate on heart rate (numeric).

Details

The dataset includes one factor variable indicating the penguin group and one numeric variable representing the slope of the regression for each observation. Some observations may contain missing values.

The dataset name has been kept as 'pinguinos_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the PenguinR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.

Source

Data taken from the abd package version 0.2-8

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.