The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.

Title: Comprehensive Data Summarization for Statistical Analysis
Version: 0.1.1
Author: Immad Ahmad Shah [aut], Uzair Javid Khan [aut, cre], Sukhdev Mishra [aut]
Maintainer: Uzair Javid Khan <uzairkhan11w@gmail.com>
Description: Summarizes data frames by calculating various statistics including central tendency, dispersion, shape, and normality diagnostics. Handles numeric, character, and factor columns with NA-aware computations.
License: GPL-3
Encoding: UTF-8
RoxygenNote: 7.3.3
Imports: moments, dplyr, nortest, stats
Suggests: testthat (≥ 3.0.0)
Config/testthat/edition: 3
URL: https://github.com/Uzairkhan11w/DataSum
BugReports: https://github.com/Uzairkhan11w/DataSum/issues
NeedsCompilation: no
Packaged: 2025-11-12 16:58:31 UTC; Uzair
Repository: CRAN
Date/Publication: 2025-11-12 18:00:21 UTC

Summarize an Entire Data Frame

Description

Applies Datum() to each column of a data frame and binds the results.

Usage

DataSumm(data)

Arguments

data

A data frame (tibble is also ok).

Value

A data frame, one row per input column.

Examples

DataSumm(iris)

Summarize a Single Vector

Description

Summarizes a single vector by calculating a consistent set of statistics. Numeric vectors include mean/median/variance/sd/min/max/range/skewness/kurtosis and a normality decision. Character/factor vectors report the mode only.

Usage

Datum(data)

Arguments

data

A numeric, character, factor, or other vector.

Value

A one-row data.frame with summary statistics.

Examples

Datum(rnorm(100))
Datum(factor(sample(letters[1:3], 20, TRUE)))

Get Mode of a Vector (numeric/character/factor)

Description

Returns the mode(s) of a vector. By default, returns a single string with all modes collapsed by comma when there are ties. If you need only one mode, set collapse = FALSE to return the first mode deterministically.

Usage

getmode(x, collapse = TRUE)

Arguments

x

A vector (numeric, character, factor, etc.).

collapse

Logical; if TRUE (default), return all modes as a single comma-separated string. If FALSE, return the first mode only.

Value

A single value (first mode) or a comma-separated string of modes.

Examples

getmode(c(1, 2, 2, 3, 4))
getmode(c("a", "b", "b", "a"), collapse = TRUE)
getmode(c("a", "b", "b", "a"), collapse = FALSE)

Shapiro/Anderson-Darling Normality Decision

Description

Performs Shapiro-Wilk for sample sizes between 3 and 5000 (inclusive), otherwise uses Anderson–Darling. Returns "Normal" if p > 0.05, else "Not Normal".

Usage

shapiro_normality_test(data)

Arguments

data

A numeric vector.

Value

Character scalar: "Normal", "Not Normal", or NA if not applicable.

Examples

shapiro_normality_test(rnorm(100))

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.