The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.

Title: Compute the Difference Between Data Frames
Version: 0.9.0
Description: Shows you which rows have changed between two data frames with the same column structure. Useful for diffing slowly mutating data.
License: MIT + file LICENSE
Imports: arrow, dplyr, janitor, rlang
BugReports: https://github.com/riazarbi/diffdfs
Encoding: UTF-8
RoxygenNote: 7.2.1
Suggests: testthat (≥ 3.0.0)
Config/testthat/edition: 3
NeedsCompilation: no
Packaged: 2022-08-05 10:38:53 UTC; jovyan
Author: Riaz Arbi [aut, cre]
Maintainer: Riaz Arbi <diffdfs@arbidata.com>
Repository: CRAN
Date/Publication: 2022-08-09 13:50:06 UTC

Check That A Dataframe Key Col Set Is Unique

Description

Checks that a provided vector of column names constitue a unique key (that is, no rows are duplicated) for a dataframe.

Usage

checkkey(df, key_cols, verbose = FALSE)

Arguments

df

a dataframe

key_cols

vector of column names

verbose

TRUE/FALSE should we print a message?

Value

TRUE if key cols have unique rows; FALSE if not

Examples

irisint = iris
irisint$rownum = 1:nrow(irisint)
key_cols = c("rownum")
checkkey(irisint, key_cols, TRUE)
checkkey(irisint, "Species", TRUE)

Compute the Difference Between Dataframes

Description

Returns a dataframe describing the modifications required to transform old_df into new_df. The dataframes needBugReports: https://github.com/tidyverse/dplyr/issues to have identical columns and column types and share unique index columns.

Usage

diffdfs(new_df, old_df = NA, key_cols = NA, verbose = FALSE)

Arguments

new_df

A dataframe of new data.

old_df

A dataframe of old data. new_df and old_df can (and usually do) have overlapping data.

key_cols

optional vector of column names that constitute a unique table key. If NA, colnames(old_df) will be used.

verbose

logical, default FALSE. Should the processing be chatty?

Value

a dataframe.

Examples

iris$key <- 1:nrow(iris)

old_df <- iris[1:100,]
old_df[75,1] <- 100
new_df <- iris[50:150,]
diffdfs(new_df, old_df, key_cols = "key")

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.