The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.
DataExplorer 0.8.3
Enhancements
- #154
PR: Added YAML option to allow HTML elements when choosing PDF
report.
- #165:
Added
geom_jitter
option to plot_boxplot
and
plot_scatterplot
.
- #176
PR: Improved legend ordering in
plot_missing
.
- #177
PR: Added group color customization in
plot_missing
.
DataExplorer 0.8.2
Enhancements
- #139:
Added
by
argument to plot_bar
.
Bug Fixes
- #148:
Address CRAN removal due to vignette build failure.
DataExplorer 0.8.1
Enhancements
- #111:
Continuous distributions can now be plotted with different scales, i.e.,
histogram, density, boxplot, scatterplot.
- #126:
Cleaned up labels in legend guide.
- #127
(PR): Added option to plot columns with missing values only in
plot_missing
.
- Cleaned up code for
create_report
.
Bug Fixes
- #109:
Fixed a bug causing unordered bar charts.
- #114:
Removed redundant message in
dummify
.
- #116:
Fixed pandoc document conversion error 99.
- #120:
Fixed type
logical
being parsed as symbol
in
configure_report
.
- #121:
Fixed missing value bug when
split_columns(..., binary_as_factor = TRUE)
.
- #130
(PR):
plot_prcomp
now drops columns with zero
variance.
DataExplorer 0.8.0
New Features
- #92:
Added
update_columns
to transform any selected
columns.
Enhancements
- #87:
Added
configure_report
function to customize report
content.
- #89:
Added option to customize
geom_text
and
geom_label
arguments.
- #91:
create_report
now displays full report directory after
completion.
- #95:
Added better exception handling for
plot_bar
.
- #98:
Added band customization to
plot_missing
.
- #100:
Switched
geom_text
to geom_label
.
- #103:
Report title can now be customized in
create_report
.
- #108:
Added option to treat binary features as discrete in
plot_bar
, plot_histogram
,
plot_density
and plot_boxplot
.
- Updated d3.min.js to v5.9.2.
Bug Fixes
- #88:
Added
plot_intro
to report config.
- #90:
Added first plot in
plot_prcomp
to output and
page_0
.
- #94:
Fixed typo for PCA.
DataExplorer 0.7.1
Enhancements
- #86:
Replaced
gridExtra::grid.arrange
with facets.
- Added seeds to vignette and README for re-producible examples.
- Hid all internal functions.
DataExplorer 0.7.0
New Features
- #72:
Added
plot_qq
for QQ plot.
- #76:
Added
plot_intro
to visualize results of
introduce
.
Enhancements
- #42:
Applied S3 methods for plotting functions.
- #77:
dummify
now works on selected columns.
- #78: All
ggplot objects from
plot_*
are now invisibly returned. As a
result, extracted profile_missing
from
plot_missing
for missing value profiles.
- #83:
Removed all deprecated functions.
- #85:
Users can now specify number of rows/columns for plot page layout.
plot_prcomp
now passed scale. = TRUE
to
prcomp
by default.
- Added
sampled_rows
argument to
plot_scatterplot
.
- Added option to parallelize plot object construction.
- Updated default config for
create_report
.
Bug Fixes
- #74:
Fixed a bug causing
create_report
failure due to zero
complete rows.
- #75:
Fixed a bug in
plot_str
when plotting data.frame with more
than 100 columns.
- #82:
Removed hard-coded scales from all plot functions.
- Fixed a bug causing wrong column indices in
split_columns
.
- Fixed a bug using standard deviation instead of variance in
plot_prcomp
.
DataExplorer 0.6.1
Enhancements
- Updated vignette for better clarity.
- #71:
Added better error handler for
plot_prcomp
.
Bug Fixes
- #69:
Fixed bug causing
create_report
failure (specifically from
plot_prcomp
) when y
is specified.
- Added more unit tests for
create_report
and
plot_prcomp
.
DataExplorer 0.6.0
New Features
- #15:
Added
plot_prcomp
to visualize principal component
analysis.
- #54:
Extracted
dummify
from plot_correlation
as a
new function.
- #59:
Added
introduce
for basic metadata.
Enhancements
- #41:
create_report
can now be customized.
- #53:
Added page number for plots that span multiple pages.
- #56:
Added support for theme and customization for individual
components.
- #62:
plot_bar
now supports optional measures (in addition to
categorical frequency) using argument with
.
- #66:
Feature engineering functions works on other classes in addition to just
data.table.
plot_missing
:
- Percentage text labels from output plot now has 2 decimals to
prevent small percentages from being truncated to 0%.
- Added example to quickly drop columns with too many missing
values.
- Added
.ignoreCat
and .getAllMissing
to
helper.
Bug Fixes
- #55:
Fixed bugs and updated vignette with latest functions.
- #57:
Fixed
plot_str
bug for not supporting S4 objects.
- #63:
Fixed
plot_histogram
and plot_density
not
working with column names containing spaces.
DataExplorer 0.5.0
New Features
- #48:
Added
plot_scatterplot
to visualize relationship of one
feature against all other.
- #50:
Added
plot_boxplot
to visualize continuous distributions
broken down by another feature.
Enhancements
- #44:
Added option to exclude categories in
group_category
.
- #45:
Added title option for all plots.
- #46:
Added option to exclude columns in
set_missing
.
- #49
[Breaking Change]: Switched package to tidyverse style. All old
functions are in
.Deprecated
mode. List of name changes in
alphabetical order:
BarDiscrete
-> plot_bar
CollapseCategory
-> group_category
CorrelationContinuous
->
plot_correlation(..., type = "continuous")
CorrelationDiscrete
->
plot_correlation(..., type = "discrete")
DensityContinuous
-> plot_density
DropVar
-> drop_columns
GenerateReport
-> create_report
HistogramContinuous
->
plot_histogram
PlotMissing
-> plot_missing
PlotStr
-> plot_str
SetNaTo
-> set_missing
SplitColType
-> split_columns
- #52:
Combined
CorrelationContinuous
and
CorrelationDiscrete
into one function, and added option to
view correlation of all features at once.
- Optimized layout for multiple plots.
Bug Fixes
- #47:
Fixed color scale for correlation heatmap.
DataExplorer 0.4.0
New Features
- #33:
Added
PlotStr
to visualize data structure.
- #40:
Added network graph to
GenerateReport
.
Bug Fixes
- #32:
Fixed pandoc requirement error in unit test on cran.
- #34:
Fixed error message when
quiet
is not supplied. In
addition, report directory are printed through message()
instead of cat()
.
- #35:
Fixed rprojroot not found error.
Enhancements
- #12:
Added vignette: dataexplorer-intro.
- #36:
Fixed warnings from data.table in
DropVar
.
- #37:
Changed all
cat()
to message()
.
- #38:
Added option to order bars in
BarDiscrete
.
- #39:
Extended
SetNaTo
to discrete features.
- Added more examples to README.md.
DataExplorer 0.3.0
New Features
- #25:
Added
SetNaTo
to quickly reset missing numerical
values.
- #29:
Added
DropVar
to quickly drop variables by either name or
column position.
Bug Fixes
- #24:
CorrelationDiscrete
now displays all factor levels instead
of full rank matrix from model.matrix
.
Enhancements
- #11:
Functions with return values will now match the input class and set it
back.
- #22:
Added documentation for
num_all_missing
in
SplitColType
.
- #23:
Added additional measures (in addition to frequency) to
CollapseCategory
.
- #26:
Removed density estimation section from report template.
- #31:
Added flexibility to name the new category in
CollapseCategory
.
Other notes
- #30: In
CollapseCategory
, update = TRUE
will only work
with input data as data.table
. However, it is still
possible to view the frequency distribution with any input data class,
as long as update = FALSE
.
DataExplorer 0.2.6
Bug Fixes
- #20:
Fixed permission denied bug due to intermediates_dir argument in
knitr::render
.
Enhancements
- #16:
Improved handling of missing values.
DataExplorer 0.2.5
Bug Fixes
- #18:
GenerateReport
now handles data without discrete or
continuous features.
Enhancements
- #14:
Updated rmarkdown template for
GenerateReport
.
- #1:
Features with all
NA
values will be ignored in
BarDiscrete
.
DataExplorer 0.2.4
Bug Fixes
- Fixed a major bug in
GenerateReport
function due to
package renaming.
Enhancements
GenerateReport
will now print the directory of the
report to console.
DataExplorer 0.2.3
New Features
- Added function
CollapseCategory
to collapse sparse
categories for discrete features.
- Added correlation heatmap for both continuous and discrete
features.
- Added density plot for continuous features.
Bug Fixes
- Fixed a bug in
BarDiscrete
and
CorrelationDiscrete
for not plotting non-factor class.
- Minor changes for CRAN re-submission.
Enhancements
- Changed grid layout for
BarDiscrete
and
HistogramContinuous
.
- Features with all missing values will be ignored.
- Switched position between continuous and discrete features in report
template.
- Renamed package name to DataExplorer.
- Added NEWS.md.
- Removed
BoxplotContinuous
.
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.