The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.
In many cases, both raw markdown files and their HTML-rendered equivalents are both available. For instance, GitHub repositories store raw markdown, but also render that markdown. In these cases, you have a choice between using readMDTable to parse the raw markdown, or rvest to parse the HTML. This vignette provides benchmarks to help you decide which is the better option for your use case.
The main benefit of readMDTable is that raw markdown is more lightweight that its HTML-rendered alternative, resulting in less retrieval time from the web. See the benchmarking below.
function(content) {
test_extract_md_tables <-extract_md_tables(content, show_col_types = FALSE)
}
function(content) {
test_read_md_table <-read_md_table(content, show_col_types = FALSE)
}
function(content) {
test_rvest <-::read_html(content) |> rvest::html_table()
rvest }
Comparing readMDTable and rvest on retrieving and parsing a markdown file with multiple tables and other non-table content.
"https://raw.githubusercontent.com/jrdnbradford/readMDTable/refs/heads/main/inst/extdata/mtcars-split.md"
mtcars_split_raw_md_url <- "https://github.com/jrdnbradford/readMDTable/blob/main/inst/extdata/mtcars-split.md"
mtcars_split_html_url <-
microbenchmark::microbenchmark(
resp <-test_extract_md_tables(mtcars_split_raw_md_url),
test_rvest(mtcars_split_html_url),
times = 100
)
::autoplot(resp) ggplot2
Comparing readMDTable and rvest on retrieving and parsing a markdown table file.
"https://raw.githubusercontent.com/jrdnbradford/readMDTable/refs/heads/main/inst/extdata/mtcars.md"
mtcars_raw_md_url <- "https://github.com/jrdnbradford/readMDTable/blob/main/inst/extdata/mtcars.md"
mtcars_html_url <-
microbenchmark::microbenchmark(
resp <-test_read_md_table(mtcars_raw_md_url),
test_rvest(mtcars_html_url),
times = 100
)
::autoplot(resp) ggplot2
Comparing readMDTable and rvest on parsing a markdown file with multiple tables and other non-table content, already retrieved and stored in memory.
readr::read_file(mtcars_split_raw_md_url)
mtcars_raw_md_file <- readr::read_file(mtcars_split_html_url)
mtcars_html_file <-
microbenchmark::microbenchmark(
resp <-test_extract_md_tables(mtcars_raw_md_file),
test_rvest(mtcars_html_file),
times = 100
)
::autoplot(resp) ggplot2
Comparing readMDTable and rvest on parsing merely a markdown table, already retrieved and stored in memory.
readr::read_file(mtcars_raw_md_url)
raw_md <- readr::read_file(mtcars_html_url)
html <-
microbenchmark::microbenchmark(
resp <-test_read_md_table(raw_md),
test_rvest(html),
times = 100
)
::autoplot(resp) ggplot2
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.