The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.
archiveRetriever 0.4.0
- Replace deprecated functions of dependencies
- Fix bugs in archive_overview() and
retrieve_urls()
- New option nonArchive added to retrieve_links()
and scrape_urls(). This option allows users to scrape internet
pages not stemming from the Internet Archive.
- New feature added to the collapse option of
scrape_urls(). collapse can now also take a Xpath as
input, to collapse results based on a structuring Xpath. Unfortunately,
this works only with Xpaths and not with CSS selectors. If used,
Paths refers only to children of the structuring Xpath given in
collapse.
archiveRetriever 0.3.1
- Changes to the testing environment.
- Disable progress bar in non-interactive use.
archiveRetriver 0.3.0
- Fixes to filtering of links in retrieve_links() to enable
link scraping from domains with more than one domain ending.
- New option filter added to retrieve_links(). This
options allows to disable the filtering of links to be sub-domains of
the top-level domain.
- New option pattern added to retrieve_links(). This
option allows for custom patterns by which links are filtered before
output.
archiveRetriever 0.2.0
- New option collapseDate added to retrieve_urls().
This option allows users to choose whether retrieve_urls
outputs all or just one memento per requested day.
archiveRetriever 0.1.2
- Fixes to ignoreErrors option for html reading-errors in
scrape_urls()
- Fixes to retrieve_links() for Errors occurring in last
Url
- Improve compatibility between retrieve_links() and
scrape_urls()
archiveRetriever 0.1.1
- Fixes to ignoreErrors option for encoding errors in
retrieve_links()
archiveRetriever 0.1.0
- Fixes to function behavior in case of timeout.
- Changes to the preliminary output printed by scrape_urls()
function in case of error.
- Integration of more flexibility for using the scrape_urls()
option attachto
- New option collapse added to scrape_urls(). This
option allows users to choose whether html elements retrieved via the
archiveRetriever are collapsed into a single observation or are kept as
different observations in the output dataset.
- Update of the package documentation.
archiveRetriever 0.0.2
- Minor fixes and adjustments to Error-Messages
- More stable testing environment
archiveRetriever 0.0.1
- Final version for Cran submission
archiveRetriever 0.0.0.9000
- Added a
NEWS.md
file to track changes to the
package.
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.