The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.
HTML Hacking Scripts
Here are a few useful web-related programs I've written lately.
You might also see this
interesting program by Abigail.
Shaking Up the Web
- latro
- Latro finds idiotic PC sites open to perl.exe?FMH.pl
abuse and reports their little problem.
HTML Munging
- churl
- Extract URLs and verify validity;
currently only looks for FTP:, HTTP:, and FILE: schemata,
stored in A or IMG tags.
- striphtml
- Strip out all the html bits from a document, leaving (unformatted)
plain text in its wake.
- htdecom
- Strips out comments from an HTML document.
htitle
Retrieve the title from a URL.
URL Munging
- surl
- Given a list of URLs, sorts them by last-modified date.
- xurl
- Given one URL, extract all URLs it contains. Uses the LWP
library, and is pretty complete.
- qxurl
- Somewhat like xurl,
(means ``quick xurl'')
but expects to
read from files, not URLs, and doesn't canonicalize relative links.
It also runs about 100x faster and doesn't require an external library.
- reltree
Fix up a tree's URL to make them all relative instead of absolute.
Netscape Munging
- ggh
- Grovel global history. Search or dump out the netscape global history
history file.
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.