Link-Checker
Hauptseite Editoren Makros Linkchecker Tool-Kits Grafik Validatoren Quellen
Log-Analyzer WebShops Vermischtes Suchen Feedback      


Letzte Änderung: $Date: 1999/01/10 16:12:00 $

Tools zum Erstellen von Sitemaps und zum Checken von Links

Linbot

Linbot is the professional Site Management Tool for webmasters. Linbot allows webmasters to view the structure of a site, track down broken links, find potentially outdated web pages list links pointing to external sites, view portfolio of inline images, get a run-down of problems sorted by author and to do all this periodically without user intervention. Linbot is a FREE clone of Linkbot and plans to incorporate many of Linkbot's features as well as enhancements of its own.

Linbot benötigt einen Python-Interpreter
--


Tree.pl

Mit tree kann man Seitenübersichten für HTML-Dateien, sogenannte Sitemaps, erstellen. Dabei werden nicht die Links in den Dateien verfolgt, stattdessen werden alle Dateien ab einem bestimmten Verzeichnis abwärts nach Bildern und HTML-Files durchsucht. Die title-Tags werden als Links benutzt. tree kann als normales Script aufgerufen werden (z.B. aus der Shell oder via crontab) und es kann als CGI-Script (Perl-Version) bzw. Servlet (Java-Version) eingesetzt werden, soweit der Server das unterstützt. In den letzen beiden Fällen wird die Ausgabe on-the-fly erzeugt (was bei einer großen Anzahl von Dateien schon etwas dauern kann).

Benötigt Perl 5.003 oder neuer

Freeware - GPL


Checkbot

Checkbot is a tool to verify links on a set of HTML pages. Checkbot can check a single document, or a set of documents on one or more servers. Checkbot creates a report which summarizes all links which caused some kind of warning or error.

Checkbot collects all links from URLs matching the match string, and accessible through the start url. It has two categories of links: internal links (to other URLs matching the match string), and external links. The external links are filed for later use, while all internal links are checked.

After checking all internal links Checkbot will check all external links found on the pages. It will always use the HEAD method for this. Checkbot does not adhere to the robots standard (which involves examining /robots.txt from a site first). However, people have brought some good arguments to my attention, so I will eventually implement this into Checkbot as well.

After a interval (which gets exponentially longer) Checkbot will write its current results to a file. This file contains some statistics, such as number of links processed. It also contains a list of pages (sorted by server, and per server by page name) which contains links which generated an HTTP error code, along with that code.

Checkbot requires the following additional software, all of which is also available at CPAN:

  • perl 5 (version 5.004 recommended)
  • LWP (the libwww-perl 5 module)
  • Net::FTP
  • Mail::Send (optional, needed to use --mailto option)


Sitemap

A little Perl script that makes a site map (HTML index page) from all HTML pages with META DESCRIPTION tags below the current directory.


MOMspider

MOMspider is a web-roaming robot that specializes in the maintenance of distributed hypertext infostructures (i.e. wide-area webs). The program is written in Perl and, once customized for your site, should work on any UNIX-based system with Perl 4.036.

For more information on what MOMspider is, why it is needed, and how it was designed, see the MOMspider paper that was presented at the First International Conference on the World-Wide Web (WWW94).


lvrfy: A HTML Link Verifier

lvrfy is a script that verifies all the internal links in HTML pages on your server. Its operation is rather simple: it starts with one page, parses all the links (including inline images), and then recursively checks all the links.

This is a regular shell script. Just make it executable, and you're ready to go. It assumes that the following programs are in your path: sed, awk, csh, touch, and rm.


SiteMap

... is a small bash script that creates a html SiteMap of your *.*htm* files Require: the unix shell BASH.

See in Action: http://www.klografx.de/misc/


Web Tree Scanner

Web Tree Scanner (WTS) is a program to visualize the tree of a WWW server and check the links. It was designed as part of a software project in 1997/1998 and uses the Xclasses X11 layout library. The scanned results may be browsed in the GUI or printed in Postscript.


Shareware


Linklint - Fast html link checker

Linklint is a Perl shareware program that checks all local and remote links on a web site. It works with Perl 4 or Perl 5 on Windows and Unix platforms.



kommerzielle Tools


LinkScan

LinkScan automatically detects broken links caused by missing files and unreachable URL's to ensure that your Internet or Intranet website retains a high quality and professional appearance.
LinkScan runs on UNIX and Windows NT servers making it the fastest, most accurate and most scalable tool of its kind. It uses and requires Perl5. Reports may be viewed using all industry standard browsers.