LinkChecker

Check websites for broken links

View project on GitHub

Introduction

LinkChecker is a free, GPL licensed website validator. LinkChecker checks links in web documents or full websites. It runs on Python 2 systems, requiring Python 2.7.2 or later. Python 3 is not yet supported.

Features

  • recursive and multithreaded checking and site crawling
  • output in colored or normal text, HTML, SQL, CSV, XML or a sitemap graph in different formats
  • HTTP/1.1, HTTPS, FTP, mailto:, news:, nntp:, Telnet and local file links support
  • restriction of link checking with regular expression filters for URLs
  • proxy support
  • username/password authorization for HTTP and FTP and Telnet
  • honors robots.txt exclusion protocol
  • Cookie support
  • HTML5 support
  • Plugin support allowing custom page checks. Currently available are HTML and CSS syntax checks, Antivirus checks, and more.
  • Different interfaces: command line, GUI and web interface
  • ... and a lot more check options documented in the manual page.

Screenshots

CLI screenshot GUI screenshot CGI screenshot
Commandline interface GUI client CGI web interface

Basic usage

To check a URL like http://www.example.org/myhomepage/ it is enough to enter http://www.example.org/myhomepage/ in the GUI client or web interface, or execute linkchecker http://www.example.org/myhomepage/ on the command line.

This check will validate recursively all pages starting with http://www.example.org/myhomepage/. Additionally, all external links pointing outside of www.example.org will be checked but not recursed into.

Other linkcheckers

If this software does not fit your requirements, you can check out other free linkcheckers.

Test suite status

Linkchecker has extensive unit tests to ensure code quality. Travis CI is used for continuous build and test integration.

Build Status