Welcome to SmartCrawler

SmartCrawler is a java-based fully configurable, multi-threaded and extensible crawler, which is able to fetch and analyze the contents of a web site by using dinamically pluggable filters.

News

August 08 2005 - SmartCrawler 0.1 Alpha 4 Released

SmartCrawler 0.1 Alpha 4 has been released for testing. It includes a performances improvement and the refactoring of the xml configuration system.

July 15 2005 - SmartCrawler 0.1 Alpha 3 Released

SmartCrawler 0.1 Alpha 3 has been released for testing. It includes updated plugins management and a simpler startup procedure.

Download it here.

Getting Started with SmartCrawler

Read our Quick Start guide on how to set up an run SmartCrawler.

Release Information

The current SmartCrawler release is version 0.1a4 and can be obtained from the download page.

The current development version is only available from CVS at this point. For more information, see Source Repository.