Download of heritrix-2.0.0-dist.zip (heritrix-2.0.0-dist.zip ( external link: SF.net): 40,719,435 bytes) will begin shortly. If not so, click link on the left.

File Information

File Size
40,719,435 bytes
MD5
d1a67ce21e40252b5190fc535e49a074

프로젝트 설명

The archive-crawler project is building Heritrix: a flexible, extensible, robust, and scalable web crawler capable of fetching, archiving, and analyzing the full diversity and breadth of internet-accesible content.