Definitions for "Heritrix "
Heritrix is the Internet Archive's extensible, Web-scale, archival-quality Web crawler.
Heritrix is the Internet Archive’s web crawler which was specially designed for web archiving. It is open-source and written in Java. The main interface is accessible using a web browser, and there is a command-line tool that can optionally be used to initiate crawls.