SiteSucker is a relatively simple program and it has a number of limitations.
SiteSucker only examines CSS and HTML for URLs. When SiteSucker analyzes HTML, it looks for URLs in the style attribute and in the following tags:
If a link is specified elsewhere in an HTML file, SiteSucker will not see it.
SiteSucker does not scan PDFs, Flash files (swf), QuickTime movie files (mov), or other media files for embedded links.
SiteSucker ignores forms for the most part, although it will download the
action URL from a
form tag and the
src URL from an
input tag, and you can use the web browser to submit forms. See Password-protected Sites in the SiteSucker manual for more information.
SiteSucker cannot download videos. See Frequently Asked Questions in the SiteSucker manual for more information.
By default, SiteSucker honors robots.txt exclusions, the Robots META tag, and the X-Robots-Tag HTTP header. As a result, any files or directories disallowed by robot exclusions will not be downloaded by SiteSucker. See Robot Exclusions in the SiteSucker manual for more information.