General

General

The General screen provides the following settings:

Suppress Login Dialog

Whenever SiteSucker encounters a page that requires authentication, it displays the Login Dialog.

Switch on this control to suppress display of the Login Screen and skip the download of any pages that require authentication. For more information on authentication, see Password-protected Sites.

Switching on this control will also suppress display of the alert which is shown when there is a problem with a server's certificate. If the certificate for a server is invalid and this option is on, SiteSucker will not display the alert and will not download content from that server.

Ignore Robot Exclusions

Switch on this control to have SiteSucker ignore robots.txt exclusions, the Robots META tag, and the X-Robots-Tag HTTP header. See Robot Exclusions for more information about robots.txt, the Robots META tag, and the X-Robots-Tag HTTP header.

Note: SiteSucker always honors robots.txt directives aimed specifically at SiteSucker.

Warning: Ignoring robot exclusions is not recommended. Robot exclusions are usually put in place for a good reason and should be obeyed.

Always Download HTML and CSS

Switch on this control to have SiteSucker always download HTML and CSS files despite the File Replacement setting. This ensures, for instance, that SiteSucker downloads an up-to-date copy of a site when File Replacement is set to With Newer.

File Replacement

Use this control to specify when SiteSucker should replace existing files. You can choose from the following options:

  • Never - SiteSucker never replaces your local files and only downloads those files that haven't already been downloaded.
  • Always - SiteSucker always deletes your local files and replaces them with files downloaded from the Internet.
  • With Newer - SiteSucker only replaces existing files if a newer copy is found on the Internet.

Path Constraint

Use this control to limit downloaded files to those at a specific site, within a specific directory, or containing a specific path. This option works in conjunction with the Path settings. SiteSucker provides the following path constraints:

  • None - SiteSucker downloads the file specified in the URL text box and every file that it links to and every site that these files link to, etc. Be aware that this option could result in a HUGE download if allowed to continue forever.
  • Host - SiteSucker limits the download to those files on the host of the original file being downloaded. For example, if the URL is http://www.example.com/directory/home.html, this setting limits the download to those URLs beginning with http://www.example.com or https://www.example.com.
  • Subdomains - SiteSucker limits the download to those files within the second-level domain and all subdomains of the original file being downloaded. Extending the previous example, this setting will download URLs beginning with http://www.example.com, https://images.example.com, http://guide.example.com, or https://example.com.
  • Directory - SiteSucker only downloads those files that are within the directory of the original file being downloaded. Extending the previous example, this setting limits the download to those URLs beginning with http://www.example.com/directory/ or https://www.example.com/directory/.
  • Path Settings - SiteSucker only downloads the file specified in the URL text box and any files that have paths allowed by the Path settings.