Advanced |
![]() |
The Advanced tab in the Download Settings dialog provides the following settings: Check All LinksCheck this box to have SiteSucker check all links in all downloaded HTML files — including links to files that you are not downloading — and log any errors that occur. With this option turned on, SiteSucker will report many errors that you normally wouldn't see. This setting is intended as a debugging tool for Web designers who want to see if their own sites have any bad links. Export External LinksCheck this box to have SiteSucker export external links to an HTML page. This option allows you to download a site with one set of download settings and then download other sites that are linked to the original site with a different set of download settings. With this option turned on, SiteSucker creates a file named "_ExternalLinks.html" in the folder of the original site being downloaded. This page contains links to files the weren't on the original site and weren't downloaded. To use this setting:
Only Follow Image LinksCheck this box to have SiteSucker only follow image links, i. e., links that you would navigate in a Web browser by clicking on an image. Assume Ambiguous URLs Are FilesCheck this box to have SiteSucker treat ambiguous URLs as files. If a URL does not end with a '/' and the last path component does not have a file extension, SiteSucker considers it to be ambiguous. When this option is off, SiteSucker adds a '/' to the end of ambiguous URLs. Save Web URL as Spotlight CommentCheck this box to have SiteSucker store the Web URL of each downloaded file in the file's Spotlight Comments field. Download AttemptsUse this control to specify the number of times SiteSucker should attempt to download a file. SiteSucker will only retry downloading a file if a timeout error occurs. Download TimeoutUse this control to select the length of time that SiteSucker should wait for a response from the server. Download DelayUse this control to specify the length of time that SiteSucker should delay before it downloads a file. This feature can allow you to download sites while using very little bandwidth and can help avoid anti-mining safeguards employed by some sites. The delay can be set to None or to a fixed range of values (e.g., 20 - 40 seconds). If you select None, SiteSucker downloads the site as quickly as possible. If you select a delay range, SiteSucker will add a random delay (within the selected range) before it downloads a file. Furthermore, if a delay is specified, SiteSucker will only use a single active connection to download files since the whole purpose of using multiple connections is to reduce delays. IdentityUse this control to customize the way SiteSucker identifies itself when making a request. Some sites are very particular about which browsers they will allow. You can you use this feature to "fool" the site into thinking that you are using an approved browser. To change SiteSucker's identity, simply click on this control and select one of the Web browsers listed. (If you choose None, SiteSucker will not include any identifying information when making requests.) You can customize the list of available Web browsers by editing the user agent property list in your home folder at ~/Library/Application Support/SiteSucker/UserAgent.plist. |