HelpUiDialogsOptionsSpider

Options Spider screen

This screen allows you to configure the spider options.

It should be noted that modifying most of these options also affects the running Spider.

Maximum depth to crawl

The parameter defines the maximum depth in the crawling process where a page must be found in order for it to be processed. Resources found deeper than this level are not fetched and parsed by the spider. The value zero means unlimited depth.

The depth is calculated starting from the seeds, so, if a Spider scan starts with only a single URL (eg. URL manually specified), the depth is calculated from this one. However, if the scan starts with multiple seeds (eg. recurse and Sites tree node with children), a resource is processed if it's depth relative to any of the seeds is less than the defined one.

Number of threads used

The spider is multi-threaded and this is the number that defines the maximum number of worker threads used in the crawling process. Changing this parameter does not have any effect on any crawling is in progress.

Maximum duration

The maximum length of time that the spider should run for, measured in minutes. Zero (the default) means that the spider will run until it has found all of the links that it is able to.

Maximum children to crawl

This parameter limits the number of children that will be crawled at every node in the tree. This is useful for data driven applications that have large numbers of 'pages' that are in fact exactly the same code but containing different data, for example from a database. By default this is set to zero which means there are no limits applied to the number of child nodes crawled.

Maximum parse size

Defines the maximum size, in bytes, that a response might have to be parsed. This allows the spider to skip big responses/files.

Domains Always in Scope

Allows to manage the domains, string literals or regular expressions, that are in the spider's scope. The normal behavior of the spider is to only follow links to resources found on the same domain as the page where the scan started. However, this option allows you to define additional domains that are considered "in scope" during the crawling process. Pages on these domains are processed during the scan.

Query parameters handling

When crawling, the Spider has an internal mechanism that marks which pages were already visited, so they are not processed again. When this check is made, the way the URIs parameters are handled is set using this option. There are three available options:

Ignore parameters completely - if www.example.org/?bar=456 is visited, then www.example.org/?foo=123 will not be visited
Consider only parameter's name (ignore parameter's value) - if www.example.org/?foo=123 is visited, then www.example.org/?foo=456 will not be visited, but www.example.org/?bar=789 or www.example.org/?foo=456&bar=123 will be visited
Consider both parameter's name and value - if www.example.org/?123 is visited, any other uri that is different (including, for example, www.example.org/?foo=456 or www.example.org/?bar=abc) will be visited

Send "Referer" header

If the spider requests should be sent with the "Referer" header.

Accept Cookies

If the spider scans should accept cookies while spidering. If enabled the Spider will properly handle any cookies received from the server and will send them back accordingly. If the option is disabled, the Spider will not send any cookies in its requests. For example, this might control whether or not the Spider uses the same session throughout a spidering scan. When accepting cookies the cookies are not shared between spider scans, each scan has its own cookie jar. This option has low priority, the Spider will respect other (global) options related to the HTTP state. This option is ignored if, for example, the option Enable (Global) HTTP State is enabled, when spidering as a User or when a HTTP Session is active.

Process forms

During the crawling process, the behaviour of the spider when it encounters HTML forms is defined by this option. If disabled, the HTML forms will not be processed at all. If enabled, the HTML forms with the method defined as HTTP GET will be submitted with some generated values. The behaviour when encountering forms with the method defined as HTTP POST is configured by the next option.

POST forms

As briefly described in the previous paragraph (Process Forms), this option configures the behaviour of the spider when Process Forms is enabled and when encountering HTML forms that have to be POSTed.

Parse HTML Comments

This option defines whether the spider should also spider the HTML comments searching for links to resources. Only the resources found in commented valid HTML tags will be processed.

Parse 'robots.txt' files

This option defines whether the spider should also spider the robots.txt files found on websites, searching for links to resources. This option does not define whether the spider should follow the rules imposed by the robots.txt file.

Handle OData-specific parameters

This options defines whether the spider should try to detect OData-specific parameters (i.e. resources identifiers) in order to properly process them according to the rule defined by the "Query parameters handling" option.

ZAP User Guide
- Introduction
- Getting Started
- The User Interface
  - Overview
  - The Top Level Menu
  - The Top Level Toolbar
  - The Tabs
    - Sites
    - Request
    - Response
    - Break
    - History
    - Search
    - Break Points
    - Alerts
    - Active Scan
    - Spider
    - Params
    - HTTP Sessions
    - Output
    - Callbacks
  - The Dialogs
  - The Footer
- Command Line
- Add Ons
  - Active Scan Rules
  - Ajax Spider
    - Dialog
    - Options
    - Tab
  - Diff
  - Forced Browse
    - Options
    - Tab
  - Fuzzer
  - Getting Started Guide
  - Invoke Applications
    - Options
  - JxBrowser
  - Online Menu
  - Passive Scan Rules
  - Plug-n-Hack
    - Clients tab
  - Quick Start
    - Command Line
    - Launch Options
  - Reveal
  - Scripts
    - Console
    - Tree
  - Selenium
    - Options
    - API
  - Tips and Tricks
  - WebSockets
    - Tab
    - Options
    - Session Properties
    - Scripts
    - API
    - Passive Scan Rules
    - About
  - Zest
- Releases
  - 2.8.0
  - 2.7.0
  - 2.6.0
  - 2.5.0
  - 2.4.3
  - 2.4.2
  - 2.4.1
  - 2.4.0
  - 2.3.1
  - 2.3.0
  - 2.2.2
  - 2.2.1
  - 2.2.0
  - 2.1.0
  - 2.0.0
  - 1.4.1
  - 1.4.0
  - 1.3.4
  - 1.3.3
  - 1.3.2
  - 1.3.1
  - 1.3.0
  - 1.2.0
  - 1.1.0
  - 1.0.0
- Paros Proxy
- Credits

	UI Overview	for an overview of the user interface
	Spider	for an overview of the Spider
	Spider Tab	for an overview of the Spider tab

Uh oh!

HelpUiDialogsOptionsSpider

Options Spider screen

Maximum depth to crawl

Number of threads used

Maximum duration

Maximum children to crawl

Maximum parse size

Domains Always in Scope

Query parameters handling

Send "Referer" header

Accept Cookies

Process forms

POST forms

Parse HTML Comments

Parse 'robots.txt' files

Handle OData-specific parameters

See also

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!