Request.meta keys
Keys that can be defined in Request.meta
for
zyte-spider-templates.
seed
Default: The seed URL (or value) from which the request originated.
The key is used for OffsiteRequestsPerSeedMiddleware
and
MaxRequestsPerSeedDownloaderMiddleware
.
The seed meta key is used to track and identify the origin of a request. It is initially set for each request that originates from the start request and can be used to manage domain constraints for subsequent requests. This key can also be set to an arbitrary value by the user to identify the seed source.
Here’s an example:
meta = {
"seed": "http://example.com",
}
is_seed_request
Default: False
The key is used for OffsiteRequestsPerSeedMiddleware
.
The is_seed_request meta key is a boolean flag that identifies whether the request is a start request (i.e., originating from the initial seed URL). When set to True, the middleware extracts seed domains from the response.
- Example:
meta = { 'is_seed_request': True, }
seed_domains
Default: Initial URL and redirected URLs
The key is used for OffsiteRequestsPerSeedMiddleware
.
The seed_domains meta key is a list of domains that the middleware uses to check whether a request belongs to these domains or not. By default, this list includes the initial URL’s domain and domains of any redirected URLs (if there was a redirection). This list can also be set by the user in the spider to specify additional domains for which the middleware should allow requests.
Here’s an example:
meta = {"seed_domains": ["example.com", "another-example.com"]}
only_feeds
Default: False
The key is used for OnlyFeedsMiddleware
.
The only_feeds meta key is a boolean flag that identifies whether the spider should discover all links on the website or extract links from RSS/Atom feeds only.
- Example:
meta = { 'page_params': {'only_feeds': True} }