Changes

0.7.2 (2024-05-07)

0.7.1 (2024-02-22)

0.7.0 (2024-02-09)

  • Updated requirement versions:

  • With the updated dependencies above, this fixes the issue of having 2 separate Zyte API Requests (productNavigation and httpResponseBody) for the same URL. Note that this issue only occurs when requesting product navigation pages.

  • Moved zyte_spider_templates.spiders.ecommerce.ExtractFrom into zyte_spider_templates.spiders.base.ExtractFrom.

0.6.1 (2024-02-02)

  • Improved the zyte_spider_templates.spiders.base.BaseSpiderParams.url description.

0.6.0 (2024-01-31)

0.5.0 (2023-12-18)

  • The zyte_spider_templates.page_objects module is now deprecated in favor of zyte_spider_templates.pages, in line with web_poet.pages.

0.4.0 (2023-12-14)

  • Products outside of the target domain can now be crawled using zyte_spider_templates.middlewares.AllowOffsiteMiddleware.

  • Updated the documentation to also set up zyte_common_items.ZyteItemAdapter.

  • The max_requests spider parameter has now a default value of 100. Previously, it was None which was unlimited.

  • Improved the description of the max_requests spider parameter.

  • Official support for Python 3.12.

  • Misc documentation improvements.

0.3.0 (2023-11-03)

  • Added documentation.

  • Added a middleware that logs information about the crawl in JSON format, zyte_spider_templates.middlewares.CrawlingLogsMiddleware. This replaces the old crawling information that was difficult to parse using regular expressions.

0.2.0 (2023-10-30)

  • Now requires zyte-common-items >= 0.12.0.

  • Added a new crawl strategy, “Pagination Only”.

  • Improved the request priority calculation based on the metadata probability value.

  • CI improvements.

0.1.0 (2023-10-24)

Initial release.