Releases: apify/crawlee-python
Releases · apify/crawlee-python
0.6.9
0.6.9 (2025-05-02)
🚀 Features
- Add an internal
HttpClient
to be used insend_request
forPlaywrightCrawler
usingAPIRequestContext
bound to the browser context (#1134) (e794f49) by @Mantisus - Make timeout error log cleaner (#1170) (78ea9d2) by @Pijukatel
- Add
on_skipped_request
decorator, to process links skipped according torobots.txt
rules (#1166) (bd16f14) by @Mantisus
🐛 Bug Fixes
0.6.8
0.6.8 (2025-04-25)
🚀 Features
- Handle unprocessed requests in
add_requests_batched
(#1159) (7851175) by @Pijukatel - Add
respect_robots_txt_file
option (#1162) (c23f365) by @Mantisus
🐛 Bug Fixes
- Update
UnprocessedRequest
to match actual data (#1155) (a15a1f3) by @Pijukatel - Fix the order in which cookies are saved to the
SessionCookies
and the handler is executed forPlaywrightCrawler
(#1163) (82ff69a) by @Mantisus - Call
failed_request_handler
forSessionError
when session rotation count exceeds maximum (#1147) (b3637b6) by @Mantisus
0.6.7
0.6.6
0.6.5
0.6.4
0.6.3
0.6.2
0.6.2 (2025-03-05)
🚀 Features
- Extend ErrorTracker with error grouping (#1014) (561de5c) by @Pijukatel
0.6.1
0.6.1 (2025-03-03)
🐛 Bug Fixes
- Add
browserforge
to mandatory dependencies (#1044) (ddfbde8) by @Pijukatel
0.6.0
0.6.0 (2025-03-03)
🚀 Features
- Integrate browserforge fingerprints (#829) (2b156b4) by @Pijukatel
- Add AdaptivePlaywrightCrawler (#872) (5ba70b6) by @Pijukatel
- Implement
_snapshot_client
forSnapshotter
(#957) (ba4d384) by @Mantisus - Add adaptive context helpers (#964) (e248f17) by @Pijukatel
- [breaking] Enable additional status codes arguments to PlaywrightCrawler (#959) (87cf446) by @Pijukatel
- Replace
HeaderGenerator
implementation bybrowserforge
implementation (#960) (c2f8c93) by @Pijukatel
🐛 Bug Fixes
- Fix playwright template and dockerfile (#972) (c33b34d) by @janbuchar
- Fix installing dependencies via pip in project template (#977) (1e3b8eb) by @janbuchar
- Fix default migration storage (#1018) (6a0c4d9) by @Pijukatel
- Fix logger name for http based loggers (#1023) (bfb3944) by @Pijukatel
- Remove allow_redirects override in CurlImpersonateHttpClient (#1017) (01d855a) by @2tunnels
- Remove follow_redirects override in HttpxHttpClient (#1015) (88afda3) by @2tunnels
- Fix flaky test_common_headers_and_user_agent (#1030) (58aa70e) by @Pijukatel
Refactor
- [breaking] Remove unused config properties (#978) (4b7fe29) by @vdusek
- [breaking] Remove Base prefix from abstract class names (#980) (8ccb5d4) by @vdusek
- [breaking] Сhange default
incognito context
topersistent context
forPlaywright
(#985) (f01520d) by @Mantisus - [breaking] Change
Session
cookies fromdict
toSessionCookies
withCookieJar
(#984) (6523b3a) by @Mantisus - [breaking] Replace enum with literal for
EnqueueStrategy
(#1019) (d2481ef) by @vdusek - [breaking] Update status code handling (#1028) (6b59471) by @Mantisus
- [breaking] Move
cli
dependencies to optional dependencies (#1011) (4382959) by @Mantisus