Replies: 2 comments 4 replies
-
Hi, I would like to work on this issue if no one else is actively fixing it. |
Beta Was this translation helpful? Give feedback.
-
@oxqnd Thanks for the help! I'm not entirely sure about the issue itself. @mrschyte, by default, crawling is scoped to the input domain to keep the output relevant.
Or, if you want to disable host-based scoping entirely:
Let me know if these options don’t cover what you're trying to achieve. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
katana version:
[INF] Current version: v1.2.1
Current Behavior:
Katana only checks the currently crawled domain when evaluating dn, rdn, fqdn checks in the scope manager.
This means, that if two URLs are in scope: (https://a.com, https://b.com) and there are cross domain links such as https://a.com/b -> https://b.com/resourceB or https://b.com/a -> https://a.com/resourceA, katana will fail to detect both resourceA and resourceB.
Expected Behavior:
Katana should loop through all URLs read through stdin and from the URLs list / file and check if any of the URLs match the dn, rdn or fqdn based on the current mode.
Steps To Reproduce:
Run katana with two input urls, where there are cross-domain links. Katana will fail to list the linked resources.
Anything else:
Beta Was this translation helpful? Give feedback.
All reactions