Optimizations, bug fixes, command line arguments, out of scope paths and domaisn, and headless browser support #91

lkotlus · 2024-08-02T22:22:30Z

Feature Description

Optimizations: some general things, and implemented a hash set that prevents the same url from being visited multiple times, which frequently lead to infinite crawling.
Bug fixes: potential issues with how the domain restrictions were being handled.
Out of scope paths and domains: users can now enter domains and paths that are out of the scope of the scan (useful for pentests).
Headless browser support: the ability to use a headless browser rather than just requests.get() when making requests. This is more thorough, as dynamic content of the site is accessed due to the web page actually being rendered. This does lead to longer waits, but can be worth it depending on how the target site is put together. In the future, user-like interaction with the site can be implemented. This feature was implemented using selenium.

Checklist

I wrote at least some documentation for this feature.

Checklist

This Pull will not add the same thing as another currently-open request.
Your Pull was made against the rivermont:dev branch and not rivermont:master.
This Pull does not commit any keys, passwords, personal data, or other private information.
I updated lines 20 and 21 in the README to reflect any changed I made.

…er and requirements

rivermont and others added 25 commits November 1, 2021 17:01

Use f-strings instead of .format()

3a62f5e

Remove stray parenthesis.

59e124d

Remove obselete configs.

15d4e8c

Adding argparse stuff

a242c7c

Basic outline of out of scope options

584a6ac

Add out of scope functionality and adjust the restricted domain logic

043a834

Fix my wording on out of scope stuff

cb0e33e

Fix syntax error (I am a programming genius)

69e4255

Fix some of my logic

6155f7b

If the argument is used, don't go looking for user input

251230d

Check if OUT_OF_SCOPE was set

0b109c7

Scratch that previous commit...

da4d2c7

Optimize by preventing multiple checks of the same URL

91080bc

Fix some globals and whatnot

2ebe062

Update config files, add selenium (import only, no code yet) to crawl…

305ee30

…er and requirements

Fix imports

2ccf5b1

This should work

6b9f1b8

Fix interceptor function

476ccc0

Bug fixes and testing

b05e006

Fix requirements

cb4f856

Update docs and fix comments

e417d34

Contributors

1456694

Remove unnecesary print

62b4668

KNOWN_ERROR_COUNT referenced before assignment fixed.

1547563

Add maximum time

b37fd41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimizations, bug fixes, command line arguments, out of scope paths and domaisn, and headless browser support #91

Optimizations, bug fixes, command line arguments, out of scope paths and domaisn, and headless browser support #91

Uh oh!

lkotlus commented Aug 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optimizations, bug fixes, command line arguments, out of scope paths and domaisn, and headless browser support #91

Are you sure you want to change the base?

Optimizations, bug fixes, command line arguments, out of scope paths and domaisn, and headless browser support #91

Uh oh!

Conversation

lkotlus commented Aug 2, 2024

Feature Description

Checklist

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants