Web Spider - Katana Feature Parity

This issue is for tracking the ongoing development of BBOT's [web spider](https://www.blacklanternsecurity.com/bbot/Stable/scanning/tips_and_tricks/#web-spider) capabilities.

BBOT's web spider is already solid, easily crawling websites and extracting URLs, JS links, etc. How well it performs compared to Project Discovery's [Katana](https://github.com/projectdiscovery/katana) is unknown.

Here are some things we can do to build on BBOT's feature set, to make it a best-in-class web spider:

- [ ] Test both BBOT and Katana against the same targets to identify strengths, weaknesses, and blind spots.
- [ ] Improve BBOT's [Custom YARA Rules](https://www.blacklanternsecurity.com/bbot/Stable/modules/custom_yara_rules/) documentation to include useful, [real examples](https://gist.github.com/N7WEra/4171e08e2cf138d5bdf5d195284b7547) instead of the "AAAAA" placeholders.
- [ ] Create web-spidering presets for more custom use cases, e.g. for when a user wants to extract and display all links.
- [ ] Replace Gowitness with a more native headless solution, which integrates nicely with the web spider.

Also, consolidating / rustifying excavate, along with it's custom rule integration, will enable us to spider at scale, with the highest performance possible.

**Why not a Katana module?**

While a Katana module would be easy to write, it wouldn't be ideal for two main reasons: 

1. BBOT is already recursive, and introducing another recursive tool is likely to have unintended side effects. Examples include infinite recursion bugs, visiting the same URL multiple times, or putting heavy stress on the target.
2. Many of Katana's features are already included in BBOT, including configurable web spider settings, URL extraction, and custom rules to search HTTP responses.

Therefore the best approach will be to polish BBOT's existing spider feature set to make it more effective and user friendly.

Relevant:

- https://github.com/blacklanternsecurity/bbot/discussions/698

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Web Spider - Katana Feature Parity #2800

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Web Spider - Katana Feature Parity #2800

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions