A Bash script to generate a custom wordlist for a target domain by combining words scraped from the live site using CeWL and historical URLs fetched from the Wayback Machine using waybackurls.
This is useful for directory brute-forcing, parameter fuzzing, or general reconnaissance.
- Scrapes visible text from the target site using CeWL
- Extracts keywords from archived URLs using waybackurls
- Cleans and merges wordlists
- Outputs a unique, sorted wordlist ready for use
Make sure the following tools are installed and accessible :
Install using
sudo gem install cewlgo install github.com/tomnomnom/waybackurls@latestchmod +x wordlistgenerator.sh
./wordlistgenerator.sh https://example.com