A Bash script to extract unique words from URLs, perfect for creating custom wordlists for penetration testing or data analysis.
- URL Parsing: Extracts path segments after the domain (third
/
). - Extension Removal: Strips common file extensions (e.g.,
.js
,.php
,.html
). - Duplicate Filtering: Automatically removes duplicate entries.
- Input/Output Support: Specify custom input and output files.
- User-Friendly: Includes a help menu and execution banner.
-
Clone the Repository:
git clone https://github.com/siuxsa/Wordlist-Generator.git cd Wordlist-Generator
-
Make the Script Executable:
chmod +x wordlist maker.sh
-
Usage
./wordlist maker.sh -l <input_file> -o <output_file>
-
Extract Paths: Removes the domain (everything before the third /).
-
Remove Extensions: Strips file extensions like .js, .jpg, etc.
-
Split Words: Breaks path segments into words using /, -, and _ as delimiters.
-
Deduplicate: Sorts and removes duplicates for a clean output.