A Python script to retrieve a list of root domains based on specific extensions (.id
, .co.id
, .ac.id
, etc.) from the Archive.org CDX API.
- Fetch domain lists from web.archive.org CDX API.
- Supports multiple extensions at once (comma-separated).
- Configurable target number of root domains per extension.
- Automatically rotates User-Agent to avoid rate limits.
- Interactive, colorful CLI using Rich.
- Automatically saves results to
.txt
files with timestamps.
Make sure Python 3.8+ is installed.
-
Clone this repository:
git clone https://github.com/bimantaraz/domain-grabber.git cd domain-grabber
-
Install dependencies:
pip install -r requirements.txt
Run:
python grabber.py
Example input:
Extensions: id, co.id, ac.id
Number of root domains per extension: 50
Cautious mode (add small delay)? [Y/n]: y
The output will be saved as:
grab_id_20250812_153045.txt
grab_co.id_20250812_153045.txt
grab_ac.id_20250812_153045.txt
abc.ac.id
def.ac.id
example.co.id
universitas.ac.id
requests>=2.31.0
rich>=13.7.1
urllib3>=2.2.2
- Use specific extensions (
co.id
,ac.id
,go.id
) to reduce the risk of rate-limiting. - If you encounter 403 or 429, try again after a few minutes or lower the target number.
- Archive.org CDX API does not always return complete results, so output may vary between runs.