Web to Kindle Telegram Bot

A Telegram bot that allows users to scrape web articles from any website, convert them to Kindle-compatible EPUB files, and send them directly to their Kindle devices via email. Substack articles are specially optimized for the best reading experience.

🚀 Try the Deployed Bot

You can try the bot immediately without any setup at: @SubstackKindleBot

Or deploy your own instance using the instructions below.

Quick Start

Option 1: Use the Deployed Bot (Easiest)

Visit @SubstackKindleBot on Telegram
Send /start to begin
Configure your Kindle email with /config
Send any article URL to convert and receive on your Kindle

Option 2: Use the CLI Tool (For Developers)

git clone https://github.com/gunneone/web-to-kindle-telegram-bot.git
cd web-to-kindle-telegram-bot
pip install -r requirements.txt
python cli.py https://your-article-url

Option 3: Deploy Your Own Bot

Follow the full installation instructions below.

Features

Multiple interfaces: Use via Telegram bot or command-line interface (CLI)
Web scraper to extract content from any website article, with special optimizations for Substack articles that remove unnecessary elements like buttons and ads.
Converts extracted content to an EPUB format compatible with Kindle devices.
Integrates with a Telegram bot for an intuitive user interface.
Sends the EPUB file to the user's Kindle email address.
Allows configuration of Kindle email within the bot.
Configurable image link preservation: Toggle whether to preserve clickable links on images (useful for zoom functionality).
Automatic file cleanup: Monitors disk space and automatically deletes old HTML and EPUB files when space is low (below 2GB) to prevent storage issues while keeping recent files for debugging.

Installation

Prerequisites

Python 3.12 or higher
SMTP server access: Required to send emails to Kindle devices.
Telegram Bot Token: Obtain from the Telegram BotFather.

Steps

Clone the Repository:

git clone https://github.com/gunneone/web-to-kindle-telegram-bot.git
cd web-to-kindle-telegram-bot

Set Up the Python Environment:

Use pyenv to ensure you're using the correct Python version.
```
pyenv install 3.12.11
pyenv local 3.12.11
pip install -r requirements.txt
```

Configuration:

Copy the example .env file and fill in your configuration.

cp .env.example .env

Populate .env with:

# Telegram Bot Token
TELEGRAM_TOKEN=YOUR:BOT_TOKEN

# Email configuration
EMAIL_FROM=your@email.domain
EMAIL_PASSWORD=your-email-password

# SMTP configuration
SMTP_HOST=your_smtp_server
SMTP_PORT=587

Run the Bot:

Option A: Direct Python execution
```
python telegram_bot.py
```
Option B: Using Docker Compose (Recommended for production)
```
docker compose up -d
```
This will start both the Telegram bot and the disk monitoring service that automatically manages old files to prevent storage issues.

Usage

1. Start the Bot:

Use /start to initialize the bot and view a welcome message.

2. Configure Kindle Email:

Use /config to set or update your Kindle email. You will be prompted to input an email address ending in @kindle.com.

3. Configure Settings:

Use /settings to view your current configuration, including:

Kindle email address
Image link preservation setting

4. Toggle Image Link Preservation:

Use /imagelinks to toggle whether clickable links on images are preserved in the EPUB. When enabled, you can click on images to zoom or view full resolution versions.

5. Send an Article:

Use /send followed by any web article URL to send content directly to your Kindle. Alternatively:

Send /send without an URL, and the bot will prompt you to provide a link.

6. Approve the Sender's Email:

Add EMAIL_FROM (as set in .env) to your approved senders list at Amazon Kindle Settings.

How it Works

Scraping Content:
- The web_scraper.py extracts the main content, title, and author from web articles. For Substack articles, it provides enhanced extraction with removal of unnecessary buttons and ads.
Converting to EPUB:
- In epub_converter.py, the content is packaged into an EPUB book, embedding metadata like the title, author, and images.
Sending Emails:
- The email_sender.py sends the generated EPUB to the user's Kindle email via an SMTP server.
Telegram Interface:
- Users interact with the bot using straightforward commands to configure their Kindle email, toggle settings, and process articles from any website.
Automatic File Management:
- The disk_monitor.py service continuously monitors disk space and automatically removes the oldest HTML and EPUB files when available space falls below 2GB, ensuring the system doesn't run out of storage while preserving recent files for debugging purposes.

Bot Commands

/start - Initialize the bot and view welcome message
/config - Configure or update your Kindle email address
/settings - View all current settings (email, image link preservation)
/imagelinks - Toggle image link preservation on/off
/cancel - Cancel current operation

CLI Usage

For users who prefer command-line tools or want to integrate the functionality into scripts, a CLI tool is available:

Quick Start with CLI

# Install dependencies
pip install -r requirements.txt

# Process any web article (basic usage)
python cli.py https://example.com/article-title

# Process article with image links preserved (clickable images) 
python cli.py --preserve-image-links https://example.com/article-title

CLI Features

Direct processing: Convert web articles to EPUB without needing a Telegram bot (supports any website, with special optimizations for Substack)
Image link preservation: Use --preserve-image-links flag to keep clickable links on images
Batch processing capability: Can be integrated into scripts for multiple articles
Email integration: Automatically sends EPUB to configured Kindle email

The CLI tool also supports the preserve image links feature:

# Process article with image links preserved
python cli.py --preserve-image-links https://example.com/article

# Process article with image links removed (default)
python cli.py https://example.com/article

EPUB Compatibility Notes

Kindle Compatibility: The EPUB files are optimized for Kindle devices. Some features might not work on other platforms.

Dependencies

The following packages are used in this project:

requests: HTTP requests for web scraping.
beautifulsoup4: HTML parsing for scraping articles.
EbookLib: For creating EPUB files.
python-telegram-bot: Telegram API integration.
python-dotenv: Managing environment variables.
SQLAlchemy: Database management using SQLite.
click: Command-line interface framework.
readability-lxml: Content extraction and readability improvements.
validators: URL and data validation.

Install dependencies via:

pip install -r requirements.txt

Support

For support, please open an issue in the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
__pycache__		__pycache__
assets		assets
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Web to Kindle Telegram Bot

🚀 Try the Deployed Bot

Quick Start

Option 1: Use the Deployed Bot (Easiest)

Option 2: Use the CLI Tool (For Developers)

Option 3: Deploy Your Own Bot

Features

Installation

Prerequisites

Steps

Usage

1. Start the Bot:

2. Configure Kindle Email:

3. Configure Settings:

4. Toggle Image Link Preservation:

5. Send an Article:

6. Approve the Sender's Email:

How it Works

Bot Commands

CLI Usage

Quick Start with CLI

CLI Features

EPUB Compatibility Notes

Dependencies

Support

About

Uh oh!

Contributors 2

Uh oh!

Languages

License

Gunneone/web_to_kindle_telegram_bot

Folders and files

Latest commit

History

Repository files navigation

Web to Kindle Telegram Bot

🚀 Try the Deployed Bot

Quick Start

Option 1: Use the Deployed Bot (Easiest)

Option 2: Use the CLI Tool (For Developers)

Option 3: Deploy Your Own Bot

Features

Installation

Prerequisites

Steps

Usage

1. Start the Bot:

2. Configure Kindle Email:

3. Configure Settings:

4. Toggle Image Link Preservation:

5. Send an Article:

6. Approve the Sender's Email:

How it Works

Bot Commands

CLI Usage

Quick Start with CLI

CLI Features

EPUB Compatibility Notes

Dependencies

Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages