A Model Context Protocol (MCP) server that promises to tell you exactly when the next Caltrain will arrive... and then be 10 minutes late anyway. Uses real GTFS data, so at least the disappointment is official!
- π "Real-time" train schedules - Get the next departures between any two stations (actual arrival times may vary by +/- infinity)
- π Station lookup - Because apparently 31 stations is too many to memorize π€·ββοΈ
- π Time-specific queries - Plan your commute with surgical precision, then watch it all fall apart
- β¨ Smart search - Type 'sf' instead of the full name because we're all lazy here
- π GTFS-based - We use the same data Caltrain does, so when things go wrong, we can blame them together
-
Install dependencies (aka "More stuff to break"):
# Install uv if you haven't already (because pip is apparently too mainstream now) curl -LsSf https://astral.sh/uv/install.sh | sh # Install dependencies using uv (fingers crossed it actually works) uv sync
-
Get that sweet, sweet GTFS data: The server expects Caltrain GTFS data in the
src/caltrain_mcp/data/caltrain-ca-us/
directory. Because apparently we can't just ask the trains nicely where they are.uv run python scripts/fetch_gtfs.py
This magical script downloads files that contain:
stops.txt
- All the places trains pretend to stoptrips.txt
- Theoretical journeys through space and timestop_times.txt
- When trains are supposed to arrive (spoiler: they don't)calendar.txt
- Weekday vs weekend schedules (because trains also need work-life balance)
This server is designed to be used with MCP clients like Claude Desktop, not run directly by humans (because that would be too easy). Here's how to actually use it:
Add this to your Claude Desktop MCP configuration file:
{
"mcpServers": {
"caltrain": {
"command": "uvx",
"args": ["caltrain-mcp"]
}
}
}
This will automatically install and run the latest version from PyPI.
Then restart Claude Desktop and you'll have access to Caltrain schedules directly in your conversations!
Any MCP-compatible client can use this server by starting it with:
uvx caltrain-mcp
The server communicates via stdin/stdout using the MCP protocol. It doesn't do anything exciting when run directly - it just sits there waiting for proper MCP messages.
You can test if this thing actually works by importing it directly:
from caltrain_mcp.server import next_trains, list_stations
# Test next trains functionality (prepare for disappointment)
result = await next_trains('San Jose Diridon', 'San Francisco')
print(result) # Spoiler: there are no trains
# Test stations list (all 31 of them, because apparently that's manageable)
stations = await list_stations()
print(stations)
Ask politely when the next train will show up. The server will consult its crystal ball (GTFS data) and give you times that are technically accurate.
Parameters:
origin
(str): Where you are now (probably regretting your life choices)destination
(str): Where you want to be (probably anywhere but here)when_iso
(str, optional): When you want to travel (as if time has any meaning in public transit)
Examples:
# Next trains from current time (aka "right now would be nice")
next_trains('San Jose Diridon', 'San Francisco')
# Trains at a specific time (for the optimists who think schedules matter)
next_trains('Palo Alto', 'sf', '2025-05-23T06:00:00')
# Using abbreviations (because typing is hard)
next_trains('diridon', 'sf')
Get a list of all 31 Caltrain stations, because memorizing them is apparently too much to ask.
Returns: A formatted list that will make you realize just how many places this train supposedly goes.
The server supports various ways to be lazy about typing station names:
- Full names: "San Jose Diridon Station" (for the perfectionists)
- Short names: "San Francisco" (for the slightly less perfectionist)
- Abbreviations: "sf" β "San Francisco" (for the truly lazy)
- Partial matching: "diridon" matches "San Jose Diridon Station" (for when you can't be bothered)
The server covers every single Caltrain station because we're completionists:
San Francisco to San Jose (The Main Event):
- San Francisco, 22nd Street, Bayshore, South San Francisco, San Bruno, Millbrae, Broadway, Burlingame, San Mateo, Hayward Park, Hillsdale, Belmont, San Carlos, Redwood City, Menlo Park, Palo Alto, Stanford, California Avenue, San Antonio, Mountain View, Sunnyvale, Lawrence, Santa Clara, College Park, San Jose Diridon
San Jose to Gilroy (The "Why Does This Exist?" Extension):
- Tamien, Capitol, Blossom Hill, Morgan Hill, San Martin, Gilroy
π Next Caltrain departures from San Jose Diridon Station to San Francisco Caltrain Station on Thursday, May 22, 2025:
β’ Train 153: 17:58:00 β 19:16:00 (to San Francisco)
β’ Train 527: 18:22:00 β 19:22:00 (to San Francisco)
β’ Train 155: 18:28:00 β 19:46:00 (to San Francisco)
β’ Train 429: 18:43:00 β 19:53:00 (to San Francisco)
β’ Train 157: 18:58:00 β 20:16:00 (to San Francisco)
Actual arrival times may vary. Side effects may include existential dread and a deep appreciation for remote work.
- GTFS Processing: We automatically handle the relationship between stations and their platforms (because apparently trains are complicated)
- Service Calendar: Respects weekday/weekend schedules (trains also need their beauty rest)
- Data Types: Handles the chaos that is mixed integer/string formats in GTFS files
- Time Parsing: Supports 24+ hour format for those mythical late-night services
- Error Handling: Gracefully fails when you type "Narnia" as a station name
caltrain-mcp/
βββ .github/workflows/ # GitHub Actions (the CI/CD overlords)
β βββ ci.yml # Main CI pipeline (linting, testing, the works)
β βββ update-gtfs.yml # Automated GTFS data updates
βββ src/caltrain_mcp/ # Main package (because modern Python demands structure)
β βββ data/caltrain-ca-us/ # GTFS data storage (where CSV files go to retire)
β βββ __init__.py # Package initialization (the ceremony of Python)
β βββ __main__.py # Entry point for python -m caltrain_mcp
β βββ server.py # MCP server implementation (where the magic happens)
β βββ gtfs.py # GTFS data processing (aka "CSV wrestling")
βββ scripts/ # Utility scripts (the supporting cast)
β βββ __init__.py # Makes scripts a proper Python package
β βββ fetch_gtfs.py # Downloads the latest disappointment data
β βββ lint.py # Run all CI checks locally (before embarrassment)
βββ tests/ # Test suite (because trust but verify)
β βββ conftest.py # Shared test fixtures (the common ground)
β βββ test_gtfs.py # GTFS functionality tests (8 tests of data wrangling)
β βββ test_server.py # Server functionality tests (4 tests of MCP protocol)
β βββ test_fetch_gtfs.py # Data fetching tests (7 tests of download chaos)
βββ .pre-commit-config.yaml # Pre-commit hooks configuration
βββ pyproject.toml # Modern Python config (because setup.py is so 2020)
βββ README.md # This literary masterpiece
This project uses modern Python tooling to keep the code clean and maintainable:
- Ruff: Lightning-fast linting and formatting (because life's too short for slow tools)
- MyPy: Type checking (because guessing types is for amateurs)
- Pytest: Testing framework with coverage reporting
This project uses automated versioning and publishing:
- Semantic Versioning: Version numbers are automatically determined from commit messages using Conventional Commits
- Automatic Tagging: When you push to
main
, semantic-release creates version tags automatically - PyPI Publishing: Tagged releases are automatically built and published to PyPI via GitHub Actions
- Trusted Publishing: Uses OIDC authentication with PyPI (no API tokens needed!)
Just commit using conventional commit format and push to main:
# For bug fixes (patch version bump: 1.0.0 β 1.0.1)
git commit -m "fix: correct station name lookup bug"
# For new features (minor version bump: 1.0.0 β 1.1.0)
git commit -m "feat: add support for weekend schedules"
# For breaking changes (major version bump: 1.0.0 β 2.0.0)
git commit -m "feat!: redesign API structure"
# or
git commit -m "feat: major API changes
BREAKING CHANGE: This changes the function signatures"
The semantic-release workflow will:
- Analyze your commit messages
- Determine the appropriate version bump
- Create a git tag (e.g.,
v1.2.3
) - Generate a changelog
- Trigger the release workflow to publish to PyPI
Test the build process locally before pushing:
# Build packages locally
uv run python -m build --sdist --wheel
# Validate packages
uv run twine check dist/*
# Test upload to Test PyPI (optional)
uv run twine upload --repository testpypi dist/*
Every PR and push to main triggers automatic checks:
- β Linting: Ruff checks for code quality issues
- β Formatting: Ensures consistent code style
- β Type Checking: MyPy validates type annotations
- β Tests: Full test suite with coverage reporting
- β Coverage: Test coverage reporting in CI logs
The CI will politely reject your PR if any checks fail, because standards matter.
This server implements the Model Context Protocol (MCP), which means it's designed to work seamlessly with AI assistants and other MCP clients. Once configured:
- Claude Desktop: Ask Claude about train schedules directly in conversation
- Other MCP Clients: Any MCP-compatible tool can access Caltrain data
- Real-time Integration: Your AI can check schedules, suggest routes, and help plan trips
- Natural Language: No need to remember station names or command syntax
The server exposes two main tools:
next_trains
- Get upcoming departures between stationslist_stations
- Browse all available Caltrain stations
So your AI assistant can now disappoint you about train schedules just like a real human would! The future is truly here.
This project uses official Caltrain GTFS data. If something goes wrong, blame them, not us. We're just the messenger.
Built with β€οΈ and a concerning amount of caffeine in the Bay Area, where public transit is both a necessity and a source of eternal suffering.