a Python library for scraping and document processing
uv add raggy
add extras to load different document types:
uv add 'raggy[chroma]' # ChromaDB support
uv add 'raggy[tpuf]' # TurboPuffer support
uv add 'raggy[pdf]' # PDF processing
read the docs
a simple-to-use Python library for:
- scraping the web to produce rich documents
- putting these documents in vectorstores
- querying the vectorstores to find documents similar to a query
this project is licensed under the Apache 2.0 License - see the LICENSE file for details.
I welcome contributions! See the contributing guide for details.