LangExtract v1.0.0 - Structured Information Extraction
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Key Features
- Extract structured data from any text using few-shot examples
- Support for Gemini and Ollama models
- Interactive HTML visualizations with source highlighting
- Optimized for long documents with parallel processing and multiple extraction passes
- Precise source grounding - every extraction maps to its location in the original text
Installation
pip install langextract
See the documentation for full usage examples.