Skip to content

LangExtract v1.0.0 - Structured Information Extraction

Compare
Choose a tag to compare
@aksg87 aksg87 released this 22 Jul 21:59
· 111 commits to main since this release

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Key Features

  • Extract structured data from any text using few-shot examples
  • Support for Gemini and Ollama models
  • Interactive HTML visualizations with source highlighting
  • Optimized for long documents with parallel processing and multiple extraction passes
  • Precise source grounding - every extraction maps to its location in the original text

Installation

pip install langextract

See the documentation for full usage examples.