InvoiceParserAI is an AI-powered tool for extracting invoice data such as vendor name, date, and total amount from PDF documents. It supports both desktop (Avalonia) and web (Blazor) interfaces and uses OpenAI for parsing, with fallback to Tesseract OCR.
- Create folders like "income" or "expenses"
- Bulk upload invoice PDFs
- AI-driven extraction of:
- Vendor Name
- Invoice Date
- Total Amount
- Visual PDF viewer with field highlighting
- Manual review and editing of extracted fields
- CSV export of all reviewed data
- Runs locally or via web app
- Configurable OpenAI model and API key
- .NET 7/8
- Avalonia (Desktop UI)
- Blazor (Web UI)
- OpenAI API (via
OpenAI
NuGet package) - PdfPig for PDF parsing
- Tesseract OCR for fallback text extraction
- .NET 7 SDK or later
- OpenAI API Key (from https://platform.openai.com)
cd InvoiceParserAI.Desktop
dotnet run
cd InvoiceParserAI.Web
dotnet run
Create or update appsettings.json
in your project:
{
"OpenAI": {
"ApiKey": "your-api-key-here",
"Model": "text-davinci-003"
}
}
- Create a folder (e.g. "income") and upload invoice PDFs
- The system will mark each invoice as
Processing
,Review
, orComplete
- You can edit extracted values manually
- After completing all, export the results to CSV
This project is licensed under the MIT License.
Developed by Nick Posey and contributors.