Skip to content

๐Ÿ–ผ๏ธ๐Ÿ“„E2E Multi-modal Document Preprocessing for Search Indexing with Azure Document Intelligence

Notifications You must be signed in to change notification settings

kimtth/rag-multimodal-semantic-chunking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

8 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ“„ Multi-modal Document Preprocessing with Azure Document Intelligence

โœจ Features

  1. ๐Ÿ“ Generate a document parsed results using Document Intelligence, and output it in Markdown format. > output
  2. ๐Ÿ–ผ๏ธ Extract figures from documents and save them as PNG images. > output
  3. ๐Ÿค– Generate figure descriptions using Azure OpenAI Multimodal.
  4. ๐Ÿ“ Update markdown outputs with generated descriptions. > output
  5. ๐Ÿ“Š Extract tables and convert them into Excel files. > output
  6. ๐Ÿ“– Text Chunking to markdown ouputs using MarkdownHeaderTextSplitter, RecursiveContentChunker, and SemanticContentChunker (TBD) > markdown chuck output | recursive chunk output

๐Ÿš€ Usage

python doc_intelli.py

๐Ÿ“š Learn More

About

๐Ÿ–ผ๏ธ๐Ÿ“„E2E Multi-modal Document Preprocessing for Search Indexing with Azure Document Intelligence

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages