Skip to content

Parthiban-3997/DocMulti-Chat-Assistant-Using-LlamaIndex

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DocMulti Chat Assistant Using LlamaIndex 🦙

Overview

DocMulti Chat Assistant is a powerful Streamlit-based application that leverages LlamaIndex to provide an interactive chat interface for querying multiple documents. This tool excels at processing and understanding complex data structures, ensuring that no information is lost during the parsing and indexing stages.

Deployed Link

DocMulti-Chat-Assistant app is Deployed And Available Here

Screenshots

llamaindex_1 llamaindex_2 llamaindex_3 llamaindex_4

Examples from uploaded document file structures

llamaindex_5

Features

  • Multi-Document Support: Upload and process multiple document types.
  • Advanced Parsing: Option to use LlamaParse for complex documents with graphs and tables.
  • Customizable Models: Choose from various Groq models for text generation.
  • Flexible Embedding: Utilizes Google's Gemini for document embedding.
  • Interactive Chat Interface: Engage in a conversation about your documents.
  • Memory Retention: Maintains context across multiple queries using a chat memory buffer.
  • Custom Parsing Instructions: Tailor the document parsing process with custom instructions.
  • Custom Prompt Templates: Define your own prompts for more specific interactions.

Components

  • Streamlit: For the web interface.
  • LlamaIndex: Core indexing and querying engine.
  • Groq: Large Language Model for text generation.
  • Google Gemini: For document embedding.
  • LlamaParse: Optional advanced document parser.

Usage

  1. Launch the Streamlit app.
  2. In the sidebar:
    • Enter your API keys for Groq, Google, and Llama Cloud.
    • Select a Groq model.
    • Upload your documents.
    • Choose whether to use LlamaParse.
    • Set any advanced options like custom parsing instructions or prompt templates.
  3. Click "Start Document Indexing" to process your documents.
  4. Once indexed, use the chat interface to ask questions about your documents.

Supported Document Types

DocMulti Chat Assistant supports an extensive range of document formats, making it incredibly versatile for various use cases. Here's the full list of supported extensions:

  • Text Documents: .txt, .rtf, .md
  • Microsoft Office: .doc, .docx, .docm, .dot, .dotm, .xls, .xlsx, .xlsm, .xlsb, .ppt, .pptx, .pptm, .pot, .potm, .potx
  • OpenDocument: .odt, .ods, .odp
  • PDF: .pdf
  • eBooks: .epub
  • Images: .jpg, .jpeg, .png, .gif, .bmp, .svg, .tiff, .webp
  • Web: .htm, .html, .xml
  • Spreadsheets: .csv, .tsv, .dif, .sylk, .slk, .prn
  • Database: .dbf
  • Other Office Suites:
    • Apple iWork: .pages, .numbers, .key
    • WordPerfect: .wpd
    • Lotus: .wks, .123
    • Quattro Pro: .qpw
  • Specialized Formats: .602, .abw, .cgm, .cwk, .hwp, .lwp, .mw, .mcw, .pbd, .sda, .sdd, .sdp, .sdw, .sgl, .sti, .sxi, .sxw, .uof, .uop, .uot, .vor, .wps, .zabw, .et, .eth, .wk1, .wk2, .wk3, .wk4, .wq1, .wq2, .wb1, .wb2, .wb3, .xlr

Advanced Features

  • Custom Parsing Instructions: Tailor how LlamaParse extracts information from your documents.
  • Custom Prompt Templates: Create specific prompts to guide the AI's responses.
  • Adjustable Model Parameters: Select different Groq models to balance between speed and capability.

Notes

  • Ensure you have valid API keys for Groq, Google, and Llama Cloud before using the application.
  • Processing time may vary depending on the number and complexity of uploaded documents.

Troubleshooting

  • If you encounter errors during document processing, check your API keys and internet connection.
  • For issues with specific document types, try toggling the LlamaParse option.

Contributing

Contributions to improve DocMulti Chat Assistant are welcome. Please submit pull requests or open issues on the project repository.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages