Skip to content

Releases: souvikmajumder26/Multi-Agent-Medical-Assistant

v2.1.2 - Upgraded RAG Agent

02 May 21:12
68f7bbf
Compare
Choose a tag to compare
  • Document Processing Upgrade:

    • Unstructured.io has been replaced with Docling for document parsing and extraction of text, tables, and images to be embedded.
    • Parsed document is converted into markdown with text, tables, and image placeholders.
    • Image placeholders are replaced by image summaries generated by multi-modal LLM using the extracted images.
  • Enhanced RAG References:

    • Links to the source documents and reference images present in reranked retrieved chunks stored in local storage are added to the bottom of the RAG responses.
  • Integrated backend and frontend into one app:

    • Run a single FastAPI app.py for both backend and frontend
  • Updated Dockerfile and configured GitHub Actions:

    • Simplified Dockerfile, replaced Unstructured.io system dependency installations with that of Docling
  • Multi-platform bug fixes:

    • Modified ingested document paths to work on any OS and docker.
    • Modified FastAPI app host from 127.0.0.1 to 0.0.0.0 to be accessible from outside docker container.

What's Changed

  • Modified document path (for multi OS support) and app host (to access from outside docker container without overriding host using command line argument) by @souvikmajumder26 in #61

Full Changelog: v2.1.1...v2.1.2

v2.1.1 - Upgraded RAG Agent

29 Apr 11:11
243eb95
Compare
Choose a tag to compare
  • Document Processing Upgrade:

    • Unstructured.io has been replaced with Docling for document parsing and extraction of text, tables, and images to be embedded.
    • Parsed document is converted into markdown with text, tables, and image placeholders.
    • Image placeholders are replaced by image summaries generated by multi-modal LLM using the extracted images.
  • Enhanced RAG References:

    • Links to the source documents and reference images present in reranked retrieved chunks stored in local storage are added to the bottom of the RAG responses.
  • Integrated backend and frontend into one app:

    • Run a single FastAPI app.py for both backend and frontend
  • Updated Dockerfile and configured GitHub Actions:

    • Simplified Dockerfile, replaced Unstructured.io system dependency installations with that of Docling
  • Refer v2.0 release notes for previous updates.

What's Changed

Full Changelog: v2.0...v2.1 v2.1...v2.1.1

v2.0 - First Final Version

07 Apr 21:32
b94a08e
Compare
Choose a tag to compare
  • Successfully working overall architecture of Automated Agent Routing with LangGraph.
  • Successfully working Conversation Agent fine-tuned for medical domain.
  • Successfully working RAG agent.
  • Successfully working Web Search agent.
  • Successfully working routing from RAG to Web Search based on Retrieval Confidence score (if low).
  • Successfully working routing to appropriate Medical Computer Vision agent based on Classification of uploaded image (brain MRI / chest X-ray / skin lesion).
  • Successfully storing conversation history till specified length.
  • Successfully working backend and frontend.
  • Added ingest_rag_data.py to manually ingest new data for information retrieval.
  • Currently document parsing implemented with PyPDF2, later will provide option of unstructured.io as well (needs tesseract and poppler -installation at system level).
  • Successfully working Medical Computer Vision model agents - Chest X-ray Covid-19 classification, and Skin Lesion Segmentation.
  • Successfully integrated ElevenLabs API to enable speech-to-text and text-to-speech services in conversation.
  • Successfully integrated Input and Output Guardrails.
  • Conversation history is now maintained in Graph State rather than separately managed in the fastapi backend like in previous releases.
  • Updated Chunking Strategy including logic of semantic chunking (chunking respecting semantic boundaries - section, paragraph, sentence boundaries) utilizing section headers specific to different document types (that will be detected) such as research papers, clinical notes, patient records, medical condition reports, guidelines and protocols, and drug information. Also, included medical entity recognition to enrich the document metadata that will aid in hybrid search comparing with the medical entities detected from the user query.
  • Provided chunking strategy options to developer: 'semantic', 'sliding_window', 'recursive', 'hybrid'.
  • Due to exhaustion of Git LFS quota, large model file is now shared via gdrive which will get downloaded automatically in the correct path (added an automatic model downloader script).
  • Successfully integrated Human-in-the-loop Validation for Computer Vision Agent results.
  • Unified LLM and Embedding model definitions from config.
  • Integrated Tables Extraction with Unstructured.IO
  • Upgraded Vector Search to Hybrid Search Retrieval (BM25 sparse keyword matching + dense embedding vector similarity search) with Qdrant DB
  • Ingested new data into the vector database for the final version; corresponding demo video and README have been updated
  • Updated README installation guide regarding using docker along with pre-existing manual option

What's Changed

Full Changelog: v1.9...v2.0

v1.9 - Ingested new data

07 Apr 20:45
c4f6e05
Compare
Choose a tag to compare
  • Successfully working overall architecture of Automated Agent Routing with LangGraph.
  • Successfully working Conversation Agent fine-tuned for medical domain.
  • Successfully working RAG agent.
  • Successfully working Web Search agent.
  • Successfully working routing from RAG to Web Search based on Retrieval Confidence score (if low).
  • Successfully working routing to appropriate Medical Computer Vision agent based on Classification of uploaded image (brain MRI / chest X-ray / skin lesion).
  • Successfully storing conversation history till specified length.
  • Successfully working backend and frontend.
  • Added ingest_rag_data.py to manually ingest new data for information retrieval.
  • Currently document parsing implemented with PyPDF2, later will provide option of unstructured.io as well (needs tesseract and poppler -installation at system level).
  • Successfully working Medical Computer Vision model agents - Chest X-ray Covid-19 classification, and Skin Lesion Segmentation.
  • Successfully integrated ElevenLabs API to enable speech-to-text and text-to-speech services in conversation.
  • Successfully integrated Input and Output Guardrails.
  • Conversation history is now maintained in Graph State rather than separately managed in the fastapi backend like in previous releases.
  • Updated Chunking Strategy including logic of semantic chunking (chunking respecting semantic boundaries - section, paragraph, sentence boundaries) utilizing section headers specific to different document types (that will be detected) such as research papers, clinical notes, patient records, medical condition reports, guidelines and protocols, and drug information. Also, included medical entity recognition to enrich the document metadata that will aid in hybrid search comparing with the medical entities detected from the user query.
  • Provided chunking strategy options to developer: 'semantic', 'sliding_window', 'recursive', 'hybrid'.
  • Due to exhaustion of Git LFS quota, large model file is now shared via gdrive which will get downloaded automatically in the correct path (added an automatic model downloader script).
  • Successfully integrated Human-in-the-loop Validation for Computer Vision Agent results.
  • Unified LLM and Embedding model definitions from config.
  • Integrated Tables Extraction with Unstructured.IO
  • Upgraded Vector Search to Hybrid Search Retrieval (BM25 sparse keyword matching + dense embedding vector similarity search) with Qdrant DB
  • Ingested new data into the vector database for the final version
  • Corresponding demo video and README have been updated

What's Changed

Full Changelog: v1.8...v1.9

v1.8 - Integrated Unstructured.IO

31 Mar 21:49
70a3677
Compare
Choose a tag to compare
  • Successfully working overall architecture of Automated Agent Routing with LangGraph.
  • Successfully working Conversation Agent fine-tuned for medical domain.
  • Successfully working RAG agent.
  • Successfully working Web Search agent.
  • Successfully working routing from RAG to Web Search based on Retrieval Confidence score (if low).
  • Successfully working routing to appropriate Medical Computer Vision agent based on Classification of uploaded image (brain MRI / chest X-ray / skin lesion).
  • Successfully storing conversation history till specified length.
  • Successfully working backend and frontend.
  • Added ingest_rag_data.py to manually ingest new data for information retrieval.
  • Currently document parsing implemented with PyPDF2, later will provide option of unstructured.io as well (needs tesseract and poppler -installation at system level).
  • Successfully working Medical Computer Vision model agents - Chest X-ray Covid-19 classification, and Skin Lesion Segmentation.
  • Successfully integrated ElevenLabs API to enable speech-to-text and text-to-speech services in conversation.
  • Successfully integrated Input and Output Guardrails.
  • Conversation history is now maintained in Graph State rather than separately managed in the fastapi backend like in previous releases.
  • Updated Chunking Strategy including logic of semantic chunking (chunking respecting semantic boundaries - section, paragraph, sentence boundaries) utilizing section headers specific to different document types (that will be detected) such as research papers, clinical notes, patient records, medical condition reports, guidelines and protocols, and drug information. Also, included medical entity recognition to enrich the document metadata that will aid in hybrid search comparing with the medical entities detected from the user query.
  • Provided chunking strategy options to developer: 'semantic', 'sliding_window', 'recursive', 'hybrid'.
  • Due to exhaustion of Git LFS quota, large model file is now shared via gdrive which will get downloaded automatically in the correct path (added an automatic model downloader script).
  • Successfully integrated Human-in-the-loop Validation for Computer Vision Agent results.
  • Unified LLM and Embedding model definitions from config.
  • Integrated Tables Extraction with Unstructured.IO
  • Upgraded Vector Search to Hybrid Search Retrieval (BM25 sparse keyword matching + dense embedding vector similarity search) with Qdrant DB

What's Changed

Full Changelog: v1.7...v1.8

v1.7 - Integrated Human-in-the-loop Validation

26 Mar 13:19
441435f
Compare
Choose a tag to compare
  • Successfully working overall architecture of Automated Agent Routing with LangGraph.
  • Successfully working Conversation Agent fine-tuned for medical domain.
  • Successfully working RAG agent.
  • Successfully working Web Search agent.
  • Successfully working routing from RAG to Web Search based on Retrieval Confidence score (if low).
  • Successfully working routing to appropriate Medical Computer Vision agent based on Classification of uploaded image (brain MRI / chest X-ray / skin lesion).
  • Successfully storing conversation history till specified length.
  • Successfully working backend and frontend.
  • Added ingest_rag_data.py to manually ingest new data for information retrieval.
  • Currently document parsing implemented with PyPDF2, later will provide option of unstructured.io as well (needs tesseract and poppler -installation at system level).
  • Successfully working Medical Computer Vision model agents - Chest X-ray Covid-19 classification, and Skin Lesion Segmentation.
  • Successfully integrated ElevenLabs API to enable speech-to-text and text-to-speech services in conversation.
  • Successfully integrated Input and Output Guardrails.
  • Conversation history is now maintained in Graph State rather than separately managed in the fastapi backend like in previous releases.
  • Updated Chunking Strategy including logic of semantic chunking (chunking respecting semantic boundaries - section, paragraph, sentence boundaries) utilizing section headers specific to different document types (that will be detected) such as research papers, clinical notes, patient records, medical condition reports, guidelines and protocols, and drug information. Also, included medical entity recognition to enrich the document metadata that will aid in hybrid search comparing with the medical entities detected from the user query.
  • Provided chunking strategy options to developer: 'semantic', 'sliding_window', 'recursive', 'hybrid'.
  • Due to exhaustion of Git LFS quota, large model file is now shared via gdrive which will get downloaded automatically in the correct path (added an automatic model downloader script).
  • Successfully integrated Human-in-the-loop Validation for Computer Vision Agent results.

What's Changed

Full Changelog: v1.6...v1.7

v1.6 - Advanced Chunking Strategy

24 Mar 21:48
c9c4eaf
Compare
Choose a tag to compare
  • Successfully working overall architecture of Automated Agent Routing with LangGraph.
  • Successfully working Conversation Agent fine-tuned for medical domain.
  • Successfully working RAG agent.
  • Successfully working Web Search agent.
  • Successfully working routing from RAG to Web Search based on Retrieval Confidence score (if low).
  • Successfully working routing to appropriate Medical Computer Vision agent based on Classification of uploaded image (brain MRI / chest X-ray / skin lesion).
  • Successfully storing conversation history till specified length.
  • Successfully working backend and frontend.
  • Added ingest_rag_data.py to manually ingest new data for information retrieval.
  • Currently document parsing implemented with PyPDF2, later will provide option of unstructured.io as well (needs tesseract and poppler -installation at system level).
  • Successfully working Medical Computer Vision model agents - Chest X-ray Covid-19 classification, and Skin Lesion Segmentation.
  • Successfully integrated ElevenLabs API to enable speech-to-text and text-to-speech services in conversation.
  • Successfully integrated Input and Output Guardrails.
  • Conversation history is now maintained in Graph State rather than separately managed in the fastapi backend like in previous releases.
  • Updated Chunking Strategy including logic of semantic chunking (chunking respecting semantic boundaries - section, paragraph, sentence boundaries) utilizing section headers specific to different document types (that will be detected) such as research papers, clinical notes, patient records, medical condition reports, guidelines and protocols, and drug information. Also, included medical entity recognition to enrich the document metadata that will aid in hybrid search comparing with the medical entities detected from the user query.
  • Provided chunking strategy options to developer: 'semantic', 'sliding_window', 'recursive', 'hybrid'.
  • Due to exhaustion of Git LFS quota, large model file is now shared via gdrive which will get downloaded automatically in the correct path (added an automatic model downloader script).

What's Changed

Full Changelog: v1.5...v1.6

v1.5 - Conversation history now maintained in graph state

21 Mar 20:34
f2401ca
Compare
Choose a tag to compare
  • Successfully working overall architecture of Automated Agent Routing with LangGraph.
  • Successfully working Conversation Agent fine-tuned for medical domain.
  • Successfully working RAG agent.
  • Successfully working Web Search agent.
  • Successfully working routing from RAG to Web Search based on Retrieval Confidence score (if low).
  • Successfully working routing to appropriate Medical Computer Vision agent based on Classification of uploaded image (brain MRI / chest X-ray / skin lesion).
  • Successfully storing conversation history till specified length.
  • Successfully working backend and frontend.
  • Added ingest_rag_data.py to manually ingest new data for information retrieval.
  • Currently document parsing implemented with PyPDF2, later will provide option of unstructured.io as well (needs tesseract and poppler -installation at system level).
  • Successfully working Medical Computer Vision model agents - Chest X-ray Covid-19 classification, and Skin Lesion Segmentation.
  • Successfully integrated ElevenLabs API to enable speech-to-text and text-to-speech services in conversation.
  • Successfully integrated Input and Output Guardrails.
  • Conversation history is now maintained in Graph State rather than separately managed in the fastapi backend like in previous releases.

What's Changed

Full Changelog: v1.4...v1.5

v1.4 - Integrated Guardrails

20 Mar 20:23
9b028ad
Compare
Choose a tag to compare
  • Successfully working overall architecture of Automated Agent Routing with LangGraph.
  • Successfully working Conversation Agent fine-tuned for medical domain.
  • Successfully working RAG agent.
  • Successfully working Web Search agent.
  • Successfully working routing from RAG to Web Search based on Retrieval Confidence score (if low).
  • Successfully working routing to appropriate Medical Computer Vision agent based on Classification of uploaded image (brain MRI / chest X-ray / skin lesion).
  • Successfully storing conversation history till specified length.
  • Successfully working backend and frontend.
  • Added ingest_rag_data.py to manually ingest new data for information retrieval.
  • Currently document parsing implemented with PyPDF2, later will provide option of unstructured.io as well (needs tesseract and poppler -installation at system level).
  • Successfully working Medical Computer Vision model agents - Chest X-ray Covid-19 classification, and Skin Lesion Segmentation.
  • Successfully integrated ElevenLabs API to enable speech-to-text and text-to-speech services in conversation.
  • Successfully integrated Input and Output Guardrails.

What's Changed

Full Changelog: v1.3...v1.4

v1.3 - Integrated speech-to-text and text-to-speech

19 Mar 18:27
bea1f50
Compare
Choose a tag to compare
  • Successfully working overall architecture of Automated Agent Routing with LangGraph.

  • Successfully working Conversation Agent fine-tuned for medical domain.

  • Successfully working RAG agent.

  • Successfully working Web Search agent.

  • Successfully working routing from RAG to Web Search based on Retrieval Confidence score (if low).

  • Successfully working routing to appropriate Medical Computer Vision agent based on Classification of uploaded image (brain MRI / chest X-ray / skin lesion).

  • Successfully storing conversation history till specified length.

  • Successfully working backend and frontend.

  • Added ingest_rag_data.py to manually ingest new data for information retrieval.

  • Currently document parsing implemented with PyPDF2, later will provide option of unstructured.io as well (needs tesseract and poppler installation at system level).

  • Successfully working Medical Computer Vision model agents - Chest X-ray Covid-19 classification, and Skin Lesion Segmentation.

  • Successfully integrated ElevenLabs API to enable speech-to-text and text-to-speech services in conversation.

What's Changed

Full Changelog: v1.2...v1.3