GitHub - grayashh/dify-upstage-document-parser-plugin

Upstage Document Parsing Plugin for Dify

A powerful document parsing plugin for the Dify platform that leverages the Upstage Document Parse API to convert various document formats into structured Markdown, HTML, or plain text.

Features

Broad format support: Handles PDF, DOCX, and various image formats
Intelligent document understanding: Extracts text, tables, charts, and figures while preserving original structure
Multiple output formats: Converts documents to Markdown, HTML, or plain text
Efficient caching: Content-based caching prevents reprocessing of identical files
OCR capability: Extracts text from scanned documents and images
Chart recognition: Identifies and extracts charts from documents
Batch processing: Efficiently processes multi-page documents
Coordinate extraction: Obtains bounding box coordinates for document elements

Installation

This plugin is under active development.

pip install -r requirements.txt

Configure the plugin in the Dify platform.

Configuration

Required credentials

The plugin requires the following credentials:

upstage_api_key: Upstage API key (obtain it from the Upstage Console)
base_url: Base URL for your Dify instance (default: "https://cloud.dify.ai")

Parameter options

You can configure the following parameters when using the tool:

result_type: Output format (options: "md", "html", "text")
as_file: Whether to return the result as a file or text (options: "file", "text")

Usage

In a Dify application

Add the Upstage Document Parse tool to your application.
Configure the required credentials.
Use the tool within your application flow to process documents.

Direct usage in Python

You can also use the client directly in Python:

from tools.upstage_client import UpstageDocumentParseClient

# Initialize the client
client = UpstageDocumentParseClient(
    api_key="your_upstage_api_key",
    output_dir="exported_documents"
)

# Convert a document to Markdown
markdown_content = client.convert_to_markdown("path/to/your/document.pdf")

# Convert a document to HTML
html_content = client.convert_to_html("path/to/your/document.docx")

# Convert a document to plain text
text_content = client.convert_to_text("path/to/your/image.jpg")

API Parameters

The plugin uses the following parameters when calling the Upstage Document Parse API:

Parameter	Type	Description	Default
`document`	File	The document file to process	Required
`ocr`	String	Control OCR behavior: "auto" (applies only to images) or "force" (convert everything to images first)	"auto"
`coordinates`	Boolean	Whether to return bounding box coordinates	false
`chart_recognition`	Boolean	Whether to use chart recognition	true
`output_formats`	List[String]	Formats for layout elements: "text", "html", "markdown"	["html", "markdown", "text"]
`model`	String	Model used for inference	"document-parse-250618"
`base64_encoding`	List[String]	Layout categories to provide as base64-encoded strings	["table", "figure", "chart"]

Caching Mechanism

The plugin implements an efficient caching system:

File content hashing to identify duplicate documents
Result caching based on content hash and output format
TTL-based cache expiration (default: 1 hour)

Examples

Convert a PDF to Markdown

client = UpstageDocumentParseClient(api_key="your_api_key")
markdown = client.convert_to_markdown("sample.pdf")
print(markdown)

Handling large documents

client = UpstageDocumentParseClient(api_key="your_api_key")
exported_files = client.process_document(
    "large_document.pdf",
    wait=True,
    poll_interval=2,
    max_wait=600
)
print(f"Exported files: {exported_files}")

Development

Project structure

upstage-documentparse.py: Main Dify plugin integration
upstage_client.py: Core client that interacts with the Upstage API
requirements.txt: Python dependencies

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
_assets		_assets
provider		provider
temp_output		temp_output
tools		tools
.difyignore		.difyignore
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
GUIDE.md		GUIDE.md
PRIVACY.md		PRIVACY.md
README.md		README.md
main.py		main.py
manifest.yaml		manifest.yaml
requirements.txt		requirements.txt
upstage-document-parser.difypkg		upstage-document-parser.difypkg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Upstage Document Parsing Plugin for Dify

Features

Installation

Configuration

Required credentials

Parameter options

Usage

In a Dify application

Direct usage in Python

API Parameters

Caching Mechanism

Examples

Convert a PDF to Markdown

Handling large documents

Development

Project structure

About

Uh oh!

Releases

Packages

Languages

grayashh/dify-upstage-document-parser-plugin

Folders and files

Latest commit

History

Repository files navigation

Upstage Document Parsing Plugin for Dify

Features

Installation

Configuration

Required credentials

Parameter options

Usage

In a Dify application

Direct usage in Python

API Parameters

Caching Mechanism

Examples

Convert a PDF to Markdown

Handling large documents

Development

Project structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages