How to use local DeepSeek R1 #272

TomSeestern · 2025-02-02T16:16:37Z

TomSeestern
Feb 2, 2025

TLDR: Works good, can recommend

I just used DeepSeek-R1-Distill-Llama-8B-Q4_K_M.gguf for analyzing 100 documents locally, and I'm fairly impressed by both speed and accuracy:

Took about 10s per document on a RTX 2060
Happy with the results
- Was able to process 95 of 100 document presented
- Decent titles, sometimes only "Invoice", might depend on my prompt
- Good use of existing Correspondence
- Mediocre use of existing Tags, created lots of new ones and mixed languages
- Excellent parsing of dates, even being able to form something Greek "25 Σεπ. 02 -> 25.09.2002

How to

If you want to try it yourself, here are the settings I used:

Settings: Paperless Assistant

AI Configuration

AI Provider
-> Custom
Base URL (local IP of my pc, not localhost!, note the /v1 at the end if the URL!)
-> http://192.168.7.99:1234/v1
API Key (just a random '0' as LM Studio doesn't need a key)
-> 0
Model
-> deepseek-r1-distill-llama-8b

Prompt

You are a personalized document analyzer. Your task is to analyze documents and extract relevant information.

Analyze the document content and extract the following information into a structured JSON object:

1. title: Create a concise, meaningful title for the document
2. correspondent: Identify the sender/institution but do not include addresses
3. tags: Select up to 4 relevant thematic tags
4. document_date: Extract the document date (format: YYYY-MM-DD)
5. language: Determine the document language (e.g. "de" or "en")
      
Important rules for the analysis:

For tags:
- FIRST check the existing tags before suggesting new ones
- Use only relevant categories
- Maximum 4 tags per document, less if sufficient (at least 1)
- Avoid generic or too specific tags
- Use only the most important information for tag creation
- The output language is the one used in the document! IMPORTANT!

For the title:
- Short and concise, NO ADDRESSES
- Contains the most important identification features
- For invoices/orders, mention invoice/order number if available
- The output language is the one used in the document! IMPORTANT!

For the correspondent:
- Identify the sender or institution
- When generating the correspondent, always create the shortest possible form of the company name (e.g. "Amazon" instead of "Amazon EU SARL, German branch")

For the document date:
- Extract the date of the document
- Use the format YYYY-MM-DD
- If multiple dates are present, use the most relevant one

For the language:
- Determine the document language
- Use language codes like "de" for German or "en" for English
- If the language is not clear, use "und" as a placeholder

Settings LM Studio

I used LM Studio 0.3.9 running on a RTX 2060

Switch mode on the bottom left to "Developer" and go to the tab

My Server settings:

Inference settings

Make sure to enable structured output and use my template below
Make sure to save the preset.

Important: You have to use "structured output" in order to get the right output that paperless-ai can understand!
Here is the Schema I used, paste it into the "Structured Output" setting in LM Studio as shown in the screenshot above.

{
  "type": "object",
  "properties": {
    "title": {
      "type": "string",
      "description": "The title of the document."
    },
    "correspondent": {
      "type": "string",
      "description": "The name of the correspondent related to the document."
    },
    "tags": {
      "type": "array",
      "items": {
        "type": "string"
      },
      "description": "A list of tags associated with the document."
    },
    "document_type": {
      "type": "string",
      "description": "The type of document, e.g., 'invoice', 'report', 'email'."
    },
    "document_date": {
      "type": "string",
      "format": "date",
      "description": "The date of the document in YYYY-MM-DD format."
    },
    "language": {
      "type": "string",
      "description": "The language of the document in ISO 639-1 format."
    }
  },
  "required": [
    "title",
    "document_type",
    "correspondent",
    "tags",
    "document_date",
    "language"
  ]
}

Load Settings

Had some big files, so I cranked up the context length to 13k

mamema · 2025-02-04T13:58:25Z

mamema
Feb 4, 2025

hmm, yeah, deepseek the new kid on the block....
..but you wrote for example: Mediocre use of existing Tags, created lots of new ones and mixed languages
So.... can you compare ito to gemma2:latest, as i haven't those issues with it you are describing?

deepseek is great.... for jailbreak prompts as it lacks every safety measure around. :-)

1 reply

TomSeestern Feb 5, 2025
Author

A bit of Prompt tuning helped with the Language stuff, but I think the Tag problem is related to #232 . I tried putting all my Tags as part of the prompt and that somewhat helped. So that should be a problem/fix with any model in paperless-ai.
I will try gemma2 next!

mamema · 2025-02-05T10:00:19Z

mamema
Feb 5, 2025

putting all my tags as part of the prompt, would break every prompting, as i have +/- 450 tags. This whole "put all tags in prompt" i still don't understand to use and/or cannot use because of prompt size.

0 replies

cmorlok · 2025-02-25T12:12:04Z

cmorlok
Feb 25, 2025

Thanks for figuring out how to use DeepSeek. I have created #381 to add support for DeepSeek (and others) on Ollama by adding support for structured output.

However, I am not happy with the results. I have tried the same document a couple of times. 2 out of 10 times the results are garbage, values are english (instead of german) and even the date is in a wrong format (document says 31.01.2025, DeepSeek returns 3101-02-25). In 2 out of 10 times it is OK, but not good, and in the remaining 6 times, it is perfect. No idea why, it is always the exact same prompt. Similar things happen with other models as well, but never as much as with DeepSeek. Might be a problem that all of my documents are german.

0 replies

mamema · 2025-02-25T17:03:04Z

mamema
Feb 25, 2025

no i guess thats AI like it is today. Even ChatGPT is like that sometimes

1 reply

cmorlok Feb 25, 2025

I am thinking about adding validation step, asking the AI whether the answer fulfills the required structure and content and if not, try again

haldi4803 · 2025-07-19T18:43:02Z

haldi4803
Jul 19, 2025

Mediocre use of existing Tags, created lots of new ones and mixed languages

I can Confirm this. Holy crap Google Gemini was already freespirited in that regard, but this is WAY over the top.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How to use local DeepSeek R1 #272

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 5 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

How to use local DeepSeek R1 #272

Uh oh!

Uh oh!

TomSeestern Feb 2, 2025

How to

Settings: Paperless Assistant

AI Configuration

Prompt

Settings LM Studio

My Server settings:

Inference settings

Load Settings

Replies: 5 comments · 2 replies

Uh oh!

mamema Feb 4, 2025

Uh oh!

TomSeestern Feb 5, 2025 Author

Uh oh!

mamema Feb 5, 2025

Uh oh!

cmorlok Feb 25, 2025

Uh oh!

mamema Feb 25, 2025

Uh oh!

cmorlok Feb 25, 2025

Uh oh!

haldi4803 Jul 19, 2025

TomSeestern
Feb 2, 2025

Replies: 5 comments 2 replies

mamema
Feb 4, 2025

TomSeestern Feb 5, 2025
Author

mamema
Feb 5, 2025

cmorlok
Feb 25, 2025

mamema
Feb 25, 2025

haldi4803
Jul 19, 2025