Anybody tried LLMLingua ? #4711

x4080 · 2023-12-31T04:54:53Z

x4080
Dec 31, 2023

https://github.com/microsoft/LLMLingua/tree/main

It tried to compress prompt and document so can be much smaller, I just wondered if we can use it with llama cpp (gguf)

Michael-F-Ellis · 2024-01-06T18:34:58Z

Michael-F-Ellis
Jan 6, 2024

Seems really interesting. Microsoft developed it as a way to save $ when interacting with paid servers like OpenAI, but I'm wondering if it could be beneficial for llama.cpp. MS claim the method can achieve prompt compressions of up to 20x and get ~same response from the LLM. I'd certainly love to have 160K effective context for a mistral model!

0 replies

mirek190 · 2024-01-06T19:35:23Z

mirek190
Jan 6, 2024

Can you imagine? Current 32K context models upgraded to 640K ..wow.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Anybody tried LLMLingua ? #4711

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Anybody tried LLMLingua ? #4711

Uh oh!

x4080 Dec 31, 2023

Replies: 2 comments

Uh oh!

Michael-F-Ellis Jan 6, 2024

Uh oh!

mirek190 Jan 6, 2024

x4080
Dec 31, 2023

Michael-F-Ellis
Jan 6, 2024

mirek190
Jan 6, 2024