Inference API: schemas and interfaces #203

iboB · 2024-11-21T07:19:09Z

iboB
Nov 21, 2024
Maintainer

Currently the inference API is defined by a named inference schemas "llama" being different from "whisper". In this particular case this difference makes sense, but the borders become fuzzier when we start talking about model classes as in "text-to-text LLM", "TTS model", etc.

So instead of differentiating at root level, I propose we define instance interfaces. So, a model would then support instance interfaces as opposed to a particular schema. Then a loader can be chosen by an instance interface, say "chat", or "tts".

This has a lot of implications on schema generation and the associated codegen. Much more specification is needed

iboB · 2025-01-29T06:33:25Z

iboB
Jan 29, 2025
Maintainer Author

This is obsolete, given the session interface

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inference API: schemas and interfaces #203

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Inference API: schemas and interfaces #203

Uh oh!

iboB Nov 21, 2024 Maintainer

Replies: 1 comment

Uh oh!

iboB Jan 29, 2025 Maintainer Author

iboB
Nov 21, 2024
Maintainer

iboB
Jan 29, 2025
Maintainer Author