feat: add new generic AI communication model #15409

sdirix · 2025-04-06T19:33:46Z

What it does

Adds a new AI communication model for tracking raw LLM requests and
responses. Tracking is automated via the language-model-service.

This model serves as the foundation for:

Extracting communication history from Theia AI, allowing LLM testing
without running the full Theia-based application
Replaying LLM communication without using an actual LLM, enabling
deterministic integration tests
Removing the clunky communication recording service, making the
ai-history package optional

Resolves #15221

Remaining tasks:

Integrate feedback regarding naming and recording of the communication model
~~Adapt History View UI~~
- ~~Track timestamps and response timestamps differently~~
- ~~Do not highlight all semantic requests~~
- ~~Show logged streamed responses in a nicer UI~~
~~Remove communication service, adapt all affected code and make ai-history package optional~~
- ~~Update changelog for breaking change~~

Moved the remaining tasks into a new PR (#15540) to make the PRs smaller.

How to test

Debug the language-model-service.

Alternatively you can check the follow up PR (#15540) to see the new model integrated in the AI History view.

Follow-ups

Breaking changes

This PR introduces breaking changes and requires careful review. If yes, the breaking changes section in the changelog has been updated.

Attribution

Contributed on behalf of STMicroelectronics

Review checklist

As an author, I have thoroughly tested my changes and carefully followed the review guidelines

Reminder for reviewers

As a reviewer, I agree to behave in accordance with the review guidelines

planger

Thank you very much Stefan!

This looks already very good and it is great to get rid of the clunky manual recording and this should be a solid foundation to extract all sorts of useful data for testing, simulation, etc.

I was wondering how we reflect function calls? I don't see them being recorded at the moment, right?

The remaining comments I'd have are related to naming:

AiSession, AiSemanticRequest, AiRequest

I'm not sure "Ai" is a great prefix. Essentially it is a record of a LanguageModelRequest or session. Also I find the use of the term "Semantic" a bit unclear.

Here are a few ideas, instead of switching to a different prefix, like Ai, and avoiding "semantic":

AiSession -> RawLanguageModelSession or LanguageModelInteraction
AiSemanticRequest -> ClientRequest (the original request for tracing raw language model requests to a client event)
AiRequest -> RawLanguageModelRequest

Thank you!

sdirix · 2025-04-22T08:47:27Z

I was wondering how we reflect function calls? I don't see them being recorded at the moment, right?

Tool Calls including their ids, parameters and result are recorded as part of the responses, see here.

The remaining comments I'd have are related to naming:

AiSession, AiSemanticRequest, AiRequest

I'm not sure "Ai" is a great prefix. Essentially it is a record of a LanguageModelRequest or session. Also I find the use of the term "Semantic" a bit unclear.

Here are a few ideas, instead of switching to a different prefix, like Ai, and avoiding "semantic":

AiSession -> RawLanguageModelSession or LanguageModelInteraction

AiSemanticRequest -> ClientRequest (the original request for tracing raw language model requests to a client event)

AiRequest -> RawLanguageModelRequest

With semantic request I wanted to express that it's representing what the user might think of a single request. I'll further think about the naming.

Thanks for the feedback!

planger · 2025-04-22T09:21:30Z

With semantic request I wanted to express that it's representing what the user might think of a single request. I'll further think about the naming.

Thanks! Yes that makes a lot of sense to capture that. It could also be any client, such as a service though, right? Therefore I was leaning towards ClientRequest, but I'm really open to anything. I can also live with "semantic", but this term is a bit convoluted to me. :-)

Adds a new AI communication model for tracking raw LLM requests and responses. Tracking is automated via the language-model-service. This model serves as the foundation for: - Extracting communication history from Theia AI, allowing LLM testing without running the full Theia-based application - Replaying LLM communication without using an actual LLM, enabling deterministic integration tests - Removing the clunky communication recording service, making the ai-history package optional Resolves eclipse-theia#15221 Contributed on behalf of STMicroelectronics

sdirix · 2025-04-30T21:29:40Z

Hi @planger, Thanks for your input!

I adapted the naming a bit and split the PR into two PRs:

This PR introducing the new model
The follow up PR (refactor: use communication model in AI history #15540) in which the new model is used to refactor the ai-history package

Please check whether you like it.

planger

Excellent, thank you! This looks very clean and useful to me. I tested it with #15540

planger · 2025-05-12T18:50:33Z

packages/ai-core/src/common/language-model-interaction-model.ts

+/**
+ * An exchange unit representing a logical operation which may involve multiple model requests.
+ */
+export interface LanguageModelExchange {


Just a nitpick: I am ok with LanguageModelExchange but I was wondering what you think about LanguageModelCompletion as a more direct term for what actually happens (i.e. an LLM request completion). Exchange sounds to me a bit bi-directional and not a perfect fit.

The exchange consists of request-response pairs, so it feels bidirectional to me. Personally I like "exchange" better than "completion". I would have never guessed that a "LanguageModelCompletion" is a set of request-response pairs, unrelated to "normal" LLM completion.

github-project-automation bot added this to PR Backlog Apr 6, 2025

github-project-automation bot moved this to Waiting on reviewers in PR Backlog Apr 6, 2025

sdirix force-pushed the communication-model branch from 1c27668 to a06f488 Compare April 6, 2025 20:11

sdirix requested a review from planger April 6, 2025 20:18

planger reviewed Apr 7, 2025

View reviewed changes

sdirix force-pushed the communication-model branch from a06f488 to 9bd59e9 Compare April 30, 2025 21:14

sdirix mentioned this pull request Apr 30, 2025

refactor: use communication model in AI history #15540

Merged

2 tasks

sdirix marked this pull request as ready for review April 30, 2025 21:27

sdirix requested a review from planger April 30, 2025 21:29

planger approved these changes May 12, 2025

View reviewed changes

github-project-automation bot moved this from Waiting on reviewers to Needs merge in PR Backlog May 12, 2025

sdirix merged commit 6d1a4a1 into eclipse-theia:master May 13, 2025
9 of 11 checks passed

github-project-automation bot moved this from Needs merge to Done in PR Backlog May 13, 2025

sdirix deleted the communication-model branch May 13, 2025 07:42

github-actions bot added this to the 1.62.0 milestone May 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add new generic AI communication model #15409

feat: add new generic AI communication model #15409

Uh oh!

sdirix commented Apr 6, 2025 •

edited

Loading

Uh oh!

planger left a comment

Uh oh!

sdirix commented Apr 22, 2025

Uh oh!

planger commented Apr 22, 2025

Uh oh!

sdirix commented Apr 30, 2025

Uh oh!

planger left a comment

Uh oh!

planger May 12, 2025

Uh oh!

sdirix May 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

feat: add new generic AI communication model #15409

feat: add new generic AI communication model #15409

Uh oh!

Conversation

sdirix commented Apr 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What it does

Remaining tasks:

How to test

Follow-ups

Breaking changes

Attribution

Review checklist

Reminder for reviewers

Uh oh!

planger left a comment

Choose a reason for hiding this comment

Uh oh!

sdirix commented Apr 22, 2025

Uh oh!

planger commented Apr 22, 2025

Uh oh!

sdirix commented Apr 30, 2025

Uh oh!

planger left a comment

Choose a reason for hiding this comment

Uh oh!

planger May 12, 2025

Choose a reason for hiding this comment

Uh oh!

sdirix May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sdirix commented Apr 6, 2025 •

edited

Loading

sdirix May 13, 2025 •

edited

Loading