Agent prompt suggestions & chat summary #15427

colin-grant-work · 2025-04-08T20:10:10Z

What it does

This PR adds a number of capabilities to the AI chat system discussed in #15094:

The ability for particular agents to contribute suggestions to the chat, which are rendered above the input.

At the moment, these are a static part of the agent's declaration. They could be implemented as a push system (agent fires event when suggestions changed) or pull system (agent can implement a method that is passed, e.g. a ChatSession and returns suggestions).

A new context variable type: session summaries, along with facilities to generate summaries, resolve summary variables, and resolve a set of summaries as part of the system prompt of a given agent.
The ability to open editors to view the values of context variables.

I'm not 100% sold on this feature. For one thing, it's a bit redundant for files. For another, at the moment, the content would be basically static and reflect the value of the variable at the time that it was resolved (the time at which the editor was opened). It might be possible to use the resolution context to provide updates when the variable service resolves a variable - e.g. if the same session is involved - but the way things are declared at the moment, it's hard to use the resolution context in a generic way.

Closes #15094

To do:

2.1: I would introduce this in the default renderer, i.e. actions in the text plus optionally additional actions that are shown under "more" in a popup. Follow-up created: Theia AI - Epic - next #15068
show progress when creating the summary (or create it earlier).
I think I'm watching for file changes but not doing anything when they happen - implement updates if files are deleted / modified on disk.
Maybe hook up response rendering system instead of current (almost) direct markdown?

How to test

Have a conversation with an agent.
Start a new chat.

If your first chat was with Coder, it should give you two suggestions about starting a new chat, or starting a new chat with a summary of the current chat. If you choose the latter, it should automatically add a context variable.

You can add a summary of the first chat to the current chat in a couple of ways:

it can be added to the context using the + in the bottom left of the input. Choose 'Session Summary' and then the session you're interested in.
it can be added to a message using #session-summary:<id> - you should get auto-completion.

If you add a session summary, it should be included in the message to the LLM - check the history.

There's bit of a UX hiccup here. It takes several seconds to get the AI-generated summary for the first time (or to generate a new one if the session to be summarized adds messages). Only once that interaction is complete do we initiate the request that required the summary. Perhaps we should show the user message in the UI before we've parsed all of its variables, and then show the progress message until we start getting content back for the main request?

Separately, if you add any kind of variable to the context, you should be able to click on the element in the context display and see the resolved value of that variable, e.g. the summary it provides of a given chat session.

Follow-ups

Breaking changes

This PR introduces breaking changes and requires careful review. If yes, the breaking changes section in the changelog has been updated.

Attribution

Review checklist

As an author, I have thoroughly tested my changes and carefully followed the review guidelines

Reminder for reviewers

As a reviewer, I agree to behave in accordance with the review guidelines

eneufeld · 2025-04-09T07:23:41Z

I tested this. The ui looks nice and I think this is a great improvement.

This was my test:

initial prompt: @Coder suggets improvement for #currentRelativeFilePath
After the answer (which contained reading the file and a changeset) I then started a new chat with a summary (the load took a bit without a progress what was concerning)
Looking at the context, it did not contain any of the file references
Then I prompted: how do we continue? and it searched the workspace to find the file from the first prompt but only based on the class name it had in the summary.

Based on this I would suggest:

resolve variables in the summary
show progress when creating the summary

JonasHelming · 2025-04-09T12:06:08Z

Quick tested this, looks cool, but i want to dig into it more. We should collect all feedback before iterating on it, but thanks Colin for preparing this!

packages/ai-ide/src/browser/coder-agent.ts

packages/ai-chat/src/browser/session-summary-variable-contribution.ts

packages/ai-chat/src/common/chat-session-summary-agent.ts

planger

First of all, I just want to say how much I like these powerful features added here—really great work!

A couple of thoughts from my side:

Suggestions Handling: I feel the suggestions are conceptually quite close to the chat session’s change set in their lifecycle, how they are created and updated, and also how they are shown to the user (as a dedicated area near the input, in reference to a chat session). So I feel like it might be better to manage these suggestions in a similar manner, i.e. directly within the chat session itself. That way, the agent or tools could update them dynamically based on the current context—like proposing follow-up questions or suggesting the user start a new task once the session only after it reached a certain length. That flexibility could really open up a lot of additional use cases.

UI Representation: I agree with Jonas—it’d be great to support going beyond markdown and support richer UI in suggestions area. That said, this raises some interesting questions. If we keep suggestions with the chat session and have them filled by agents or tools, it might feel awkward to directly create React nodes. So far, agents rather produce chat response nodes, and those are transferred to the UI using dedicated renderers. Should we stick with that model or would that be a misuse of the chat response infrastructure we already have?

TaskContextService: I also like the idea of pulling out the chat task management logic and persistence into a separate service and the summarization into a specific agent. That abstraction would make creating summaries and managing task contexts accessible not only through variables but also through a more direct API to agents and potentially other consumers. This feels like a good move for configurability of the summarization (LLMs and prompts) and reusability of reading or creating tasks.

Thanks again for the great work!

colin-grant-work · 2025-04-18T18:52:07Z

Should we stick with that model or would that be a misuse of the chat response infrastructure we already have?

I think that that is the closest analogue we have if we decide we want to allow LLM's to contribute suggestions of their own, rather than hardcoding them on specific agents. Then it wouldn't be an abuse of the system, but just the system: if the LLM is generating them, then they are just responses :-). For now, though, I haven't provided a mechanism for the LLM's to suggest things, but only for agents to provide suggestions. How they get them is left open, though.

packages/ai-ide/src/common/coder-replace-prompt-template.ts

packages/ai-ide/src/browser/context-session-summary-variable.ts

packages/ai-chat/src/browser/session-summary-variable-contribution.ts

JonasHelming · 2025-04-18T21:34:59Z

You can click now on any context variable, e.g. also instances of the file variable. However, for the file variable, it opens an empty read only editor for me. This is unexpected, it should open the underlying file instead.
As this behavior is not really part of this feature, I am also perfectly fine with restricting the click to the new taskContext variable, only.

packages/ai-chat/src/browser/ai-chat-preferences.ts

JonasHelming · 2025-04-18T21:40:17Z

Is there a reason why we make the editor for tastContexts read only for the case that there is no underlying file? Could we easily allow the user to modify it in memory instead? This can for sure be a follow-up, but we should capture it.

JonasHelming · 2025-04-18T21:45:11Z

If persistence for taskContexts is set-up, the in memory version is still kept, so you have two versions of the same taskContex then.

Create a task context (via Coder suggestion click)
Resolve it (e.g. via saying something in a new chat
type "#session-summary:" => You see two version of the tastContext, the file and the in-memory one

=> If persistence is active, we should fully get rid of the in memory ones I think

JonasHelming · 2025-04-18T21:46:29Z

When I click on a taskContext variable in the chat which points to an underlying file, it opens a read only editor. This is unexpected, it should open the underlying file.

packages/ai-ide/src/browser/coder-agent.ts

JonasHelming · 2025-04-18T21:51:00Z

packages/ai-ide/src/browser/coder-agent.ts

+    }
+    async suggest(context: ChatSession | ChatRequestModel): Promise<void> {
+        const model = ChatRequestModel.is(context) ? context.session : context.model;
+        const session = this.chatService.getSessions().find(candidate => candidate.model.id === model.id);


@planger Is this the expected way to get sessions now?

@planger Ping

I think in this case we don't even need to retrieve the actual chat session, because model.id should always be equivalent to the ChatSession.id (see chat-serivce.ts:187).

In general, we do have a distinction there:

The ChatModel encompasses all information on a conversation (also referred to as session in the chat model, e.g. ChatRequestModel.session).

The Chat Service wraps each ChatModel into another object, called ChatSession, which also adds more UI-related state relevant for the chat service, such as whether the session is currently active in the chat view, the currently pinned agent, last interaction, etc.

But the IDs should always be the same.

because model.id should always be equivalent to the ChatSession.id (see chat-serivce.ts:187).
... But the IDs should always be the same.

This is (currently) true (for our implementation of ChatSessions), but not guaranteed by the interfaces / API, so I'd prefer to keep the structure that matches ID's on elements that are supposed to be the same.

The alternative is the getSession API, which does the same iteration, so this isn't much less efficient:

theia/packages/ai-chat/src/common/chat-service.ts

Lines 180 to 182 in 505c885

getSession(id: string): ChatSessionInternal | undefined {

return this._sessions.find(session => session.id === id);

}

Alternatively, we could add a reference to the session on the model, or an API on the chat service to retrieve a session given a model?

colin-grant-work · 2025-04-18T21:56:45Z

When I click on a taskContext variable in the chat which points to an underlying file, it opens a read only editor. This is unexpected, it should open the underlying file.

We can do this, but it would require different services. At the moment, we create one kind of resource and then populate that using the variable resolution system, which returns text. The editor is readonly (and not hooked up to language services, etc.) because the variable resolution system doesn't know anything except that it can get a string. If we want to open the underlying file, then we have to add to the variable resolver interface the possibility of opening a variable so that resolver itself can decide whether to use a variable-specific resource or a standard URI. Would you like that as part of this PR?

Is there a reason why we make the editor for tastContexts read only for the case that there is no underlying file? Could we easily allow the user to modify it in memory instead? This can for sure be a follow-up, but we should capture it.

The answer is the same for this: we're not opening the file, we're retrieving the value of a variable.

packages/ai-ide/src/browser/coder-agent.ts

packages/ai-chat-ui/src/browser/chat-view-commands.ts

JonasHelming · 2025-04-18T22:10:37Z

When I click on a taskContext variable in the chat which points to an underlying file, it opens a read only editor. This is unexpected, it should open the underlying file.

We can do this, but it would require different services. At the moment, we create one kind of resource and then populate that using the variable resolution system, which returns text. The editor is readonly (and not hooked up to language services, etc.) because the variable resolution system doesn't know anything except that it can get a string. If we want to open the underlying file, then we have to add to the variable resolver interface the possibility of opening a variable so that resolver itself can decide whether to use a variable-specific resource or a standard URI. Would you like that as part of this PR?

Is there a reason why we make the editor for tastContexts read only for the case that there is no underlying file? Could we easily allow the user to modify it in memory instead? This can for sure be a follow-up, but we should capture it.

The answer is the same for this: we're not opening the file, we're retrieving the value of a variable.

I see! @planger WDYT about this? I think we will want this anyways, also for the file variable, right?

packages/ai-chat-ui/src/browser/ai-chat-ui-contribution.ts

packages/ai-chat/src/common/chat-session-summary-agent-prompt.ts

tsmaeder · 2025-04-25T14:41:09Z

packages/core/src/common/resource.ts

@@ -224,29 +224,54 @@ export class DefaultResourceProvider {

 }

+export type ResourceInitializationOptions = Pick<Resource, 'autosaveable' | 'initiallyDirty' | 'readOnly'>


This is hard to understand as opposed to a simple interface type.

It's also guaranteed to maintain the correct type if anything changes in Resource. For example, readOnly had changed from boolean to boolean | MarkdownString since the original declaration was written, and the declaration hadn't kept up.

tsmaeder · 2025-04-25T14:42:54Z

packages/core/src/common/resource.ts

-    protected contents: string = '';
+    protected contents: string | Promise<string>;
+
+    constructor(readonly uri: URI, protected options?: ResourceInitializationOptions) { }


This makes all of the fields in this class effectively mutable.

I consider that an advantage, given that it does have Mutable in its name. You could consider these changes an answer to the objection that previously, the only thing 'mutable' about a MutableResource was its content and not any of the other fields on a Resource, when there are good UX grounds for wishing for an in-memory resource that can do whatever other resources can do. Why not be able to mark it readonly? Why not be able to say why it's readonly? Why not be able to say that it's initially dirty? Why not be able to customize its save behavior without having to create a new resource class (and so new resource resolver) for every particular use case?

sdirix

Thanks for the great work. All comments from mine besides the regression mentioned below could be handled in follow ups if the time until release runs out.

I had a quick look through the code however I did not check every line in detail. I tried breaking the functionality but was not able too.

Architecturally I understand that each agent can add own suggestions. However it's a bit weird to me that such a generic suggestion like we have now for the Coder is not handled generically too.

Could be a good follow up.

Blocker regression: The edit chat button UI is broken because the input is not shown anymore.

Minor: The pointer is shown over the whole text instead of only the actions:

packages/ai-core/src/common/ai-variable-resource.ts

packages/ai-chat-ui/src/browser/ai-chat-ui-contribution.ts

packages/ai-chat-ui/src/browser/chat-view-commands.ts

packages/ai-chat-ui/src/browser/ai-chat-ui-contribution.ts

sdirix · 2025-04-25T20:20:50Z

packages/ai-chat/src/common/chat-session-summary-agent.ts

+import { CHAT_SESSION_SUMMARY_PROMPT } from './chat-session-summary-agent-prompt';
+
+@injectable()
+export class ChatSessionSummaryAgent extends AbstractStreamParsingChatAgent implements ChatAgent {


The ChatSessionSummaryAgent should not be a chat agent as it is not intended to be addressed by the user within the chat. Instead it should just be a regular agent like the chat naming agent.

Removed tag.

I think we should not even implement the ChatAgent interface

packages/ai-chat/src/browser/task-context-service.ts

colin-grant-work · 2025-04-25T23:36:15Z

Blocker regression: The edit chat button UI is broken because the input is not shown anymore.

Fixed the initialization of the new AIChatTreeInputWidget so that the error causing the apparent disappearance wouldn't occur.

Minor: The pointer is shown over the whole text instead of only the actions:

Tweaked the check here so that it wouldn't appear on non-callback suggestions.

sdirix

Works for me! Thanks for the work!

Potential follow up:
The agent can now be successfully disabled, however the UI does not react.

If the user tries to trigger a summary, they do not get an error shown, so they don't know what is going wrong (maybe they forgot that they disabled the agent)
In case the agent is disabled, we could hide the "start new chat with summary" text in case the summary does not exist yet (as it will not be creatable)
In case the agent is disabled, we could hide the "summary chat" action in the toolbar (but keep the "show summary one)

sdirix · 2025-04-27T19:45:04Z

packages/ai-chat/src/common/chat-session-summary-agent.ts

+import { CHAT_SESSION_SUMMARY_PROMPT } from './chat-session-summary-agent-prompt';
+
+@injectable()
+export class ChatSessionSummaryAgent extends AbstractStreamParsingChatAgent implements ChatAgent {


I think we should not even implement the ChatAgent interface

And trigger lint, maybe?

github-project-automation bot added this to PR Backlog Apr 8, 2025

github-project-automation bot moved this to Waiting on reviewers in PR Backlog Apr 8, 2025

colin-grant-work force-pushed the feature/agent-suggestions branch 4 times, most recently from d58e8c2 to b249a03 Compare April 8, 2025 21:13

JonasHelming reviewed Apr 11, 2025

View reviewed changes

planger self-requested a review April 14, 2025 16:12

planger reviewed Apr 14, 2025

View reviewed changes

JonasHelming mentioned this pull request Apr 16, 2025

Add task management functionality to CoderAgent #15094

Closed

2 tasks

colin-grant-work force-pushed the feature/agent-suggestions branch from b249a03 to 8ffb092 Compare April 18, 2025 03:57