Sequence Flow

InferGPT Sequence Flow

Director Supervisor

User utterance sent to Director Supervisor
Predict next step: Director Supervisor reviews the current ‘world state’:

World state:

Short term memory: Conversation history {userMessage1: "Hello", AIMessage1: "Hi, how can I help?"},

Supervisors and Agents: {taskSupervisor: 'description: has a team of agents that execute tasks', financialAdvisorAgent: 'description: is an expert in helping the user with their finances'

Previously completed, pending and failed tasks

Self reflections & verifications

then is prompted with Language Agent Tree Search: We combine GPT4 with Monte-Carlo Tree Search to reason what the next step should be taken is.

Select Supervisor: After Director Supervisor predicts the next step, it is given the availability of other Supervisor agents it can use. It picks the most suitable supervisor to handle the next step.
Summarise conversation history based on step 1 prediction and the supervisor director has picked, the complete chat history is "contextualised" - only the most relevant messages are used for the final evaluation step.
Evaluate & deliver A final evaluation step is run to determine what agent function is executed to try and further achieve the user's goal.

Director Self reflect & Self verify:

Returning of task: Director is passed a summary of the task assigned by another supervisor and is told whether it is done successfully or not. If not completed successfully, a detailed review describes why not and what is needed to succeed next time.

If the task is completed successfully: This pattern is looped until all tasks listed in the canvas are completed and the response is handed back to the user.
If the task is not completed successfully: Director passes on the failed task and feedback to the Automatic curriculum agent, which suggests a new smaller, easier step to take to complete. This is passed on to the quartermaster, which analyses the error and the new step/task. It then creates new agents or tools to fulfil the task. This is passed back to the Director once complete and the director once again chooses an agent to complete the task, with that agent's new tool at its disposal.

This concludes the responsibilities of the Director Supervisor.

Additional Context We leverage GPT3.5 to self-ask questions to provide additional context and increase the chance of task success. Each question is paired with a concept that is used for retrieving the most relevant document from the knowledge graph. We feed the document content to GPT3.5 to answer the self-written questions.

Additional Context Prompt:

You are a helpful assistant that answers my question about {topic}.

I will give you the following information: Question: ... You will answer the question based on the context ( only if available and helpful ) and your own knowledge of {topic}.

Start your answer with "Answer:".
Answer "Answer: Unknown" if you don’t know the answer.

Automatic curriculum agent

Automatic curriculum prompt You are a versatile assistant tasked with guiding me through the next immediate step in accomplishing my goal related to [specific task or goal]. My ultimate ambition is to explore a broad array of [relevant field or activity] topics, master various skills, and become the top expert in [specific field or area]. To assist you, I will provide the following details: {'Question 1: ...': 'Answer: ...', 'Question 2: ...': 'Answer: ...'}, etc.

Tools available to the Supervisor & agents include: [list of tools]. Tasks completed so far include: [list of completed tasks].

Tasks deemed too challenging and left incomplete: [list of failed tasks]. You must adhere to the following criteria in your guidance:

Act as a mentor, guiding me to the next task based on my learning progress and the information provided.
Specify the knowledge I need to acquire or the steps needed to complete a task, including any relevant tools or methods.
Propose the next task in a clear, concise format, such as "Gather [quantity] [item or information]", "Create [quantity] [item]", "Study [topic] for [duration]", etc. Issue one task at a time and avoid mentioning unrelated details.
Ensure the task is achievable with my current resources and skill level to prevent overwhelming challenges.
Each task should introduce a new and interesting element to avoid repetition and maintain engagement. Encourage exploration of uncommon resources, improvement of tools or skills, and discovery of new concepts.
Repeat tasks only when necessary to accumulate resources or skills for more advanced objectives.
Avoid suggesting tasks that limit exploration or creativity, such as constructing or remaining stationary for safety. The focus should be on expansion and discovery.
Refrain from tasks that depend on real-time verification or visual confirmation, such as specific placements or constructions.

Focus on actions that are verifiable through the information provided.

RESPONSE FORMAT: Reasoning: Based on the details above, explain the rationale behind the suggested next task. Task: Clearly state the next task.

Example response: Reasoning: Considering your current knowledge in [field] and the resources at your disposal, it's important to build foundational skills. Task: Study [specific topic] for [duration].

Evaluate and Execute: A final evaluation step is run to determine what agent function is executed to try and further achieve the user's goal.

Orchestration Agents

DirectorAgent

Inspired by the delocalised processing of the human brain: this agent is a central coordinating agent, responsible for managing the overall process, including chat interactions, predicting the best next steps, and finding the best agent for executing specific tasks.

The process loops:

Predict Next Step: For each iteration of the execution loop, DirectorAgent starts by making an informed prediction about what the best next step should be. The predictBestNextStep method will take three parameters:

messages: An array of ChatMessage objects from working memory, representing the conversation history.
previousAgent: An optional parameter representing the agent that was used in the previous step.
terminationStr: An optional string used to signal the termination of the process.

Select Best Agent: Based on this prediction, DirectorAgent selects a best-fit agent persona.
Contextualize Chat History: Based on the prediction from step 1 and the agent persona in step 2, the complete chat history is "contextualized" and only the most relevant messages are used for the final evaluation step.
Evaluate and Execute: A final evaluation step is run to determine what agent function is executed to try and further achieve the user's goal.

PlannerAgent

The PlannerAgent, inspired by the prefrontal cortex is involved in planning and project management, with a focus on achieving set goals. It keeps checking that it's achieved the criteria for the goals set by the user.

The process loops:

GoalRequired Is a goal required to be set to answer the question and if so
GoalCheckDoes the user have clear goals that relate to the question in mind?
ContextCheck Do I need any further context to answer this question?
IfContextMissing Calls QuestionGeneratorAgent to find missing context and generate actions or questions needed
IfNotContextMissing Either carries out the task with TaskAgent or Call WorkingMemoryAgent and pull appropriate guidance or information

QuestionGeneratorAgent

Calls the NextBestQuestion function in Neo4j which infers on missing information needed from the graph and generates a list of questions to be answered. Pushes quantity to the front end and also tells the user how many questions need to be asked.

WorkingMemoryAgent:

The WorkingMemory Agent Reads and Writes to the working memory:

conversation context
new web search information
new user information

LongTermMemorySynthesiserAgent (LTMSA):

We take inspiration from how the brain stores events using 'Sparse Representation': Individual memories are represented in the brain by small sets of neurons, spread across, rather than in local areas in the hippocampus. In graph terms, this means only storing 'essential' connections and relationships, rather than trying to link every piece of information with everything else, which is what will happen in the working memory. Our ambition is to represent each key concept or entity in the graph with a limited number of highly relevant connections, avoiding excessive interlinking that could dilute the significance of each connection. How this functions exactly is yet to be worked out.

We're unsure whether few-shot will be enough to achieve 'cleaning up' the graph after a conversation has ended.

ProspectiveAgent: Reads and writes to prospective memory for appointments, dates or events that are due to happen in the future.

Task Agents

CsvAnalystAgent: This agent is designed to perform various operations on CSV data, such as adding columns, filtering rows, joining by column, ordering columns, sorting by column, summing up column values, and writing files.

DeveloperAgent: This agent seems to be focused on development-related tasks, including reading and writing files, and reading directories. It also has commented-out functions for running Pytest and initializing Poetry, indicating potential capabilities for Python development workflows.

ResearcherAgent: This agent is equipped for conducting web research, scraping text, and handling file operations (reading and writing files, reading directories).

SynthesizerAgent: The SynthesizerAgent is focused on synthesizing or combining information, supported by file reading and writing capabilities.

Neo4j

Data Science

Intent prediction

Coming soon

Recommendation engine

Coming Soon

Next Best Question

Uses an inference engine (yet to be decided, could be as simple as a property check) that checks for the goal and decides what information is needed before being able to provide recommendations, options, advice or carry out a task.

Next Best Question Condenders: *Deepminds AMIE

20 questions using graphs

Memory Modules

Working

When a new conversation starts, a copy of the episodic/semantic graph is made to store conversation, tasks etc. It's also used to infer and carry out most functions below. Once the conversation has finished, the conversation and any actions taken are summarised, and the graph is then restructured and formatted back into long-term memory to be more suitable and efficient (see long-term processing) for later use if new nodes and relationships have been made.

Stores: Workspace, Tasks, and current conversation chain.

Long-term (Episodic & Semantic)

Episodic memory is needed to recall past events – recent or distant. You use episodic memory when you remember personal experiences, such as what you had for lunch or when you attended a family gathering. Semantic Memory is needed to recall facts.

Prospective

Used to remember appointments, dates or events that are due to happen in the future.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sequence Flow

InferGPT Sequence Flow

Director Supervisor

Director Self reflect & Self verify:

Automatic curriculum agent

Orchestration Agents

DirectorAgent

PlannerAgent

QuestionGeneratorAgent

WorkingMemoryAgent:

LongTermMemorySynthesiserAgent (LTMSA):

Task Agents

Neo4j

Data Science

Intent prediction

Recommendation engine

Next Best Question

Memory Modules

Working

Long-term (Episodic & Semantic)

Prospective

Long-term memory ontology

Labels & nodes

Node properties

Relationships

Relationship properties

Out of date diagrams:

Sequencing

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally