Normalization of OpenInference Trace Outputs #24

njbrake · 2025-03-20T16:09:35Z

langchain, openai, and smolagents, although they are all implemented using openinference and exported with the JSONExporter all have slightly different trace formats.

In order to correctly parse the specific information (e.g. tool use, input/output messages, etc) we need to standardize the outputs before feeding it into the evaluation harness.

Build a class that will load the json trace from either openai/langchain/smolagents and normalize it. Use this normalized output when feeding into the evaluation code.

njbrake · 2025-03-20T19:17:49Z

Digging into this now and it appears that the normalization process is something that openinference is working on and the issue we are seeing are things that openinference is working to fix.

I filed a new ticket about the issues with their openaiagents instrumentation: [bug] OpenAI Agents OpenInference TOOL call missing input parameters and output, missing tool.name Arize-ai/openinference#1397
Langchain is missing token counts info [langchain] token counts missing for gemini + langchain Arize-ai/openinference#1391
Langchain trace names were messed up, that bug apparently got patched literally yesterday [BUG]: False positive error Arize-ai/openinference#1388 hasn't yet made it into a release
Smolagent names are messed up [smolagents] Include agent names and the executed Python code in traces Arize-ai/openinference#1387
missing information about who the llm provider actually is Propagate llm.provider in all instrumentations Arize-ai/openinference#1333

What doesn't appear to be covered is the idea that since different agent frameworks have different processing steps, the actual traces and sequence of spans that come out are still totally different. If the names of spans and metadata are fixed that will help being able to identify what each span is doing but we still need to work to figure out how to use them.

The answer may not be to normalize the traces per say, but maybe we need to have normalized access functions.

For example, maybe we need to implement methods for retrieve_tool_calls, retrieve_span_name_sequence, or things like that. Trying to normalize the transcripts before trying to access specific characteristics of the trace is looking like a mammoth task that is probably beyond this POC

njbrake self-assigned this Mar 20, 2025

njbrake added tracing evaluation labels Mar 20, 2025

njbrake mentioned this issue Mar 21, 2025

Uniform Trace extraction and inferring agent_type #27

Merged

3 tasks

njbrake closed this as completed Mar 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Normalization of OpenInference Trace Outputs #24

Normalization of OpenInference Trace Outputs #24

njbrake commented Mar 20, 2025

njbrake commented Mar 20, 2025

Uh oh!

Normalization of OpenInference Trace Outputs #24

Normalization of OpenInference Trace Outputs #24

Comments

njbrake commented Mar 20, 2025

njbrake commented Mar 20, 2025

Uh oh!