You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_accuracy/tool_call_accuracy.prompty
+3-3Lines changed: 3 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -126,7 +126,7 @@ Your output should consist only of a JSON object, as provided in the examples, t
126
126
- additional_details: a dictionary that contains the following keys:
127
127
- tool_calls_made_by_agent: total number of tool calls made by the agent
128
128
- correct_tool_calls_made_by_agent: total number of correct tool calls made by the agent
129
-
- per_tool_details: a list of dictionaries, each containing:
129
+
- details: a list of dictionaries, each containing:
130
130
- tool_name: name of the tool
131
131
- total_calls_required: total number of calls required for the tool
132
132
- correct_calls_made_by_agent: number of correct calls made by the agent
@@ -135,12 +135,12 @@ Your output should consist only of a JSON object, as provided in the examples, t
135
135
- tool_success_result: 'pass' or 'fail' based on the evaluation of the tool call accuracy for this tool
136
136
- excess_tool_calls: a dictionary with the following keys:
137
137
- total: total number of excess, unnecessary tool calls made by the agent
138
-
- per_tool_details: a list of dictionaries, each containing:
138
+
- details: a list of dictionaries, each containing:
139
139
- tool_name: name of the tool
140
140
- excess_count: number of excess calls made for this query
141
141
- missing_tool_calls: a dictionary with the following keys:
142
142
- total: total number of missing tool calls that should have been made by the agent to be able to answer the query
143
-
- per_tool_details: a list of dictionaries, each containing:
143
+
- details: a list of dictionaries, each containing:
144
144
- tool_name: name of the tool
145
145
- missing_count: number of missing calls for this query
0 commit comments