Spell check fixes

Salma Elshafey · Salma Elshafey · commit eaf493af6cfb · 2025-06-24T13:26:09.000+03:00
diff --git a/sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_accuracy/_tool_call_accuracy.py b/sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_accuracy/_tool_call_accuracy.py
@@ -26,7 +26,7 @@ class ToolCallAccuracyEvaluator(PromptyEvaluatorBase[Union[str, float]]):
     The evaluator uses a scoring rubric of 1 to 5:
         - Score 1: The tool calls are irrelevant
         - Score 2: The tool calls are partially relevant, but not enough tools were called or the parameters were not correctly passed
-        - Score 3: The tool calls are relevant, but there were unncessary, excessive tool calls made
+        - Score 3: The tool calls are relevant, but there were unnecessary, excessive tool calls made
         - Score 4: The tool calls are relevant, but some tools returned errors and agent retried calling them again and succeeded
         - Score 5: The tool calls are relevant, and all parameters were correctly passed
 
diff --git a/sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_accuracy/tool_call_accuracy.prompty b/sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_accuracy/tool_call_accuracy.prompty
@@ -43,7 +43,7 @@ user:
 # Ratings
 ## [Tool Call Accuracy: 1] (Irrelevant)
 **Definition:**
-Tool calls were not relevant to the user's query, resulting in anirrelevant or unhelpful final output.
+Tool calls were not relevant to the user's query, resulting in an irrelevant or unhelpful final output.
 This level is a 'fail'.
 
 **Example:**
@@ -122,7 +122,7 @@ TOOL DEFINITION: {{tool_definition}}
 Your output should consist only of a JSON object, as provided in the examples, that has the following keys:
   - chain_of_thought: a string that explains your thought process to decide on the tool call accuracy level. Start this string with 'Let's think step by step:', and think deeply and precisely about which level should be chosen based on the agent's tool calls and how they were able to address the user's query.
   - tool_calls_success_level: a integer value between 1 and 5 that represents the level of tool call success, based on the level definitions mentioned before. You need to be very precise when deciding on this level. Ensure you are correctly following the rating system based on the description of each level.
-  - tool_calls_sucess_result: 'pass' or 'fail' based on the evaluation level of the tool call accuracy. Levels 1 and 2 are a 'fail', levels 3, 4 and 5 are a 'pass'.
+  - tool_calls_success_result: 'pass' or 'fail' based on the evaluation level of the tool call accuracy. Levels 1 and 2 are a 'fail', levels 3, 4 and 5 are a 'pass'.
   - additional_details: a dictionary that contains the following keys:
         - tool_calls_made_by_agent: total number of tool calls made by the agent
         - correct_tool_calls_made_by_agent: total number of correct tool calls made by the agent