BugBash - PR #40894

w-javed · 2025-05-05T19:18:20Z

No description provided.

Copilot

Pull Request Overview

This PR introduces several new sample scripts and documentation updates to demonstrate the use of Azure AI Evaluation APIs, including simulation, content safety evaluation, and direct evaluator usage.

Added simulation_and_eval.py that integrates the AdversarialSimulator and evaluation API.
Added content_safety_using_evaluate_api.py and content_safety_evaluator.py to demonstrate different content safety evaluation approaches.
Updated bugbash_instructions.md with setup and usage instructions.

Reviewed Changes

Copilot reviewed 6 out of 7 changed files in this pull request and generated 2 comments.

File	Description
sdk/evaluation/azure-ai-evaluation/samples/onedp/simulation_and_eval.py	Introduces a simulation example with evaluation API usage and writes output to a file.
sdk/evaluation/azure-ai-evaluation/samples/onedp/content_safety_using_evaluate_api.py	Adds a sample script to call the Evaluate API for content safety evaluation.
sdk/evaluation/azure-ai-evaluation/samples/onedp/content_safety_evaluator.py	Provides a standalone example for using the ContentSafetyEvaluator.
sdk/evaluation/azure-ai-evaluation/samples/onedp/bugbash_instructions.md	Supplies detailed instructions and prerequisites for the bug bash.

Files not reviewed (1)

sdk/evaluation/azure-ai-evaluation/samples/onedp/oai-integration-testing/test_eval_input.jsonl: Language not supported

Copilot · 2025-05-05T19:18:59Z

sdk/evaluation/azure-ai-evaluation/samples/onedp/simulation_and_eval.py

+        session_state: Any = None,
+        context: Dict[str, Any] = None,
+    ) -> dict:
+        query = messages["messages"][0]["content"]


The 'messages' parameter is annotated as a List[Dict] but is accessed as a dictionary with a 'messages' key. Consider updating the type annotation to Dict[str, List[Dict]] (or adjusting the usage) to prevent potential runtime errors.

Copilot · 2025-05-05T19:18:59Z

sdk/evaluation/azure-ai-evaluation/samples/onedp/simulation_and_eval.py

+    with open(path, "w") as file:
+        file.write(JsonLineChatProtocol(simulator_output[0]).to_eval_qr_json_lines())    
+


Accessing simulator_output[0] without checking if simulator_output contains any elements might lead to an IndexError if the output is empty. It is recommended to validate the output before indexing.

Suggested change

with open(path, "w") as file:

file.write(JsonLineChatProtocol(simulator_output[0]).to_eval_qr_json_lines())

if not simulator_output:

raise ValueError("Simulator output is empty. Cannot write to file.")

with open(path, "w") as file:

file.write(JsonLineChatProtocol(simulator_output[0]).to_eval_qr_json_lines())

azure-sdk · 2025-05-05T19:42:41Z

API change check

API changes are not detected in this pull request.

…on-testing/azure_ai_evaluation-1.6.0-py3-none-any.whl

w-javed added 6 commits May 1, 2025 08:04

bug-bash-1dp

4e1ce52

fix

ad0ca94

fix typo

e66bb22

adding sim and eval

9ae30dc

rename

39d148c

aoai-bug-bash

631c814

w-javed added the Do Not Merge label May 5, 2025

Copilot AI review requested due to automatic review settings May 5, 2025 19:18

w-javed requested a review from a team as a code owner May 5, 2025 19:18

Copilot AI reviewed May 5, 2025

View reviewed changes

singankit and others added 3 commits May 5, 2025 13:36

Add files via upload

3f7fb78

Delete sdk/evaluation/azure-ai-evaluation/samples/onedp/oai-integrati…

0a85d76

…on-testing/azure_ai_evaluation-1.6.0-py3-none-any.whl

more samples

87bded1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BugBash - PR #40894

BugBash - PR #40894

w-javed commented May 5, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI May 5, 2025

Uh oh!

Copilot AI May 5, 2025

Uh oh!

azure-sdk commented May 5, 2025

Uh oh!

Uh oh!

		with open(path, "w") as file:
		file.write(JsonLineChatProtocol(simulator_output[0]).to_eval_qr_json_lines())

BugBash - PR #40894

Are you sure you want to change the base?

BugBash - PR #40894

Conversation

w-javed commented May 5, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI May 5, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 5, 2025

Choose a reason for hiding this comment

Uh oh!

azure-sdk commented May 5, 2025

Uh oh!

Uh oh!