Skip to content

Add Tool Call Accuracy Evaluator Bugbash Notebook #42121

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 44 commits into
base: users/singankit/agent_evaluators_v2_bug_bash
Choose a base branch
from

Conversation

salma-elshafey
Copy link
Contributor

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

If an SDK is being regenerated based on a new swagger spec, a link to the pull request containing these swagger spec changes has been included above.

All SDK Contribution checklist:

  • The pull request does not introduce [breaking changes]
  • CHANGELOG is updated for new features, bug fixes or other significant changes.
  • I have read the contribution guidelines.

General Guidelines and Best Practices

  • Title of the pull request is clear and informative.
  • There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

  • Pull request includes test coverage for the included changes.

Salma Elshafey and others added 30 commits June 22, 2025 22:40
* Made response_format option type public and update docstr

* resolved comment

* update

* Update CHANGELOG.md
…d noisy messages (Azure#41852)

* Initial plan

* Fix lazy loading for optional dependency imports to avoid noisy messages

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Refactor lazy import mechanism to use generalized function

- Replace separate _try_import_aiagentconverter and _try_import_skagentconverter functions with a single _create_lazy_import function
- The new function accepts class_name, module_path, and dependency_name as parameters
- Reduces code duplication and makes the pattern more maintainable
- Maintains exact same functionality and error messages
- Updates corresponding unit tests to reflect the new implementation

Addresses feedback from @singankit to generalize the lazy import pattern.

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Remove [INFO] prefix from error messages in lazy imports

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Simplify lazy import error handling to let ImportError propagate naturally

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Implement lazy loading for red_team module to avoid noisy import messages

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Revert red_team module to traditional import pattern as requested

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Change red_team module to raise ImportError instead of print statement

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Apply black formatting to fix code style issues

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Remove AIAgentConverter test since azure-ai-projects is always in dev requirements

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

* Skip lazy import tests when semantic-kernel is installed

Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>
…e#41822)

Signed-off-by: Paul Van Eck <paulvaneck@microsoft.com>
… owner) (Azure#41496)

* code and test

* update-tasecase

* Update CHANGELOG.md

---------

Co-authored-by: azure-sdk <PythonSdkPipelines>
Co-authored-by: ChenxiJiang333 <v-chenjiang@microsoft.com>
Co-authored-by: ChenxiJiang333 <119990644+ChenxiJiang333@users.noreply.github.com>
… by SDK owner) (Azure#41859)

* code and test

* update changelog

* Update CHANGELOG.md

---------

Co-authored-by: azure-sdk <PythonSdkPipelines>
Co-authored-by: ChenxiJiang333 <v-chenjiang@microsoft.com>
Co-authored-by: ChenxiJiang333 <119990644+ChenxiJiang333@users.noreply.github.com>
* [Identity Broker] Allow default account usage in WSL

Signed-off-by: Paul Van Eck <paulvaneck@microsoft.com>

* Apply suggestions from code review

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Signed-off-by: Paul Van Eck <paulvaneck@microsoft.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
… merged by SDK owner) (Azure#41861)

* code and test

* update changelog

* Update CHANGELOG.md

---------

Co-authored-by: azure-sdk <PythonSdkPipelines>
Co-authored-by: ChenxiJiang333 <v-chenjiang@microsoft.com>
Co-authored-by: ChenxiJiang333 <119990644+ChenxiJiang333@users.noreply.github.com>
…arailSimulator, _SafetyEvaluation (Azure#41978)

* Add retry logic for high concurrency scenarios

* Update sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/simulator/_model_tools/_proxy_completion_model.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* black fixes

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
…writes (Azure#42024)

* Initial plan

* Fix red team scan output_path issue - prevent interim evaluation overwrites

Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com>

* Update CHANGELOG.md to document red team scan output_path bug fix

Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com>

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com>
@github-actions github-actions bot added AI Agents Azure.Identity Community Contribution Community members are working on the issue Cosmos customer-reported Issues that are reported by GitHub users external to the Azure organization. Evaluation Issues related to the client library for Azure AI Evaluation Health Deidentification labels Jul 21, 2025
Copy link

Thank you for your contribution @salma-elshafey! We will review the pull request and get back to you soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
AI Agents Azure.Identity Community Contribution Community members are working on the issue Cosmos customer-reported Issues that are reported by GitHub users external to the Azure organization. Evaluation Issues related to the client library for Azure AI Evaluation Health Deidentification
Projects
Status: Untriaged
Status: No status
Development

Successfully merging this pull request may close these issues.

8 participants