-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Add Tool Call Accuracy Evaluator Bugbash Notebook #42121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
salma-elshafey
wants to merge
44
commits into
Azure:users/singankit/agent_evaluators_v2_bug_bash
Choose a base branch
from
salma-elshafey:selshafey/improve_tool_call_accuracy
base: users/singankit/agent_evaluators_v2_bug_bash
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Add Tool Call Accuracy Evaluator Bugbash Notebook #42121
salma-elshafey
wants to merge
44
commits into
Azure:users/singankit/agent_evaluators_v2_bug_bash
from
salma-elshafey:selshafey/improve_tool_call_accuracy
+191,028
−994,233
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
….com/salma-elshafey/azure-sdk-for-python into selshafey/improve_tool_call_accuracy
* Made response_format option type public and update docstr * resolved comment * update * Update CHANGELOG.md
…d noisy messages (Azure#41852) * Initial plan * Fix lazy loading for optional dependency imports to avoid noisy messages Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Refactor lazy import mechanism to use generalized function - Replace separate _try_import_aiagentconverter and _try_import_skagentconverter functions with a single _create_lazy_import function - The new function accepts class_name, module_path, and dependency_name as parameters - Reduces code duplication and makes the pattern more maintainable - Maintains exact same functionality and error messages - Updates corresponding unit tests to reflect the new implementation Addresses feedback from @singankit to generalize the lazy import pattern. Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Remove [INFO] prefix from error messages in lazy imports Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Simplify lazy import error handling to let ImportError propagate naturally Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Implement lazy loading for red_team module to avoid noisy import messages Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Revert red_team module to traditional import pattern as requested Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Change red_team module to raise ImportError instead of print statement Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Apply black formatting to fix code style issues Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Remove AIAgentConverter test since azure-ai-projects is always in dev requirements Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> * Skip lazy import tests when semantic-kernel is installed Co-authored-by: singankit <30610298+singankit@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: singankit <30610298+singankit@users.noreply.github.com>
…e#41822) Signed-off-by: Paul Van Eck <paulvaneck@microsoft.com>
… owner) (Azure#41496) * code and test * update-tasecase * Update CHANGELOG.md --------- Co-authored-by: azure-sdk <PythonSdkPipelines> Co-authored-by: ChenxiJiang333 <v-chenjiang@microsoft.com> Co-authored-by: ChenxiJiang333 <119990644+ChenxiJiang333@users.noreply.github.com>
… by SDK owner) (Azure#41859) * code and test * update changelog * Update CHANGELOG.md --------- Co-authored-by: azure-sdk <PythonSdkPipelines> Co-authored-by: ChenxiJiang333 <v-chenjiang@microsoft.com> Co-authored-by: ChenxiJiang333 <119990644+ChenxiJiang333@users.noreply.github.com>
* [Identity Broker] Allow default account usage in WSL Signed-off-by: Paul Van Eck <paulvaneck@microsoft.com> * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Signed-off-by: Paul Van Eck <paulvaneck@microsoft.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
… merged by SDK owner) (Azure#41861) * code and test * update changelog * Update CHANGELOG.md --------- Co-authored-by: azure-sdk <PythonSdkPipelines> Co-authored-by: ChenxiJiang333 <v-chenjiang@microsoft.com> Co-authored-by: ChenxiJiang333 <119990644+ChenxiJiang333@users.noreply.github.com>
…arailSimulator, _SafetyEvaluation (Azure#41978) * Add retry logic for high concurrency scenarios * Update sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/simulator/_model_tools/_proxy_completion_model.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * black fixes --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
…writes (Azure#42024) * Initial plan * Fix red team scan output_path issue - prevent interim evaluation overwrites Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com> * Update CHANGELOG.md to document red team scan output_path bug fix Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: slister1001 <103153180+slister1001@users.noreply.github.com>
Thank you for your contribution @salma-elshafey! We will review the pull request and get back to you soon. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
AI Agents
Azure.Identity
Community Contribution
Community members are working on the issue
Cosmos
customer-reported
Issues that are reported by GitHub users external to the Azure organization.
Evaluation
Issues related to the client library for Azure AI Evaluation
Health Deidentification
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Please add an informative description that covers that changes made by the pull request and link all relevant issues.
If an SDK is being regenerated based on a new swagger spec, a link to the pull request containing these swagger spec changes has been included above.
All SDK Contribution checklist:
General Guidelines and Best Practices
Testing Guidelines