You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Change the enum according to APIView Review (#41880)
* Prepare evals SDK Release
* Fix bug
* Fix for ADV_CONV for FDP projects
* Update release date
* re-add pyrit to matrix
* Change grader ids
* Update unit test
* replace all old grader IDs in tests
* Update platform-matrix.json
Add pyrit and not remove the other one
* Update test to ensure everything is mocked
* tox/black fixes
* Skip that test with issues
* update grader ID according to API View feedback
* Update test
* remove string check for grader ID
* Update changelog and officialy start freeze
* update the enum according to suggestions
* update the changelog
* Finalize logic
---------
Co-authored-by: Nagkumar Arkalgud <nagkumar@naarkalg-work-mac.local>
Co-authored-by: Nagkumar Arkalgud <nagkumar@Mac.lan>
Copy file name to clipboardExpand all lines: sdk/evaluation/azure-ai-evaluation/CHANGELOG.md
+2-1Lines changed: 2 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -5,13 +5,14 @@
5
5
### Features Added
6
6
7
7
- Added support for Azure Open AI evaluation via `AzureOpenAIScoreModelGrader` class, which serves as a wrapper around Azure Open AI score model configurations. This new grader object can be supplied to the main `evaluate` method as if it were a normal callable evaluator.
8
+
- Added new experimental risk categories ProtectedMaterial and CodeVulnerability for redteam agent scan.
8
9
9
10
10
11
### Bugs Fixed
11
12
12
13
- Significant improvements to IntentResolution evaluator. New version has less variance, is nearly 2x faster and consumes fewer tokens.
13
-
14
14
- Fixed MeteorScoreEvaluator and other threshold-based evaluators returning incorrect binary results due to integer conversion of decimal scores. Previously, decimal scores like 0.9375 were incorrectly converted to integers (0) before threshold comparison, causing them to fail even when above the threshold. [#41415](https://github.com/Azure/azure-sdk-for-python/issues/41415)
15
+
- Added a new enum `ADVERSARIAL_QA_DOCUMENTS` which moves all the "file_content" type prompts away from `ADVERSARIAL_QA` to the new enum
0 commit comments