Track and persist training history per class #47

codelion · 2025-07-24T02:11:27Z

Adds a training_history dictionary to AdaptiveClassifier to track the cumulative number of training examples per class. Updates prediction logic to use training_history for weighting, ensures training_history is saved/loaded with backward compatibility, and includes tests for confidence consistency, continuous learning, backward compatibility, and new class detection. Also adds a GitHub Actions workflow for running tests and coverage.

codelion · 2025-07-24T02:11:42Z

This will fix #44

Ensures the FAISS index is rebuilt after new examples are added to maintain up-to-date nearest neighbor search. Test assertions for confidence thresholds are relaxed to account for lower confidence scores due to prototype normalization, improving test robustness.

Bump package version to 0.0.15 in setup.py. Add CHANGELOG.md documenting recent changes and fixes. Update GitHub Actions workflow to test only with Python 3.12 instead of a matrix of versions.

Added tests to verify that prediction confidence remains consistent before and after saving/loading AdaptiveClassifier, especially for single-example-per-class cases. Updated test_memory.py to skip memory efficiency test if psutil is unavailable. Updated GitHub Actions workflow to install psutil for testing.

Label IDs are now assigned in alphabetical order when new classes are added, ensuring consistent mappings regardless of input order. Updated the README to document order dependency in online learning and added comprehensive tests to verify order independence and label assignment behavior.

Removed unnecessary sorting and reordering of input texts in the _get_embeddings method. The function now directly processes the input texts and returns embeddings in the original order, improving code clarity.

codelion added 3 commits July 24, 2025 10:26

Release version 0.0.15 and update test workflow

9d9291f

Bump package version to 0.0.15 in setup.py. Add CHANGELOG.md documenting recent changes and fixes. Update GitHub Actions workflow to test only with Python 3.12 instead of a matrix of versions.

codelion mentioned this pull request Jul 24, 2025

Training data ordering bias #45

Closed

codelion added 3 commits July 24, 2025 12:27

Simplify _get_embeddings by removing sorting logic

2d0a3ec

Removed unnecessary sorting and reordering of input texts in the _get_embeddings method. The function now directly processes the input texts and returns embeddings in the original order, improving code clarity.

Update README.md

b2baef0

codelion merged commit 0ae37c5 into main Jul 24, 2025
2 checks passed

codelion deleted the fix-bugs branch July 24, 2025 06:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Track and persist training history per class #47

Track and persist training history per class #47

Uh oh!

codelion commented Jul 24, 2025

Uh oh!

codelion commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

Track and persist training history per class #47

Track and persist training history per class #47

Uh oh!

Conversation

codelion commented Jul 24, 2025

Uh oh!

codelion commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!