More evaluations, metrics and documentation (MIAs, ES, EM etc.) #90

molereddy · 2025-04-07T03:04:28Z

What does this PR do?

metrics

adds 6 of MIMIR's MIA attacks: minK, minK ++, ref, zlib, loss, GradNorm
adds ES and EM metrics
simplifies and standardize MIA evals structure/code
adds holdout dataset so TOFU allows for MIA based privleak evaluations: created by generating IID examples with the same GPT-4 endpoint and original prompt from TOFU's original creation
generalises KS-test, rel diff metrics

updated documentation

documents source links for all features implemented in links.md (addresses [Feature Request] More documentations #70)
updates & simplifies leaderboard and repro.md files with new evals

misc

adds seed in train and eval

Fixes # (issue)
#70

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Have you gone through the contributions guide?
Are your changes documented? Read documentation guidelines here.

* testing commit * Fixes * cleanup

Fix tofu_unlearn.sh for IdKDPO method.

Revert "Dpo fix"

IdkDPO fix

* IdkDPO script fix in tofu_unlearn.sh (locuslab#65) * Fix hyperlinks in README * Download I don't know data in setup_data.py * Fix tofu_unlearn.sh for IdkDPO --------- Co-authored-by: Anmol Mekala <49127549+molereddy@users.noreply.github.com> * overwrite=True * RMU added * Fix ref model device * ruff fix * RMU updated * Update rmu.py * Update README.md: add RMU * Added references and renamed functions --------- Co-authored-by: Anmol Mekala <49127549+molereddy@users.noreply.github.com>

…on (#8) * docs: updates, small corrections, re-formats * modified ruff commands * modified ruff commands * CI/CD minor updates * added contributing + leaderboard * fix minor spelling misatkes * docs: bunch of minor updates * docs fixes --------- Co-authored-by: molereddy <m.anmolreddy@gmail.com>

* update docs * setup: add license * Update results docs * re-factor MIA probs extraction code * Ensure model and tokenizer are compatible wrt vocab size * Model-tokenizer vocab check: treat as warning * Document metric usage * Merge main properly * pre-commit enforcement to be set only in dev mode * revert vocab size warning * Revert "revert vocab size warning" This reverts commit eded904. * Revert "pre-commit enforcement to be set only in dev mode" This reverts commit 2bb5e19. * small fixes: keep pre-commit only in dev mode prereqs, remove vocab size check * privleak changes: change relative_auc handler to privleak. move min_k logic to metric file from utils * Simplified privleak version * Access key option for datasets * feat: MIA attack implementation from mimir, pipelined into open-unlearning format * Tokenwise, example wise log probs computation * configs: add MIA new version of privleak support * feat: privleak in TOFU * More hyperparam details: bsz * load and register mia metrics, cleanup old * MIA init bug fixes + convert numpy scalars to floats * fix: load ref model for ref mia * misc: comments and nits * fix: ref attack load to same device, log things * describe standard mia score signage * fix: handle division by 0 * standardize auc convention (fix muse's) make relative diff metric separate and clear * generalise the ks-test usage beyond forget quality * update docs: repo updates, clarify example commands, small fixes * fix: wrong command in readme * fit gradnorm to conventions * updated list of metrics * Add MIA attack configs * load holdout dataset from local path (must be updated) * docs: Update MIA count * remove hanging comments * fixes: docs, comments etc. addressing comments * Replace scrollable with dropdown * Ruff style changes * Clarify metrics kwargs, small docs updates * Delete todos.txt * Alert rendering * Mark alerts again * More alert rendering * more rendering changes * Add star history * Make heading smaller * Add holdout dataset to metrics, datasets and experiment configs * Add holdout evals to scripts * Fix cfg spell mistake (#9) * fix spelling of holdout * fix typo * Add links documentation * Update misc documentation * Small docs updates * Clean up links * Clarify new target models * Add ES, EM metrics + Minor fixes (#10) * Add seed in eval * Add ES, EM metrics * Add tqdm in MIA * Fix get_data * Ruff format fix (pre-commit and code) * Update docs * Update Readme * Evaluation and leaderboard modifications (#11) * Remove different splits in leaderboard * update default metrics * Fix: Only required metrics in summary * Minor fixes in docs and gitignore --------- Co-authored-by: Vineeth <48151992+Dornavineeth@users.noreply.github.com> Co-authored-by: Dornavineeth <vineethdorna@gmail.com>

molereddy and others added 17 commits March 1, 2025 09:13

Fix hyperlinks in README (#2)

54d3560

* testing commit * Fixes * cleanup

Fixed DPO command

4c36e4f

download idk

f7a69de

Merge pull request #3 from Dornavineeth/dpo_fix

1bd4411

Fix tofu_unlearn.sh for IdKDPO method.

Revert "Dpo fix"

332af36

Merge pull request #4 from Dornavineeth/revert-3-dpo_fix

7d6aef3

Revert "Dpo fix"

download idk data

f468efb

fix dpo experiment config

ca8d503

Merge pull request #5 from Dornavineeth/dpo_fix

dde60d3

IdkDPO fix

Merge branch 'locuslab:main' into main

6367fb6

Merge branch 'locuslab:main' into main

855c5f3

Merge branch 'locuslab:main' into main

4fb577c

Merge branch 'locuslab:main' into main

e3c4709

Update README.md (describe new updates)

3e42003

molereddy temporarily deployed to tests April 7, 2025 03:04 — with GitHub Actions Inactive

molereddy linked an issue Apr 7, 2025 that may be closed by this pull request

[Feature Request] More documentations #70

Closed

5 tasks

molereddy merged commit 2e16546 into locuslab:main Apr 7, 2025
1 check passed

molereddy mentioned this pull request Apr 13, 2025

Reproduction issues on LLaMA3.1 (TOFU): checkpoint vs final‑model evaluation + epoch step‑count mismatch #88

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

More evaluations, metrics and documentation (MIAs, ES, EM etc.) #90

More evaluations, metrics and documentation (MIAs, ES, EM etc.) #90

Uh oh!

molereddy commented Apr 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

More evaluations, metrics and documentation (MIAs, ES, EM etc.) #90

More evaluations, metrics and documentation (MIAs, ES, EM etc.) #90

Uh oh!

Conversation

molereddy commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

molereddy commented Apr 7, 2025 •

edited

Loading