-
Couldn't load subscription status.
- Fork 104
Add UNDIAL, a new unlearning algorithm #89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
* testing commit * Fixes * cleanup
Fix tofu_unlearn.sh for IdKDPO method.
Revert "Dpo fix"
* IdkDPO script fix in tofu_unlearn.sh (locuslab#65) * Fix hyperlinks in README * Download I don't know data in setup_data.py * Fix tofu_unlearn.sh for IdkDPO --------- Co-authored-by: Anmol Mekala <49127549+molereddy@users.noreply.github.com> * overwrite=True * RMU added * Fix ref model device * ruff fix * RMU updated * Update rmu.py * Update README.md: add RMU * Added references and renamed functions --------- Co-authored-by: Anmol Mekala <49127549+molereddy@users.noreply.github.com>
…on (locuslab#8) * docs: updates, small corrections, re-formats * modified ruff commands * modified ruff commands * CI/CD minor updates * added contributing + leaderboard * fix minor spelling misatkes * docs: bunch of minor updates * docs fixes --------- Co-authored-by: molereddy <m.anmolreddy@gmail.com>
|
Thank you for this! Can you make these additions
Summary of steps to contribute
We've provided a template here. |
|
@molereddy Thank you for the suggestions. I fixed the part I could understand. And send a private message about things I don't understand. I hope at least what I did in this PR is able to be merged. Let me know if there's any fatal issues. Thank you! |
|
I've merged our upstream changes onto your branch. If you can pull, test the code and add some results, that would be great. Else mark PR for review: we can then just test on our end and merge the code in in the coming days. |
|
Sorry for the delay. I had a bit trouble running my method on 7B which requires LoRA implementation as my method need to run TWO 7B model in parallel. It would be great if LoRA could be supported. I would really appreciate if you could test my methods like all other using the same standard and follow the hyperparam search in my description in the PR (which should not be hard given you have the pipeline ready). I also turned the PR for review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great Work.
Currently it is tested only on TOFU 1B model.
What does this PR do?
Before submitting