You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I am a student in Korea working on a 6 week project.
I want to fine-tune a CodeLlama model using your paper's methodology for the Code Repair task. How do you estimate the GPU resources and time required for this project?
I also have two new ideas:
Can a static code analyzer's output improve the dataset?
Can a RLHF based approach using DPO help the model generate better code?
Thank you for your time and guidance.
Best regards,
Won