We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I run
python train.py --digit --fix_src --dataset gsm8k --steps 120000 --weights_path /huyang/r1/diffusion-of-thoughts/plaid1b_weights/
python evaluation_batch.py --weights_path outputs/gsm8k-bs16-fix_src-digit-steps120000 --fix_src --digit --dataset gsm8k --score_temp 0.5
the result is [2025-02-24 13:14:58,570] total: 1319, corr: 68, acc: 0.05155420773313116 [2025-02-24 13:14:58,570] time: 315.3894371986389s [2025-02-24 13:14:58,571] Mean: 0.05155420773313116, Std: 0.0
Am I doing right? Thank you so much for checking the issue
The text was updated successfully, but these errors were encountered:
I find acc: 0.05 is due to my imcomplete training data, after using the right gsm8k, the result is a lot better, but still have some issues.
the train and eval code are as: python train.py --digit --fix_src --dataset gsm8k --steps 120000 --weights_path /huyang/r1/diffusion-of-thoughts/plaid1b_weights/
the final result is acc: 0.19863532979529946. It can't achieve the paper result 32.6
Sorry, something went wrong.
No branches or pull requests
I run
python train.py --digit --fix_src --dataset gsm8k --steps 120000 --weights_path /huyang/r1/diffusion-of-thoughts/plaid1b_weights/
python evaluation_batch.py --weights_path outputs/gsm8k-bs16-fix_src-digit-steps120000 --fix_src --digit --dataset gsm8k --score_temp 0.5
the result is
[2025-02-24 13:14:58,570] total: 1319, corr: 68, acc: 0.05155420773313116
[2025-02-24 13:14:58,570] time: 315.3894371986389s
[2025-02-24 13:14:58,571] Mean: 0.05155420773313116, Std: 0.0
Am I doing right? Thank you so much for checking the issue
The text was updated successfully, but these errors were encountered: