Skip to content

Commit d48deaf

Browse files
authored
【开源实习】bertweet模型微调 (#1964)
1 parent 9149ebc commit d48deaf

File tree

2 files changed

+901
-0
lines changed

2 files changed

+901
-0
lines changed

llm/finetune/bertweet/README.md

Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
# FineTune BERTweet with hate_speech_twitter
2+
3+
## Data
4+
huggingface dataset [thefrankhsu/hate_speech_twitter](https://huggingface.co/datasets/thefrankhsu/hate_speech_twitter)
5+
6+
## Results
7+
### my results on mindspore
8+
|Epoch|Training Loss|Validation Loss|Accuracy|Precision|Recall|F1|
9+
|-----|-------------|---------------|--------|---------|------|--|
10+
|1|0.305200|0.896717|0.670000|0.942708|0.362000|0.523121|
11+
|2|0.143000|0.876202|0.738000|0.940741|0.508000|0.659740|
12+
|3|0.096300|0.689730|0.790000|0.947531|0.614000|0.745146|
13+
|<span style="color:red">4</span>|<span style="color:red">0.063500</span>|<span style="color:red">0.754796</span>|<span style="color:red">0.801000</span>|<span style="color:red">0.943953</span>|<span style="color:red">0.640000</span>|<span style="color:red">0.762813</span>|
14+
|5|0.052800|0.935889|0.770000|0.944079|0.574000|0.713930|
15+
16+
requirements:
17+
- Ascend 910B
18+
- Python 3.9
19+
- MindSpore 2.3.1
20+
- MindNLP 0.4.1
21+
- datasets emoji scikit-learn
22+
23+
### my results on pytorch
24+
|Epoch|Training Loss|Validation Loss|Accuracy|Precision|Recall|F1|
25+
|-----|-------------|---------------|--------|---------|------|--|
26+
|1|0.228100|0.682149|0.750000|0.956204|0.524000|0.677003|
27+
|<span style="color:red">2</span>|<span style="color:red">0.134900</span>|<span style="color:red">0.585958</span>|<span style="color:red">0.804000</span>|<span style="color:red">0.947059</span>|<span style="color:red">0.644000</span>|<span style="color:red">0.766667</span>|
28+
|3|0.088700|0.848252|0.763000|0.942761|0.560000|0.702635|
29+
|4|0.058300|0.956421|0.763000|0.945763|0.558000|0.701887|
30+
|5|0.036500|0.894330|0.788000|0.950000|0.608000|0.741463|
31+
32+
requirements:
33+
- GPU V100
34+
- CUDA 11.8.0
35+
- Python 3.10
36+
- Pytorch 2.1.0
37+
- Transformers 4.45.2
38+
- datasets emoji accelerate scikit-learn

0 commit comments

Comments
 (0)