Skip to content

Commit 828b9c0

Browse files
authored
Merge pull request #10 from CaoWJ/master
202108 version
2 parents ff05393 + eb49216 commit 828b9c0

File tree

11 files changed

+1111624
-1036657
lines changed

11 files changed

+1111624
-1036657
lines changed

README-zh_CN.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ CaCl2是开放项目CaOCl(CA开放中文词法分析工具包)重要组成
1515

1616
| 时间 | 总词条数 | 候选词条 | 已公开词条 | 预览版词条 |
1717
| :----: | :----: | :----: | :----: | :----: |
18-
| 2021-04-01 | 约21,000,000 | 约3,000,000 | 5,405,531 | 280,000 |
18+
| 2021-04-01 | 约21,000,000 | 约3,000,000 | 5,480,494 | 280,000 |
1919

2020
#### 2.行业字典数
2121
| 时间 | 行业 | 词典数 | 已公开 | 预览版 | 未公开 |
@@ -248,6 +248,7 @@ ICWB2标准数据集上测试分词的评分结果:
248248
### 2.自动发布版本
249249
| 最新版本 | 发布周期 | 发布时间 | 变更日志 |
250250
| :----: | :----: | :----: | :---- |
251+
| v0.2.21.07 | monthly | 2021-08-05 | 添加农林牧渔、交通运输、公用事业等行数数据 |
251252
| v0.2.21.06 | monthly | 2021-07-05 | 添加化工、钢铁、有色金属等行数数据 |
252253
| v0.2.21.05 | monthly | 2021-06-06 | 添加农林牧渔、商业贸易、房地产等行数数据 |
253254
| v0.2.21.04 | monthly | 2021-05-07 | 添加ICWB2标准数据集测试结果 |

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ CaCl2 project aims to build a consistent, complete and accurate industrial lexic
2626
#### Entries
2727
| Date | All | Candidate | Released | Preview |
2828
| :----: | :----: | :----: | :----: | :----: |
29-
| 2021-02-01 | 21,000,000 | 3,000,000 | 5,405,531 | 280,000 |
29+
| 2021-02-01 | 21,000,000 | 3,000,000 | 5,480,494 | 280,000 |
3030

3131
#### Dictionaries
3232
| Date | Class | Industries | Released | Preview | Closing |
@@ -258,6 +258,7 @@ Score for ICWB:
258258
### 2.Monthly/Quarterly releases
259259
| Version | Circle | Date | Changelogs |
260260
| :----: | :----: | :----: | :---- |
261+
| v0.2.21.07 | monthly | 2021-08-05 | Dictionaries for agriculture,transportation and utility added |
261262
| v0.2.21.06 | monthly | 2021-07-05 | Dictionaries for chemical, ferrous and nonferrous metal added |
262263
| v0.2.21.05 | monthly | 2021-06-06 | Dictionaries for agriculture, commerce & trade and real estate added |
263264
| v0.2.21.04 | monthly | 2021-05-07 | ICWB2 test and code added |

STATUES-zh_CN.md

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@
66

77
| 时间 | 总词条数 | 候选词条 | 已公开词条 | 预览版词条 |
88
| :----: | :----: | :----: | :----: | :----: |
9+
| 2021-07-01 | 约21,000,000 | 约3,000,000 | 5,480,494 | 280,000 |
910
| 2021-06-01 | 约21,000,000 | 约3,000,000 | 5,405,531 | 280,000 |
1011
| 2021-05-01 | 约21,000,000 | 约3,000,000 | 3,919,527 | 280,000 |
1112
| 2021-04-01 | 约21,000,000 | 约3,000,000 | 3,279,518 | 280,000 |
@@ -22,9 +23,9 @@
2223
### 一级行业词库
2324
| 行业代码 | 词库名称 | 词条数量 | 当前状态 | 公开时间 | 当前版本 | 格式 | 下载地址 |
2425
| :----: | :---- | :----: | :----: | :----: | :----: | :----: | :----: |
25-
| 110000 | 农林牧渔-通用 | 109,920 | 预览版 | - | v0.1 | txt | [110000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/110000.zip) |
26+
| 110000 | 农林牧渔-通用 | 134,729 | 预览版 | - | v0.1 | txt | [110000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/110000.zip) |
2627
| 210000 | 采掘-通用 | 25,892 | 预览版 | - | v0.1 | txt | [210000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/210000.zip) |
27-
| 220000 | 化工-通用 | 100,018 | 预览版 | - | v0.1 | txt | [220000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/220000.zip) |
28+
| 220000 | 化工-通用 | 106,380 | 预览版 | - | v0.1 | txt | [220000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/220000.zip) |
2829
| 230000 | 钢铁-通用 | 23,998 | 预览版 | - | v0.1 | txt | [230000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/230000.zip) |
2930
| 240000 | 有色金属-通用 | 455,219 | 预览版 | - | v0.1 | txt | [240000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/240000.zip) |
3031
| 270000 | 电子-通用 | 180,488 | 预览版 | - | v0.1 | txt | [270000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/270000.zip) |
@@ -34,19 +35,19 @@
3435
| 350000 | 纺织服装-通用 | 40,525 | 预览版 | - | v0.1 | txt | [350000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/350000.zip) |
3536
| 360000 | 轻工制造-通用 | 157,655 | 预览版 | - | v0.1 | txt | [360000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/360000.zip) |
3637
| 370000 | 医药生物-通用 | 301,067 | 预览版 | - | v0.1 | txt | [370000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/370000.zip) |
37-
| 410000 | 公用事业-通用 | 115,806 | 预览版 | - | v0.1 | txt | [410000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/410000.zip) |
38-
| 420000 | 交通运输-通用 | 65,305 | 预览版 | - | v0.1 | txt | [420000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/420000.zip) |
38+
| 410000 | 公用事业-通用 | 150,987 | 预览版 | - | v0.1 | txt | [410000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/410000.zip) |
39+
| 420000 | 交通运输-通用 | 66,567 | 预览版 | - | v0.1 | txt | [420000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/420000.zip) |
3940
| 430000 | 房地产-通用 | 127,658 | 预览版 | - | v0.1 | txt | [430000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/430000.zip) |
40-
| 450000 | 商业贸易-通用 | 352,365 | 预览版 | - | v0.1 | txt | [450000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/450000.zip) |
41-
| 460000 | 休闲服务-通用 | 257,780 | 预览版 | - | v0.1 | txt | [460000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/460000.zip) |
41+
| 450000 | 商业贸易-通用 | 354,551 | 预览版 | - | v0.1 | txt | [450000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/450000.zip) |
42+
| 460000 | 休闲服务-通用 | 262,838 | 预览版 | - | v0.1 | txt | [460000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/460000.zip) |
4243
| 480000 | 银行-通用 | 52,105 | 发布 | 2020-02 | v0.2 | txt | [480000.zip](https://github.com/limccn/cacl2/blob/master/archive/v0.2/480000.zip) |
4344
| 490000 | 非银金融-通用 | 365,878 | 发布 | 2020-02 | v0.2 | txt | [490000.zip](https://github.com/limccn/cacl2/blob/master/archive/v0.2/490000.zip) |
4445
| 510000 | 综合-通用 | 326,846 | 预览版 | - | v0.1 | txt | [510000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/510000.zip) |
4546
| 610000 | 建筑材料-通用 | 83,724 | 预览版 | - | v0.1 | txt | [610000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/610000.zip) |
4647
| 620000 | 建筑装饰-通用 | 75,351 | 预览版 | - | v0.1 | txt | [620000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/620000.zip) |
4748
| 630000 | 电气设备-通用 | 83,699 | 预览版 | - | v0.1 | txt | [630000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/630000.zip) |
4849
| 640000 | 机械设备-通用 | 234,233 | 预览版 | - | v0.1 | txt | [640000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/640000.zip) |
49-
| 650000 | 国防军工-通用 | 37,535 | 预览版 | - | v0.1 | txt | [650000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/650000.zip) |
50+
| 650000 | 国防军工-通用 | 37,640 | 预览版 | - | v0.1 | txt | [650000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/650000.zip) |
5051
| 710000 | 计算机-通用 | 128,559 | 预览版 | - | v0.1 | txt | [710000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/710000.zip) |
5152
| 720000 | 传媒-通用 | 177,489 | 预览版 | - | v0.1 | txt | [720000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/720000.zip) |
5253
| 730000 | 通信-通用 | 70,788 | 预览版 | - | v0.1 | txt | [730000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/730000.zip) |

STATUES.md

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@
44
### Entries
55
| Date | Total | Candidate | Released | Preview |
66
| :----: | :----: | :----: | :----: | :----: |
7+
| 2021-07-01 | 21,000,000 | 3,000,000 | 5,480,494 | 280,000 |
78
| 2021-06-01 | 21,000,000 | 3,000,000 | 5,405,531 | 280,000 |
89
| 2021-05-01 | 21,000,000 | 3,000,000 | 3,919,527 | 280,000 |
910
| 2021-04-01 | 21,000,000 | 3,000,000 | 3,279,518 | 280,000 |
@@ -21,9 +22,9 @@
2122

2223
| Code | Name | Entries | Status | Date | Version | Format | Download |
2324
| :----: | :---- | :----: | :----: | :----: | :----: | :----: | :----: |
24-
| 110000 | Agriculture-Common | 109,920 | Preview | - | v0.1 | txt | [110000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/110000.zip) |
25+
| 110000 | Agriculture-Common | 134,729 | Preview | - | v0.1 | txt | [110000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/110000.zip) |
2526
| 210000 | Mining-Common | 25,892 | Preview | - | v0.1 | txt | [210000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/210000.zip) |
26-
| 220000 | Chemical-Common | 100,018 | Preview | - | v0.1 | txt | [220000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/220000.zip) |
27+
| 220000 | Chemical-Common | 106,380 | Preview | - | v0.1 | txt | [220000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/220000.zip) |
2728
| 230000 | Ferrous Metal-Common | 23,998 | Preview | - | v0.1 | txt | [230000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/230000.zip) |
2829
| 240000 | Nonferrous Metal-Common | 455,219 | Preview | - | v0.1 | txt | [240000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/240000.zip) |
2930
| 270000 | Electronics-Common | 180,488 | Preview | - | v0.1 | txt | [270000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/270000.zip) |
@@ -33,19 +34,19 @@
3334
| 350000 | Textile & Apparel-Common | 40,525 | Preview | - | v0.1 | txt | [350000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/350000.zip) |
3435
| 360000 | Light-industry Manufacture-Common | 157,655 | Preview | - | v0.1 | txt | [360000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/360000.zip) |
3536
| 370000 | Health Care-Common | 301,067 | Preview | - | v0.1 | txt | [370000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/370000.zip) |
36-
| 410000 | Utility-Common | 115,806 | Preview | - | v0.1 | txt | [410000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/410000.zip) |
37-
| 420000 | Transportation-Common | 65,305 | Preview | - | v0.1 | txt | [420000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/420000.zip) |
37+
| 410000 | Utility-Common | 150,987 | Preview | - | v0.1 | txt | [410000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/410000.zip) |
38+
| 420000 | Transportation-Common | 66,567 | Preview | - | v0.1 | txt | [420000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/420000.zip) |
3839
| 430000 | Real Estate-Common | 127,658 | Preview | - | v0.1 | txt | [430000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/430000.zip) |
39-
| 450000 | Commerce & Trade-Common | 352,365 | Preview | - | v0.1 | txt | [450000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/450000.zip) |
40-
| 460000 | Leisure Services-Common | 257,780 | Preview | - | v0.1 | txt | [460000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/460000.zip) |
40+
| 450000 | Commerce & Trade-Common | 354,551 | Preview | - | v0.1 | txt | [450000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/450000.zip) |
41+
| 460000 | Leisure Services-Common | 262,838 | Preview | - | v0.1 | txt | [460000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/460000.zip) |
4142
| 480000 | Banking-Common | 52,105 | Release | - | v0.2 | txt | [480000.zip](https://github.com/limccn/cacl2/blob/master/archive/v0.2/480000.zip) |
4243
| 490000 | Financials-Common | 365,878 | Release | - | v0.2 | txt | [490000.zip](https://github.com/limccn/cacl2/blob/master/archive/v0.2/490000.zip) |
4344
| 510000 | Conglomerate-Common | 326,846 | Preview | - | v0.1 | txt | [510000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/510000.zip) |
4445
| 610000 | Construction Material-Common | 83,724 | Preview | - | v0.1 | txt | [610000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/610000.zip) |
4546
| 620000 | Architectural Decoration-Common | 75,351 | Preview | - | v0.1 | txt | [620000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/620000.zip) |
4647
| 630000 | Electrical Equipment-Common | 83,699 | Preview | - | v0.1 | txt | [630000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/630000.zip) |
4748
| 640000 | Machinery Equipment-Common | 234,233 | Preview | - | v0.1 | txt | [640000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/640000.zip) |
48-
| 650000 | National Defense-Common | 37,535 | Preview | - | v0.1 | txt | [650000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/650000.zip) |
49+
| 650000 | National Defense-Common | 37,640 | Preview | - | v0.1 | txt | [650000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/650000.zip) |
4950
| 710000 | Information Services-Common | 128,559 | Preview | - | v0.1 | txt | [710000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/710000.zip) |
5051
| 720000 | Media-Common | 177,489 | Preview | - | v0.1 | txt | [720000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/720000.zip) |
5152
| 730000 | Telecom-Common | 70,788 | Preview | - | v0.1 | txt | [730000.zip](https://github.com/limccn/cacl2/blob/master/archive/preview/730000.zip) |

0 commit comments

Comments
 (0)