This repository is the source code for the paper 'Semantics from Look and Sound: Learning Chinese Character Representations from Function-Specific Components and Pronunciations'.
- Build env with requirements.txt;
- Download the BERT model 'chinese_L-12_H-768_A-12' and change the path in config file;
- Download the data from 'Weibo' and change the path in config file;
- Run:
bash script/run_weibo.sh