T5-GlyF (Glyph Fixing) — a method to correct text data attacked with homoglyphs using a pre-trained T5 model by the SberDevices team as part of the SAGE project.
To get started, you need to install the requirements
git clone https://github.com/YRL-AIDA/T5-GlyF.git
cd T5-GlyF
pip install -r requirements.txt
To start training the model, simply enter the following command
python train.py --config_path configs/train_config.json
To start testing the model
python test.py --config_path configs/test_config.json