🤗 Transformers NER Model Trainer

See in spanish/Ver en español

🤗 Transformers NER Model Trainer

Train your own NER Model using HuggingFace's Transformers with this Python Scripts.

📝 Technology Explanation

A NER Model (Named Entity Recognition) is a AI tool capable of recognizing words and patterns and clasify them, depending of the training data.
There's already trained models like SpaCy but with this simple script you can train your own model with custom training datasets.

🛠️ Setup

You'll obviously need Python to install the dependencies and run the scripts.
Open a CMD and install the necessary depndencies:

pip install transformers datasets seqeval scikit-learn torch transformers[torch] accelerate>=0.26.0

Or if it fails or you're using a newer version of Python:

py -m pip install transformers datasets seqeval scikit-learn torch transformers[torch] accelerate>=0.26.0

Check if Python is in Windows' PATH: C:\Users\USER_NAME\AppData\Local\Programs\Python\Python313\Scripts

Note

The folder Python313\ represents that the installed version is the '3.13', if you have other version installed, change it.

🚀 Project Usage Explanation

In trainmodel.py, change the default dataset with the data you want to use to train your NER Model (explained in comments).
In the line 77 there are the training arguments, you can change them to test the results.
Run the training Script using:

python trainmodel.py

Or if it fails or you're using a newer version of Python:

py trainmodel.py

Now, in inferencemodel.py, change the label map to match the used in the training step.
Run the inference Script:

python inferencemodel.py

Or if it fails or you're using a newer version of Python:

py inferencemodel.py

📂 Files

If the scripts ran succesfully, the model will be generated in the same folder the script is, this files are:

my_ner_model: here is stored all the model's data and configuration.
ner_model: here are the different Model's checkpoints.

💻 Technologies Used

Programming Language: Python
Framework: seqeval (1.2.2)
Libraries:
- datasets (3.3.2)
- scikit-learn (1.6.1)
- torch (2.6.0)
- transformers (with PyTorch)
- accelerate (0.26.0)
Other:
- Model Base: xlm-roberta-base
Recommended IDE: VS Code

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.es.md		README.es.md
README.md		README.md
inferencemodel.py		inferencemodel.py
trainmodel.py		trainmodel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤗 Transformers NER Model Trainer

📝 Technology Explanation

🛠️ Setup

🚀 Project Usage Explanation

📂 Files

💻 Technologies Used

About

Uh oh!

Languages

License

LuisMiSanVe/TransformersNERTrainer

Folders and files

Latest commit

History

Repository files navigation

🤗 Transformers NER Model Trainer

📝 Technology Explanation

🛠️ Setup

🚀 Project Usage Explanation

📂 Files

💻 Technologies Used

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages