Skip to content

Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models

License

Notifications You must be signed in to change notification settings

palomapiot/geographic-bias

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models

Overview 🌟

This project presents a debias tuning technique to mitigate bias in LLMs, specifically, geographic bias.

Models Hugging Face

Soon! 🚀

Contents 📚

  • data/ - Directory containing the datasets used in the research.
  • debias-tuning/ - Directory with the code for the models.
  • prompts/ - Directory holding the prompts utilised for training and evaluation.

Data Access 🔍

In the data/ directory, you will find the IDs and labels used in our research. For those interested in the complete dataset, it can be accessed here.

Citation 📑

If you use any part of the code included in this repository or the Geographic MetaHate dataset, please cite the following reference:

@misc{piot2025personalisationprejudiceaddressinggeographic,
      title={Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models}, 
      author={Paloma Piot and Patricia Martín-Rodilla and Javier Parapar},
      year={2025},
      eprint={2505.02252},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.02252}, 
}

If you use MetaHate dataset or Geographic MetaHate dataset, please cite the following reference:

@article{Piot_Martín-Rodilla_Parapar_2024,
  title={MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection},
  volume={18},
  url={https://ojs.aaai.org/index.php/ICWSM/article/view/31445},
  DOI={10.1609/icwsm.v18i1.31445},
  number={1},
  journal={Proceedings of the International AAAI Conference on Web and Social Media},
  author={Piot, Paloma and Martín-Rodilla, Patricia and Parapar, Javier},
  year={2024},
  month={May},
  pages={2025-2039}
}

Disclaimer ⚠️

This repository includes content that may contain hate speech, offensive language, or other forms of inappropriate and objectionable material. The content present in the dataset or code is not created or endorsed by the authors or contributors of this project. It is collected from various sources and does not necessarily reflect the views or opinions of the project maintainers. The purpose of using this repository is for research, analysis, or educational purposes only. The authors do not endorse or promote any harmful, discriminatory, or offensive behavior conveyed in the dataset.

Users are advised to exercise caution and sensitivity when interacting with or interpreting the repository. If you choose to use the datasets or models, it is recommended to handle the content responsibly and in compliance with ethical guidelines and applicable laws. The project maintainers disclaim any responsibility for the content within the repository and cannot be held liable for how it is used or interpreted by others.

Acknowledgements 🙏

The authors thank the funding from the Horizon Europe research and innovation programme under the Marie Skłodowska-Curie Grant Agreement No. 101073351. The authors thank the financial support supplied by the grant PID2022-137061OB-C21 funded by MI-CIU/AEI/10.13039/501100011033 and by “ERDF/EU”. The authors also thank the funding supplied by the Consellería de Cultura, Educación, Formación Profesional e Universidades (accreditations ED431G 2023/01 and ED431C 2025/49) and the European Regional Development Fund, which acknowledges the CITIC, as a center accredited for excellence within the Galician University System and a member of the CIGUS Network, receives subsidies from the Department of Education, Science, Universities, and Vocational Training of the Xunta de Galicia. Additionally, it is co-financed by the EU through the FEDER Galicia 2021-27 operational program (Ref. ED431G 2023/01).

License 📜

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

The Apache License 2.0 is an open-source license that allows you to use the software for any purpose, to distribute it, to modify it, and to distribute modified versions of the software under the terms of the license.

For more details, please refer to the Apache License 2.0.

Contact 📬

For further questions, inquiries, or discussions related to this project, please feel free to reach out via email:

If you encounter any issues or have specific questions about the code, we recommend opening an issue on GitHub for better visibility and collaboration.

About

Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages