Exposing Racial Dialect Bias in Abusive Language Detection: Can Explainability Play a Role?

Biases can arise and be introduced during each phase of a supervised learning pipeline, eventually leading to harm. Within the task of automatic abusive language detection, this matter becomes particularly severe since unintended bias towards sensitive topics such as gender, sexual orientation, or ethnicity can harm underrepresented groups. The role of the datasets used to train these models is crucial to address these challenges. In this contribution, we investigate whether explainability methods can expose racial dialect bias attested within a popular dataset for abusive language detection. Through preliminary experiments, we found that pure explainability techniques cannot effectively uncover biases within the dataset under analysis: the rooted stereotypes are often more implicit and complex to retrieve.

Marta Marchiori Manerba and Virginia Morini. "Exposing Racial Dialect Bias in Abusive Language Detection: Can Explainability Play a Role?". XKDD, 2022.

Bibtex for citations:

TBD

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Data_Analysis.ipynb		Data_Analysis.ipynb
README.md		README.md
Viz_Expl.ipynb		Viz_Expl.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Exposing Racial Dialect Bias in Abusive Language Detection: Can Explainability Play a Role?

About

Uh oh!

Releases

Packages

Languages

MartaMarchiori/Exposing-Racial-Dialect-Bias

Folders and files

Latest commit

History

Repository files navigation

Exposing Racial Dialect Bias in Abusive Language Detection: Can Explainability Play a Role?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages