This repository contains all material needed to participate in "Introduction To Profile Area" exercise of the research group Human-Centered Computing (HCC) at Freie Universität Berlin.
You can find the introduction slides to the exercise inside this repository here.
- Introduction: 4:15 PM
- Forming groups: 4:25 PM
- Working on the task: 4:30 PM
- Discussion: 5:30 PM
- End: 5:45 PM
Goal: Getting familiar with LIME as an explanation method for evaluating classifiers.
LIME was introduced as a model-agnostic and local explanation/interpretability method in the lecture. We will use the tutorial Basic usage, two class. We explain random forest classifiers. provided in the LIME repository on GitHub.
We will be using miro for this exercise. Please add your answers to this board for the discussion.
- First things first: Get familiar with the LIME repository on GitHub. Please read the README: https://github.com/marcotcr/lime.
- Get the tutorial notebook running in a collaborative environment. You can use one of the options listed below the task, or one of you shares the screen.
- Please step through the notebook and understand what is happening in each line. Please add markdown cells to your notebook to document your understanding of what is happening. You can also write down the question you have in mind, while stepping through the notebook or include links or references you used to get a deeper understanding of the notebook.
- Look at explanations for at least three different documents of the given newsgroup dataset (currently document number 83 is used).
- Reflection: Please answer the following questions for the discussion in miro:
- What did you learn about the model?
- How well do you think the classifier works? Why?
- How useful is LIME for a non-data-scientist (e.g. non-ML-experts or designer)? Why?
- What questions is LIME able to answer? Why?
- Would you incorporate tools like LIME into your data science practice and how?
- (Optional) Play around with LIME and different datasets or models. Are you able to generate some helpful explanations?
❗ Additional Information
There are many sites available that offer collaboration services with jupyter notebooks (with a different degree of collaboration functionality). Here is a non-exhaustive list of options:
Jupyter notebook is by itself running a local server. You can also host that that server and enable collaboration using the --collaborative
flag.