Social Foundations of Computation

All

15 repositories

error-parity
Public
Achieve error-rate fairness between societal groups for any score-based classifier.
Python
•
MIT License
•4•19•0•2•Updated Aug 21, 2025Aug 21, 2025
benchmark-prediction
Public
Python
•
MIT License
•1•3•0•0•Updated Aug 19, 2025Aug 19, 2025
lm-harmony
Public
Python
•
MIT License
•0•4•0•0•Updated Aug 5, 2025Aug 5, 2025
lm-evaluation-harness
Public
A framework for few-shot evaluation of language models.
Python
•
MIT License
•2.7k•1•0•0•Updated May 4, 2025May 4, 2025
folktexts
Public
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!
python machine-learning tabular-data transformers uncertainty fairness large-language-models
Jupyter Notebook
•
MIT License
•4•24•0•0•Updated Apr 8, 2025Apr 8, 2025
causal-features
Public
Code to reproduce the paper "Do causal predictors generalize better to new domains?"
Python
•
Other
•14•12•0•0•Updated Feb 7, 2025Feb 7, 2025
twitter-predictability
Public
Jupyter Notebook
•
MIT License
•0•1•0•0•Updated Jan 22, 2025Jan 22, 2025
surveying-language-models
Public
Code to reproduce the paper "Questioning the Survey Responses of Large Language Models"
Jupyter Notebook
•
MIT License
•2•9•0•0•Updated Dec 8, 2024Dec 8, 2024
training-on-the-test-task
Public
Code to reproduce the experiments in the paper Training on the Test Task Confounds Evaluation and Emergence.
Jupyter Notebook
•1•11•0•0•Updated Dec 3, 2024Dec 3, 2024
lawma
Public
Lawma: A lightly fine-tuned Llama model for legal classification tasks.
language-model legaltech legaltools
Jupyter Notebook
•0•21•0•0•Updated Sep 14, 2024Sep 14, 2024
benchbench
Public
BenchBench is a Python package to evaluate multi-task benchmarks.
Python
•
MIT License
•1•16•0•0•Updated Jul 18, 2024Jul 18, 2024
folktables
Public
Datasets derived from US census data
Python
•
MIT License
•22•268•7•4•Updated May 15, 2024May 15, 2024
tttlm
Public
Test-time-training on nearest neighbors for large language models
Python
•
MIT License
•5•45•0•0•Updated Apr 18, 2024Apr 18, 2024
backward_baselines
Public
Code for "Is your model predicting the past?"
Jupyter Notebook
•
MIT License
•0•2•0•0•Updated Mar 10, 2024Mar 10, 2024
whynot
Public
A Python sandbox for decision making in dynamics
Python
•
MIT License
•43•422•8•2•Updated Aug 21, 2023Aug 21, 2023