LLM SATs FTW

A collection of experiments exploring Large Language Models (LLMs) and their performance on SAT-style tasks, powered by Streamlit.

Features

Multiple LLM-based experiments
Interactive Streamlit UI
Easy to extend and customize

Installation

Clone the repository:

git clone https://github.com/yourusername/talk-llm-sats-ftw-code.git
cd talk-llm-sats-ftw-code

Create a virtual environment (optional but recommended):
```
python3 -m venv venv
source venv/bin/activate
```
Install dependencies:
```
pip install -r requirements.txt
```

OpenAI API Key Required

To use these experiments, you must provide your own OpenAI API key. You can obtain an API key by signing up at OpenAI. The app will prompt you to enter your key when you run an experiment.

Running Experiments

Each experiment is implemented as a separate Streamlit app in the main directory, named experiment-<number>.py.

To run an experiment, use:

streamlit run *.py

For example:

streamlit run experiment-1-starburst.py

Available Experiments

experiment-1-starburst.py: Starburst
experiment-2-ach.py: Analysis of Competing Hypotheses (ACH)
experiment-3-kac.py: Key Assumptions Check (KAC)

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.MD		LICENSE.MD
README.md		README.md
experiment-1-starburst.py		experiment-1-starburst.py
experiment-2-ach.py		experiment-2-ach.py
experiment-3-kac.py		experiment-3-kac.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM SATs FTW

Features

Installation

OpenAI API Key Required

Running Experiments

Available Experiments

License

About

Uh oh!

Releases

Uh oh!

Languages

License

sroberts/talk-llm-sats-ftw-code

Folders and files

Latest commit

History

Repository files navigation

LLM SATs FTW

Features

Installation

OpenAI API Key Required

Running Experiments

Available Experiments

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Languages