Skip to content

Commit e13b772

Browse files
committed
Update tutorial with 3.7 trajectories
1 parent f005abb commit e13b772

File tree

14 files changed

+21790
-1642
lines changed

14 files changed

+21790
-1642
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ cd data-analysis-crow
2929
# Install dependencies
3030
pip install -e .
3131

32-
# OPTIONAL:ull the docker image with bioinformatics packages
32+
# OPTIONAL:pull the docker image with bioinformatics packages
3333
docker pull futurehouse/bixbench:aviary-notebook-env
3434
```
3535

@@ -79,7 +79,7 @@ BixBench tests AI agents' ability to:
7979
- Perform long, multi-step computational analyses
8080
- Interpret nuanced results in the context of a research question
8181

82-
You can find the BixBench dataset in [Hugging Face](https://huggingface.co/datasets/futurehouse/BixBench), the paper [here](), and the blog post [here](https://futurehouse.org/blog/bixbench/).
82+
You can find the BixBench dataset in [Hugging Face](https://huggingface.co/datasets/futurehouse/BixBench), the paper [here](https://storage.googleapis.com/bixbench-results/BixBench.pdf), and the blog post [here](https://futurehouse.org/blog/bixbench/).
8383

8484
### Running BixBench Evaluations
8585

tutorial/example.ipynb

Lines changed: 745 additions & 177 deletions
Large diffs are not rendered by default.

tutorial/tmp_results_dir/bf222a115d3970be6e12430b1cd57eb67d2f36b1950ed5765a247a1f071e7569-1740969537.698443/notebook.ipynb

Lines changed: 0 additions & 558 deletions
This file was deleted.

tutorial/tmp_results_dir/bf222a115d3970be6e12430b1cd57eb67d2f36b1950ed5765a247a1f071e7569-1740969537.698443/notebook.md

Lines changed: 0 additions & 289 deletions
This file was deleted.

0 commit comments

Comments
 (0)