Highlights
- Pro
Pinned Loading
-
Trace_Check_QA
Trace_Check_QA PublicCode for Invesitgating Trace-based Knowledge Distillation on Question-Answering
Python
-
React_Brittleness
React_Brittleness PublicCode for TMLR paper: "Do Think Tags Really Help LLMs Plan? A Critical Evaluation of ReAct-Style Prompting"
Jupyter Notebook
-
camera_model_and_stereo_depth_sensing
camera_model_and_stereo_depth_sensing PublicCamera model and stereo depth sensing using OpenCV
Python 10
-
wordle_using_rollouts
wordle_using_rollouts PublicThis repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertseka…
Jupyter Notebook 5
-
LLMs_for_Sparse_RL
LLMs_for_Sparse_RL PublicCode for "Efficient Reinforcement Learning via Large Language Model-based Search"
Python
If the problem persists, check the GitHub status page or contact support.