Highlights
- Pro
 
Pinned Loading
- 
  Trace_Check_QA
Trace_Check_QA PublicCode for Invesitgating Trace-based Knowledge Distillation on Question-Answering
Python
 - 
  React_Brittleness
React_Brittleness PublicCode for TMLR paper: "Do Think Tags Really Help LLMs Plan? A Critical Evaluation of ReAct-Style Prompting"
Jupyter Notebook
 - 
  camera_model_and_stereo_depth_sensing
camera_model_and_stereo_depth_sensing PublicCamera model and stereo depth sensing using OpenCV
Python 12
 - 
  wordle_using_rollouts
wordle_using_rollouts PublicThis repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertseka…
Jupyter Notebook 5
 - 
  LLMs_for_Sparse_RL
LLMs_for_Sparse_RL PublicCode for "Efficient Reinforcement Learning via Large Language Model-based Search"
Python
 
If the problem persists, check the GitHub status page or contact support.

