Skip to content
View sbhambr1's full-sized avatar
🐈‍⬛
🐈‍⬛

Highlights

  • Pro

Block or report sbhambr1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Trace_Check_QA Trace_Check_QA Public

    Code for Invesitgating Trace-based Knowledge Distillation on Question-Answering

    Python

  2. React_Brittleness React_Brittleness Public

    Code for TMLR paper: "Do Think Tags Really Help LLMs Plan? A Critical Evaluation of ReAct-Style Prompting"

    Jupyter Notebook

  3. camera_model_and_stereo_depth_sensing camera_model_and_stereo_depth_sensing Public

    Camera model and stereo depth sensing using OpenCV

    Python 10

  4. wordle_using_rollouts wordle_using_rollouts Public

    This repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertseka…

    Jupyter Notebook 5

  5. LLMs_for_Sparse_RL LLMs_for_Sparse_RL Public

    Code for "Efficient Reinforcement Learning via Large Language Model-based Search"

    Python