Red-Teaming Challenge - OpenAI gpt-oss-20b

Welcome to my repository for the OpenAI gpt-oss-20b Red-Teaming Challenge!

Overview

This repo documents my participation in the Kaggle red-teaming competition focused on probing OpenAI's newly released gpt-oss-20b model for previously undiscovered vulnerabilities and harmful behaviors. The goal is to identify, document, and report up to five distinct issues, contributing to the safety and alignment of open-source AI models.

📚 Notebook

Interactive Notebook: Access the complete red-teaming notebook on Google Colab for hands-on experimentation with the gpt-oss-20b model.

Challenge Objectives

Find flaws and vulnerabilities in gpt-oss-20b (not previously reported)
Document exploits with reproducible reports and code
Share insights to improve AI safety and alignment

Topics of Interest

Reward hacking
Deception & deceptive alignment
Sabotage
Inappropriate tool use
Data exfiltration
Sandbagging
Evaluation awareness
Chain of Thought issues

Submission Format

Kaggle Writeup (project summary, strategy, findings)
Up to 5 findings files (JSON)
(Optional) Reproduction notebook
(Optional) Open-source tooling

Timeline

Start: August 5, 2025
End: August 26, 2025

Repository Structure

Challange.txt: Full competition details and rules
README.md: This file
gpt_oss_20b_colab_final.ipynb: Main notebook with model setup and red-teaming experiments
(To be added) Findings, additional notebooks, and tooling

Getting Started

I have just joined the challenge and will be updating this repository with:

My discovery process and methodology
Vulnerability findings and reports
Reproducible code and notebooks

Stay tuned for updates as I progress through the competition!

Citation: D. Sculley, Samuel Marks, and Addison Howard. Red‑Teaming Challenge - OpenAI gpt-oss-20b. Kaggle Competition, 2025.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitattributes		.gitattributes
Challange.txt		Challange.txt
README.md		README.md
Run_GPT_OSS_20B_using_Ollama.txt		Run_GPT_OSS_20B_using_Ollama.txt
Testing_commands.txt		Testing_commands.txt
gpt_oss_20b_colab_final.ipynb		gpt_oss_20b_colab_final.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Red-Teaming Challenge - OpenAI gpt-oss-20b

Overview

📚 Notebook

Challenge Objectives

Topics of Interest

Submission Format

Timeline

Repository Structure

Getting Started

About

Uh oh!

Releases

Packages

Languages

OMCHOKSI108/Red-Teaming-Challenge-OpenAI-gpt-oss-20b

Folders and files

Latest commit

History

Repository files navigation

Red-Teaming Challenge - OpenAI gpt-oss-20b

Overview

📚 Notebook

Challenge Objectives

Topics of Interest

Submission Format

Timeline

Repository Structure

Getting Started

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages