A/B Testing: A Causal Inference Starter

🧰 How to Use This Template

Click the green "Use this template" button at the top of the page, then choose "Create a new repository".
This will create your own copy of this project, which you can modify freely — no need to fork!

A/B Testing: A Causal Inference Starter

Your first hands-on lab in designing, simulating, and analyzing A/B tests for causal inference.

Table of Contents

🎯 Project Overview

This project is a comprehensive walkthrough designed for:

Data scientists starting their journey in A/B testing
Students learning causal inference through simulation
Professionals who want a clean, well-explained portfolio project

You'll simulate online experiments, perform statistical tests (manually and via scipy), visualize results with confidence intervals, and explore how low power affects false positives. This lab is both a learning tool and a public showcase for your GitHub portfolio.

📘 What You’ll Learn

How to simulate binary outcome data (e.g. conversion)
Run hypothesis tests: manual Z-test and scipy t-test
Calculate and visualize confidence intervals
Understand p-values, alpha levels, and Type I/II errors
Simulate and explore the False Positive Risk (FPR)
Visualize “what if” scenarios: low power, small effects, bad design

📂 Folder Structure

AB-Testing_Causal-Inference-Starter/
├── notebooks/
│   └── 01_ab_test_simulation.ipynb       # Main walkthrough notebook
├── data/
│   └── synthetic_ab_test.csv            # Simulated experiment dataset
├── scripts/
│   └── abtest_utils.py                  # Reusable functions
├── images/
│   └── pvalue_distribution.png          # Visual outputs for README/docs
├── README.md                            # Project overview (you are here)
└── LICENSE                              # MIT License

🚀 Getting Started

1. Clone the repo

git clone https://github.com/your-username/AB-Testing_Causal-Inference-Starter.git
cd AB-Testing_Causal-Inference-Starter

2. Install requirements

pip install numpy pandas matplotlib scipy jupyter

3. Launch the notebook

jupyter notebook notebooks/01_ab_test_simulation.ipynb

🧠 Notebook Topics

📊 1. Simulate Control and Treatment

Conversion rates (e.g. 10% vs 11%)
Generate binary data for 10,000 users each group

✍️ 2. Manual Hypothesis Testing

Pooled standard error
Z-statistic
Manual p-value calculation

🧪 3. Use `scipy` to Validate

scipy.stats.ttest_ind or z-proportion test
Compare with manual results

📉 4. Visualize Confidence Intervals

Bar plots with error bars
Bootstrap sampling distributions

❗ 5. Simulate False Positives

Run 1000 null experiments
Show that ~5% are “significant” by chance
Explore what happens when power is low

🧮 6. What-If Scenarios

Low sample size vs. high MDE
Winner’s curse demonstration
Sign vs magnitude errors

🧑‍🏫 For Students

Each notebook section includes:

👩‍🏫 Teaching comments and annotated code
📌 Real-world context from online experimentation
💡 Insights from industry practice (Expedia, Microsoft, Airbnb)

📸 Sample Visuals

See the /images folder for example outputs:

p-value distributions
confidence interval charts
bootstrap effect distributions

🛠 Built With

Python 3.8+
numpy, pandas, scipy, matplotlib
Jupyter Notebook

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🌱 Inspired By

Trustworthy Online Controlled Experiments by Ron Kohavi et al.
"False Positives in A/B Tests" (KDD '24)
Andrew Gelman & John Carlin on sign and magnitude errors

🤝 Connect

If you’re a recruiter, data science student, or just excited about experimentation, feel free to reach out or star this repo!

Let’s learn causal inference—one experiment at a time. 🌱

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
docs		docs
images		images
notebooks		notebooks
scripts		scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧰 How to Use This Template

A/B Testing: A Causal Inference Starter

🎯 Project Overview

📘 What You’ll Learn

📂 Folder Structure

🚀 Getting Started

1. Clone the repo

2. Install requirements

3. Launch the notebook

🧠 Notebook Topics

📊 1. Simulate Control and Treatment

✍️ 2. Manual Hypothesis Testing

🧪 3. Use `scipy` to Validate

📉 4. Visualize Confidence Intervals

❗ 5. Simulate False Positives

🧮 6. What-If Scenarios

🧑‍🏫 For Students

📸 Sample Visuals

🛠 Built With

📄 License

🌱 Inspired By

🤝 Connect

About

Uh oh!

Uh oh!

Languages

License

0-mostafa-rezaee-0/AB-Testing_Causal-Inference-Starter

Folders and files

Latest commit

History

Repository files navigation

🧰 How to Use This Template

A/B Testing: A Causal Inference Starter

🎯 Project Overview

📘 What You’ll Learn

📂 Folder Structure

🚀 Getting Started

1. Clone the repo

2. Install requirements

3. Launch the notebook

🧠 Notebook Topics

📊 1. Simulate Control and Treatment

✍️ 2. Manual Hypothesis Testing

🧪 3. Use scipy to Validate

📉 4. Visualize Confidence Intervals

❗ 5. Simulate False Positives

🧮 6. What-If Scenarios

🧑‍🏫 For Students

📸 Sample Visuals

🛠 Built With

📄 License

🌱 Inspired By

🤝 Connect

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages

🧪 3. Use `scipy` to Validate