🚀 Did a new search ranking system increase user bookings?
This project investigates that question through a full A/B testing workflow — from sanity checks to deep segment analysis.
In this analysis, we simulate a real-world scenario where an online travel agency is testing a new search algorithm (variant) against the current one (control). The goal is to evaluate conversion uplift, booking speed, and performance across user segments.
We analyze 17,000+ user sessions using Python, applying statistical techniques to determine whether the new experience positively impacts key business metrics.
- ✅ Increase booking conversions
- ✅ Reduce time to booking
- ✅ Identify which user segments benefit most
The leadership team needs statistically sound evidence to decide whether to roll out the new ranking algorithm.
The dataset consists of two files:
sessions_data.csv
: Contains session-level browsing and booking behaviorusers_data.csv
: Contains user type (guest
vslogged_in
) and experiment group assignment (control
orvariant
)
Metric Type | Metric | Test Used | Significance Level |
---|---|---|---|
Sanity Check | Sample Ratio Mismatch (SRM) | Chi-square test | α = 0.01 |
Primary Metric | Conversion Rate | Chi-square test | α = 0.1 |
Guardrail Metric | Time to Booking | Mann-Whitney U test | α = 0.1 |
Segment Analysis | Conversion by User Segment | Grouped Mean Comparison | - |
- ✅ Passed: Users were evenly split between control and variant.
- p-value = 0.6658, Chi-squared = 0.19
Ensures random assignment wasn't biased and experiment setup was valid.
Group | Conversion Rate |
---|---|
Control | 15.92% |
Variant | 18.19% |
Lift | +14.22% |
p-value | 0.0002 |
- ✅ Statistically significant improvement
- ✅ Business-impactful effect size
- 🧬 90% Confidence Interval for Lift: [+1.26%, +3.27%]
💡 Recommendation: Roll out the new search algorithm. The variant significantly increased bookings.
Group | Avg Time to Book |
---|---|
Control | ~X mins (simulated) |
Variant | ~0.79% faster |
p-value | 0.3699 (not significant) |
⛔ The variant didn’t make booking significantly faster — but it also didn’t make it worse.
Segment | Control Conversion | Variant Conversion |
---|---|---|
Casual Users | 16.14% | 17.72% |
Engaged Users | 15.82% | 18.42% |
- ✅ Both user types saw improvement
- 📈 Larger lift for Engaged Users
- Python (Pandas, NumPy, Seaborn, Matplotlib)
- Statistical Tests: Chi-square, Mann-Whitney U, Confidence Intervals
- Data Merging, Cleaning, Aggregation
- Segmented Visualizations (Engagement segments, conversion rate distributions)
The new search experience significantly increased conversion rates across all user types without negatively affecting time to booking. There is strong statistical evidence to support a company-wide rollout of the new algorithm.
📢 Final Verdict: ✅ Roll out the new search ranking system!
✅ Real-world scenario
✅ Full experimentation flow
✅ Business-driven insights
✅ Visual + statistical interpretation
✅ Storytelling ready for stakeholders
notebook.ipynb
: Full analysissegmented_conversion_rate.png
: Visualization of conversions by user segmentAB_testing_report.pdf
: Original problem statement and reference