Skip to content

IBM/ITBench-Scenarios

Repository files navigation

ITBench-Scenarios

This repository contains infrastructure automation scripts for both deploying the environments and configuring the scenarios required to run ITBench. These scenarios are realistic simulations based on actual IT automation challenges faced by CISO, SRE, and FinOps teams. For example, one of the SRE scenarios is to resolve a “high error rate on service checkout” while a CISO scenario involves assessing the compliance posture for a “new control rule detected for RHEL 9.” Every ITBench scenario is deployed in a sandboxed operational Kubernetes (K8s) environment.

These scenarios simulate compliance-related misconfigurations. Each scenario provides:

  • A pre-configured environment with specific compliance issues
  • Tools to detect misconfigurations
  • Validation methods to verify successful remediation

CISO scenarios are located here.

These scenarios focus on observability and incident response. Each scenario includes:

  • A comprehensive observability stack deployment featuring:
    • Prometheus for metrics collection
    • Clickhouse and OpenSearch for search and analytics
    • Jaeger for distributed tracing
    • OpenTelemetry for Kubernetes event logs collection
  • Simulated faults that trigger service degradation
  • Thereby leading to alerts associated with application performance issues such as increased error rates and latency spikes

SRE scenarios are located here.

Each scenario includes:

  • The core SRE observability stack
  • OpenCost integration for cost monitoring
  • Simulated faults trigger cost overrun alerts

FinOps scenarios are located here along-side SRE scenarios.

ITBench Ecosystem and Related Repositories

  • ITBench: Central repository providing an overview of the ITBench ecosystem, related announcements, and publications.
  • CISO-CAA Agent: CISO (Chief Information Security Officer) agents that automate compliance assessments by generating policies from natural language, collecting evidence, integrating with GitOps workflows, and deploying policies for assessment.
  • SRE Agent: SRE (Site Reliability Engineering) agents designed to diagnose and remediate problems in Kubernetes-based environments. Leverage logs, metrics, traces, and Kubernetes states/events from the IT enviroment.
  • ITBench Leaderboard: Service that handles scenario deployment, agent evaluation, and maintains a public leaderboard for comparing agent performance on ITOps use cases.
  • ITBench Utilities: Collection of supporting tools and utilities for participants in the ITBench ecosystem and leaderboard challenges.
  • ITBench Tutorials: Repository containing the latest tutorials, workshops, and educational content for getting started with ITBench.

Maintainers

About

Code repository for scenarios and environment setup as part of ITBench

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 7