AWS Database Configuration

This repository contains an Upbound project, tailored for users establishing their initial control plane with Upbound. This configuration deploys fully managed AWS database instances.

Overview

The core components of a custom API in Upbound Project include:

CompositeResourceDefinition (XRD): Defines the API's structure.
Composition(s): Configures the Functions Pipeline
Embedded Function(s): Encapsulates the Composition logic and implementation within a self-contained, reusable unit

In this specific configuration, the API contains:

an AWS Database custom resource type.
Composition: Configured in /apis/composition.yaml
Embedded Function: The Composition logic is encapsulated within embedded function

Intelligent RDS Scaling Experiment

Architecture

Pipeline Flow: RDS Instance → CloudWatch Metrics → AI Analysis → Scaling Decisions

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   RDS Instance  │    │ function-rds-    │    │ function-claude │
│                 │───▶│    metrics       │───▶│                 │
│ (CloudWatch)    │    │                  │    │ (Claude AI)     │
└─────────────────┘    └──────────────────┘    └─────────────────┘
                              │                        │
                              ▼                        ▼
                    ┌──────────────────┐    ┌─────────────────┐
                    │ Context:         │    │ Scaling Actions │
                    │ performanceMetrics│    │ + Audit Trail   │
                    └──────────────────┘    └─────────────────┘

Component Integration

1. function-rds-metrics

Fetches CloudWatch metrics: CPU, memory, IOPS, connections, storage
Writes to both status.performanceMetrics and context.performanceMetrics
Provides structured metric data with timestamps and units

2. function-claude

Reads metrics from pipeline context via contextFields: ["performanceMetrics"]
Analyzes complex observed resources (300+ line YAML with full metadata)
Makes scaling decisions using Claude Sonnet 4.0 with configurable token limits
Returns clean desired spec (no system fields) for server-side apply

Key Technical Challenges Solved

1. Large YAML Handling

Problem: Claude failed with empty tool parameters for complex observed resources
Solution: Increased maxTokens from 1024 → 8192 and refined prompt engineering

2. Pipeline Context Flow

Problem: function-claude wasn't receiving metrics from previous pipeline step
Solution: Extended input schema with contextFields parameter for generic context access

3. Resource State Management

Problem: First reconciliation had empty observed resources
Solution: Fallback logic to use desired resources when observed is empty

4. Schema Validation

Problem: Returning full observed state caused resourceRefs uid schema errors
Solution: Prompt engineering to return only clean desired spec excluding system fields

Decision Making Process

Input: Real CloudWatch metrics

performanceMetrics:
  metrics:
    CPUUtilization: { value: 1.98, unit: "Percent" }
    DatabaseConnections: { value: 0, unit: "Count" }
    FreeStorageSpace: { value: 2775404544, unit: "Bytes" }
    # ... other metrics

AI Analysis: Claude evaluates against thresholds

CPU >80% → scale up instance class
Memory <20% free → memory-optimized instance
IOPS >80% → increase storage/instance
Connections >80% → larger instance

Output: Structured decision with audit trail

status:
  claudeDecision:
    reasoning: "CPU utilization at 1.98% is well below 80% threshold..."
    timestamp: "2025-07-29T22:05:18Z"

Production Results

✅ End-to-end pipeline: All components synced and ready
✅ Real-time decisions: ~2-5 second analysis latency
✅ Cost efficiency: ~$0.06-0.12 per scaling decision
✅ Audit compliance: Full reasoning captured in XR status
✅ Safety: Only scales up, prevents accidental downsizing

Usage

Deploy an intelligent scaling database:

apiVersion: aws.platform.upbound.io/v1alpha1
kind: XSQLInstance
metadata:
  name: my-intelligent-db
spec:
  compositionSelector:
    matchLabels:
      type: intelligent
      scaling: ai-driven
  parameters:
    engine: mariadb
    engineVersion: "10.11"
    storageGB: 20
    region: us-west-2
    # ... other parameters

Prerequisites

Claude API Secret: Store Anthropic API key in control plane

kubectl create secret generic claude \
  --from-literal=ANTHROPIC_API_KEY=your-api-key \
  -n crossplane-system

AWS Credentials: Ensure CloudWatch metrics access for RDS monitoring

Current Status: Experimental - Successfully proven concept with real infrastructure, ready for production validation and monitoring.

Load Testing and Benchmarking

Database Stress Testing for AI Scaling Validation

To validate the intelligent scaling system, you can stress test the RDS instance to trigger high CPU utilization and observe AI-driven scaling decisions:

MySQL/MariaDB Stress Test

# Trigger high CPU load with multiple concurrent MD5 hash computations
for i in {1..8}; do
    mysql \
      --host=your-rds-endpoint.region.rds.amazonaws.com \
      --user=masteruser \
      --password=your-password \
      --default-auth=mysql_native_password \
      --execute="SELECT BENCHMARK(1000000000, MD5('trigger_scaling_$i'));" &
done

PostgreSQL Stress Test

# Alternative for PostgreSQL instances
for i in {1..8}; do
    psql "postgresql://masteruser:password@your-rds-endpoint.region.rds.amazonaws.com/upbound" \
      -c "SELECT md5(generate_series(1,10000000)::text);" &
done

Expected Behavior

CPU Spike: Database CPU should reach 80%+ utilization within 1-2 minutes
Metrics Collection: function-rds-metrics captures high CPU in context
AI Analysis: Claude detects threshold breach and recommends scaling
Infrastructure Change: Instance class upgraded (e.g., db.t3.micro → db.t3.small)
Performance Recovery: CPU utilization drops after scaling completes

Monitoring Scaling Events

# Watch Claude's scaling decisions
kubectl get xsqlinstance your-db-name -o jsonpath='{.status.claudeDecision}' | jq .

# Check current instance class via Kubernetes
kubectl get instance.rds -l crossplane.io/composite=your-db-name -o jsonpath='{.items[0].spec.forProvider.instanceClass}'

# Monitor instance class changes with watch
kubectl get instance.rds -l crossplane.io/composite=your-db-name -o custom-columns=NAME:.metadata.name,CLASS:.spec.forProvider.instanceClass,STATUS:.status.conditions[-1].type --watch

# Check performance metrics from XR status
kubectl get xsqlinstance your-db-name -o jsonpath='{.status.performanceMetrics}' | jq .

# Alternative: Monitor AWS console for instance class changes
aws rds describe-db-instances --db-instance-identifier your-db-name --query 'DBInstances[0].DBInstanceClass'

Cleanup

# Stop all background processes
pkill -f "mysql.*BENCHMARK"
# or for PostgreSQL
pkill -f "psql.*generate_series"

Testing

The configuration can be tested using:

up composition render --xrd=apis/definition.yaml apis/composition.yaml examples/mariadb-xr.yaml to render the MariaDB composition
up composition render --xrd=apis/definition.yaml apis/composition.yaml examples/postgres-xr.yaml to render the PostgreSQL composition
up composition render --xrd=apis/definition.yaml apis/composition-intelligent.yaml examples/mariadb-xr-intelligent.yaml to render the intelligent scaling composition
up test run tests/* to run composition tests
up test run tests/* --e2e to run end-to-end tests

Crossplane v2.0 Operations

This configuration includes Crossplane Operations for automated intelligent scaling using both scheduled and reactive patterns. Operations are automatically deployed with up project run.

CronOperation: Scheduled Scaling Analysis

Schedule: Every minute (*/1 * * * *) for proactive monitoring
Target: XSQLInstance resources with scale: me label
Purpose: Regular scheduled analysis for predictable workloads

WatchOperation: Reactive Scaling

Trigger: Any XSQLInstance resource changes
Purpose: Immediate response to metric updates or configuration changes
Debouncing: 5-minute rate limiting to prevent scaling thrash

Operation Features

AI-Powered Decision Making: Uses upbound-function-claude for intelligent scaling decisions
Conservative Thresholds: CPU >85%, Memory <15%, Connections >85%
Instance Progression: db.t3.micro → db.t3.small → db.t3.medium → db.t3.large
Rate Limiting: Annotations-based cooldown to prevent excessive scaling
Audit Trail: Full reasoning captured in resource annotations

Quick Start with Operations

Setup Claude API Secret:

kubectl create secret generic claude \
  --from-literal=ANTHROPIC_API_KEY=your-api-key \
  -n crossplane-system

Deploy Configuration and Operations:

up project run  # Automatically includes operations/

Deploy Example with Scaling Labels:

# Network dependency
kubectl apply -f examples/network-rds-metrics.yaml

# Database with scaling enabled
kubectl apply -f examples/mariadb-xr-rds-metrics.yaml

Monitor Operations:

# Check operation status
kubectl get cronoperation,watchoperation -n crossplane-system

# Monitor operation executions
kubectl get operation -n crossplane-system --watch

Example Resources for Operations Testing

The configuration includes dedicated examples for operations testing:

examples/network-rds-metrics.yaml: Network setup for RDS metrics testing
examples/mariadb-xr-rds-metrics.yaml: XSQLInstance with scale: me label for operation targeting

Deployment

Execute up project run (automatically includes operations)
Alternatively, install the Configuration from the Upbound Marketplace
Check examples for example XR(Composite Resource)

Next steps

This repository serves as a foundational step. To enhance the configuration, consider:

create new API definitions in this same repo
editing the existing API definition to your needs

To learn more about how to build APIs for your managed control planes in Upbound, read the guide on Upbound's docs.

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
.github		.github
apis		apis
examples		examples
functions/xsqlinstance		functions/xsqlinstance
operations/rds-intelligent-scaling-cron		operations/rds-intelligent-scaling-cron
tests		tests
.gitignore		.gitignore
.yamllint		.yamllint
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
perf-scale-demo.sh		perf-scale-demo.sh
simple-demo-scaling-monitor.sh		simple-demo-scaling-monitor.sh
upbound.yaml		upbound.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AWS Database Configuration

Overview

Intelligent RDS Scaling Experiment

Architecture

Component Integration

Key Technical Challenges Solved

Decision Making Process

Production Results

Usage

Prerequisites

Load Testing and Benchmarking

Database Stress Testing for AI Scaling Validation

MySQL/MariaDB Stress Test

PostgreSQL Stress Test

Expected Behavior

Monitoring Scaling Events

Cleanup

Testing

Crossplane v2.0 Operations

CronOperation: Scheduled Scaling Analysis

WatchOperation: Reactive Scaling

Operation Features

Quick Start with Operations

Example Resources for Operations Testing

Deployment

Next steps

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

upbound/configuration-aws-database-ai

Folders and files

Latest commit

History

Repository files navigation

AWS Database Configuration

Overview

Intelligent RDS Scaling Experiment

Architecture

Component Integration

Key Technical Challenges Solved

Decision Making Process

Production Results

Usage

Prerequisites

Load Testing and Benchmarking

Database Stress Testing for AI Scaling Validation

MySQL/MariaDB Stress Test

PostgreSQL Stress Test

Expected Behavior

Monitoring Scaling Events

Cleanup

Testing

Crossplane v2.0 Operations

CronOperation: Scheduled Scaling Analysis

WatchOperation: Reactive Scaling

Operation Features

Quick Start with Operations

Example Resources for Operations Testing

Deployment

Next steps

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages