Open source the actual production configuration

```yaml
models:
  - name: llama3 #must be lowercase
    model: "casperhansen/llama-3-70b-instruct-awq"
    servedModelName: ""
    quantization: "awq"
    dtype: ""
    gpuMemoryUtilization: "0.96"
    huggingface_token: ""
    ropeScaling:
      enabled: true
      jsonConfiguration: '{"type":"dynamic","factor":4.0}'
      theta: "500000"
    replicaCount: 1
    pvc:
      enabled: true
      storageSize: 60Gi

sender:
  image:
    tag: v1.1.1
consumer:
  image:
    tag: v1.1.1
inferenceserver:
  image:
    repository: vllm/vllm-openai
    tag: v0.5.0
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Open source the actual production configuration #11

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Open source the actual production configuration #11

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions