Clarify bedrock anthropic and llama2 docs

tzolov · tzolov · commit 683a1b9a88f6 · 2024-02-09T16:56:32.000+01:00
diff --git a/spring-ai-docs/src/main/antora/modules/ROOT/pages/api/clients/bedrock/bedrock-anthropic.adoc b/spring-ai-docs/src/main/antora/modules/ROOT/pages/api/clients/bedrock/bedrock-anthropic.adoc
@@ -10,62 +10,11 @@ The Claude model has the following high level features
 
 The https://aws.amazon.com/bedrock/claude[AWS Bedrock Anthropic Model Page] and https://docs.aws.amazon.com/bedrock/latest/userguide/what-is-bedrock.html[Amazon Bedrock User Guide] contains detailed information on how to use the AWS hosted model.
 
-== Getting Started
+== Pre-requisites
 
 Refer to the xref:api/clients/bedrock.adoc[Spring AI documentation on Amazon Bedrock] for setting up API access.
 
-The link:./src/main/java/org/springframework/ai/bedrock/anthropic/BedrockAnthropicChatClient.java[BedrockAnthropicChatClient] implements the `ChatClient` and `StreamingChatClient` and uses the `AnthropicChatBedrockApi` library to connect to the Bedrock Anthropic service.
-
-Add the `spring-ai-bedrock` dependency to your project's Maven `pom.xml` file:
-
-[source,xml]
-----
-<dependency>
-    <groupId>org.springframework.ai</groupId>
-    <artifactId>spring-ai-bedrock</artifactId>
-    <version>0.8.0-SNAPSHOT</version>
-</dependency>
-----
-
-or to your Gradle `build.gradle` build file.
-
-[source,gradle]
-----
-dependencies {
-    implementation 'org.springframework.ai:spring-ai-bedrock:0.8.0-SNAPSHOT'
-}
-----
-
-NOTE: Refer to the xref:getting-started.adoc#_dependency_management[Dependency Management] section to add Milestone and/or Snapshot Repositories to your build file.
-
-Next, create an `BedrockAnthropicChatClient` instance and use it to text generations requests:
-
-[source,java]
-----
-AnthropicChatBedrockApi anthropicApi =  new AnthropicChatBedrockApi(
-    AnthropicChatBedrockApi.AnthropicModel.CLAUDE_V2.id(),
-    EnvironmentVariableCredentialsProvider.create(),
-    Region.EU_CENTRAL_1.id(),
-    new ObjectMapper());
-
-BedrockAnthropicChatClient chatClient = new BedrockAnthropicChatClient(anthropicApi,
-    AnthropicChatOptions.builder()
-        .withTemperature(0.6f)
-        .withTopK(10)
-        .withTopP(0.8f)
-        .withMaxTokensToSample(100)
-        .withAnthropicVersion(AnthropicChatBedrockApi.DEFAULT_ANTHROPIC_VERSION)
-        .build());
-
-ChatResponse response = chatClient.call(
-    new Prompt("Generate the names of 5 famous pirates."));
-
-// Or with streaming responses
-Flux<ChatResponse> response = chatClient.stream(
-    new Prompt("Generate the names of 5 famous pirates."));
-----
-
-=== AnthropicChatClient Auto-configuration
+== Auto-configuration
 
 or you can leverage the `spring-ai-bedrock-ai-spring-boot-starter` Spring Boot starter:
 
@@ -78,18 +27,50 @@ or you can leverage the `spring-ai-bedrock-ai-spring-boot-starter` Spring Boot s
 </dependency>
 ----
 
-==== Enable Anthropic Support
+=== Enable Anthropic Support
 
 Spring AI defines a configuration property named `spring.ai.bedrock.anthropic.chat.enabled` that you should set to `true` to enable support for Anthropic.
-
 Exporting environment variables in one way to set this configuration property.
 
 [source,shell]
 ----
 export SPRING_AI_BEDROCK_ANTHROPIC_CHAT_ENABLED=true
 ----
 
-==== Sample Code
+=== Chat Properties
+
+The prefix `spring.ai.bedrock.aws` is the property prefix to configure the connection to AWS Bedrock.
+
+[cols="3,3,1"]
+|====
+| Property | Description | Default
+
+| spring.ai.bedrock.aws.region     |   AWS region to use. | us-east-1
+| spring.ai.bedrock.aws.access-key | AWS access key.  | -
+| spring.ai.bedrock.aws.secret-key | AWS secret key.  | -
+|====
+
+The prefix `spring.ai.bedrock.anthropic.chat` is the property prefix that configures the `ChatClient` implementation for Claude.
+
+[cols="2,5,1"]
+|====
+| Property | Description | Default
+
+| spring.ai.bedrock.anthropic.chat.enable | Enable Bedrock Anthropic chat client. Disabled by default | false
+| spring.ai.bedrock.anthropic.chat.model  | The model id to use. See the `AnthropicChatModel` for the supported models.  | anthropic.claude-v2
+| spring.ai.bedrock.anthropic.chat.options.temperature  | Controls the randomness of the output. Values can range over [0.0,1.0]  | 0.8
+| spring.ai.bedrock.anthropic.chat.options.topP  | The maximum cumulative probability of tokens to consider when sampling.  | AWS Bedrock default
+| spring.ai.bedrock.anthropic.chat.options.topK  | Specify the number of token choices the generative uses to generate the next token.  | AWS Bedrock default
+| spring.ai.bedrock.anthropic.chat.options.stopSequences  | Configure up to four sequences that the generative recognizes. After a stop sequence, the generative stops generating further tokens. The returned text doesn't contain the stop sequence.  | 10
+| spring.ai.bedrock.anthropic.chat.options.anthropicVersion  | The version of the generative to use. | bedrock-2023-05-31
+| spring.ai.bedrock.anthropic.chat.options.maxTokensToSample  | Specify the maximum number of tokens to use in the generated response. Note that the models may stop before reaching this maximum. This parameter only specifies the absolute maximum number of tokens to generate. We recommend a limit of 4,000 tokens for optimal performance. | 500
+|====
+
+Look at the Spring AI enumeration `AnthropicChatModel` for other model IDs.  The other value supported is `anthropic.claude-instant-v1`.
+
+Model ID values can also be found in the https://docs.aws.amazon.com/bedrock/latest/userguide/model-ids-arns.html[AWS Bedrock documentation for base model IDs].
+
+=== Sample Code
 
 This will create a `ChatClient` implementation that you can inject into your class.
 
@@ -122,38 +103,59 @@ public class ChatController {
 }
 ----
 
-==== Bedrock Properties
+== Manual Configuration
 
-The prefix `spring.ai.bedrock.aws` is the property prefix to configure the connection to AWS Bedrock.
+The link:./src/main/java/org/springframework/ai/bedrock/anthropic/BedrockAnthropicChatClient.java[BedrockAnthropicChatClient] implements the `ChatClient` and `StreamingChatClient` and uses the `AnthropicChatBedrockApi` library to connect to the Bedrock Anthropic service.
 
-[cols="3,3,1"]
-|====
-| Property | Description | Default
+Add the `spring-ai-bedrock` dependency to your project's Maven `pom.xml` file:
 
-| spring.ai.bedrock.aws.region     |   AWS region to use. | us-east-1
-| spring.ai.bedrock.aws.access-key | AWS access key.  | -
-| spring.ai.bedrock.aws.secret-key | AWS secret key.  | -
-|====
+[source,xml]
+----
+<dependency>
+    <groupId>org.springframework.ai</groupId>
+    <artifactId>spring-ai-bedrock</artifactId>
+    <version>0.8.0-SNAPSHOT</version>
+</dependency>
+----
 
-The prefix `spring.ai.bedrock.anthropic.chat` is the property prefix that configures the `ChatClient` implementation for Claude.
+or to your Gradle `build.gradle` build file.
 
-[cols="2,5,1"]
-|====
-| Property | Description | Default
+[source,gradle]
+----
+dependencies {
+    implementation 'org.springframework.ai:spring-ai-bedrock:0.8.0-SNAPSHOT'
+}
+----
 
-| spring.ai.bedrock.anthropic.chat.enable | Enable Bedrock Anthropic chat client. Disabled by default | false
-| spring.ai.bedrock.anthropic.chat.model  | The model id to use. See the `AnthropicChatModel` for the supported models.  | anthropic.claude-v2
-| spring.ai.bedrock.anthropic.chat.options.temperature  | Controls the randomness of the output. Values can range over [0.0,1.0]  | 0.8
-| spring.ai.bedrock.anthropic.chat.options.topP  | The maximum cumulative probability of tokens to consider when sampling.  | AWS Bedrock default
-| spring.ai.bedrock.anthropic.chat.options.topK  | Specify the number of token choices the generative uses to generate the next token.  | AWS Bedrock default
-| spring.ai.bedrock.anthropic.chat.options.stopSequences  | Configure up to four sequences that the generative recognizes. After a stop sequence, the generative stops generating further tokens. The returned text doesn't contain the stop sequence.  | 10
-| spring.ai.bedrock.anthropic.chat.options.anthropicVersion  | The version of the generative to use. | bedrock-2023-05-31
-| spring.ai.bedrock.anthropic.chat.options.maxTokensToSample  | Specify the maximum number of tokens to use in the generated response. Note that the models may stop before reaching this maximum. This parameter only specifies the absolute maximum number of tokens to generate. We recommend a limit of 4,000 tokens for optimal performance. | 500
-|====
+NOTE: Refer to the xref:getting-started.adoc#_dependency_management[Dependency Management] section to add Milestone and/or Snapshot Repositories to your build file.
 
-Look at the Spring AI enumeration `AnthropicChatModel` for other model IDs.  The other value supported is `anthropic.claude-instant-v1`.
+Next, create an `BedrockAnthropicChatClient` instance and use it to text generations requests:
+
+[source,java]
+----
+AnthropicChatBedrockApi anthropicApi =  new AnthropicChatBedrockApi(
+    AnthropicChatBedrockApi.AnthropicModel.CLAUDE_V2.id(),
+    EnvironmentVariableCredentialsProvider.create(),
+    Region.EU_CENTRAL_1.id(),
+    new ObjectMapper());
+
+BedrockAnthropicChatClient chatClient = new BedrockAnthropicChatClient(anthropicApi,
+    AnthropicChatOptions.builder()
+        .withTemperature(0.6f)
+        .withTopK(10)
+        .withTopP(0.8f)
+        .withMaxTokensToSample(100)
+        .withAnthropicVersion(AnthropicChatBedrockApi.DEFAULT_ANTHROPIC_VERSION)
+        .build());
+
+ChatResponse response = chatClient.call(
+    new Prompt("Generate the names of 5 famous pirates."));
+
+// Or with streaming responses
+Flux<ChatResponse> response = chatClient.stream(
+    new Prompt("Generate the names of 5 famous pirates."));
+----
 
-Model ID values can also be found in the https://docs.aws.amazon.com/bedrock/latest/userguide/model-ids-arns.html[AWS Bedrock documentation for base model IDs].
 
 == Appendices
 
diff --git a/spring-ai-docs/src/main/antora/modules/ROOT/pages/api/clients/bedrock/bedrock-llama2.adoc b/spring-ai-docs/src/main/antora/modules/ROOT/pages/api/clients/bedrock/bedrock-llama2.adoc
@@ -9,58 +9,11 @@ Rigorous testing, including over 1,000 hours of red-teaming and annotation, ensu
 
 The https://aws.amazon.com/bedrock/llama-2/[AWS Llama 2 Model Page] and https://docs.aws.amazon.com/bedrock/latest/userguide/what-is-bedrock.html[Amazon Bedrock User Guide] contains detailed information on how to use the AWS hosted model.
 
-
-== Getting Started
+== Prerequisites
 
 Refer to the xref:api/clients/bedrock.adoc[Spring AI documentation on Amazon Bedrock] for setting up API access.
 
-
-Add the `spring-ai-bedrock` dependency to your project's Maven `pom.xml` file:
-
-[source,xml]
-----
-<dependency>
-    <groupId>org.springframework.ai</groupId>
-    <artifactId>spring-ai-bedrock</artifactId>
-    <version>0.8.0-SNAPSHOT</version>
-</dependency>
-----
-
-or to your Gradle `build.gradle` build file.
-
-[source,gradle]
-----
-dependencies {
-    implementation 'org.springframework.ai:spring-ai-bedrock:0.8.0-SNAPSHOT'
-}
-----
-
-NOTE: Refer to the xref:getting-started.adoc#_dependency_management[Dependency Management] section to add Milestone and/or Snapshot Repositories to your build file.
-
-The link:./src/main/java/org/springframework/ai/bedrock/llama2/BedrockLlama2ChatClient.java[BedrockLlama2ChatClient] implements the `ChatClient` and `StreamingChatClient` and uses the `Llama2ChatBedrockApi` library to connect to the Bedrock Llama2 service.
-
-Here is how to create and use a `BedrockLlama2ChatClient`:
-
-[source,java]
-----
-Llama2ChatBedrockApi api = new Llama2ChatBedrockApi(Llama2ChatModel.LLAMA2_70B_CHAT_V1.id(),
-	EnvironmentVariableCredentialsProvider.create(), Region.US_EAST_1.id(), new ObjectMapper());
-
-BedrockLlama2ChatClient chatClient = new BedrockLlama2ChatClient(api,
-	    BedrockLlama2ChatOptions.builder()
-            .withTemperature(0.5f)
-            .withMaxGenLen(100)
-            .withTopP(0.9f).build());
-
-ChatResponse response = chatClient.call(
-    new Prompt("Generate the names of 5 famous pirates."));
-
-// Or with streaming responses
-Flux<ChatResponse> response = chatClient.stream(
-    new Prompt("Generate the names of 5 famous pirates."));
-----
-
-=== BedrockLlama2ChatClient Auto-configuration
+== Auto-configuration
 
 or you can leverage the `spring-ai-bedrock-ai-spring-boot-starter` Spring Boot starter:
 
@@ -82,18 +35,48 @@ dependencies {
 }
 ----
 
-==== Enable Llama2 Chat Support
+=== Enable Llama2 Chat Support
 
 Spring AI defines a configuration property named `spring.ai.bedrock.llama2.chat.enabled` that you should set to `true` to enable support for Llama2.
-
 Exporting environment variables in one way to set this configuration property.
 
 [source,shell]
 ----
 export SPRING_AI_BEDROCK_LLAMA2_CHAT_ENABLED=true
 ----
 
-==== Sample Code
+=== Chat Properties
+
+The prefix `spring.ai.bedrock.aws` is the property prefix to configure the connection to AWS Bedrock.
+
+[cols="3,3,3"]
+|====
+| Property | Description | Default
+
+| spring.ai.bedrock.aws.region     |   AWS region to use. | us-east-1
+| spring.ai.bedrock.aws.access-key | AWS access key.  | -
+| spring.ai.bedrock.aws.secret-key | AWS secret key.  | -
+|====
+
+
+The prefix `spring.ai.bedrock.llama2.chat` is the property prefix that configures the `ChatClient` implementation for Llama2.
+
+[cols="2,5,1"]
+|====
+| Property | Description | Default
+
+| spring.ai.bedrock.llama2.chat.enabled              | Enable or disable support for Llama2  | false
+| spring.ai.bedrock.llama2.chat.model                | The model id to use (See Below) | meta.llama2-70b-chat-v1
+| spring.ai.bedrock.llama2.chat.options.temperature          | Controls the randomness of the output. Values can range over [0.0,1.0], inclusive. A value closer to 1.0 will produce responses that are more varied, while a value closer to 0.0 will typically result in less surprising responses from the model. This value specifies default to be used by the backend while making the call to the model. | 0.7
+| spring.ai.bedrock.llama2.chat.options.top-p                | The maximum cumulative probability of tokens to consider when sampling. The model uses combined Top-k and nucleus sampling. Nucleus sampling considers the smallest set of tokens whose probability sum is at least topP. | AWS Bedrock default
+| spring.ai.bedrock.llama2.chat.options.max-gen-len          | Specify the maximum number of tokens to use in the generated response. The model truncates the response once the generated text exceeds maxGenLen. | 300
+|====
+
+Look at the Spring AI enumeration, `Llama2ChatModel`  for other model IDs.  The other value supported is `meta.llama2-13b-chat-v1`.
+
+Model ID values can also be found in the https://docs.aws.amazon.com/bedrock/latest/userguide/model-ids-arns.html[AWS Bedrock documentation for base model IDs].
+
+=== Sample Code
 
 This will create a `ChatClient` implementation that you can inject into your class.
 
@@ -126,36 +109,52 @@ public class ChatController {
 }
 ----
 
-=== Bedrock Properties
+== Manual Configuration
 
-The prefix `spring.ai.bedrock.aws` is the property prefix to configure the connection to AWS Bedrock.
+Add the `spring-ai-bedrock` dependency to your project's Maven `pom.xml` file:
 
-[cols="3,3,3"]
-|====
-| Property | Description | Default
+[source,xml]
+----
+<dependency>
+    <groupId>org.springframework.ai</groupId>
+    <artifactId>spring-ai-bedrock</artifactId>
+    <version>0.8.0-SNAPSHOT</version>
+</dependency>
+----
 
-| spring.ai.bedrock.aws.region     |   AWS region to use. | us-east-1
-| spring.ai.bedrock.aws.access-key | AWS access key.  | -
-| spring.ai.bedrock.aws.secret-key | AWS secret key.  | -
-|====
+or to your Gradle `build.gradle` build file.
 
+[source,gradle]
+----
+dependencies {
+    implementation 'org.springframework.ai:spring-ai-bedrock:0.8.0-SNAPSHOT'
+}
+----
 
-The prefix `spring.ai.bedrock.llama2.chat` is the property prefix that configures the `ChatClient` implementation for Llama2.
+NOTE: Refer to the xref:getting-started.adoc#_dependency_management[Dependency Management] section to add Milestone and/or Snapshot Repositories to your build file.
 
-[cols="2,5,1"]
-|====
-| Property | Description | Default
+The link:./src/main/java/org/springframework/ai/bedrock/llama2/BedrockLlama2ChatClient.java[BedrockLlama2ChatClient] implements the `ChatClient` and `StreamingChatClient` and uses the `Llama2ChatBedrockApi` library to connect to the Bedrock Llama2 service.
 
-| spring.ai.bedrock.llama2.chat.enabled              | Enable or disable support for Llama2  | false
-| spring.ai.bedrock.llama2.chat.model                | The model id to use (See Below) | meta.llama2-70b-chat-v1
-| spring.ai.bedrock.llama2.chat.options.temperature          | Controls the randomness of the output. Values can range over [0.0,1.0], inclusive. A value closer to 1.0 will produce responses that are more varied, while a value closer to 0.0 will typically result in less surprising responses from the model. This value specifies default to be used by the backend while making the call to the model. | 0.7
-| spring.ai.bedrock.llama2.chat.options.top-p                | The maximum cumulative probability of tokens to consider when sampling. The model uses combined Top-k and nucleus sampling. Nucleus sampling considers the smallest set of tokens whose probability sum is at least topP. | AWS Bedrock default
-| spring.ai.bedrock.llama2.chat.options.max-gen-len          | Specify the maximum number of tokens to use in the generated response. The model truncates the response once the generated text exceeds maxGenLen. | 300
-|====
+Here is how to create and use a `BedrockLlama2ChatClient`:
 
-Look at the Spring AI enumeration, `Llama2ChatModel`  for other model IDs.  The other value supported is `meta.llama2-13b-chat-v1`.
+[source,java]
+----
+Llama2ChatBedrockApi api = new Llama2ChatBedrockApi(Llama2ChatModel.LLAMA2_70B_CHAT_V1.id(),
+	EnvironmentVariableCredentialsProvider.create(), Region.US_EAST_1.id(), new ObjectMapper());
 
-Model ID values can also be found in the https://docs.aws.amazon.com/bedrock/latest/userguide/model-ids-arns.html[AWS Bedrock documentation for base model IDs].
+BedrockLlama2ChatClient chatClient = new BedrockLlama2ChatClient(api,
+	    BedrockLlama2ChatOptions.builder()
+            .withTemperature(0.5f)
+            .withMaxGenLen(100)
+            .withTopP(0.9f).build());
+
+ChatResponse response = chatClient.call(
+    new Prompt("Generate the names of 5 famous pirates."));
+
+// Or with streaming responses
+Flux<ChatResponse> response = chatClient.stream(
+    new Prompt("Generate the names of 5 famous pirates."));
+----
 
 
 == Appendices