Co-authored-by: Yuanbo Li <ybalbert@amazon.com>tags/0.10.1
| model: anthropic.claude-3-5-sonnet-20241022-v2:0 | |||||
| label: | |||||
| en_US: Claude 3.5 Sonnet V2 | |||||
| model_type: llm | |||||
| features: | |||||
| - agent-thought | |||||
| - vision | |||||
| - tool-call | |||||
| - stream-tool-call | |||||
| model_properties: | |||||
| mode: chat | |||||
| context_size: 200000 | |||||
| # docs: https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-anthropic-claude-messages.html | |||||
| parameter_rules: | |||||
| - name: max_tokens | |||||
| use_template: max_tokens | |||||
| required: true | |||||
| type: int | |||||
| default: 4096 | |||||
| min: 1 | |||||
| max: 4096 | |||||
| help: | |||||
| zh_Hans: 停止前生成的最大令牌数。请注意,Anthropic Claude 模型可能会在达到 max_tokens 的值之前停止生成令牌。不同的 Anthropic Claude 模型对此参数具有不同的最大值。 | |||||
| en_US: The maximum number of tokens to generate before stopping. Note that Anthropic Claude models might stop generating tokens before reaching the value of max_tokens. Different Anthropic Claude models have different maximum values for this parameter. | |||||
| - name: temperature | |||||
| use_template: temperature | |||||
| required: false | |||||
| type: float | |||||
| default: 1 | |||||
| min: 0.0 | |||||
| max: 1.0 | |||||
| help: | |||||
| zh_Hans: 生成内容的随机性。 | |||||
| en_US: The amount of randomness injected into the response. | |||||
| - name: top_p | |||||
| required: false | |||||
| type: float | |||||
| default: 0.999 | |||||
| min: 0.000 | |||||
| max: 1.000 | |||||
| help: | |||||
| zh_Hans: 在核采样中,Anthropic Claude 按概率递减顺序计算每个后续标记的所有选项的累积分布,并在达到 top_p 指定的特定概率时将其切断。您应该更改温度或top_p,但不能同时更改两者。 | |||||
| en_US: In nucleus sampling, Anthropic Claude computes the cumulative distribution over all the options for each subsequent token in decreasing probability order and cuts it off once it reaches a particular probability specified by top_p. You should alter either temperature or top_p, but not both. | |||||
| - name: top_k | |||||
| required: false | |||||
| type: int | |||||
| default: 0 | |||||
| min: 0 | |||||
| # tip docs from aws has error, max value is 500 | |||||
| max: 500 | |||||
| help: | |||||
| zh_Hans: 对于每个后续标记,仅从前 K 个选项中进行采样。使用 top_k 删除长尾低概率响应。 | |||||
| en_US: Only sample from the top K options for each subsequent token. Use top_k to remove long tail low probability responses. | |||||
| - name: response_format | |||||
| use_template: response_format | |||||
| pricing: | |||||
| input: '0.003' | |||||
| output: '0.015' | |||||
| unit: '0.001' | |||||
| currency: USD |
| model: eu.anthropic.claude-3-5-sonnet-20241022-v2:0 | |||||
| label: | |||||
| en_US: Claude 3.5 Sonnet V2(EU.Cross Region Inference) | |||||
| model_type: llm | |||||
| features: | |||||
| - agent-thought | |||||
| - vision | |||||
| - tool-call | |||||
| - stream-tool-call | |||||
| model_properties: | |||||
| mode: chat | |||||
| context_size: 200000 | |||||
| # docs: https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-anthropic-claude-messages.html | |||||
| parameter_rules: | |||||
| - name: max_tokens | |||||
| use_template: max_tokens | |||||
| required: true | |||||
| type: int | |||||
| default: 4096 | |||||
| min: 1 | |||||
| max: 4096 | |||||
| help: | |||||
| zh_Hans: 停止前生成的最大令牌数。请注意,Anthropic Claude 模型可能会在达到 max_tokens 的值之前停止生成令牌。不同的 Anthropic Claude 模型对此参数具有不同的最大值。 | |||||
| en_US: The maximum number of tokens to generate before stopping. Note that Anthropic Claude models might stop generating tokens before reaching the value of max_tokens. Different Anthropic Claude models have different maximum values for this parameter. | |||||
| - name: temperature | |||||
| use_template: temperature | |||||
| required: false | |||||
| type: float | |||||
| default: 1 | |||||
| min: 0.0 | |||||
| max: 1.0 | |||||
| help: | |||||
| zh_Hans: 生成内容的随机性。 | |||||
| en_US: The amount of randomness injected into the response. | |||||
| - name: top_p | |||||
| required: false | |||||
| type: float | |||||
| default: 0.999 | |||||
| min: 0.000 | |||||
| max: 1.000 | |||||
| help: | |||||
| zh_Hans: 在核采样中,Anthropic Claude 按概率递减顺序计算每个后续标记的所有选项的累积分布,并在达到 top_p 指定的特定概率时将其切断。您应该更改温度或top_p,但不能同时更改两者。 | |||||
| en_US: In nucleus sampling, Anthropic Claude computes the cumulative distribution over all the options for each subsequent token in decreasing probability order and cuts it off once it reaches a particular probability specified by top_p. You should alter either temperature or top_p, but not both. | |||||
| - name: top_k | |||||
| required: false | |||||
| type: int | |||||
| default: 0 | |||||
| min: 0 | |||||
| # tip docs from aws has error, max value is 500 | |||||
| max: 500 | |||||
| help: | |||||
| zh_Hans: 对于每个后续标记,仅从前 K 个选项中进行采样。使用 top_k 删除长尾低概率响应。 | |||||
| en_US: Only sample from the top K options for each subsequent token. Use top_k to remove long tail low probability responses. | |||||
| - name: response_format | |||||
| use_template: response_format | |||||
| pricing: | |||||
| input: '0.003' | |||||
| output: '0.015' | |||||
| unit: '0.001' | |||||
| currency: USD |
| model: us.anthropic.claude-3-5-sonnet-20241022-v2:0 | |||||
| label: | |||||
| en_US: Claude 3.5 Sonnet V2(US.Cross Region Inference) | |||||
| model_type: llm | |||||
| features: | |||||
| - agent-thought | |||||
| - vision | |||||
| - tool-call | |||||
| - stream-tool-call | |||||
| model_properties: | |||||
| mode: chat | |||||
| context_size: 200000 | |||||
| # docs: https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-anthropic-claude-messages.html | |||||
| parameter_rules: | |||||
| - name: max_tokens | |||||
| use_template: max_tokens | |||||
| required: true | |||||
| type: int | |||||
| default: 4096 | |||||
| min: 1 | |||||
| max: 4096 | |||||
| help: | |||||
| zh_Hans: 停止前生成的最大令牌数。请注意,Anthropic Claude 模型可能会在达到 max_tokens 的值之前停止生成令牌。不同的 Anthropic Claude 模型对此参数具有不同的最大值。 | |||||
| en_US: The maximum number of tokens to generate before stopping. Note that Anthropic Claude models might stop generating tokens before reaching the value of max_tokens. Different Anthropic Claude models have different maximum values for this parameter. | |||||
| - name: temperature | |||||
| use_template: temperature | |||||
| required: false | |||||
| type: float | |||||
| default: 1 | |||||
| min: 0.0 | |||||
| max: 1.0 | |||||
| help: | |||||
| zh_Hans: 生成内容的随机性。 | |||||
| en_US: The amount of randomness injected into the response. | |||||
| - name: top_p | |||||
| required: false | |||||
| type: float | |||||
| default: 0.999 | |||||
| min: 0.000 | |||||
| max: 1.000 | |||||
| help: | |||||
| zh_Hans: 在核采样中,Anthropic Claude 按概率递减顺序计算每个后续标记的所有选项的累积分布,并在达到 top_p 指定的特定概率时将其切断。您应该更改温度或top_p,但不能同时更改两者。 | |||||
| en_US: In nucleus sampling, Anthropic Claude computes the cumulative distribution over all the options for each subsequent token in decreasing probability order and cuts it off once it reaches a particular probability specified by top_p. You should alter either temperature or top_p, but not both. | |||||
| - name: top_k | |||||
| required: false | |||||
| type: int | |||||
| default: 0 | |||||
| min: 0 | |||||
| # tip docs from aws has error, max value is 500 | |||||
| max: 500 | |||||
| help: | |||||
| zh_Hans: 对于每个后续标记,仅从前 K 个选项中进行采样。使用 top_k 删除长尾低概率响应。 | |||||
| en_US: Only sample from the top K options for each subsequent token. Use top_k to remove long tail low probability responses. | |||||
| - name: response_format | |||||
| use_template: response_format | |||||
| pricing: | |||||
| input: '0.003' | |||||
| output: '0.015' | |||||
| unit: '0.001' | |||||
| currency: USD |