Introduction to Amazon Bedrock
Nikhil Rangarajan
Data Scientist
Models have parameters to control their behavior
temperature
: Controls randomness in predictions
top_p
: Controls diversity of model's output by including top-ranked tokensmax_tokens
: Sets the maximum length of the outputResponse randomness and creativity
Low temperature (near 0): More focused, deterministic responses
High temperature (near 1): More diverse, creative outputs
Most Bedrock models default to 0.7
prompt = "Write a headline for a
tech article"
request = {
"temperature": 0.2,
"messages": [
{
"role": "user",
"content": [{"type": "text",
"text": prompt}],
}
],
...
}
Temperature = model's 'risk appetite'
Low temperature acts like a cautious decision-maker
High temperature behaves like a creative thinker willing to take risks
prompt = "Explain quantum computing"
# Focused response
request["top_p"] = 0.1
# Diverse response
request["top_p"] = 0.9
Max_tokens limits response length:
prompt = "Explain quantum computing"
# Focused shorter response
request["top_p"] = 0.1
request["max_tokens"] = 100
# Diverse longer response
request["top_p"] = 0.9
request["max_tokens"] = 500
Introduction to Amazon Bedrock