Otto configuration
Configure Otto agents to customize their behavior, model settings, tool access, and MCP server connections within your Ascend Project.
Organization
The otto.yaml configuration must be located within a dedicated otto directory in your Ascend Project:
- Screenshot
- File tree

├── otto
│ ├── agents
│ │ └── custom_agent.md
│ ├── mcp.yaml
│ ├── otto.yaml
│ └── rules
│ └── demo_welcome.yaml
│ └── git_commits.yaml
Configuration structure
otto/otto.yaml
otto:
default_model: <model-identifier> # Default model for all agents
model_settings: # Default model settings
temperature: <0.0-1.0>
max_tokens: <number>
agents:
<agent_id>:
additional_instructions: ... # Append to system instructions
instructions: ... # Replace system instructions entirely
max_turns: ... # Override conversation turn limit
model: ... # Override model for this agent
model_settings: ... # Override model settings
mcp_servers: ... # Grant access to MCP servers
tools: ... # Configure tool access
Model configuration
Default model
Set a default model for all agents:
otto/otto.yaml
otto:
default_model: gpt-4.1
Model settings
Configure model parameters like temperature and token limits:
otto/otto.yaml
otto:
model_settings:
temperature: 0.7 # Response randomness (0.0-1.0)
max_tokens: 4096 # Maximum response length
Per-agent model override
Override the model for specific agents:
otto/otto.yaml
otto:
agents:
chat:
model: gpt-4.1
model_settings:
temperature: 0.3 # Lower for more deterministic responses
"Code Reviewer":
model: claude-3-5-sonnet
model_settings:
temperature: 0.2
Extended thinking (Claude models)
Enable extended thinking for Claude models to improve reasoning on complex tasks. Extended thinking allows Claude to work through problems step-by-step before responding.
otto/otto.yaml
otto:
agents:
chat:
model: bedrock/us.anthropic.claude-sonnet-4-5-20250929-v1:0
model_settings:
thinking:
type: enabled
budget_tokens: 4096 # Max tokens for reasoning (1024-16384)
| Setting | Description |
|---|---|
type | Set to enabled to activate extended thinking |
budget_tokens | Maximum tokens Claude can use for reasoning. Higher values allow deeper analysis but increase latency and cost. |
Extended thinking is most useful for:
- Complex code analysis and debugging
- Multi-step planning tasks
- Problems requiring careful reasoning