Documentation Index
Fetch the complete documentation index at: https://docs.langdock.com/llms.txt
Use this file to discover all available pages before exploring further.
Which models to add
AI models evolve quickly. We recommend adding at least one flagship model from each major provider. This gives your users access to the best available model for different tasks while keeping your setup manageable.
When a provider releases a new model, add it alongside the existing one rather than replacing it. Users may have agents or workflows that rely on specific models, so removing them without notice can cause disruption.
Select a provider to see the recommended model types and their configuration values.
Model-specific configuration
Use these values when configuring models manually. With a prebuilt Langdock config, these are applied automatically.
OpenAI
Anthropic
Google
Others
Add these model types:
- Latest flagship - Most capable for complex tasks
- Efficient variant (mini/nano) - Fast, cost-effective for everyday use
- Reasoning model (o-series) - For analytical and mathematical tasks
Look for the highest version number available. OpenAI increments version numbers with each major release.
GPT-5.5
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| GPT-5.5 | Responses API | 200,000 | 16,000 | Enable Always show reasoning, reasoning: none, verbosity: low |
GPT-5.4
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| GPT-5.4 | Responses API | 200,000 | 32,000 | Reasoning: none, verbosity: medium |
| GPT-5.4 Thinking | Responses API | 200,000 | 32,000 | Enable Always show reasoning, reasoning: medium, verbosity: medium |
| GPT-5.4 Mini | Responses API | 200,000 | 32,000 | None |
GPT-5.2
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| GPT-5.2 | Responses API | 200,000 | 32,000 | Reasoning: none, verbosity: medium |
| GPT-5.2 Thinking | Responses API | 200,000 | 32,000 | Enable Always show reasoning, reasoning: medium, verbosity: medium |
| GPT-5.2 Pro | Responses API | 200,000 | 16,384 | Enable Always show reasoning, reasoning: medium, verbosity: low |
GPT-5.1
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| GPT-5.1 | Responses API | 200,000 | 32,000 | Reasoning: medium, verbosity: medium |
| GPT-5.1 Thinking | Responses API | 200,000 | 32,000 | Enable Always show reasoning, reasoning: medium, verbosity: medium |
| GPT-5.1 Thinking Fast | Responses API | 200,000 | 32,000 | Enable Always show reasoning, reasoning: none, verbosity: low |
GPT-5
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| GPT-5 | Responses API | 200,000 | 32,000 | Enable Always show reasoning, reasoning: minimal, verbosity: low |
| GPT-5 Thinking | Responses API | 200,000 | 32,000 | Enable Always show reasoning |
| GPT-5 Pro | Responses API | 200,000 | 16,384 | Enable Always show reasoning, reasoning: high, verbosity: low |
| GPT-5 Mini | Responses API | 200,000 | 32,000 | Reasoning: minimal, verbosity: low |
| GPT-5 Nano | Responses API | 200,000 | 32,000 | Reasoning: minimal, verbosity: low |
GPT-4.1
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| GPT-4.1 | Completion API | 200,000 | 32,768 | Good for data analysis |
| GPT-4.1 Mini | Completion API | 200,000 | 32,768 | None |
| GPT-4.1 Nano | Completion API | 200,000 | 32,768 | None |
GPT-4o
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| GPT-4o | Completion API | 128,000 | 16,384 | None |
| GPT-4o Mini | Completion API | 128,000 | 16,384 | None |
GPT oss
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| GPT oss (120b) | Completion API | 128,000 | 16,384 | Enable Always show reasoning |
Reasoning models (o-series)
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| o3 | Responses API | 200,000 | 32,000 | Enable Always show reasoning |
| o3 Mini high | Responses API | 200,000 | 32,000 | Enable Always show reasoning, reasoning: high |
| o3 Pro | Responses API | 200,000 | 32,000 | Enable Always show reasoning |
| o4 Mini | Responses API | 200,000 | 32,000 | Enable Always show reasoning |
Add these model types:
- Sonnet - Balanced model for most tasks, excellent at coding and writing
- Sonnet Reasoning - Enhanced version for complex logical tasks
- Opus (if available) - Most intelligent for demanding tasks
Anthropic uses tier names (Opus > Sonnet > Haiku) rather than version numbers. Opus is most capable, Haiku is fastest.
Claude Opus
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| Claude Opus 4.7 1M | Completion API | 1,000,000 | 32,000 | None |
| Claude Opus 4.7 | Completion API | 200,000 | 32,000 | None |
| Claude Opus 4.6 1M | Completion API | 1,000,000 | 32,000 | None |
| Claude Opus 4.6 | Completion API | 200,000 | 32,000 | None |
| Claude Opus 4.6 Reasoning | Completion API | 200,000 | 32,000 | Enable Always show reasoning |
| Claude Opus 4.5 | Completion API | 200,000 | 32,000 | None |
| Claude Opus 4.5 Reasoning | Completion API | 200,000 | 32,000 | Enable Always show reasoning |
Claude Sonnet
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| Claude Sonnet 4.6 | Completion API | 200,000 | 32,000 | None |
| Claude Sonnet 4.6 Reasoning | Completion API | 200,000 | 32,000 | Enable Always show reasoning |
| Claude Sonnet 4.5 | Completion API | 200,000 | 32,000 | None |
| Claude Sonnet 4.5 Reasoning | Completion API | 200,000 | 32,000 | Enable Always show reasoning |
| Claude Sonnet 4 | Completion API | 200,000 | 32,000 | None |
| Claude Sonnet 3.7 | Completion API | 200,000 | 32,000 | None |
| Claude Sonnet 3.7 Reasoning | Completion API | 200,000 | 32,000 | Enable Always show reasoning |
| Claude Sonnet 3.5 | Completion API | 200,000 | 8,000 | None |
Claude Haiku
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| Claude Haiku 4.5 | Completion API | 200,000 | 32,000 | None |
Add these model types:
- Gemini Pro - Flagship model with advanced multimodal capabilities
- Gemini Flash - Fast, efficient for real-time applications
Google uses “Pro” for their flagship and “Flash” for fast variants. Higher version numbers indicate newer releases.
Gemini Pro
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| Gemini 3.1 Pro Preview | Completion API | 200,000 | 32,000 | None |
| Gemini 2.5 Pro | Completion API | 200,000 | 32,000 | None |
| Gemini 2.5 Pro Reasoning | Completion API | 200,000 | 32,000 | Enable Always show reasoning |
Gemini Flash
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| Gemini 3 Flash Preview | Completion API | 200,000 | 32,000 | None |
| Gemini 2.5 Flash | Completion API | 200,000 | 32,000 | None |
Mistral
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| Mistral Large 3 | Completion API | 200,000 | 16,000 | None |
| Mistral Large 2411 | Completion API | 128,000 | 16,000 | None |
| Mistral Medium | Completion API | 128,000 | 8,000 | None |
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| Llama 4 Maverick | Completion API | 200,000 | 16,000 | None |
| Llama 3.3 70B | Completion API | 128,000 | 16,000 | None |
DeepSeek
| Model | API Type | Context Size | Max Output Tokens | Configuration |
|---|
| DeepSeek r1 | Completion API | 128,000 | 8,000 | Enable Always show reasoning |
| DeepSeek v3.1 | Completion API | 128,000 | 16,000 | None |
For the most up-to-date model information and capabilities, check the model picker in app.langdock.com. Model naming follows consistent patterns across providers. See our Model Guide for help understanding these patterns.