FAQ
1. What is AI-TokenHub?
AI-TokenHub is an enterprise-level LLM Token platform that provides a unified API interface to access different AI models. You can use the full range of doubao-seed series large models with a single API Key.
2. How to get started?
- Register and login to AI-TokenHub
- Create API Key in the console
- Use API Key to call endpoints
See Quick Start for details.
3. Which models are supported?
We fully support the doubao-seed series of large models, with specific parameters as follows:
✅ Latest Version Models
| Model ID | Capabilities | Token Limit | Rate Limit |
|---|---|---|---|
| doubao-seed-2-0-pro-260215 | Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling | Context Window: 256k Max Input: 256k Max Output (default 4k): 128k Max Chain of Thought: 128k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
| doubao-seed-2-0-lite-260215 | Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured Output | Context Window: 256k Max Input: 256k Max Output (default 4k): 128k Max Chain of Thought: 128k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
| doubao-seed-2-0-mini-260215 | Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured Output | Context Window: 256k Max Input: 256k Max Output (default 4k): 128k Max Chain of Thought: 128k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
| doubao-seed-2-0-code-preview-260215 | Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling | Context Window: 256k Max Input: 256k Max Output (default 4k): 128k Max Chain of Thought: 128k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
📜 Legacy Version Models
| Model ID | Capabilities | Token Limit | Rate Limit |
|---|---|---|---|
| doubao-seed-1-8-251228 | Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured Output | Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
| doubao-seed-code-preview-251028 | Deep Thinking, Programming Scenario Enhanced Text Generation, Multimodal Understanding, Tool Calling | Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k | Max RPM: 5000 Max TPM: 1200000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
| doubao-seed-1-6-lite-251015 | Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling | Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
| doubao-seed-1-6-flash-250828 | Deep Thinking, Text Generation, Visual Positioning, Multimodal Understanding, Tool Calling, Structured Output | Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
| doubao-seed-1-6-vision-250815 | Deep Thinking, Text Generation, Multimodal Understanding, GUI Tasks, Tool Calling, Structured Output | Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
| doubao-seed-1-6-251015 | Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured Output | Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k | Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details |
4. Is API compatible with OpenAI?
Yes, AI-TokenHub API is fully compatible with OpenAI API format. You can use OpenAI SDK directly, just modify base_url:
from openai import OpenAI
client = OpenAI(
base_url="https://ai-tokenhub.com/api/v1",
api_key="YOUR_TOKENHUB_KEY"
)5. How does billing work?
- Charged by token usage
- Different models have different pricing
- View real-time usage and costs in console
- Supports recharge and quota management
6. Is there free quota?
New users receive a certain amount of free testing quota after registration for development and debugging.
7. How to handle rate limits?
When rate limit is exceeded, API returns 429 error. Recommendations:
- Implement retry mechanism, wait and retry
- Batch process requests to reduce request count
- Use streaming to reduce concurrency
See API Key Limits for details.
8. How to set fallback models?
Use fallback_models parameter:
{
"model": "doubao-seed-2-0-pro-260215",
"fallback_models": ["doubao-seed-2-0-lite-260215", "doubao-seed-2-0-mini-260215"]
}System will automatically try fallback models when primary model is unavailable.
9. Is streaming supported?
Yes. Set stream: true to enable streaming. See Streaming for details.
10. How to get technical support?
- Check Errors and Debugging documentation
- Login to console to view usage logs
- Contact customer service for help
