FAQ

1. What is AI-TokenHub?

AI-TokenHub is an enterprise-level LLM Token platform that provides a unified API interface to access different AI models. You can use the full range of doubao-seed series large models with a single API Key.

2. How to get started?

Register and login to AI-TokenHub
Create API Key in the console
Use API Key to call endpoints

See Quick Start for details.

3. Which models are supported?

We fully support the doubao-seed series of large models, with specific parameters as follows:

✅ Latest Version Models

Model ID	Capabilities	Token Limit	Rate Limit
doubao-seed-2-0-pro-260215	Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling	Context Window: 256k Max Input: 256k Max Output (default 4k): 128k Max Chain of Thought: 128k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-2-0-lite-260215	Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured Output	Context Window: 256k Max Input: 256k Max Output (default 4k): 128k Max Chain of Thought: 128k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-2-0-mini-260215	Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured Output	Context Window: 256k Max Input: 256k Max Output (default 4k): 128k Max Chain of Thought: 128k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-2-0-code-preview-260215	Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling	Context Window: 256k Max Input: 256k Max Output (default 4k): 128k Max Chain of Thought: 128k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details

📜 Legacy Version Models

Model ID	Capabilities	Token Limit	Rate Limit
doubao-seed-1-8-251228	Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured Output	Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-code-preview-251028	Deep Thinking, Programming Scenario Enhanced Text Generation, Multimodal Understanding, Tool Calling	Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k	Max RPM: 5000 Max TPM: 1200000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-1-6-lite-251015	Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling	Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-1-6-flash-250828	Deep Thinking, Text Generation, Visual Positioning, Multimodal Understanding, Tool Calling, Structured Output	Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-1-6-vision-250815	Deep Thinking, Text Generation, Multimodal Understanding, GUI Tasks, Tool Calling, Structured Output	Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-1-6-251015	Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured Output	Context Window: 256k Max Input: 224k Max Output (default 4k): 32k Max Chain of Thought: 32k	Max RPM: 30000 Max TPM: 5000000 Non-rigid guarantee, affected by platform load/calling method, see documentation for details

4. Is API compatible with OpenAI?

Yes, AI-TokenHub API is fully compatible with OpenAI API format. You can use OpenAI SDK directly, just modify base_url:

python

from openai import OpenAI

client = OpenAI(
    base_url="https://ai-tokenhub.com/api/v1",
    api_key="YOUR_TOKENHUB_KEY"
)

5. How does billing work?

Charged by token usage
Different models have different pricing
View real-time usage and costs in console
Supports recharge and quota management

6. Is there free quota?

New users receive a certain amount of free testing quota after registration for development and debugging.

7. How to handle rate limits?

When rate limit is exceeded, API returns 429 error. Recommendations:

Implement retry mechanism, wait and retry
Batch process requests to reduce request count
Use streaming to reduce concurrency

See API Key Limits for details.

8. How to set fallback models?

Use fallback_models parameter:

json

{
  "model": "doubao-seed-2-0-pro-260215",
  "fallback_models": ["doubao-seed-2-0-lite-260215", "doubao-seed-2-0-mini-260215"]
}

System will automatically try fallback models when primary model is unavailable.

9. Is streaming supported?

Yes. Set stream: true to enable streaming. See Streaming for details.

10. How to get technical support?

Check Errors and Debugging documentation
Login to console to view usage logs
Contact customer service for help

FAQ ​

1. What is AI-TokenHub? ​

2. How to get started? ​

3. Which models are supported? ​

✅ Latest Version Models ​

📜 Legacy Version Models ​

4. Is API compatible with OpenAI? ​

5. How does billing work? ​

6. Is there free quota? ​

7. How to handle rate limits? ​

8. How to set fallback models? ​

9. Is streaming supported? ​

10. How to get technical support? ​

FAQ