Skip to content

FAQ

1. What is AI-TokenHub?

AI-TokenHub is an enterprise-level LLM Token platform that provides a unified API interface to access different AI models. You can use the full range of doubao-seed series large models with a single API Key.

2. How to get started?

  1. Register and login to AI-TokenHub
  2. Create API Key in the console
  3. Use API Key to call endpoints

See Quick Start for details.

3. Which models are supported?

We fully support the doubao-seed series of large models, with specific parameters as follows:

✅ Latest Version Models

Model IDCapabilitiesToken LimitRate Limit
doubao-seed-2-0-pro-260215Deep Thinking, Text Generation, Multimodal Understanding, Tool CallingContext Window: 256k
Max Input: 256k
Max Output (default 4k): 128k
Max Chain of Thought: 128k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-2-0-lite-260215Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured OutputContext Window: 256k
Max Input: 256k
Max Output (default 4k): 128k
Max Chain of Thought: 128k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-2-0-mini-260215Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured OutputContext Window: 256k
Max Input: 256k
Max Output (default 4k): 128k
Max Chain of Thought: 128k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-2-0-code-preview-260215Deep Thinking, Text Generation, Multimodal Understanding, Tool CallingContext Window: 256k
Max Input: 256k
Max Output (default 4k): 128k
Max Chain of Thought: 128k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details

📜 Legacy Version Models

Model IDCapabilitiesToken LimitRate Limit
doubao-seed-1-8-251228Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured OutputContext Window: 256k
Max Input: 224k
Max Output (default 4k): 32k
Max Chain of Thought: 32k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-code-preview-251028Deep Thinking, Programming Scenario Enhanced Text Generation, Multimodal Understanding, Tool CallingContext Window: 256k
Max Input: 224k
Max Output (default 4k): 32k
Max Chain of Thought: 32k
Max RPM: 5000
Max TPM: 1200000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-1-6-lite-251015Deep Thinking, Text Generation, Multimodal Understanding, Tool CallingContext Window: 256k
Max Input: 224k
Max Output (default 4k): 32k
Max Chain of Thought: 32k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-1-6-flash-250828Deep Thinking, Text Generation, Visual Positioning, Multimodal Understanding, Tool Calling, Structured OutputContext Window: 256k
Max Input: 224k
Max Output (default 4k): 32k
Max Chain of Thought: 32k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-1-6-vision-250815Deep Thinking, Text Generation, Multimodal Understanding, GUI Tasks, Tool Calling, Structured OutputContext Window: 256k
Max Input: 224k
Max Output (default 4k): 32k
Max Chain of Thought: 32k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details
doubao-seed-1-6-251015Deep Thinking, Text Generation, Multimodal Understanding, Tool Calling, Structured OutputContext Window: 256k
Max Input: 224k
Max Output (default 4k): 32k
Max Chain of Thought: 32k
Max RPM: 30000
Max TPM: 5000000
Non-rigid guarantee, affected by platform load/calling method, see documentation for details

4. Is API compatible with OpenAI?

Yes, AI-TokenHub API is fully compatible with OpenAI API format. You can use OpenAI SDK directly, just modify base_url:

python
from openai import OpenAI

client = OpenAI(
    base_url="https://ai-tokenhub.com/api/v1",
    api_key="YOUR_TOKENHUB_KEY"
)

5. How does billing work?

  • Charged by token usage
  • Different models have different pricing
  • View real-time usage and costs in console
  • Supports recharge and quota management

6. Is there free quota?

New users receive a certain amount of free testing quota after registration for development and debugging.

7. How to handle rate limits?

When rate limit is exceeded, API returns 429 error. Recommendations:

  1. Implement retry mechanism, wait and retry
  2. Batch process requests to reduce request count
  3. Use streaming to reduce concurrency

See API Key Limits for details.

8. How to set fallback models?

Use fallback_models parameter:

json
{
  "model": "doubao-seed-2-0-pro-260215",
  "fallback_models": ["doubao-seed-2-0-lite-260215", "doubao-seed-2-0-mini-260215"]
}

System will automatically try fallback models when primary model is unavailable.

9. Is streaming supported?

Yes. Set stream: true to enable streaming. See Streaming for details.

10. How to get technical support?

  • Check Errors and Debugging documentation
  • Login to console to view usage logs
  • Contact customer service for help