Token 是什么？- Higress 技术词汇表

📖 定义

Token is the basic unit for processing text in large language models. Models split input text into a series of tokens for processing, where each token may be a word, subword, or character. Token count directly affects API call costs and response time.

🔗 在 Higress 中的应用

Higress AI Gateway provides token-level traffic management, supporting quota management and rate limiting based on token consumption, helping enterprises precisely control AI call costs.

💡 示例

1 "Hello World" in English is typically split into 2 tokens
2 Chinese tokenization is usually done by character or word group
3 GPT-4's context window supports 128K tokens

⚙️ 配置示例

YAML

# Higress Token Rate Limiting Configuration Example
rule:
  limit_by_token:
    tokens_per_minute: 100000
    tokens_per_day: 1000000

🔄 相关术语

Large Language Model

Prompt

Context Window

❓ 常见问题

Token 是什么？