What is Model Routing? - Higress Technical Glossary

📖 Definition

Model Routing is a technique that dynamically selects the most appropriate LLM based on request complexity, cost requirements, response time, or content type. It can balance cost and performance to achieve optimal utilization of AI resources.

🔗 How Higress Uses This

Higress provides powerful model routing capabilities, distributing traffic to GPT-4 or lower-cost local models based on request characteristics such as user level and token length.

💡 Examples

1 Simple classification tasks routed to lightweight models to reduce costs
2 Logical reasoning tasks routed to high-performance large models to ensure accuracy
3 Automatically switches to backup model routing when the primary model responds slowly

🔄 Related Terms

Large Language Model

AI Gateway

Failover

❓ FAQ

What is Model Routing?