What is AI Content Safety? - Higress Technical Glossary

📖 Definition

AI Content Safety refers to reviewing inputs (Prompts) and outputs (Responses) of large language models, identifying and blocking violations, harmful, pornographic, or sensitive content to ensure AI applications comply with regulatory requirements.

🔗 How Higress Uses This

Higress integrates various content safety plugins, supporting sensitive word filtering and compliance review to prevent AI models from being induced to output inappropriate information or leak privacy.

💡 Examples

1 Block user inputs containing politically sensitive words
2 Filter false fraud information in model outputs
3 Prevent prompt injection attacks through semantic recognition

🔄 Related Terms

Large Language Model

Prompt

Web Application Firewall

❓ FAQ

What is AI Content Safety?