We are excited to announce that AI/ML API is now officially supported by the ai-proxy, ai-proxy-multi, and ai-request-rewrite plugins in Apache APISIX. This powerful integration allows you to connect your API gateway to over 300 industry-leading AI models, including GPT-4, Claude, Gemini, and more through a single, unified interface.
Why Apache APISIX + AI/ML API?
By integrating AI/ML API directly into Apache APISIX, the popular open-source API gateway, you gain:
- Unified AI Access: One configuration in APISIX unlocks hundreds of LLMs and AI services, eliminating the need to hardcode multiple vendor endpoints.
- Dynamic Model Routing: Automatically route AI requests based on cost efficiency, latency, or specific use cases, switching effortlessly between providers like OpenAI, Anthropic, and Mistral.
- Enterprise-Grade Security: Features such as PII masking, content moderation, and request rewriting protect data and keep AI usage compliant.
- Scalable AI Applications: Power chatbots, real-time translations, summarization, and data enrichment pipelines with low-latency streaming and hybrid multi-cloud deployments—all managed via Apache APISIX’s centralized control plane.
Quick Setup: Connect APISIX to OpenAI in 5 Minutes
Getting started is simple:
- Install Apache APISIX: Installation Guide.
- Obtain your OpenAI API key: OpenAI API Keys.
- Configure a route in conf/config.yaml enabling the AI plugin with your API key and preferred model.
- Reload APISIX and send test chat requests. Apache APISIX now acts as your AI gateway.
Core Use Cases
Apache APISIX combined with AI/ML API allows enterprises and developers to build:
- Unified AI Service Management: Manage multiple AI providers seamlessly, switching models adaptively based on cost, latency, or task.
- Enterprise Security & Compliance: Use features like automatic Personally Identifiable Information (PII) masking, content moderation, and prompt sanitization.
- Cost-Efficient AI Operations: Implement token budgeting, caching of responses, and fallback mechanisms to optimize AI-related expenses.
- Scaling Real-Time AI Applications: Power low-latency chatbots and virtual agents with streaming response capability, as well as enrich APIs with on-the-fly AI transformations.
- Hybrid and Multi-Cloud Deployments: Control models deployed on-premises and in the cloud from a single gateway platform with consistent policies.
Wrap-Up
With AI/ML API’s native support in Apache APISIX, you get unmatched flexibility and control over your AI infrastructure, delivering speed, security, and scalability in one powerful gateway. A simple YAML tweak transforms APISIX into a multi-model AI powerhouse, letting you innovate faster and smarter while maintaining compliance and budget.
Learn More & Explore
Discover the AI plugins in Apache APISIX ecosystem powered by AI/ML API:
Join our growing developer community at AI/ML API Community to stay updated on the latest features and best practices for APISIX AI integration.