/

500+ AI models.
One API. Zero hassle.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Claude Opus 4.8

Sharper reasoning, more honest output, and the best agentic performance the company has shipped.

Additional price

Gemini 3.5 Flash

Gemini 3.5 Flash brings near-Pro reasoning at Flash speed. Explore specs, pricing, thinking levels.

Additional price

For developers, researchers, and businesses that rely on fast-moving information and technical workflows, Grok 4.3 is becoming one of the most compelling large language models to watch.

Additional price

From end-to-end engineering tasks to navigating live software, GPT-5.5 closes the gap between "AI assistant" and "AI colleague."

Additional price

The model combines advanced multimodal training with diffusion-based image generation. This enables it to convert complex instructions into visually consistent outputs while preserving strong control over composition, typography, and layout.

Additional price

Claude Opus 4.7

It focuses less on “demo intelligence” and more on sustained reliability across large, multi-step tasks. It is particularly effective when context, precision, and consistency matter more than short-form speed.

Additional price

It enables users to generate cinematic, short-form videos from text descriptions, images, or reference materials, combining visual coherence, realistic motion, and synchronized audio in a single workflow.

Additional price

Gemini 3 Flash Preview is Google’s fast multimodal LLM API for agents, coding, and docs with pro-level control.

Additional price

GPT Image 1.5 is OpenAI’s image generation model, built to produce crisp images that closely follow your prompt and to support dependable editing and variations.

Additional price

With significant upgrades in reasoning, vision, coding, and long-context handling, GPT-5.2 delivers state-of-the-art performance across professional, scientific, and engineering domains, while maintaining strong safety and reliability standards.

Additional price

It delivers professional-grade image synthesis for commercial and creative applications with speed and efficiency at its core.

Additional price

Claude Opus 4.5

It excels in software engineering and agentic workflows, supports advanced tool use and large context windowsl.

Additional price

Gemini 3 Pro Image (Nano Banana Pro)

Gemini 3 Pro Image, also known as Nano Banana Pro, is Google DeepMind’s latest state-of-the-art text-to-image generation model.

Additional price

It focuses on making the user experience more natural and the model’s reasoning behaviour more adaptive.

Additional price

It is optimized for real-time dialog systems, analytics platforms, and visual assistant applications.

Additional price

The model produces fully synchronized audiovisual content, with support for various aspect ratios and high-definition output.

Additional price

While it requires more computational resources and higher cost, the payoff is exceptional performance on complex and mission-critical tasks.

Additional price

With dedicated Text-to-Video and Image-to-Video APIs, it offers flexible entry points for both concept-driven storytelling and visual iteration.

Additional price

Alibaba's Qwen Max is a powerful multimodal AI with OpenAI-compatible API, notable for instruction-following stability.

Additional price

Discover Qwen-Plus, Alibaba's advanced multilingual model designed for complex tasks and detailed analysis.

Additional price

Positioned at the very top of Alibaba's Qwen model family, Qwen3.7-Max isn't a general-purpose assistant with reasoning bolted on — reasoning is its foundation.

Additional price

Nemotron 3 Nano Omni

The "Nano Omni" variant specifically targets sub-agent roles — the perception and context layer in larger multi-agent systems — where speed, memory efficiency, and cross-modal coherence matter most.

Additional price

Nemotron 3 Super 120B-A12B

With 120 billion total weights but only 12 billion active during any single inference call, you get the reasoning quality associated with large-scale models at a fraction of the compute cost per token.

Additional price

Nemotron 3 Nano 30B A3B

Nemotron 3 Nano 30B A3B is NVIDIA's flagship open-weight small language model built specifically for developers who need agentic, reasoning-capable AI without paying the compute tax of full-size dense transformers.

Additional price

Qwen3.5-Flash is a performance-optimized large language model designed for developers and businesses that prioritize speed, efficiency, and cost control.

Additional price

Whether you're generating videos from scratch, animating images, or guiding motion with references, Wan 2.7 is designed to balance visual quality, speed, and cost-efficiency.

Additional price

Happy Horse by Alibaba Cloud is a next-generation multimodal video generation model designed to bridge the gap between creative intent and production-ready output.

Additional price

This isn't a patch on GPT-5.4. It thinks faster, uses fewer tokens, and keeps going when problems get messy.

Additional price

DeepSeek V4 Pro

A 1.6 trillion-parameter mixture-of-experts model designed for world-class reasoning, agentic coding, and long-context intelligence at a fraction of the cost of comparable frontier models.

Additional price

Qwen3.5 Omni Flash

A natively omnimodal AI that reads text, watches video, listens to audio, and analyzes images — all at once, in a single forward pass. Built for production workloads where latency and cost can't be afterthoughts.

Additional price

Qwen3.5 Omni Plus

Qwen3.5 Omni Plus is Alibaba's most advanced omnimodal model: a single system that processes text, images, audio, and video simultaneously — and talks back in natural, streaming speech across 36 languages.

Additional price

Dola-Seed-2.0 Code

Part of the Seed 2.0 family, it shares the flagship's frontier-level intelligence while being fine-tuned specifically for developer workflows.

Additional price

Dola-Seed 2.0 Mini

Mini inherits the same Seed 2.0 architectural backbone as Pro and Lite, but goes through a deeper distillation process that aggressively strips parameters and reduces layer depth to achieve the latency and cost targets you'd expect from an edge-deployment-grade model.

Additional price

Dola-Seed 2.0 Lite

It covers roughly 95% of enterprise tasks at about half the cost of the flagship Pro variant — without sacrificing the multimodal depth that serious applications actually need.

Additional price

Dola-Seed 2.0 Pro

It doesn't just process information. It acts on it.

Additional price

nstead of pushing for maximum reasoning depth, Qwen3.6-Flash focuses on responsiveness, throughput, and efficiency, enabling AI experiences that feel immediate and fluid.

Additional price

Claude Opus 4.8

Sharper reasoning, more honest output, and the best agentic performance the company has shipped.

Additional price

Gemini 3.5 Flash

Gemini 3.5 Flash brings near-Pro reasoning at Flash speed. Explore specs, pricing, thinking levels.

Additional price

For developers, researchers, and businesses that rely on fast-moving information and technical workflows, Grok 4.3 is becoming one of the most compelling large language models to watch.

Additional price

From end-to-end engineering tasks to navigating live software, GPT-5.5 closes the gap between "AI assistant" and "AI colleague."

Additional price

Claude Opus 4.7

It focuses less on “demo intelligence” and more on sustained reliability across large, multi-step tasks. It is particularly effective when context, precision, and consistency matter more than short-form speed.

Additional price

Gemini 3 Flash Preview is Google’s fast multimodal LLM API for agents, coding, and docs with pro-level control.

Additional price

With significant upgrades in reasoning, vision, coding, and long-context handling, GPT-5.2 delivers state-of-the-art performance across professional, scientific, and engineering domains, while maintaining strong safety and reliability standards.

Additional price

Claude Opus 4.5

It excels in software engineering and agentic workflows, supports advanced tool use and large context windowsl.

Additional price

It focuses on making the user experience more natural and the model’s reasoning behaviour more adaptive.

Additional price

It is optimized for real-time dialog systems, analytics platforms, and visual assistant applications.

Additional price

While it requires more computational resources and higher cost, the payoff is exceptional performance on complex and mission-critical tasks.

Additional price

Alibaba's Qwen Max is a powerful multimodal AI with OpenAI-compatible API, notable for instruction-following stability.

Additional price

Discover Qwen-Plus, Alibaba's advanced multilingual model designed for complex tasks and detailed analysis.

Additional price

Positioned at the very top of Alibaba's Qwen model family, Qwen3.7-Max isn't a general-purpose assistant with reasoning bolted on — reasoning is its foundation.

Additional price

Nemotron 3 Nano Omni

The "Nano Omni" variant specifically targets sub-agent roles — the perception and context layer in larger multi-agent systems — where speed, memory efficiency, and cross-modal coherence matter most.

Additional price

Nemotron 3 Super 120B-A12B

With 120 billion total weights but only 12 billion active during any single inference call, you get the reasoning quality associated with large-scale models at a fraction of the compute cost per token.

Additional price

Nemotron 3 Nano 30B A3B

Nemotron 3 Nano 30B A3B is NVIDIA's flagship open-weight small language model built specifically for developers who need agentic, reasoning-capable AI without paying the compute tax of full-size dense transformers.

Additional price

Qwen3.5-Flash is a performance-optimized large language model designed for developers and businesses that prioritize speed, efficiency, and cost control.

Additional price

This isn't a patch on GPT-5.4. It thinks faster, uses fewer tokens, and keeps going when problems get messy.

Additional price

DeepSeek V4 Pro

A 1.6 trillion-parameter mixture-of-experts model designed for world-class reasoning, agentic coding, and long-context intelligence at a fraction of the cost of comparable frontier models.

Additional price

Qwen3.5 Omni Flash

A natively omnimodal AI that reads text, watches video, listens to audio, and analyzes images — all at once, in a single forward pass. Built for production workloads where latency and cost can't be afterthoughts.

Additional price

Qwen3.5 Omni Plus

Qwen3.5 Omni Plus is Alibaba's most advanced omnimodal model: a single system that processes text, images, audio, and video simultaneously — and talks back in natural, streaming speech across 36 languages.

Additional price

Dola-Seed-2.0 Code

Part of the Seed 2.0 family, it shares the flagship's frontier-level intelligence while being fine-tuned specifically for developer workflows.

Additional price

Dola-Seed 2.0 Mini

Mini inherits the same Seed 2.0 architectural backbone as Pro and Lite, but goes through a deeper distillation process that aggressively strips parameters and reduces layer depth to achieve the latency and cost targets you'd expect from an edge-deployment-grade model.

Additional price

Dola-Seed 2.0 Lite

It covers roughly 95% of enterprise tasks at about half the cost of the flagship Pro variant — without sacrificing the multimodal depth that serious applications actually need.

Additional price

Dola-Seed 2.0 Pro

It doesn't just process information. It acts on it.

Additional price

nstead of pushing for maximum reasoning depth, Qwen3.6-Flash focuses on responsiveness, throughput, and efficiency, enabling AI experiences that feel immediate and fluid.

Additional price

Qwen3.6 Max Preview

It prioritizes depth over efficiency, making it a strong choice for advanced reasoning, large-scale analysis, and high-stakes AI applications that demand precision and consistency.

Additional price

Built for real-world deployment rather than niche benchmarks, it fits comfortably into a wide range of use cases from conversational AI and content generation to business automation and developer tooling.

Additional price

Qwen3.6-35B-A3B

Additional price

Frontier-level agentic performance. Half the inference cost.

Additional price

Whether you're running complex software engineering pipelines, long-horizon agentic tasks, or just pushing what a language model can reliably accomplish, this is where you start.

Additional price

Whether you are building complex agentic workflows, high-volume customer support bots, or sophisticated code assistants, Qwen 3.6-27B provides the optimal infrastructure for next-generation AI products.

Additional price

With a 262,144-token context window, Kimi K2.6 handles massive projects in a single pass, reducing fragmentation and improving coherence across long workflows. It also supports agent swarms up to 300 agents, making it one of the most advanced open agentic systems available today.

Additional price

MiniMax M2.1 Highspeed

It is engineered for developers building interactive systems where response speed is as critical as output quality, including conversational agents, live automation pipelines, and embedded AI experiences.

Additional price

MiniMax M2.7 Highspeed

It keeps every bit of the original’s reasoning power and agentic capabilities while pushing output speeds up to roughly 100 tokens per second.

Additional price

The model combines advanced multimodal training with diffusion-based image generation. This enables it to convert complex instructions into visually consistent outputs while preserving strong control over composition, typography, and layout.

Additional price

GPT Image 1.5 is OpenAI’s image generation model, built to produce crisp images that closely follow your prompt and to support dependable editing and variations.

Additional price

It delivers professional-grade image synthesis for commercial and creative applications with speed and efficiency at its core.

Additional price

Gemini 3 Pro Image (Nano Banana Pro)

Gemini 3 Pro Image, also known as Nano Banana Pro, is Google DeepMind’s latest state-of-the-art text-to-image generation model.

Additional price

Qwen Image 2.0 Pro

Qwen Image 2.0 Pro is especially valuable for marketing teams, designers, and content creators who need production-grade assets rather than experimental visuals.

Additional price

Generate and edit in one workflow, no design tools required.

Additional price

Wan 2.7 Pro is a high-fidelity AI image generation model built for developers who care about quality, speed, and workflow efficiency.

Additional price

Wan 2.7 Image is a production-ready AI image generation and editing model by Alibaba Cloud, offering text-to-image, instruction-based editing, and multi-image consistency through a unified API.

Additional price

Grok Imagine Image Pro

Imagine it. Generate it. Ship it.

Additional price

Imagine Anything. Generate Everything.

Additional price

Seadream 5.0 Lite

Every major capability has been redesigned from first principles, moving image generation from a creative toy into a genuine production instrument.

Additional price

Gemini 3.1 Flash Image (Nano Banana 2)

Google's fastest high-resolution AI image model built on Gemini 3.1 Flash.

Additional price

Alibaba Cloud’s state-of-the-art diffusion-based image generation model, engineered to produce photorealistic and highly detailed visuals from text prompts.

Additional price

FLUX.2 Max Edit

Black Forest Labs

FLUX.2 Max Edit transforms one or multiple reference images using simple prompts, preserving composition, lighting, and identity while applying high-fidelity changes suitable for final delivery, not just drafts.

Additional price

Black Forest Labs

FLUX.2 Max is Black Forest Labs' flagship text-to-image model, delivering exceptional photorealism, anatomical accuracy, and prompt fidelity for professional-grade visuals in production pipelines.

Additional price

It excels in rendering sharp, legible text directly within images, ideal for branded and advertising content.

Additional price

Built for creators who demand semantic intelligence, visual coherence, and professional-grade results without complex workflows.

Additional price

Z-Image Turbo LoRA

It combines Turbo’s speed with LoRA-based style adaptation, allowing flexible control over illustration, toon, and branded visual styles without compromising performance.

Additional price

Black Forest Labs

Its ability to combine the speed with the precision of LoRA fine-tuning sets it apart as a leading model in AI-powered image generation technology.

Additional price

Black Forest Labs

Perfect for designers, marketers, and creative teams seeking speed, consistency, and professional-grade results.

Additional price

Black Forest Labs

FLUX.2 is an advanced text-to-image generative model developed by Black Forest Labs.

Additional price

GPT Image 1 Mini

Its optimized infrastructure ensures high throughput and low latency, ideal for embedding into various digital products and creative pipelines.

Additional price

It transforms complex OBJ and GLB models into clean, semantically meaningful components, enabling faster workflows in game development, animation, robotics, and simulation.

Additional price

Its optimized architecture ensures smooth, real-time interaction, making it ideal for dynamic applications like content creation tools, live design assists, and AI-powered artistic platforms.

Additional price

Designed for creators, artists, and developers, it offers a powerful solution for generating visually striking images from textual descriptions with impressive fidelity and nuance.

Additional price

Wan 2.5 Preview

Its flexible dimension support and high-quality output make it ideal for use in creative apps, marketing tools, content management systems, and design software.

Additional price

Sharpen-Generative

Utilize adjustable sharpening strength, noise reduction, and face enhancement parameters to balance creativity and realism per project needs.

Additional price

Sharpen | AI Image

Balances speed, precision, and ease of use, making it highly recommended for photographers, creators, and imaging professionals seeking the best sharpening accuracy and efficiency available.

Additional price

Fast, accurate, and context-aware AI for creative and professional visual content.

Additional price

HunyuanImage 3.0

The model supports understanding and rendering multi-thousand-word prompts and creates clear, legible text within images, making it ideal for diverse creative applications.

Additional price

Reve Remix Image

Its superior handling of detailed prompts and embedded text, combined with commercial rights, makes it an ideal solution for creators and businesses aiming for impactful visual content with minimum hassle.

Additional price

Reve Edit Image

Its high accuracy in prompt adherence, superior text rendering, and 4K image quality set it apart from competitors.

Additional price

Reve Create Image

Its distinct advantages in text rendering and multi-object composition make it ideal for professional marketers and designers seeking high-quality, ready-to-use visuals.

Additional price

Qwen Image Edit

It supports bilingual text editing in English and Chinese, enabling complex scene adjustments, style transfers, and seamless visual edits while preserving image consistency.

Additional price

Flux SRPO Image‑to‑Image

Black Forest Labs

The model supports spatially-aware localized editing, enabling targeted modifications without compromising surrounding areas.

Additional price

Black Forest Labs

The model offers fine-grained control over composition and scene elements, enabling precise and customizable image synthesis.

Additional price

It enables users to generate cinematic, short-form videos from text descriptions, images, or reference materials, combining visual coherence, realistic motion, and synchronized audio in a single workflow.

Additional price

The model produces fully synchronized audiovisual content, with support for various aspect ratios and high-definition output.

Additional price

With dedicated Text-to-Video and Image-to-Video APIs, it offers flexible entry points for both concept-driven storytelling and visual iteration.

Additional price

Whether you're generating videos from scratch, animating images, or guiding motion with references, Wan 2.7 is designed to balance visual quality, speed, and cost-efficiency.

Additional price

Happy Horse by Alibaba Cloud is a next-generation multimodal video generation model designed to bridge the gap between creative intent and production-ready output.

Additional price

Seedance 2 Fast

Built for developers who need near-instant video generation, it delivers responsive, high-throughput performance while maintaining strong visual coherence and control.

Additional price

It combines text, images, audio, and video into a single coherent generation pipeline, making it one of the most flexible and production-ready models available today.

Additional price

The result is a faster development cycle, predictable operational costs, and the ability to scale video generation without complex infrastructure management.

Additional price

Seedance 1.5 Pro

Enjoy optimized performance across resolutions and formats, from prototypes to production-grade outputs.

Additional price

Kling 2.6 Pro Motion Control

It is part of the Kling Video 2.6 Pro stack and is designed for creators, studios, and marketers who need precise character animation without traditional 3D rigging or manual keyframing

Additional price

Kling Video v3 Standard

It emphasizes speed, accessibility, and predictable results, making it well suited for rapid content creation and iterative workflows.

Additional price

Kling Video v3 Pro

Designed with flexibility in mind, Kling Video v3 Pro supports narrative control, multi-scene composition, and native sound generation, making it a powerful foundation for modern video pipelines.

Additional price

This approach positions Magic as a comprehensive tool compared to more specialized, single-function AI video models.

Additional price

Veo 3.1 Fast Extend Video

With optional audio synthesis (on/off toggle), Veo 3.1 Fast empowers creators to tailor multimodal experiences without compromising quality or workflow agility.

Additional price

Veo 3.1 Extend Video

Extend Video focuses on temporal extrapolation, making it uniquely suited for extending real or synthetic clips without abrupt visual shifts or narrative breaks.

Additional price

Pixverse v5.5 Image-to-Video

This image-to-video model excels in generating high-quality clips up to 10 seconds long across multiple resolutions.

Additional price

PixVerse V5.5 Text-to-Video

Create dynamic, dialogue-rich scenes with automatic shot transitions, emotional voice delivery, and precise visual framing.

Additional price

Kling 2.6 Pro Text-to-Video

Kling 2.6 Pro is high-fidelity AI video generation with synchronized audio, ideal for social creatives, ads, promos, and rapid prototyping.

Additional price

Kling 2.6 Pro Image-to-Video

Designed for creators, marketers, and developers, this model delivers production-ready results with minimal latency.

Additional price

Kling Video O1 Video-to-Video Reference

Its Video-to-Video Reference mode enables creators to generate new, coherent video clips that preserve motion dynamics, cinematic language, and visual identity from source footage.

Additional price

Kling Video O1 Image to Video

It leverages a unified multi-modal engine for superior consistency in complex scenes.

Additional price

Kling Video O1 Video to Video Edit

It combines multi-angle character replacement, environment swapping, and full motion integrity in a single prompt.

Additional price

Kling Video O1 Reference-to-Video

It uses advanced feature extraction to preserve visual identity such as appearance, texture, and style across entirely new scenarios and motions.

Additional price

It offers exceptional speed for rapid video iterations without compromising visual quality, enabling users to produce sharp, realistic clips quickly.

Additional price

It powers professional creative workflows with near real‑time generation, 4K‑ready output, and flexible modes optimized for both speed and fidelity.

Additional price

This model excels in synchronizing lip movements, facial expressions, and subtle behavioral cues with the emotional tone and rhythm of the audio, producing lifelike avatars ideal for interactive and multimedia applications.

Additional price

Kling AI Avatar Pro

It delivers smooth, professional-quality animations that maintain consistent realism.

Additional price

Kling AI Avatar Standard

It enables precise lip-syncing, natural facial expressions, and lively articulation, suitable for diverse applications such as video presentations, virtual hosts, customer avatars, and digital dubbing.

Additional price

Seedance 1.0 Pro Fast | Bytedance | Video Generation

It's the top choice for rapid social media ads and content prototyping, offering cinematic results.

Additional price

Hailuo 2.3 Fast

Built for modern AI applications, it delivers fast inference, stable performance under load, and a flexible architecture that adapts to conversational, analytical, and creative workflows.

Additional price

As part of the evolving MiniMax ecosystem, it focuses on delivering stable motion, coherent storytelling, and visually consistent outputs that are ready for real-world use.

Additional price

Its optimized architecture supports fast iteration, preview, and production workflows for professional video, VFX, and multimedia applications.

Additional price

Its balance of power and accessibility makes it a standout model for creators seeking to innovate with AI-driven cinematic video production.

Additional price

It supports natural language instructions to produce dynamic camera movements and seamless animations without the need for complex editing skills.

Additional price

With Krea WAN 14B, users can generate and edit high-quality videos in real time simply by using text descriptions or storyboards.

Additional price

Kandinsky 5 Distill

This model is ideal for developers, content creators, and researchers who need to generate video content from text prompts efficiently.

Additional price

One model. Everything you do on a computer.

Additional price

By combining agent-centric design, configurable reasoning depth, and multimodal understanding, it moves beyond traditional code generation and into true software engineering collaboration.

Additional price

GPT-5.1 Codex Mini

Its advanced training and multimodal features enable seamless integration into development pipelines, boosting productivity and code quality.

Additional price

Unlike general-purpose language models, it focuses on producing clean, maintainable, and executable code following developer instructions precisely.

Additional price

Grok Code Fast 1

Its massive 256,000-token context window allows for handling large codebases and multi-file projects without truncation, making it ideal for complex coding workflows.

Additional price

Qwen3-Coder is a cutting-edge AI model with a 262K token context window, designed for advanced text-to-text coding and instruction-based workflows. It offers robust integration support for seamless automation and software development at scale.

Additional price

Qwen 2.5 Coder 32B Instruct

Discover Qwen2.5 Coder 32B Instruct: An open-source coding LLM for efficient code solutions.

Additional price

Replit-Code-v1 (3B)

Access Replit's 2.7B parameter code completion model, along with 100+ AI Models. 20 Supported programming languages in your hands.

Additional price

CodeGen2-16B: A colossal language model developed by Salesforce AI Research for advanced program synthesis tasks.

Additional price

Access CodeGen2 (7B) API: A 7 billion parameter autoregressive language model, capable of generating and completing code in 12 programming languages and most popular frameworks.

Additional price

StarCoder (16B)

Explore the power of StarCoder API, a 15.5B parameter model, ideal for generating code across 80+ programming languages with unparalleled depth.

Additional price

SQLCoder API, a state-of-the-art language model excelling in transforming natural language queries into precise SQL commands. Perfect for developers and data analysts!

Additional price

Phind Code LLaMA v2 (34B)

Phind Code LLaMA v2 (34B) API transforms coding by automating generation, debugging, and translating code across multiple languages.

Additional price

WizardCoder Python v1.0 (34B)

Transform your Python development with WizardCoder Python v1.0 (34B) API, an AI model that revolutionizes code writing, debugging, and optimization with its vast knowledge base and analytical power.

Additional price

Code Llama Instruct (34B)

Elevate your coding with Code Llama Instruct (34B) API, an AI model specialized in following complex instructions and generating precise code. Ideal for developers seeking high-level programming assistance.

Additional price

Code Llama Instruct (7B)

Elevate your coding experience with Code Llama Instruct (7B) API. This AI model provides precise code generation and instructions compliance, making coding more efficient and accessible for developers of all levels.

Additional price

Code Llama Python (13B)

The Code Llama Python (13B) API is a high-performance AI model designed to automate and enhance Python programming tasks. With 13 billion parameters, it excels in generating code, debugging, and providing programming insights.

Additional price

Code Llama Instruct (13B)

Transform your coding processes with Code Llama Instruct (13B) API, an AI model specialized in understanding and executing programming instructions. With 13 billion parameters, it offers nuanced code generation and problem-solving capabilities, setting new standards in AI-assisted development.

Additional price

Code Llama Python (34B)

Experience next-level code generation with Code Llama Python (34B) API. This model offers deeper insights and more complex code solutions, enhancing your programming projects with AI efficiency.

Additional price

Code Llama Python (7B)

Unlock the power of AI in your coding projects with Code Llama Python (7B) API. This model accelerates code writing, debugs, and suggests optimizations effortlessly.

Additional price

Deepseek Coder Instruct (33B)

Empower your development with Deepseek Coder Instruct (33B) API, a state-of-the-art AI model with 33 billion parameters designed for coding instruction and automation.

Additional price

Code Llama (70B)

Elevate your coding with Code Llama (70B) API. This massive open source model is designed to understand and generate code across multiple programming languages.

Additional price

Code Llama Python (70B)

Unlock the full potential of AI in coding with Code Llama Python (70B) API. This 70 billion parameter model specializes in understanding and generating Python code, offering unparalleled assistance in software development.

Additional price

Code Llama Instruct (70B)

CodeLlama-70B-Instruct API: Meta's AI model tailored for code tasks, excelling in code completion and chatbot applications. Suitable for research and commercial use.

Additional price

It focuses on delivering speech that feels polished and production-ready, with attention to detail that goes beyond standard TTS systems.

Additional price

Speech 2.8 Turbo

It focuses on delivering natural, expressive speech with minimal delay, making it a strong fit for interactive environments where users expect immediate, human-like responses.

Additional price

Inworld TTS-1.5-Max

An 8.8‑billion‑parameter transformer model that speaks with near‑human subtlety.

Additional price

Inworld TTS-1.5-Mini

For developers building responsive AI characters, this model offers the best trade-off of cost, speed, and quality in Inworld's lineup, outperforming competitors in latency-sensitive environments.

Additional price

Inworld TTS-1-Max

Inworld TTS-1-Max is a high-fidelity, transformer-based neural text-to-speech model optimized for interactive and emotionally expressive voice synthesis.

Additional price

A next-generation neural text-to-speech (TTS) model developed by Inworld AI, engineered specifically for dynamic, real-time conversational experiences within games, virtual agents, and immersive applications.

Additional price

GPT-4o Mini Transcribe

Its advanced pretraining and reinforcement learning techniques make it ideal for real-time transcription in voice agents, call centers, and interactive audio applications.

Additional price

GPT-4o Transcribe

It excels in handling diverse speech patterns and long audio contexts, making it an excellent choice for developers building accurate and scalable voice-enabled applications.

Additional price

It provides robust, natural-sounding speech output while maintaining efficiency, enabling voice interactivity on devices with limited resources.

Additional price

Whether recognizing complex utterances, synthesizing expressive responses, or reasoning across modalities, it remains remarkably responsive and adaptable.

Additional price

MiniMax Speech 2.6 Turbo

The Turbo version is finely optimized for real-time applications requiring expressive voices with minimal delay.

Additional price

MiniMax Speech 2.6 HD

The model is optimized for high-definition audio output, supporting studio-grade prosody, breath control, and smooth phrasing.

Additional price

It comprehends meaning and emotion, delivering unparalleled voice quality and expressiveness.

Additional price

TTS-1 | Text-to-Speech

It delivers swift, real-time audio generation with minimal latency, making it especially suitable for live conversational agents and interactive applications.

Additional price

TTS-1 HD | Text to speech

The model balances quality and latency making it suitable for demanding voice synthesis applications.

Additional price

GPT-4o mini TTS

By enabling dynamic control over voice attributes like accent and emotion, this model surpasses many traditional TTS systems in naturalness and user customization.

Additional price

Deepgram Aura 2

With high concurrency support and cost-efficient pricing, Aura 2 enables seamless, clear, and responsive voice AI interactions for industries like finance, healthcare, and customer support.

Additional price

Qwen3-Omni Captioner

It serves audio input and returns rich text captions in real-time or batch mode without requiring input prompts.

Additional price

Qwen3 TTS Flash

It excels in real-time applications, delivering clear, versatile speech suitable for conversational AI, audiobooks, and accessibility tools.

Additional price

Eleven v3 Alpha

Its flexible prompting and tone control features allow developers to customize outputs for conversational agents, content automation, and multilingual use cases.

Additional price

The model supports fine-grained control over tone, pace, emotion, and language, making it an ideal choice for businesses aiming for high-quality, scalable speech generation solutions.

Additional price

Its advanced neural architecture enables seamless integration into a wide range of voice-driven applications, from virtual assistants to interactive storytelling and accessibility tools.

Additional price

MiniMax Speech 2.5 HD

Its cutting-edge technology enables seamless integration across a wide range of voice-driven applications, from interactive assistants to multimedia production.

Additional price

MiniMax Speech 2.5 Turbo

Designed for scalability, it fits effortlessly into applications spanning media, entertainment, education, and customer service environments.

Additional price

Universal is designed for seamless integration into diverse speech-to-text workflows, enabling accurate and efficient transcription across multiple languages and audio conditions.

Additional price

It offers substantial gains in accuracy and adaptability, directly improving transcription workflows in complex real-world environments.

Additional price

ElevenLabs Turbo v2.5

In contrast to ultra-fast models that often compromise on voice quality, Turbo v2.5 preserves clarity, pacing, and tone, making it suitable for production systems that must feel both immediate and human.

Additional price

ElevenLabs Multilingual v2

It focuses on delivering lifelike, expressive speech across multiple languages while preserving a speaker’s identity and tone.

Additional price

Chat GPT 4o mini audio preview

GPT-4o Mini Audio adds speech-to-text and text-to-speech abilities to the efficient GPT-4o Mini model, optimized for voice interfaces in smaller applications.

Additional price

Chat GPT 4o audio preview

GPT-4o Audio Preview is OpenAI's latest flagship model capable of understanding and generating text and audio in real-time, designed for natural conversation and auditory tasks.

Additional price

Deepgram Aura: A real-time TTS model delivering human-like voices for responsive, high-throughput conversational AI agents and applications via API.

Additional price

Deepgram Nova-2

Deepgram Nova-2 API features enhanced accuracy, multilingual support, and rapid transcription across various applications.

Additional price

OpenAI's Whisper API offers robust, multilingual speech-to-text capabilities, trained on diverse data, free for commercial use under the MIT license.

Additional price

MiniMax Music Cover

It enables creators and developers to generate high-quality cover versions with customizable vocals, instrumentation, and production.

Additional price

It moves beyond short clips or loops and focuses on delivering coherent, multi-part compositions with vocals, arrangement, and progression.

Additional price

MiniMax Music 2.0

Designed for developers and creative teams, it produces complete songs that feel composed and arranged rather than mechanically generated.

Additional price

MiniMax Music 1.5

With the ability to produce long, fully arranged songs featuring natural vocals and ethnic instruments, it excels in diverse cultural and genre contexts.

Additional price

It supports multimodal inputs, diverse genres, and neural audio synthesis for media, gaming, and entertainment applications.

Additional price

Google Lyria2 | Text to Audio

Google's Lyria 2 is an advanced AI model that generates professional-grade, instrumental music from text prompts, offering creators fine-tuned controls.

Additional price

Discover MiniMax Music, an AI model that transforms text into captivating music with advanced style learning capabilities and multiple genre support.

Additional price

Discover Stable Audio by Stability AI, an advanced audio generation model that creates high-quality tracks from text prompts with innovative features.

Additional price

Suno — AI for Music Creators [Deprecated]

Suno AI API generates realistic music from text prompts, supporting diverse genres, languages, and seamless integration into applications.

Additional price

Qwen Text Embedding v4

Qwen Text Embedding v4 is a state-of-the-art multilingual embedding model optimized for semantic search and retrieval tasks.

Additional price

Qwen Text Embedding v3

Built on Qwen3 foundations, it prioritizes long-context understanding and semantic accuracy for real-world applications.

Additional price

Textembedding-gecko@001

Discover the textembedding-gecko@001 model API: features, technical specifications, usage guidelines, and ethical considerations for developers.

Additional price

Textembedding-gecko@003

Explore Textembedding-gecko@003 API, a powerful text embedding model by Google, designed for diverse NLP applications and high performance.

Additional price

Textembedding-gecko-multilingual@001

Explore the textembedding-gecko-multilingual@001 model API, its architecture, training data, performance, and applications in NLP tasks.

Additional price

Text multilingual embedding 002

Discover Text-multilingual-embedding-002 API, a powerful model for multilingual text embeddings, enhancing NLP applications across languages.

Additional price

Voyage Large 2 Instruct

Voyage Large 2 Instruct API: A top-performing, instruction-tuned text embedding model for retrieval, classification, and clustering tasks.

Additional price

Text Embedding Ada 002

text-embedding-ada-002 API delivers consistent text embeddings, ideal for search, clustering, and recommendation applications at an affordable price.

Additional price

Text Embedding 3 Large

Text-embedding-3-large API provides top-tier text embeddings with customizable dimensions, delivering exceptional accuracy for complex applications.

Additional price

Text Embedding 3 Small

text-embedding-3-small API enhances text representation, offering better accuracy and cost-efficiency compared to its predecessor, text-embedding-ada-002.

Additional price

BAAI-Bge-Base-1p5

Utilize the API of the BAAI-Bge-Base-1p5 model to generate detailed language embeddings, enhancing the accuracy and depth of your linguistic analyses and applications.

Additional price

Bert Base Uncased

Unlock the potential of natural language processing with BERT Base Uncased API, a fundamental model in AI for creating powerful and nuanced language embeddings, facilitating a deep understanding of text.

Additional price

Sentence Transformers

Discover Sentence-BERT API, a cutting-edge model designed for creating sentence embeddings that capture deep semantic meanings, facilitating enhanced text comparison and analysis.

Additional price

M2-BERT-Retrieval-32k

Transform your data search and retrieval processes with M2-BERT-Retrieval-32k, featuring advanced AI capabilities for navigating vast datasets and delivering precise information swiftly.

Additional price

Leverage the power of Universal Angle Embedding with UAE-Large-V1 API, an AI model designed to provide advanced vector embeddings for a variety of AI applications, enhancing machine learning accuracy and efficiency.

Additional price

BAAI-Bge-Large-1p5

Elevate your language processing tasks with BGE-Large-EN-v1.5 API, a state-of-the-art embedding model designed to capture nuanced linguistic features and semantics, significantly improving language understanding and analysis.

Additional price

M2-BERT-Retrieval-8k

Elevate your search capabilities with M2-BERT-Retrieval-8k, an AI model optimized for fast and accurate information retrieval. Ideal for powering advanced search engines and data analysis tools.

Additional price

M2-BERT-Retrieval-2K

Enhance your search capabilities with M2-BERT-Retrieval-2K API, an AI model optimized for rapid and accurate information retrieval in smaller datasets.

Additional price

WizardLM 2-8 (22B) (Deprecated)

Discover Microsoft’s WizardLM 2-8 (22B), an advanced language model optimized for multilingual conversations and complex reasoning tasks with high efficiency.

Additional price

Access Llama-3 (8B) API along with 100+ AI Models. LLama-3 8B is an optimized, open-source language model excelling in dialogue, reasoning, and code generation.

Additional price

Access Meta's Llama-3 (70B) AI along with other 100+ other AI models with our API. LLama 3 is a state-of-the-art open-source language model with enhanced reasoning, coding, and multilingual capabilities for software developers.

Additional price

Chat GPT-3.5 Turbo Instruct

Get GPT-3.5-Turbo-Instruct model from OpenAI, along with 100+ open-source AI Modes. All designed for efficient, accurate, and instruction-driven AI interactions.

Additional price

StableLM Base Alpha 3B

Dive into the world of enhanced language processing with StableLM-Base-Alpha API, boasting up to 7 billion parameters for superior text generation.

Additional price

FLAN T5 XL (3B)

Discover FLAN-T5 XL (3B) API, a transformation of the T5 model enhanced by fine-tuning on over 1000 diverse tasks, excelling in multilingual language processing.

Additional price

Explore the capabilities of GPT-NeoX-20B API for generating complex, context-aware text across a multitude of domains. Ideal for research and advanced AI applications!

Additional price

Mixtral 8x22B API, a pioneering AI model with 176 billion parameters, offers unparalleled language processing capabilities.

Additional price

GPT-JT-Moderation (6B)

Experience efficient, AI-powered content moderation to ensure respectful, safe digital environments with GPT-JT-Moderation (6B) API.

Additional price

Falcon-7B API offers unmatched language understanding and generation, leveraging 1,500B tokens and cutting-edge technology, under a flexible Apache 2.0 license.

Additional price

RedPajama-INCITE (3B)

The AI model from RedPajama-INCITE (3B) API utilizes advanced technology to analyze data and provide valuable insights for business decision-making.

Additional price

Falcon 40B API leads in natural language generation, offering multilingual support and efficient, scalable AI-driven text creation. Explore unparalleled capabilities.

Additional price

Qwen (7B) API is a state-of-the-art language model that combines advanced performance with remarkable efficiency, making it a top choice for developers and enterprises.

Additional price

RedPajama-INCITE Instruct (3B)

Harness the exceptional text generation capabilities of the RedPajama-INCITE Instruct (3B) API. Unlock intelligent, human-like responses for a wide range of applications.

Additional price

RedPajama-INCITE Instruct (7B)

Enhance decision-making across industries with RedPajama-INCITE Instruct API, providing precise, AI-generated insights for smarter, faster decisions.

Additional price

RedPajama-INCITE (7B)

RedPajama-INCITE (7B) API is a highly adaptable and customizable AI model that delivers accurate and relevant results, making it an invaluable resource for a wide range of applications.

Additional price

NexusRaven (13B)

Step into the future of data-driven decision-making with NexusRaven (13B) API. This model offers unparalleled insights and analyses, empowering businesses to make informed strategic decisions

Additional price

01-ai Yi Base (6B)

Discover the versatility of 01-ai Yi Base (6B) API, an AI model adept in text generation, language understanding, and data analysis. Ideal for businesses and developers seeking AI-powered solutions.

Additional price

Unleash the potential of AI with LLaMA-2 (13B) API, a model boasting 13 billion parameters, designed for comprehensive data analysis, deep learning, and sophisticated problem-solving across various domains.

Additional price

LLaMA-2-32K (7B)

Harness the power of LLaMA-2-32K (7B) API, an AI model with 7 billion parameters and 32,000 token support, designed for deep learning and complex problem-solving.

Additional price

Unlock unparalleled AI performance with LLaMA-2 (70B) API, a groundbreaking model boasting 70 billion parameters for superior understanding and problem-solving capabilities.

Additional price

StripedHyena Hessian (7B)

Unlock the power of data with StripedHyena Hessian (7B) API, a cutting-edge AI model designed for intricate data analysis and pattern recognition. With 7 billion parameters, this model provides deep insights and predictive analytics to drive informed decisions.

Additional price

Qwen 1.5 (1.8B)

Qwen 1.5 (1.8B), a beta version of Qwen2, excels in text generation, chatbots, and content moderation with its transformer-based architecture. It outperforms competitors in benchmarks, offering multilingual support and advanced capabilities across various domains.

Additional price

Qwen 1.5 (0.5B)

Enter the world of efficient AI communication with Qwen 1.5 (0.5B) API. This model, with 500 million parameters, offers a compact yet effective solution for generating intelligent and context-aware dialogues.

Additional price

Microsoft Phi-2

Microsoft Phi-2 API, a breakthrough in AI, offers significant computational and AI advancements. Engineered for modern demands, it excels in NLP and various applications, setting new standards in AI efficiency, capability, and safety.

Additional price

Unlock the potential of conversational AI with Qwen 1.5 (4B) API. Boasting 4 billion parameters, this model delivers efficient, high-quality dialogues, making it an excellent choice for streamlined and intelligent communication solutions.

Additional price

Discover unparalleled conversational AI with Qwen 1.5 (14B) API. Empower your applications with deep, nuanced interactions, thanks to its 14 billion parameter architecture.

Additional price

Step into the future of AI communication with Qwen 1.5 (7B) API. With 7 billion parameters, this model offers nuanced, intelligent conversational capabilities that redefine user engagement.

Additional price

Qwen 1.5-72B: Transformer-based language model with multilingual support, 32K context, and strong performance in text completion and reasoning.

Additional price

Gemma (7B) created an AI model that uses machine learning to predict customer behavior and preferences for personalized recommendations.

Additional price

Mixtral-8x7B v0.1

The AI model Mixtral-8x7B v0.1 API is a cutting-edge, advanced system designed to accurately analyze and process data in multiple domains.

Additional price

Gemma (2B) (Deprecated)

Gemma's AI model utilizes deep learning techniques to accurately predict outcomes in a wide range of industries.

Additional price

StripedHyena Nous (7B)

The AI model from StripedHyena Nous (7B) API utilizes advanced machine learning algorithms to analyze and interpret complex data sets, enabling organizations to make informed decisions and predictions.

Additional price

Mixtral 7B API excels in complex tasks like language translation, and content creation, surpassing Llama models and matching ChatGPT's capabilities.

Additional price

LLaMA-2 (7B) API excels in assistance tasks, delivering great overall performance at the modest price.

Additional price

Magic Image-to-3D

The model is optimized for fast 3D asset creation, making it suitable for real-time pipelines in game development, AR/VR, e-commerce, and digital content production.

Additional price

Stable TripoSR 3D

TripoSR API generates high-quality 3D meshes from single images in under 0.5 seconds, using transformer architecture for efficient reconstruction.

Additional price

Mistral (7B) v0.1

Discover the power of Mistral (7B) v0.1 API, an AI model with 7 billion parameters designed for versatile, high-performance machine learning tasks.

Additional price

GLM-OCR stands out by combining state-of-the-art computer vision with intelligent structure detection, delivering 95%+ accuracy on real-world documents.

Additional price

Qwen2.5 VL 7B Instruct

Its optimized size ensures efficient performance with cost-effective operation, suitable for chatbots, AI assistants, and automated content extraction systems.

Additional price

Mistral OCR Latest API

Mistral OCR Latest

Mistral OCR (mistral-ocr-latest), developed by Mistral AI, transforms PDFs and images into structured Markdown/JSON, handling text, tables, equations, and multilingual content.

Additional price

Autonomous AI
Assistant for Real Work

OpenClaw runs locally, under your supervision. It automates real work across your files, code, and services — maintaining full transparency, predictable outcomes, and a human-in-the-loop at every step. Stay in control while getting more done.

🦞 Try OpenClaw View API Docs