Skip to main content
VoiceInfra lets you connect five leading AI providers in one platform and assign any of them to any agent or workflow node — independently. You’re never locked into a single vendor. If a better model launches, you switch from a dropdown. If a provider has an outage, your backup model takes over automatically. If you want to cut costs on simple FAQ calls, route them to a lightweight model and reserve your premium model for high-stakes conversations. The same agent configuration, prompts, and integrations work regardless of which model is underneath.

Supported Providers

ProviderAvailable ModelsBest For
OpenAIGPT-4o, GPT-4.1 Mini, GPT-4.1 NanoGeneral purpose, high output quality, broad capability
AnthropicClaude 3.5 Sonnet, Claude 3 HaikuComplex reasoning, nuanced multi-turn conversation
GoogleGemini 1.5 Pro, Gemini 1.5 FlashMultilingual calls, multimodal context
GroqLlama 3.x (8B, 70B)Speed-critical calls, ultra-low inference latency
MistralMistral Large, Mistral SmallCost-efficient operation, European data residency

Smart Cost Routing

Not every call needs the same model. A caller asking for your business hours needs a fast, cheap response. A caller troubleshooting a complex technical issue benefits from deeper reasoning. Smart cost routing lets you match the model to the moment. Example cost comparison (approximate per-minute LLM costs):
ModelApproximate costBest used for
GPT-4.1 Nano~$0.004 / minFAQs, routing, simple data collection
GPT-4.1 Mini~$0.012 / minStandard support, lead qualification
GPT-4o~$0.06 / minComplex reasoning, sales conversations
Claude 3 Haiku~$0.005 / minFast, nuanced responses at low cost
Groq Llama 3 70B~$0.008 / minSpeed-first, high-volume deployments
Costs vary by provider pricing and are passed through at provider rates when you use your own API keys. Check each provider’s current pricing page for up-to-date figures. These estimates are for LLM usage only; voice (TTS/STT) costs are separate.
By routing simple calls to lightweight models and complex calls to premium models, teams typically reduce their blended AI cost by 40–60% compared to running every call on a single premium model.

Setting Up a Provider

1

Go to Settings → AI Providers

Open the VoiceInfra dashboard and navigate to Settings in the left sidebar, then select AI Providers.
2

Select your provider and enter your API key

Click the provider you want to add (Anthropic, OpenAI, Google, Groq, or Mistral). Paste your API key from that provider’s dashboard. VoiceInfra validates the key immediately.
3

The provider is immediately available

Once saved, the provider and all of its supported models appear in the model picker across every agent and workflow node in your account. No restart or republish required.
4

Select the model per agent or per workflow node

In the no-code builder, select your model under the Model & Voice tab. In the workflow builder, each Conversation node has its own model selector — so different nodes in the same workflow can use different providers.

Zero Vendor Lock-In

VoiceInfra is designed so your agents are portable across providers. Your configurations, prompts, behavioral rules, knowledge bases, and tool integrations are all stored independently of the AI provider. Switching models never touches any of that.
  • Switch instantly — change the model for any agent or node from a single dropdown. The change applies to new calls immediately.
  • Automatic fallback — configure a backup model for any agent. If the primary provider is unreachable, calls automatically route to your fallback — no interruption, no manual intervention.
  • Side-by-side testing — run two agents with identical configurations but different models. Compare transcript quality, response accuracy, and cost in the call logs to find the right model for each use case.

Bring Your Own API Keys

When you connect a provider using your own API key, AI inference costs are billed directly by the provider at their standard rates. VoiceInfra does not mark up pass-through AI costs. This gives you full visibility into what each call costs and lets you optimize at the provider level — including taking advantage of volume discounts, committed use agreements, or enterprise pricing you’ve negotiated directly. If you prefer not to manage individual provider accounts, VoiceInfra also offers managed access to supported models through platform credits.
Provider-specific prompt behavior: Different models have distinct strengths you can use to your advantage. Claude 3.5 Sonnet handles long, nuanced conversations especially well and is less likely to veer off-topic on complex calls. Groq’s Llama inference is optimized for speed — ideal when response latency matters more than depth. Gemini 1.5 Flash balances speed and multilingual accuracy for international deployments. When switching providers, review your system prompts to make sure tone and instruction style are aligned with the new model’s behavior.

Frequently Asked Questions

No, API keys are optional. VoiceInfra provides managed access to all supported models through platform credits — no provider account needed to get started. If you bring your own API keys, AI costs are passed through at the provider’s standard rates, which gives you more cost control and lets you use any pricing agreements you have directly with the provider.
Yes. In the workflow builder, every Conversation node has its own model selector. You can run a fast, low-cost model for the greeting and data-collection nodes, then switch to an advanced reasoning model for the troubleshooting or negotiation nodes within the exact same workflow.
If you’ve configured a fallback model in your agent or workflow settings, VoiceInfra automatically routes to that model when the primary provider is unreachable or returns errors. The switchover happens instantly — callers experience no disruption. You’ll see the fallback events in your call logs.