Supported Providers
| Provider | Available Models | Best For |
|---|---|---|
| OpenAI | GPT-4o, GPT-4.1 Mini, GPT-4.1 Nano | General purpose, high output quality, broad capability |
| Anthropic | Claude 3.5 Sonnet, Claude 3 Haiku | Complex reasoning, nuanced multi-turn conversation |
| Gemini 1.5 Pro, Gemini 1.5 Flash | Multilingual calls, multimodal context | |
| Groq | Llama 3.x (8B, 70B) | Speed-critical calls, ultra-low inference latency |
| Mistral | Mistral Large, Mistral Small | Cost-efficient operation, European data residency |
Smart Cost Routing
Not every call needs the same model. A caller asking for your business hours needs a fast, cheap response. A caller troubleshooting a complex technical issue benefits from deeper reasoning. Smart cost routing lets you match the model to the moment. Example cost comparison (approximate per-minute LLM costs):| Model | Approximate cost | Best used for |
|---|---|---|
| GPT-4.1 Nano | ~$0.004 / min | FAQs, routing, simple data collection |
| GPT-4.1 Mini | ~$0.012 / min | Standard support, lead qualification |
| GPT-4o | ~$0.06 / min | Complex reasoning, sales conversations |
| Claude 3 Haiku | ~$0.005 / min | Fast, nuanced responses at low cost |
| Groq Llama 3 70B | ~$0.008 / min | Speed-first, high-volume deployments |
Costs vary by provider pricing and are passed through at provider rates when you use your own API keys. Check each provider’s current pricing page for up-to-date figures. These estimates are for LLM usage only; voice (TTS/STT) costs are separate.
Setting Up a Provider
Go to Settings → AI Providers
Open the VoiceInfra dashboard and navigate to Settings in the left sidebar, then select AI Providers.
Select your provider and enter your API key
Click the provider you want to add (Anthropic, OpenAI, Google, Groq, or Mistral). Paste your API key from that provider’s dashboard. VoiceInfra validates the key immediately.
The provider is immediately available
Once saved, the provider and all of its supported models appear in the model picker across every agent and workflow node in your account. No restart or republish required.
Zero Vendor Lock-In
VoiceInfra is designed so your agents are portable across providers. Your configurations, prompts, behavioral rules, knowledge bases, and tool integrations are all stored independently of the AI provider. Switching models never touches any of that.- Switch instantly — change the model for any agent or node from a single dropdown. The change applies to new calls immediately.
- Automatic fallback — configure a backup model for any agent. If the primary provider is unreachable, calls automatically route to your fallback — no interruption, no manual intervention.
- Side-by-side testing — run two agents with identical configurations but different models. Compare transcript quality, response accuracy, and cost in the call logs to find the right model for each use case.
Bring Your Own API Keys
When you connect a provider using your own API key, AI inference costs are billed directly by the provider at their standard rates. VoiceInfra does not mark up pass-through AI costs. This gives you full visibility into what each call costs and lets you optimize at the provider level — including taking advantage of volume discounts, committed use agreements, or enterprise pricing you’ve negotiated directly. If you prefer not to manage individual provider accounts, VoiceInfra also offers managed access to supported models through platform credits.Provider-specific prompt behavior: Different models have distinct strengths you can use to your advantage. Claude 3.5 Sonnet handles long, nuanced conversations especially well and is less likely to veer off-topic on complex calls. Groq’s Llama inference is optimized for speed — ideal when response latency matters more than depth. Gemini 1.5 Flash balances speed and multilingual accuracy for international deployments. When switching providers, review your system prompts to make sure tone and instruction style are aligned with the new model’s behavior.
Frequently Asked Questions
Do I need my own API keys?
Do I need my own API keys?
No, API keys are optional. VoiceInfra provides managed access to all supported models through platform credits — no provider account needed to get started. If you bring your own API keys, AI costs are passed through at the provider’s standard rates, which gives you more cost control and lets you use any pricing agreements you have directly with the provider.
Can different workflow nodes use different models?
Can different workflow nodes use different models?
Yes. In the workflow builder, every Conversation node has its own model selector. You can run a fast, low-cost model for the greeting and data-collection nodes, then switch to an advanced reasoning model for the troubleshooting or negotiation nodes within the exact same workflow.
What happens if a provider goes down?
What happens if a provider goes down?
If you’ve configured a fallback model in your agent or workflow settings, VoiceInfra automatically routes to that model when the primary provider is unreachable or returns errors. The switchover happens instantly — callers experience no disruption. You’ll see the fallback events in your call logs.