Setting up an AI phone agent in 2026 is a different exercise than setting one up in 2024. The AI itself is no longer the hard part — modern voice AI runs at sub-200ms latency, sounds natural, and handles barge-in and code-switching out of the box. The hard part is everything around the AI: which carrier, which CRM, which calendar, which compliance posture, which transfer path.
This guide walks through the practical setup, in order, using Open.cx as the example because we know its setup well. The same shape applies to most modern AI phone agents.
TL;DR — the order of operations
- Pick the carrier path (BYO existing carrier or Open's built-in).
- Wire the CRM and calendar before configuring the agent.
- Configure the agent (role, voice, languages, escalation rules).
- Connect the knowledge base (docs, help center, internal SOPs).
- Define transfer paths (when the AI escalates, where the call goes).
- Test with real callers, not just yourself.
- Launch with a single number/queue first, expand from there.
Skip any of these and you'll hit a wall in week 2 of production. Do them in order and most teams are live in 1-7 days.
Step 1 — Pick the carrier path
Two options, both valid:
Path A — Bring your own carrier. Use whatever carrier you run today. Open.cx supports 37+ as first-class SIP destinations: Twilio, Vonage, RingCentral, Zoom Phone, Aircall, BT Business, Verizon Business, AT&T Business, and more. You add Open as a SIP destination behind your existing carrier; the AI joins on calls you opt-in. No porting required.
Path B — Use a carrier we provide. Skip the carrier complexity entirely; Open provisions numbers and routes them directly. Faster for greenfield deployments without an existing carrier relationship.
For most existing operations, Path A is the right answer. You keep your numbers, billing, and reporting where they already are.
Where an AI agent sits in the support stack
Orchestration layerHelpdesk
Ticket management, agent UI, reporting. Intercom, Zendesk, Freshdesk, HubSpot, Salesforce, Twilio Flex
AI agent
OrchestratorCustomer-facing reasoning and action execution. Native (Fin, Einstein, Freddy) or third-party (Open.cx)
Knowledge base
Source of truth for policy and procedures. Intercom Articles, Zendesk Guide, Notion, custom CMS
Identity & auth
Customer authentication. Auth0, Okta, custom SSO
Transactional systems
Orders, billing, subscriptions, fulfillment. Stripe, Shopify, custom OMS
CRM
Customer history and account context. Salesforce, HubSpot, Segment
Observability
Conversation logs, confidence sampling, replay. Platform-native, data warehouse, custom dashboards
The AI agent makes the rest of the stack invisible to the customer
Step 2 — Wire the CRM and calendar before the agent
This is where most teams lose time. They configure the AI agent first, hit limitations, then realize they need integrations they haven't set up yet. Reverse the order.
Required integrations for most use cases:
- CRM — Salesforce, HubSpot, Pipedrive, Zoho, your custom system. The AI needs caller-context lookup by phone number.
- Calendar — Google, Outlook, HubSpot Meetings, Calendly, Acuity, OpenDental, Dentrix, etc., if booking is in scope.
- Helpdesk — Zendesk, Intercom, Freshdesk, HubSpot Service. The AI creates tickets and reads ticket history.
- Billing — Stripe, your billing provider, if payment is part of the agent's job.
- Knowledge base — Notion, Confluence, your help center, your docs.
Authorize the OAuth or API tokens for each system the agent will use. Open.cx ships ~50 native integrations, so most of this is point-and-click. For everything else, the REST API and webhooks cover the gap.
Step 3 — Configure the agent
With integrations live, the agent configuration becomes a meaningful exercise rather than a frustrating one. Five things to set:
Role and persona. Who is the agent representing? What's the brand voice? Specific or warm? Formal or casual? Define a system prompt that captures this — 2-4 sentences is usually enough.
Voice and language. Pick a default voice (premium TTS or your cloned brand voice). Decide on language coverage; Open supports 100+ with mid-call switching.
Tool grants. The agent can only call tools you've authorized. For an AI receptionist: CRM lookup, calendar booking, knowledge base retrieval. For an AI sales agent: CRM update, deal-stage change, meeting booking. Granular tool permissions matter for compliance.
Escalation rules. Define when the AI escalates: keywords ("emergency", "manager"), intent classifications ("refund > $X"), confidence thresholds ("if AI < 70% confident"), or fallback ("after 2 unsuccessful clarifications"). The escalation rules are the single biggest CSAT lever.
Knowledge scope. Which documents and help-center pages should the AI use? Tighter scope = better answers; broader scope = more coverage. Start tight, expand based on actual call patterns.
Step 4 — Connect the knowledge base
Three approaches, often combined:
Web crawler. Point Open at your help center URL (Zendesk Help Center, Intercom Articles, Freshdesk KB, custom). Schedule recrawls daily/weekly so the AI stays current.
Native integrations. Connect Notion, Confluence, Google Docs, GitHub, SharePoint, etc. directly. Open syncs incrementally.
Direct upload. PDFs, Word docs, structured data — upload to the agent's knowledge layer for content that doesn't live anywhere else.
Tighter knowledge scope tends to produce better answers. If the AI seems to be making things up, the first instinct should be to narrow the knowledge, not to swap models.
Step 5 — Define transfer paths
Most teams skip this and learn the hard way. Two things to define:
When does the AI escalate?
- Specific keywords or intents that always escalate (urgent, emergency, manager, complaint).
- Confidence-based rules (AI < 70% confident → escalate).
- Fallback rules (after 2 clarification attempts → escalate).
- Time-based rules (after 5 minutes → escalate to human).
Where does the escalated call go?
- A specific extension or hunt group (small teams).
- A specific queue in your CCaaS (Five9, Talkdesk, RingCX) with the AI transcript attached as a contact attribute.
- A Slack channel + warm-transfer to whoever's on call (after-hours).
- Voicemail-with-callback if no human is available (last resort).
The transcript and detected intent travel with the call via SIP REFER. Your humans never receive a "what's this about?" cold transfer.
Step 6 — Test with real callers
Your team's familiarity with the agent does not predict caller experience. Real callers will:
- Mumble.
- Talk over each other.
- Switch languages mid-call.
- Ask things in ways your knowledge base doesn't directly answer.
- Get frustrated faster than you expect.
Run a test cohort of 50-100 real callers (small business: customers; large: a sample queue with disclosure). Listen to actual recordings. The first 20 calls will surface issues no internal QA process found.
Common issues you'll hear and fix:
- Voice too fast or too slow.
- AI doesn't yield naturally on barge-in.
- Specific industry vocabulary the AI mispronounces.
- Edge-case intents that escalate too aggressively or not enough.
- Transfer paths that lose the transcript.
Open.cx's reasoning trace lets you replay any call and see exactly what the AI considered, what it called, and why it said what it said. Use it.
Step 7 — Launch with one number, expand from there
Don't go big-bang. Pick one inbound number or one queue, launch there, run for a week, expand.
30 / 60 / 90 day rollout
A realistic arc on a clean Intercom instance with weekly iteration.
Day 30
Setup + narrow launch
Audit, knowledge-base prep, one or two layer-3 workflows live.
Day 60
Account-aware workflows
Layer-4 personalization on top topics; weekly failure reviews start.
Day 90+
Action-led + iteration
Layer-5 flows resolving end-to-end; voice and proactive on deck.
A typical rollout schedule:
- Week 1: One number, business-hours only. Watch every call.
- Week 2: Expand to 24/7 on the same number. Weekend traffic surfaces different patterns.
- Week 3: Add a second number (or queue). Different use case = different agent config.
- Week 4-6: Roll out to remaining numbers. Build playbooks for new agents.
- Month 2+: Outbound campaigns, additional channels (chat, email, WhatsApp on the same agent).
Most teams hit production resolution rates of 65-75% by week 4 and continue improving as the knowledge base sharpens.
The four common gotchas
In rough order of frequency:
1. Latency. Median latency above 300ms feels robotic regardless of the rest of the experience. If your AI sounds slow, check the carrier path (extra hops add latency), the LLM region (cross-region calls add 100-200ms), and the TTS streaming setup. Sub-200ms median is achievable on modern stacks.
2. Knowledge staleness. The AI is only as good as the knowledge it can read. Set up automatic recrawls of your help center, scheduled syncs of Notion/Confluence, and a process to update the knowledge when product/policy changes. Stale knowledge is the #1 cause of regression after week 4.
3. Transfer paths that lose context. When the AI escalates, the human picking up the call should have the transcript and intent on screen — not on a separate Slack message they have to find. Configure your CCaaS or helpdesk to surface the AI metadata as the screen-pop.
4. Voice-cloning licensing. Cloned brand voices are great until you realize the licensing constraints (couriers like ElevenLabs require explicit voice-actor consent or your own recordings). Plan ahead if you want a custom brand voice; default premium voices are good enough for most launches.
Resources by carrier
We've published carrier-specific setup guides for the 37+ supported integrations:
- Twilio, Vonage, RingCentral, Zoom Phone, Aircall, Dialpad
- Bandwidth, Plivo, Telnyx, SignalWire, Sinch
- 8x8, Nextiva, GoTo Connect, Five9, Genesys, NICE CXone, Talkdesk, Amazon Connect
- Microsoft Teams, Cisco Webex, Google Voice
- 3CX, Asterisk, FreePBX, Avaya, Mitel
- BT Business, Verizon Business, AT&T Business, Orange Business
Resources by use case
- AI receptionist — first-ring inbound for service businesses.
- AI call center — at-scale inbound and outbound.
- AI answering service — 24/7 coverage, AI-resolved or human-paged.
- AI outbound calling — dialer-class outbound with real conversations.
- AI cold caller — qualified outbound at scale.
What we'd actually recommend
Pick a single use case (most common: AI receptionist on the main inbound number). Pick one carrier (whatever you already use). Pick one CRM and calendar to integrate with. Skip multi-channel and outbound for the first deployment.
A focused first deployment ships in a week. A broad first deployment takes a quarter. The teams that succeed at AI phone agent deployment in 2026 are the ones that ship narrow first, learn from real callers, and expand from there.
For carrier-specific setup steps, jump to the carrier page above. For an end-to-end demo, book a call → and we'll walk through your specific stack.