Autonomous WhatsApp agents that take action, not just answer questions.
A WhatsApp AI agent doesn't just respond — it books, qualifies, transacts, refunds, escalates, and reports. Waaru's agent layer combines a language model, tool calling, conversation memory, and a native MCP server, so the same agent that runs your support inbox can power your sales conversation and your COD verification flow.
In one line
A WhatsApp AI agent is an autonomous LLM-driven agent connected to the WhatsApp Business API that uses tools to take real actions in your systems — order lookups, bookings, payments, escalation. Waaru ships agents as the primary interface (not bolted on a flow builder) and exposes a native MCP server so your existing Claude, GPT, or Gemini agent can drive WhatsApp directly.
No credit card. No commitment.
1 agentPer workspace, all surfaces
Native MCPConnect Claude, GPT, Gemini
Tool-firstTyped schemas, validated args
0% markupOn Meta conversation fees
The problem
The 'AI agent' on most WhatsApp platforms is a chatbot in a costume.
Search 'WhatsApp AI agent' and every BSP homepage says the same five words. In practice, what they ship is a single GPT-powered node inside a flow builder with no tool calling, no memory between sessions, and no way for your existing AI agent to drive the conversation. That is not an agent. That is a chatbot with a thesaurus.
No tool calling — the 'agent' can answer questions but cannot place a refund, update an order, or book a slot in your calendar.
No conversation memory beyond the current session — the agent forgets the customer the moment the session ends.
No way to bring your own agent — if you've already built an AI agent on Claude or GPT, you cannot point it at WhatsApp without writing custom API code.
Pricing-page language ('AI agents included') hides a feature behind an enterprise tier upgrade once you actually try to ship.
What an agent really is
Agent = LLM + tools + memory + goals.
A true AI agent has four properties. A language model that reasons about what to do next. Tools (typed functions) it can call to take action in the world. Memory that persists across turns and sessions so context is not lost. And goals or policies that constrain its behaviour. Without all four, you have an enhanced chatbot, not an agent.
Waaru's agent layer ships all four out of the box. The LLM is your choice (Claude, GPT, Gemini, or any model with an OpenAI-compatible endpoint). Tools are pre-built for the common WhatsApp actions plus your custom REST endpoints. Memory persists per-contact across every conversation. Goals are configured per-workflow — qualify a lead, recover a cart, close a refund, escalate a complaint.
Tool calling
Typed tools, validated arguments, every call logged.
Every action your agent can take is a tool with a JSON-schema signature. When the agent decides to call a tool, Waaru validates the arguments against the schema before any external system is touched. If the schema is wrong, the agent gets a structured error and retries — your downstream systems never see malformed input.
Connect your existing AI agent to WhatsApp via Model Context Protocol.
Waaru is built MCP-native. That means more than 'we ship an MCP server' — the same operations available to humans in the dashboard are available to AI agents over the Model Context Protocol. Your existing Claude, GPT, or Gemini agent can send WhatsApp messages, look up conversation history, manage contacts, trigger flows, and read analytics, the same way it already uses any other MCP tool.
This is the difference between a platform that 'has an MCP server' and a platform designed for AI agents to operate it. Waaru is the second. Your agent connects in minutes, the schema is documented, and the auth model is per-workspace API keys — no shared credentials, no impersonation risk.
Memory
Per-contact, per-workspace, persists across sessions.
Memory is the unsexy feature that decides whether an agent feels human. Waaru maintains three tiers of memory. Conversation memory — the full transcript of the active session, used as the immediate context window. Contact memory — facts the agent has learned about this specific customer across every prior conversation (preferred language, last order, complaint history, name). Workspace memory — facts about your business that every agent shares (catalog, policies, FAQs).
The agent decides what to remember automatically. If a customer says 'I prefer Hindi', the agent stores that preference at contact level and switches language on the next conversation, six weeks later, on a different device.
Governance
What the agent can and cannot do, by policy.
Autonomy without guardrails is a liability. Every Waaru agent runs inside a policy envelope you configure per workspace. You set the rules for which tools the agent can call without approval, which actions require human confirmation, what the agent must refuse, and what triggers escalation. The agent obeys the policy or escalates — it never silently overrides it.
Tool allowlists — limit the agent to a subset of tools per workflow
Approval gates — actions over a threshold (refund > ₹5,000, manual booking change) require human confirmation
Refusal rules — explicit topics or actions the agent will not engage on, even if asked
Brand-voice guardrails — tone, banned phrases, mandatory disclaimers per channel
Rate limits — per-contact and per-workspace caps on tool calls
Handoff
When the agent reaches its limit, the human takes the keys with full context.
Every agent has a confidence threshold and an explicit escalate tool. When the agent escalates — or when a customer types 'speak to a human' — the conversation is routed to the team inbox with the full transcript, a one-paragraph AI summary of the situation, the contact record, and any prior notes. The human picks up at exactly the right point.
On the Scale plan, the human's resolution becomes context for the agent. The next time the same intent appears, the agent handles it alone. The deflection rate compounds over months without manual retraining.
Observability
Every reply, every tool call, every escalation — logged and explainable.
You cannot trust an autonomous system you cannot inspect. Every Waaru agent conversation produces a structured trace: the prompt, the model and version that handled it, the tools considered, the tool calls made (with arguments and responses), the reply that went out, the confidence score, and the policy decisions that fired. The trace is available in the dashboard and via API.
When something goes wrong — a wrong refund issued, a customer escalated to the wrong team, a hallucination — you can see exactly why, fix the policy or the tool, and re-run the failed turn on the new configuration.
Side-by-side
Waaru agents vs. 'AI agent' on a typical WhatsApp platform.
What you actually need to ship a WhatsApp agent that does real work.
Capability
Waaru
Typical WhatsApp platform
LLM choice (per workspace)
Claude, GPT-4, Gemini, custom
Locked to vendor's default
Native tool calling with typed schemas
Yes
Limited to Zapier-style webhooks
Per-contact memory across sessions
Yes
No
Policy envelope (allowlists, approvals, refusals)
Yes
Manual flow gates
Native MCP server for external agents
First-class, designed for it
Available on 2 of 13 surveyed platforms
Observability trace per conversation
Yes
Basic logs only
Human handoff with AI summary
Yes
Cold transfer to shared inbox
Agent included in entry plan
Included on Growth
Enterprise add-on
MCP support claim: based on the public MCP server availability of 13 WhatsApp platforms surveyed in the Waaru competitive feature scan, June 2026.
Integrations
What your agent can do, out of the box.
Tools the agent can call without you writing integration code. Coming-soon entries are on the public roadmap.
Send UPI, card, or wallet payment links inside chat. Paid event fires back as a webhook in real time.
Zapier
Coming soon
Long-tail connector to 6,000+ apps. Use Zapier where you don't need native real-time speed.
Cal.com / Calendly
Coming soon
Booking flow integration for service businesses. In-chat slot pickers tied to your calendar.
Google Calendar
Coming soon
Slot search, book, reschedule, cancel — all from inside a WhatsApp flow.
HubSpot
Coming soon
Two-way sync for contacts, deals, notes, and lifecycle stage. Conversations append to the timeline.
Pipedrive
Coming soon
Contacts, deals, and activities. WhatsApp threads log against the deal owner automatically.
Zoho CRM
Coming soon
Contacts, leads, deals. Native sync, no Zapier in the middle.
Freshdesk
Coming soon
Open, update, and resolve tickets from a WhatsApp conversation. Two-way status sync.
Zendesk
Coming soon
Ticket lifecycle bridged to WhatsApp threads. Internal notes stay internal.
FAQ
Questions worth answering.
A WhatsApp AI agent is an autonomous, LLM-driven software agent connected to the official WhatsApp Business API that uses tools to take action in your systems — book a slot, look up an order, issue a refund, qualify a lead, escalate to a human — based on real conversations with your customers on WhatsApp.
Sources & references
Sources backing the claims on this page.
Competitor MCP claims on this page are verified against publicly available repositories and developer documentation.
1
Wati's MCP server (open source on GitHub) wraps Wati API v3 endpoints — contacts, messages, templates, campaigns, channels.