How is an AI agent different from a WhatsApp chatbot?

A chatbot replies to messages. An agent reasons about what to do next, calls tools to take action, remembers context across sessions, and decides when to escalate. Most BSPs sell chatbots and call them agents; Waaru ships the four capabilities (LLM, tools, memory, policy) that make the difference.

Can I use my own LLM (Claude, GPT-4, Gemini)?

Yes. Each Waaru workspace picks its model — Claude, GPT-4, Gemini, or any OpenAI-compatible endpoint including self-hosted models. You can change the model anytime; conversation memory and tool configuration travel with the workspace, not the model.

What does 'MCP-native' actually mean?

Waaru's first-class interface is the Model Context Protocol. Every action available in the dashboard is available to AI agents over MCP — send a message, look up history, manage contacts, trigger a flow, read analytics. This means your existing Claude or GPT agent can drive WhatsApp the same way it drives any other MCP tool, without custom API code. Wati and Respond.io have shipped MCP servers; Waaru is the only platform built so the agent is the primary user.

What tools can the agent call?

At launch: native WhatsApp actions (send_message, send_template, send_flow, conversation history), MCP tools exposed by your agent, and any custom REST endpoint you define with an OpenAPI spec. Coming soon on the public roadmap: Shopify, Razorpay payment links, Cal.com / Calendly, Google Calendar, HubSpot, Pipedrive, Zoho, Freshdesk, Zendesk, and Zapier. Every tool call has a typed schema, arguments are validated before execution, and every call is logged. See the integrations section above for current status.

How do you stop the agent from doing something wrong?

Every agent runs inside a policy envelope. You configure: which tools the agent can call without approval, which actions require human confirmation, what the agent must refuse, what triggers escalation, and per-contact / per-workspace rate limits. The agent either obeys the policy or escalates — it never silently overrides it.

Does the agent remember the customer across conversations?

Yes. Waaru maintains contact-level memory that persists across every session. If a customer states a preference in March, the agent acts on it in May — even on a different device, with a different agent assignment, in a different language.

Can the agent hand off to a human?

Yes — handoff is a first-class tool. The agent calls escalate_to_human() with a reason, a priority, and an AI-generated summary; the conversation lands in the team inbox with full context. On Scale, the human's resolution is indexed and the same intent is handled by the agent next time.

How do I see why the agent did what it did?

Every conversation produces a structured trace: prompt, model + version, tools considered, tool calls with arguments and responses, reply, confidence score, and policy decisions. Available in the dashboard and via API. You can replay a failed turn against an updated tool or policy without messaging the customer again.

How much does it cost to run a WhatsApp AI agent on Waaru?

The agent layer is included from the Growth plan. Scale includes the native MCP server, self-learning HITL, and unlimited contacts. Meta's conversation fees are billed directly by Meta — Waaru charges 0% markup. 14-day free trial, no credit card required.

WhatsApp AI agents

Autonomous WhatsApp agents that take action, not just answer questions.

A WhatsApp AI agent doesn't just respond — it books, qualifies, transacts, refunds, escalates, and reports. Waaru's agent layer combines a language model, tool calling, conversation memory, and a native MCP server, so the same agent that runs your support inbox can power your sales conversation and your COD verification flow.

In one line

A WhatsApp AI agent is an autonomous LLM-driven agent connected to the WhatsApp Business API that uses tools to take real actions in your systems — order lookups, bookings, payments, escalation. Waaru ships agents as the primary interface (not bolted on a flow builder) and exposes a native MCP server so your existing Claude, GPT, or Gemini agent can drive WhatsApp directly.

No credit card. No commitment.

1 agentPer workspace, all surfaces

Native MCPConnect Claude, GPT, Gemini

Tool-firstTyped schemas, validated args

0% markupOn Meta conversation fees

The problem

The 'AI agent' on most WhatsApp platforms is a chatbot in a costume.

Search 'WhatsApp AI agent' and every BSP homepage says the same five words. In practice, what they ship is a single GPT-powered node inside a flow builder with no tool calling, no memory between sessions, and no way for your existing AI agent to drive the conversation. That is not an agent. That is a chatbot with a thesaurus.

No tool calling — the 'agent' can answer questions but cannot place a refund, update an order, or book a slot in your calendar.
No conversation memory beyond the current session — the agent forgets the customer the moment the session ends.
No way to bring your own agent — if you've already built an AI agent on Claude or GPT, you cannot point it at WhatsApp without writing custom API code.
Pricing-page language ('AI agents included') hides a feature behind an enterprise tier upgrade once you actually try to ship.

What an agent really is

Agent = LLM + tools + memory + goals.

A true AI agent has four properties. A language model that reasons about what to do next. Tools (typed functions) it can call to take action in the world. Memory that persists across turns and sessions so context is not lost. And goals or policies that constrain its behaviour. Without all four, you have an enhanced chatbot, not an agent. Waaru's agent layer ships all four out of the box. The LLM is your choice (Claude, GPT, Gemini, or any model with an OpenAI-compatible endpoint). Tools are pre-built for the common WhatsApp actions plus your custom REST endpoints. Memory persists per-contact across every conversation. Goals are configured per-workflow — qualify a lead, recover a cart, close a refund, escalate a complaint.

Tool calling

Typed tools, validated arguments, every call logged.

Every action your agent can take is a tool with a JSON-schema signature. When the agent decides to call a tool, Waaru validates the arguments against the schema before any external system is touched. If the schema is wrong, the agent gets a structured error and retries — your downstream systems never see malformed input.

WhatsApp — send_message, send_template, send_flow, list_conversations
Custom REST — define an endpoint with an OpenAPI spec and the agent gets the tool
Handoff — escalate_to_human(reason, priority, summary)
Coming soon — Shopify, Razorpay payment links, Calendar (Google / Cal.com), CRM (HubSpot, Pipedrive, Zoho), Helpdesk (Freshdesk, Zendesk), Zapier. See the integrations roadmap below.

MCP-native

Connect your existing AI agent to WhatsApp via Model Context Protocol.

Waaru is built MCP-native. That means more than 'we ship an MCP server' — the same operations available to humans in the dashboard are available to AI agents over the Model Context Protocol. Your existing Claude, GPT, or Gemini agent can send WhatsApp messages, look up conversation history, manage contacts, trigger flows, and read analytics, the same way it already uses any other MCP tool. This is the difference between a platform that 'has an MCP server' and a platform designed for AI agents to operate it. Waaru is the second. Your agent connects in minutes, the schema is documented, and the auth model is per-workspace API keys — no shared credentials, no impersonation risk.

Memory

Per-contact, per-workspace, persists across sessions.

Memory is the unsexy feature that decides whether an agent feels human. Waaru maintains three tiers of memory. Conversation memory — the full transcript of the active session, used as the immediate context window. Contact memory — facts the agent has learned about this specific customer across every prior conversation (preferred language, last order, complaint history, name). Workspace memory — facts about your business that every agent shares (catalog, policies, FAQs). The agent decides what to remember automatically. If a customer says 'I prefer Hindi', the agent stores that preference at contact level and switches language on the next conversation, six weeks later, on a different device.

Governance

What the agent can and cannot do, by policy.

Autonomy without guardrails is a liability. Every Waaru agent runs inside a policy envelope you configure per workspace. You set the rules for which tools the agent can call without approval, which actions require human confirmation, what the agent must refuse, and what triggers escalation. The agent obeys the policy or escalates — it never silently overrides it.

Tool allowlists — limit the agent to a subset of tools per workflow
Approval gates — actions over a threshold (refund > ₹5,000, manual booking change) require human confirmation
Refusal rules — explicit topics or actions the agent will not engage on, even if asked
Brand-voice guardrails — tone, banned phrases, mandatory disclaimers per channel
Rate limits — per-contact and per-workspace caps on tool calls

Handoff

When the agent reaches its limit, the human takes the keys with full context.

Every agent has a confidence threshold and an explicit escalate tool. When the agent escalates — or when a customer types 'speak to a human' — the conversation is routed to the team inbox with the full transcript, a one-paragraph AI summary of the situation, the contact record, and any prior notes. The human picks up at exactly the right point. On the Scale plan, the human's resolution becomes context for the agent. The next time the same intent appears, the agent handles it alone. The deflection rate compounds over months without manual retraining.

Observability

Every reply, every tool call, every escalation — logged and explainable.

You cannot trust an autonomous system you cannot inspect. Every Waaru agent conversation produces a structured trace: the prompt, the model and version that handled it, the tools considered, the tool calls made (with arguments and responses), the reply that went out, the confidence score, and the policy decisions that fired. The trace is available in the dashboard and via API. When something goes wrong — a wrong refund issued, a customer escalated to the wrong team, a hallucination — you can see exactly why, fix the policy or the tool, and re-run the failed turn on the new configuration.

Side-by-side

Waaru agents vs. 'AI agent' on a typical WhatsApp platform.

What you actually need to ship a WhatsApp agent that does real work.

Capability	Waaru	Typical WhatsApp platform
LLM choice (per workspace)	Claude, GPT-4, Gemini, custom	Locked to vendor's default
Native tool calling with typed schemas	Yes	Limited to Zapier-style webhooks
Per-contact memory across sessions	Yes	No
Policy envelope (allowlists, approvals, refusals)	Yes	Manual flow gates
Native MCP server for external agents	First-class, designed for it	Available on 2 of 13 surveyed platforms
Observability trace per conversation	Yes	Basic logs only
Human handoff with AI summary	Yes	Cold transfer to shared inbox
Agent included in entry plan	Included on Growth	Enterprise add-on

MCP support claim: based on the public MCP server availability of 13 WhatsApp platforms surveyed in the Waaru competitive feature scan, June 2026.

Integrations

What your agent can do, out of the box.

Tools the agent can call without you writing integration code. Coming-soon entries are on the public roadmap.

Most loved
Native MCP server
Available
Your Claude, GPT, or Gemini agent operates the full workspace through MCP tools. The headline integration.
Bring your own LLM
Available
Anthropic Claude, OpenAI GPT, Google Gemini — per-workspace key. Switch any time without losing config.
Custom REST + webhooks
Available
Define an endpoint with an OpenAPI spec and Waaru registers it as a typed tool. Inbound webhooks trigger flows in real time.
Trending
Shopify
Coming soon
Native two-way: orders, carts, catalog, fulfilments. Powers abandoned cart, COD-to-prepaid, native catalog browsing.
Razorpay payment links
Coming soon
Send UPI, card, or wallet payment links inside chat. Paid event fires back as a webhook in real time.
Zapier
Coming soon
Long-tail connector to 6,000+ apps. Use Zapier where you don't need native real-time speed.
Cal.com / Calendly
Coming soon
Booking flow integration for service businesses. In-chat slot pickers tied to your calendar.
Google Calendar
Coming soon
Slot search, book, reschedule, cancel — all from inside a WhatsApp flow.
HubSpot
Coming soon
Two-way sync for contacts, deals, notes, and lifecycle stage. Conversations append to the timeline.
Pipedrive
Coming soon
Contacts, deals, and activities. WhatsApp threads log against the deal owner automatically.
Zoho CRM
Coming soon
Contacts, leads, deals. Native sync, no Zapier in the middle.
Freshdesk
Coming soon
Open, update, and resolve tickets from a WhatsApp conversation. Two-way status sync.
Zendesk
Coming soon
Ticket lifecycle bridged to WhatsApp threads. Internal notes stay internal.

FAQ

Questions worth answering.

A WhatsApp AI agent is an autonomous, LLM-driven software agent connected to the official WhatsApp Business API that uses tools to take action in your systems — book a slot, look up an order, issue a refund, qualify a lead, escalate to a human — based on real conversations with your customers on WhatsApp.

Sources & references

Sources backing the claims on this page.

Competitor MCP claims on this page are verified against publicly available repositories and developer documentation.

Wati's MCP server (open source on GitHub) wraps Wati API v3 endpoints — contacts, messages, templates, campaigns, channels.
Wati · wati-io/wati-mcp-server on GitHub · Accessed 9 June 2026
Respond.io's open-source MCP server exposes 28 tools across contacts, messaging, conversations, comments, and workspace.
Respond.io · respond-io/mcp-server on GitHub · Accessed 9 June 2026

Evaluating platforms? Every claim cited, every price dated.

Ready to ship a WhatsApp flow that doesn't break?

Apply for founding access. No credit card. Cancel anytime.

See pricing