Waaru
WhatsApp AI agents

Autonomous WhatsApp agents that take action, not just answer questions.

A WhatsApp AI agent doesn't just respond — it books, qualifies, transacts, refunds, escalates, and reports. Waaru's agent layer combines a language model, tool calling, conversation memory, and a native MCP server, so the same agent that runs your support inbox can power your sales conversation and your COD verification flow.

In one line

A WhatsApp AI agent is an autonomous LLM-driven agent connected to the WhatsApp Business API that uses tools to take real actions in your systems — order lookups, bookings, payments, escalation. Waaru ships agents as the primary interface (not bolted on a flow builder) and exposes a native MCP server so your existing Claude, GPT, or Gemini agent can drive WhatsApp directly.

No credit card. No commitment.

1 agentPer workspace, all surfaces
Native MCPConnect Claude, GPT, Gemini
Tool-firstTyped schemas, validated args
0% markupOn Meta conversation fees
The problem

The 'AI agent' on most WhatsApp platforms is a chatbot in a costume.

Search 'WhatsApp AI agent' and every BSP homepage says the same five words. In practice, what they ship is a single GPT-powered node inside a flow builder with no tool calling, no memory between sessions, and no way for your existing AI agent to drive the conversation. That is not an agent. That is a chatbot with a thesaurus.

  • No tool calling — the 'agent' can answer questions but cannot place a refund, update an order, or book a slot in your calendar.
  • No conversation memory beyond the current session — the agent forgets the customer the moment the session ends.
  • No way to bring your own agent — if you've already built an AI agent on Claude or GPT, you cannot point it at WhatsApp without writing custom API code.
  • Pricing-page language ('AI agents included') hides a feature behind an enterprise tier upgrade once you actually try to ship.
What an agent really is

Agent = LLM + tools + memory + goals.

A true AI agent has four properties. A language model that reasons about what to do next. Tools (typed functions) it can call to take action in the world. Memory that persists across turns and sessions so context is not lost. And goals or policies that constrain its behaviour. Without all four, you have an enhanced chatbot, not an agent. Waaru's agent layer ships all four out of the box. The LLM is your choice (Claude, GPT, Gemini, or any model with an OpenAI-compatible endpoint). Tools are pre-built for the common WhatsApp actions plus your custom REST endpoints. Memory persists per-contact across every conversation. Goals are configured per-workflow — qualify a lead, recover a cart, close a refund, escalate a complaint.

Tool calling

Typed tools, validated arguments, every call logged.

Every action your agent can take is a tool with a JSON-schema signature. When the agent decides to call a tool, Waaru validates the arguments against the schema before any external system is touched. If the schema is wrong, the agent gets a structured error and retries — your downstream systems never see malformed input.

  • WhatsApp — send_message, send_template, send_flow, list_conversations
  • Custom REST — define an endpoint with an OpenAPI spec and the agent gets the tool
  • Handoff — escalate_to_human(reason, priority, summary)
  • Coming soon — Shopify, Razorpay payment links, Calendar (Google / Cal.com), CRM (HubSpot, Pipedrive, Zoho), Helpdesk (Freshdesk, Zendesk), Zapier. See the integrations roadmap below.
MCP-native

Connect your existing AI agent to WhatsApp via Model Context Protocol.

Waaru is built MCP-native. That means more than 'we ship an MCP server' — the same operations available to humans in the dashboard are available to AI agents over the Model Context Protocol. Your existing Claude, GPT, or Gemini agent can send WhatsApp messages, look up conversation history, manage contacts, trigger flows, and read analytics, the same way it already uses any other MCP tool. This is the difference between a platform that 'has an MCP server' and a platform designed for AI agents to operate it. Waaru is the second. Your agent connects in minutes, the schema is documented, and the auth model is per-workspace API keys — no shared credentials, no impersonation risk.

Memory

Per-contact, per-workspace, persists across sessions.

Memory is the unsexy feature that decides whether an agent feels human. Waaru maintains three tiers of memory. Conversation memory — the full transcript of the active session, used as the immediate context window. Contact memory — facts the agent has learned about this specific customer across every prior conversation (preferred language, last order, complaint history, name). Workspace memory — facts about your business that every agent shares (catalog, policies, FAQs). The agent decides what to remember automatically. If a customer says 'I prefer Hindi', the agent stores that preference at contact level and switches language on the next conversation, six weeks later, on a different device.

Governance

What the agent can and cannot do, by policy.

Autonomy without guardrails is a liability. Every Waaru agent runs inside a policy envelope you configure per workspace. You set the rules for which tools the agent can call without approval, which actions require human confirmation, what the agent must refuse, and what triggers escalation. The agent obeys the policy or escalates — it never silently overrides it.

  • Tool allowlists — limit the agent to a subset of tools per workflow
  • Approval gates — actions over a threshold (refund > ₹5,000, manual booking change) require human confirmation
  • Refusal rules — explicit topics or actions the agent will not engage on, even if asked
  • Brand-voice guardrails — tone, banned phrases, mandatory disclaimers per channel
  • Rate limits — per-contact and per-workspace caps on tool calls
Handoff

When the agent reaches its limit, the human takes the keys with full context.

Every agent has a confidence threshold and an explicit escalate tool. When the agent escalates — or when a customer types 'speak to a human' — the conversation is routed to the team inbox with the full transcript, a one-paragraph AI summary of the situation, the contact record, and any prior notes. The human picks up at exactly the right point. On the Scale plan, the human's resolution becomes context for the agent. The next time the same intent appears, the agent handles it alone. The deflection rate compounds over months without manual retraining.

Observability

Every reply, every tool call, every escalation — logged and explainable.

You cannot trust an autonomous system you cannot inspect. Every Waaru agent conversation produces a structured trace: the prompt, the model and version that handled it, the tools considered, the tool calls made (with arguments and responses), the reply that went out, the confidence score, and the policy decisions that fired. The trace is available in the dashboard and via API. When something goes wrong — a wrong refund issued, a customer escalated to the wrong team, a hallucination — you can see exactly why, fix the policy or the tool, and re-run the failed turn on the new configuration.

Side-by-side

Waaru agents vs. 'AI agent' on a typical WhatsApp platform.

What you actually need to ship a WhatsApp agent that does real work.

CapabilityWaaruTypical WhatsApp platform
LLM choice (per workspace)Claude, GPT-4, Gemini, customLocked to vendor's default
Native tool calling with typed schemasYesLimited to Zapier-style webhooks
Per-contact memory across sessionsYesNo
Policy envelope (allowlists, approvals, refusals)YesManual flow gates
Native MCP server for external agentsFirst-class, designed for itAvailable on 2 of 13 surveyed platforms
Observability trace per conversationYesBasic logs only
Human handoff with AI summaryYesCold transfer to shared inbox
Agent included in entry planIncluded on GrowthEnterprise add-on

MCP support claim: based on the public MCP server availability of 13 WhatsApp platforms surveyed in the Waaru competitive feature scan, June 2026.

Integrations

What your agent can do, out of the box.

Tools the agent can call without you writing integration code. Coming-soon entries are on the public roadmap.

  • Most loved

    Native MCP server

    Available

    Your Claude, GPT, or Gemini agent operates the full workspace through MCP tools. The headline integration.

  • Bring your own LLM

    Available

    Anthropic Claude, OpenAI GPT, Google Gemini — per-workspace key. Switch any time without losing config.

  • Custom REST + webhooks

    Available

    Define an endpoint with an OpenAPI spec and Waaru registers it as a typed tool. Inbound webhooks trigger flows in real time.

  • Trending

    Shopify

    Coming soon

    Native two-way: orders, carts, catalog, fulfilments. Powers abandoned cart, COD-to-prepaid, native catalog browsing.

  • Razorpay payment links

    Coming soon

    Send UPI, card, or wallet payment links inside chat. Paid event fires back as a webhook in real time.

  • Zapier

    Coming soon

    Long-tail connector to 6,000+ apps. Use Zapier where you don't need native real-time speed.

  • Cal.com / Calendly

    Coming soon

    Booking flow integration for service businesses. In-chat slot pickers tied to your calendar.

  • Google Calendar

    Coming soon

    Slot search, book, reschedule, cancel — all from inside a WhatsApp flow.

  • HubSpot

    Coming soon

    Two-way sync for contacts, deals, notes, and lifecycle stage. Conversations append to the timeline.

  • Pipedrive

    Coming soon

    Contacts, deals, and activities. WhatsApp threads log against the deal owner automatically.

  • Zoho CRM

    Coming soon

    Contacts, leads, deals. Native sync, no Zapier in the middle.

  • Freshdesk

    Coming soon

    Open, update, and resolve tickets from a WhatsApp conversation. Two-way status sync.

  • Zendesk

    Coming soon

    Ticket lifecycle bridged to WhatsApp threads. Internal notes stay internal.

FAQ

Questions worth answering.

A WhatsApp AI agent is an autonomous, LLM-driven software agent connected to the official WhatsApp Business API that uses tools to take action in your systems — book a slot, look up an order, issue a refund, qualify a lead, escalate to a human — based on real conversations with your customers on WhatsApp.

Sources & references

Sources backing the claims on this page.

Competitor MCP claims on this page are verified against publicly available repositories and developer documentation.

  1. Wati's MCP server (open source on GitHub) wraps Wati API v3 endpoints — contacts, messages, templates, campaigns, channels.

    Wati · wati-io/wati-mcp-server on GitHub · Accessed 9 June 2026

  2. Respond.io's open-source MCP server exposes 28 tools across contacts, messaging, conversations, comments, and workspace.

    Respond.io · respond-io/mcp-server on GitHub · Accessed 9 June 2026

Ready to ship a WhatsApp flow that doesn't break?

Apply for founding access. No credit card. Cancel anytime.

See pricing