How to Decide: Build a Chatbot or an Agent

Updated May 2026
Choosing between building a chatbot and building an AI agent requires a structured evaluation of your use case requirements, technical capacity, and business case. This framework walks you through the decision process step by step, helping you make a choice grounded in analysis rather than hype.

The worst technology decisions are made based on excitement about capabilities rather than analysis of requirements. Both chatbots and agents are powerful technologies, but applying the wrong one to your use case wastes resources and produces a poor outcome. This framework helps you avoid that mistake.

Define Your Core Use Case

Before evaluating technology options, write a clear, specific description of what the system needs to accomplish. Avoid technology-specific language. Instead of "I want to build a chatbot for customer support," write "I need a system that answers common customer questions about our products, helps customers track their orders, and resolves simple issues like password resets and return initiation."

List the specific tasks the system needs to perform. For each task, note what inputs are required, what actions need to be taken, what systems are involved, and what the successful outcome looks like. This task inventory becomes the foundation for the rest of the evaluation.

Be honest about scope. It is tempting to list aspirational capabilities alongside genuine requirements. Distinguish between what you need for the initial deployment and what would be nice to have eventually. The initial deployment drives the technology choice; future capabilities can be added later.

Score the Five Decision Criteria

Evaluate your use case against each criterion using a simple low/medium/high scale.

Task autonomy (low/medium/high). Low: the system provides information and the user acts on it. Medium: the system takes some actions but requires user confirmation for each one. High: the system takes actions independently with minimal human oversight.

Integration depth (low/medium/high). Low: zero or one external system. Medium: two to three external systems with simple interactions. High: four or more external systems requiring coordinated workflows.

Workflow complexity (low/medium/high). Low: single-step responses. Medium: two to four coordinated steps. High: five or more steps with dependencies and branching logic.

Memory requirements (low/medium/high). Low: each interaction is independent. Medium: session continuity is helpful but not essential. High: persistent knowledge across sessions is critical to the application's value.

Error tolerance (low/medium/high). Low: mistakes are inconsequential. Medium: mistakes are costly but recoverable. High: mistakes have serious consequences requiring prevention and autonomous recovery.

If you score low on all five criteria, build a chatbot. If you score medium or high on one criterion, consider adding that specific capability to a chatbot. If you score medium or high on three or more criteria, evaluate an agent or hybrid architecture.

The scoring exercise itself often reveals insights beyond the final recommendation. Teams that go through the evaluation process frequently discover that their assumptions about what users need do not match the actual task requirements. A team convinced they need an agent may find, after honest scoring, that their use case scores low on most criteria. Conversely, a team planning a simple chatbot may realize that their core workflow scores high on integration depth and workflow complexity, signaling that a chatbot will struggle with the most important use cases.

Document your scoring rationale carefully. The decision will likely be questioned later, either when the project encounters difficulties or when stakeholders see competing approaches at other organizations. Having a documented, data-driven rationale for the technology choice protects the team from second-guessing and provides a framework for re-evaluating the decision as requirements evolve.

Assess Your Technical Capacity

Technology choices must be realistic given your team's capabilities and resources. A small team without agent engineering experience should not attempt a complex agent deployment, regardless of what the requirements analysis suggests. Assess your team's experience with LLM APIs and prompt engineering, your infrastructure for running and monitoring AI systems, your budget for development and ongoing operations, and your timeline for delivery.

If your requirements point to an agent but your capacity points to a chatbot, consider starting with a chatbot that covers the core use case and planning for an agent upgrade as your team builds experience. Alternatively, consider using managed agent platforms or consulting with specialized firms that can accelerate the agent implementation.

Be realistic about your timeline constraints as well. If you need a working solution within weeks, a chatbot is the only viable option. Agent development cannot be meaningfully accelerated below a certain minimum because the testing, safety validation, and integration work require thoroughness regardless of team size or urgency. Rushing an agent deployment leads to reliability problems that are far more costly to fix in production than to prevent during development.

Build a Business Case

Calculate the cost and value for both options. For the chatbot option, estimate the development cost (typically $5,000 to $25,000 for a custom build), monthly operational cost (API usage plus hosting, typically $200 to $2,000), and the value delivered (support ticket deflection, lead conversion improvement, time savings for users).

For the agent option, estimate the development cost (typically $50,000 to $300,000), monthly operational cost (API usage plus infrastructure plus maintenance, typically $2,000 to $20,000), and the value delivered (automated workflows, reduced manual labor, capabilities that were previously impossible).

Compare the ROI for each option over a 12-month horizon. The agent needs to deliver substantially more value than the chatbot to justify its higher cost. If the ROI difference is marginal, the lower-risk chatbot option is usually the better choice.

Choose and Prototype

Based on your scoring, capacity assessment, and business case, make a decision and build a prototype to validate your assumptions. A chatbot prototype can be built in one to two weeks. An agent prototype typically takes four to six weeks. The prototype should cover the single most important use case, not the full scope of the eventual deployment.

Test the prototype with real users and real scenarios. Does it handle the core use case effectively? Are users satisfied with the experience? Does the cost per interaction align with your projections? The prototype phase is where assumptions meet reality, and it is far cheaper to discover problems here than after a full production build.

If the prototype succeeds, proceed with the full implementation. If it reveals issues, revisit your requirements analysis and consider whether a different approach (simpler chatbot, more targeted agent, or hybrid architecture) would better serve the use case.

Key Takeaway

Make the chatbot-versus-agent decision through structured analysis: define the use case, score the requirements, assess your capacity, build the business case, and validate with a prototype. This disciplined approach prevents both under-building (a chatbot that cannot solve the problem) and over-building (an agent that costs ten times more than necessary for what is essentially a chatbot use case).