Each pattern has up to 4 challenge types across 3 difficulty tiers. Build architectures, debug broken pipelines, write prompts, and optimize costs.
The foundational design pattern where a complex task is decomposed into a linear sequence of smaller...
Dynamically directing a user request to the most appropriate specialized agent, model, or processing...
Executing multiple independent agentic tasks simultaneously — such as voting on a decision, generati...
Enables an agent to critique its own output (or the output of another agent) to identify errors, hal...
Extends the capabilities of an LLM by allowing it to interact with the external world (APIs, databas...
The capability of an agent to break down a high-level, ambiguous user goal into a sequence of execut...
Structuring a system as a team of specialized agents (e.g., a Researcher, a Writer, a Reviewer, a Co...
Mechanisms for agents to store, index, and retrieve information over time, spanning short-term (conv...
The ability of an agent to improve its performance over time based on feedback, user interactions, o...
A standardized protocol (Model Context Protocol) for connecting AI models to data sources and tools,...
Mechanisms for agents to define high-level objectives, break them into trackable sub-goals, and cont...
Strategies for agents to detect failures (e.g., API errors, hallucinations, logic loops) and self-co...
Integrating human oversight at critical decision points to validate agent actions, provide feedback,...
Retrieval-Augmented Generation (RAG) equips agents with the ability to query external knowledge base...
Protocols and formats for autonomous agents to exchange messages, tasks, and state information to co...
Agents aware of their token consumption, API costs, and computational limits, optimizing their strat...
Advanced cognitive architectures like Chain-of-Thought (CoT), Tree-of-Thought (ToT), and ReAct (Reas...
Architectural safeguards (input/output filters) to prevent agents from executing harmful actions, le...
Frameworks for measuring agent performance (accuracy, faithfulness, tool usage) and monitoring behav...
The ability of an agent to rank disparate tasks or goals based on urgency, importance, and constrain...
Agents proactively seeking new information, testing hypotheses, or exploring an environment to expan...