Agent Theory

    Cognitive Architectures for Intelligent Agents: From BDI to Modern AI

    11 min read
    By Fritz Lauer
    Cognitive Architecture
    Agent Design
    Theory
    BDI

    Modern AI agents didn't emerge from nowhere. They build on decades of research in cognitive architectures—frameworks for modeling how intelligent agents represent knowledge, reason, and act.

    Understanding classical architectures like BDI (Belief-Desire-Intention), SOAR, and ACT-R provides a foundation for designing AI agents that go beyond pattern matching to exhibit genuine reasoning capabilities.

    What is a Cognitive Architecture?

    A cognitive architecture is a blueprint for the mind—a specification of the structures and processes that enable intelligent behavior. For AI agents, it defines:

    • How agents represent knowledge (beliefs about the world)
    • How agents set goals (desires, intentions, objectives)
    • How agents reason (planning, problem-solving, learning)
    • How agents act (selecting and executing actions)

    Unlike LLMs that generate text based on statistical patterns, cognitive architectures provide structured mental models that enable agents to reason about beliefs, goals, and plans explicitly.

    BDI: Belief-Desire-Intention Architecture

    The most influential cognitive architecture for agent systems, BDI models agents as having:

    Beliefs: What the agent knows about the world

    • Current state of the environment
    • Properties of objects and entities
    • Causal relationships
    • Uncertainty and confidence levels

    Example: Document intelligence agent beliefs

    beliefs = {
      "document_123": {
        "type": "patent_filing",
        "contains_novel_claims": True,
        "confidence": 0.87,
        "related_documents": ["doc_45", "doc_78"],
        "key_concepts": ["autonomous_vehicles", "sensor_fusion"]
      },
      "stakeholder_legal_team": {
        "needs_citation_accuracy": True,
        "prefers_detailed_analysis": True,
        "last_feedback_rating": 8.5
      }
    }
    

    Desires: What the agent wants to achieve

    • High-level goals
    • Preferences and priorities
    • Success criteria

    Example: Document intelligence agent desires

    desires = {
      "extract_novel_insights": {
        "priority": "high",
        "success_metric": "stakeholder_rating > 8.0"
      },
      "maintain_citation_accuracy": {
        "priority": "critical",
        "success_metric": "accuracy > 95%"
      },
      "minimize_analysis_time": {
        "priority": "medium",
        "success_metric": "time_per_doc < 5min"
      }
    }
    

    Intentions: What the agent has committed to doing

    • Selected plans being executed
    • Active commitments
    • Resource allocations

    Example: Document intelligence agent intentions

    intentions = {
      "current_plan": "analyze_document_123",
      "plan_steps": [
        {"action": "extract_text", "status": "completed"},
        {"action": "identify_key_concepts", "status": "in_progress"},
        {"action": "assess_novelty", "status": "pending"},
        {"action": "generate_stakeholder_summary", "status": "pending"}
      ],
      "allocated_resources": {"llm_tokens": 15000, "vector_db_queries": 50}
    }
    

    BDI Reasoning Cycle

    1. Update Beliefs: Perceive environment, update belief state
    2. Generate Options: Given beliefs and desires, what plans could achieve goals?
    3. Select Intention: Choose plan based on priorities, resource constraints, likelihood of success
    4. Execute: Perform next action in selected plan
    5. Revise: If plan fails or context changes, reconsider intentions

    Why BDI Matters for Modern AI Agents

    BDI provides explainability. When an agent makes a decision, we can inspect:

    • What beliefs led to this decision?
    • Which goals was the agent trying to achieve?
    • What alternative plans did it consider?
    • Why was this plan selected over others?

    This transparency is critical for high-stakes applications (finance, healthcare, legal) where "the model said so" isn't sufficient justification.

    SOAR: State, Operator, And Result Architecture

    SOAR models cognition as problem-solving through state space search:

    Core Concepts

    States: Representations of problem situations Operators: Actions that transform one state into another Rules: If-then rules that propose operators when conditions match Impasse: When no operator can be applied, agent enters impasse

    SOAR's Unique Contribution: Universal Subgoaling

    When SOAR hits an impasse (can't solve main problem), it creates a subgoal:

    Main goal: "Analyze document for novel insights" Impasse: "Can't assess novelty—don't know what's already known in this domain" Subgoal: "Query knowledge base for existing work in this domain"

    Once subgoal is solved, SOAR returns to main goal with new knowledge.

    SOAR for AI Agents: Hierarchical Task Decomposition

    Modern agents implement SOAR-inspired hierarchical reasoning:

    def analyze_document(doc):
        # Main goal: Extract insights
        if not has_domain_knowledge(doc.domain):
            # Impasse → create subgoal
            retrieve_domain_knowledge(doc.domain)
        
        if not has_stakeholder_preferences(doc.stakeholder):
            # Another impasse → another subgoal
            load_stakeholder_preferences(doc.stakeholder)
        
        # Now proceed with main goal
        insights = extract_insights(doc)
        return format_for_stakeholder(insights, doc.stakeholder)
    

    This pattern—detect impasse, solve subgoal, resume main goal—enables agents to handle complex tasks by breaking them down dynamically.

    ACT-R: Adaptive Control of Thought—Rational

    ACT-R models human cognition with two memory systems:

    Declarative Memory: Factual knowledge

    • Stored as chunks (structured knowledge units)
    • Retrieved based on activation (recency, frequency, relevance)
    • Parallel to our long-term memory in agent systems

    Procedural Memory: Skills and procedures

    • Stored as production rules (if-then patterns)
    • Execute automatically when conditions match
    • Parallel to learned agent behaviors

    ACT-R's Contribution: Activation and Learning

    ACT-R chunks have activation levels that decay over time but strengthen with use:

    activation = base_level + sum(relevance_to_context) - decay_over_time
    

    This models human memory: recently used, frequently used, and contextually relevant knowledge is easier to retrieve.

    ACT-R for AI Agents: Memory Prioritization

    Implement ACT-R-inspired memory systems:

    def retrieve_relevant_knowledge(query, context):
        # Retrieve candidate knowledge chunks
        candidates = vector_db.search(query, limit=100)
        
        # Score by activation
        for chunk in candidates:
            chunk.activation = (
                chunk.base_activation +  # How often retrieved historically
                chunk.recency_score +    # How recently retrieved
                chunk.context_match(context)  # Relevance to current context
            )
        
        # Return highest activation chunks
        return sorted(candidates, key=lambda c: c.activation, reverse=True)[:10]
    

    This ensures agents retrieve not just semantically similar knowledge, but knowledge that's practically useful given recency and usage patterns.

    Integrating Cognitive Architectures with Modern LLMs

    LLMs are powerful but lack structured reasoning. Cognitive architectures provide the scaffolding:

    Pattern 1: BDI + LLM

    • Beliefs: Maintained in structured databases (vector DB, graph DB)
    • Desires: Defined explicitly (goal hierarchies, constraints)
    • Intentions: Selected via LLM reasoning over beliefs and desires
    • Execution: LLM generates actions, structured system tracks state

    Pattern 2: SOAR + LLM

    • State representation: Structured (JSON, database schemas)
    • Operator proposal: LLM generates possible actions
    • Impasse detection: Rule-based (if no valid operator, enter impasse)
    • Subgoaling: LLM proposes subgoals to resolve impasses

    Pattern 3: ACT-R + LLM

    • Declarative memory: Vector DB with activation-based retrieval
    • Procedural memory: Prompt templates + learned patterns
    • Memory retrieval: Hybrid search (semantic + activation)
    • Learning: Update activation scores based on usage

    Real-World Application: Multi-Stakeholder Document Analysis

    We implemented a BDI architecture for a document intelligence agent:

    Beliefs:

    • Document properties (length, domain, complexity)
    • Stakeholder preferences (legal team wants citations, executives want summaries)
    • Domain knowledge (retrieved from vector DB)
    • Past analysis outcomes (episodic memory)

    Desires:

    • Extract novel insights (priority: high)
    • Maintain citation accuracy (priority: critical)
    • Deliver in 5 minutes (priority: medium)
    • Adapt to stakeholder preferences (priority: high)

    Intention Selection:

    • LLM reasons over beliefs and desires to select plan:
      • If document is complex AND stakeholder is legal team → detailed analysis plan
      • If document is routine AND stakeholder is executive → summary plan
      • If domain knowledge is missing → retrieve-then-analyze plan

    Execution:

    • Execute selected plan step-by-step
    • If plan fails (e.g., retrieval returns no relevant knowledge) → revise intentions

    Results:

    • 92% stakeholder satisfaction (vs. 78% with non-BDI baseline)
    • Explainable decisions ("I chose detailed analysis because legal team requires citation accuracy and document contains novel claims")
    • Adaptive behavior (same agent serves multiple stakeholders effectively)

    Related Reading

    Conclusion

    Cognitive architectures aren't academic curiosities—they're blueprints for building agents that reason, not just generate text. BDI provides explainable goal-directed behavior. SOAR enables hierarchical problem-solving. ACT-R models human-like memory and learning.

    By integrating these architectures with modern LLMs, we get the best of both worlds: the broad knowledge and language understanding of LLMs, plus the structured reasoning and explainability of cognitive architectures.

    The future of AI agents isn't just bigger models—it's models embedded in cognitive architectures that enable genuine intelligence.

    We Value Your Privacy

    We use cookies to enhance your browsing experience, analyze site traffic, and personalize content. You can choose which cookies to accept. Read our Privacy Policy to learn more.