Part 6: Planning in Agents + Reasoning Models

# Part 6: Planning in Agents + Reasoning Models --- ## Woah! We’re more than halfway through our course! Over the past few parts, we talked about what agents can do:

Use tools
Retrieve information through RAG
Pass everything in a clean format using MCP

That the agent actually knows what to do next.

how agents think

Understand the instruction
Break it into manageable parts
Retrieve the right data
Choose tools
Perform steps in order
Handle exceptions
Know when the task is done

planning

predict the next token

Continuing sentences
Generating summaries
Answering direct questions

short-sighted generators

Skip steps
Repeat actions
Overcomplicate simple things
Lose the plot halfway through

Chain-of-Thought prompting

real agents

Tools
Unpredictable inputs
Changing state

prompt tricks

by design

Large Reasoning Models (LRMs)

LLMs:

LRMs:

think before acting

Examples:

OpenAI’s o-series (o1, o3) — first public examples
DeepSeek’s DeepSeek-R1 — tuned for tool-augmented reasoning and planning
Google’s Gemini thinking models
Anthropic’s Claude 3.7 reasoning mode

only when needed

planning component

planning is where agents often fail

new

Overthink simple tasks
Generate longer outputs
Increase latency and cost
Can hallucinate logical-sounding but incorrect plans

Rule of thumb:

Don’t start with a reasoning model.
Begin with a mid-size base model.
Only switch if you see clear planning failures — and even then, evaluate the real impact.

--- ## Up Next In the next part, we’ll shift to another core component of agents: memory — how agents can remember effectively and why it matters.

← Part 5 Part 7 →

Course: Agentic AI Crash Course by Aishwarya Naresh Reganti

Source: aishwaryanr/awesome-generative-ai-guide

License: MIT License