Context engineering is the practice of building dynamic systems that provide the right information and tools, in the right format, so that an AI application can accomplish a task. Context can be characterized along two key dimensions:
By mutability:
Static context: Immutable data that doesn’t change during execution (e.g., user metadata, database connections, tools)
Dynamic context: Mutable data that evolves as the application runs (e.g., conversation history, intermediate results, tool call observations)
By lifetime:
Runtime context: Data scoped to a single run or invocation
Cross-conversation context: Data that persists across multiple conversations or sessions
Runtime context refers to local context: data and dependencies your code needs to run. It does not refer to:
The LLM context, which is the data passed into the LLM’s prompt.
The “context window”, which is the maximum number of tokens that can be passed to the LLM.
Runtime context is a form of dependency injection and can be used to optimize the LLM context. It lets to provide dependencies (like database connections, user IDs, or API clients) to your tools and nodes at runtime rather than hardcoding them. For example, you can use user metadata in the runtime context to fetch user preferences and feed them into the context window.
LangGraph provides three ways to manage context, which combines the mutability and lifetime dimensions:
Static runtime context represents immutable data like user metadata, tools, and database connections that are passed to an application at the start of a run via the context argument to invoke/stream. This data does not change during execution.
from langchain.tools import tool, ToolRuntime@tooldef get_user_email(runtime: ToolRuntime[ContextSchema]) -> str: """Retrieve user information based on user ID.""" # simulate fetching user info from a database email = get_user_email_from_db(runtime.context.user_name) return email
The Runtime object can be used to access static context and other utilities like the active store and stream writer.
See the Runtime documentation for details.
Dynamic runtime context represents mutable data that can evolve during a single run and is managed through the LangGraph state object. This includes conversation history, intermediate results, and values derived from tools or LLM outputs. In LangGraph, the state object acts as short-term memory during a run.
In an agent
In a workflow
Example shows how to incorporate state into an agent prompt.State can also be accessed by the agent’s tools, which can read or update the state as needed. See tool calling guide for details.
from langchain.agents import create_agentfrom langchain.agents.middleware import dynamic_prompt, ModelRequestfrom langchain.agents import AgentStateclass CustomState(AgentState): user_name: str@dynamic_promptdef personalized_prompt(request: ModelRequest) -> str: user_name = request.state.get("user_name", "User") return f"You are a helpful assistant. User's name is {user_name}"agent = create_agent( model="claude-sonnet-4-6", tools=[...], state_schema=CustomState, middleware=[personalized_prompt],)agent.invoke({ "messages": "hi!", "user_name": "John Smith"})
Turning on memory
Please see the memory guide for more details on how to enable memory. This is a powerful feature that allows you to persist the agent’s state across multiple invocations. Otherwise, the state is scoped only to a single run.
Dynamic cross-conversation context represents persistent, mutable data that spans across multiple conversations or sessions and is managed through the LangGraph store. This includes user profiles, preferences, and historical interactions. The LangGraph store acts as long-term memory across multiple runs. This can be used to read or update persistent facts (e.g., user profiles, preferences, prior interactions).