Amazon Bedrock AgentCore Runtime now supports shell command execution via InvokeAgentRuntimeCommand API. Builders can execute commands, stream output over HTTP/2, and receive exit codes without custom infrastructure.

Agents can now execute shell commands directly within runtime sessions with real-time streaming, eliminating custom command infrastructure and enabling interactive infrastructure automation.
Signal analysis
Here at industry sources, we tracked this release closely because it fundamentally changes how agents interact with host systems. Amazon Bedrock AgentCore Runtime now exposes the InvokeAgentRuntimeCommand API, allowing agents to execute arbitrary shell commands directly within an active runtime session. This is not a wrapper - it's native support with real-time streaming over HTTP/2 and proper exit code handling.
Previously, agents that needed system-level access required builders to architect custom command execution layers, manage subprocess safety, and handle streaming/buffering manually. This addition collapses that complexity into an API call. The runtime handles the plumbing - your agent focuses on what to execute and when.
The HTTP/2 streaming is the operational detail that matters most. You get output as it arrives, not batched at the end. For long-running diagnostics, log aggregation, or interactive troubleshooting, that streaming behavior changes the user experience entirely.
This changes the calculus for agent-based infrastructure automation and DevOps tooling. If you've been hesitant to build agents that perform system tasks because command execution felt bolted-on, this removes that friction. You can now have agents execute system diagnostics, deploy code, manage resources, or parse logs as a first-class runtime capability.
The streaming output is critical for user feedback loops. When an agent runs a 30-second deployment script, users see progress in real time rather than waiting for the full result. This matters for interactive CLI tools, development environments, and real-time monitoring dashboards where agents assist human operators.
Security and error handling become explicit concerns. Exit codes let you build proper error detection - your agent can branch logic based on command success or failure. But you're now responsible for what commands agents can invoke. Build allowlists, not open shells. Consider output filtering if agents interact with sensitive systems.
The session context is important. These commands execute within the scope of an active AgentCore Runtime session, meaning state carries across invocations. An agent can navigate directories, set environment variables, and maintain context between separate command calls. Builders should design agent workflows with this statefulness in mind.
System administration and infrastructure management becomes agent-native. An agent can diagnose server issues by running diagnostic commands, parsing output, and recommending fixes in natural language. No separate command execution service needed. This is particularly relevant for AWS Lambda-based agents that need to manage EC2 instances, RDS databases, or S3 operations through shell scripts.
Development tooling and CI/CD acceleration are clear targets. An agent can review pull requests, run tests, execute build commands, and stream results back to developers. The real-time streaming matters here - developers see test progress as it happens rather than waiting for aggregated results.
Log aggregation and analysis workflows benefit directly. Agents can query logs, run parsing commands, aggregate results, and provide structured summaries. Commands can chain - one command feeds into the next, all within the same session context.
Interactive troubleshooting assistants become feasible. A user describes a problem, the agent runs diagnostic commands, streams the output, interprets results, and recommends next steps. The conversation flows naturally because the agent has direct access to system state.
This release accelerates AWS's push toward agentic infrastructure. Lambda functions have long been compute primitives; agents are becoming control primitives. When agents can execute shell commands natively, they move from observing systems to actively managing them. This is table stakes for enterprise adoption of LLM-based automation.
The HTTP/2 streaming is AWS signaling that interactive agent experiences matter. Batch processing is sufficient for background work, but real-time streaming unlocks assistant and co-pilot use cases. Expect more AWS primitives to add streaming capabilities as agents become more central to the platform.
This also reveals a maturity threshold - Bedrock AgentCore Runtime is moving from prototype to production infrastructure. Direct system access, proper error handling, and streaming output are production requirements. When foundational services add these capabilities, it signals the market is consolidating around certain patterns.
The momentum in this space continues to accelerate.
Best use cases
Open the scenarios below to see where this shift creates the clearest practical advantage.
One concise email with the releases, workflow changes, and AI dev moves worth paying attention to.
More updates in the same lane.
Mistral Forge allows organizations to convert proprietary knowledge into custom AI models, enhancing enterprise capabilities.
Version 8.1 of the MongoDB Entity Framework Core Provider brings essential updates. This article analyzes the implications for builders.
The latest @composio/core update enhances Toolrouter with custom tool integration, expanding flexibility for developers.