Summary
RLM Agents (Reinforcement Learning from Model agents) achieve more reliable and maintainable behavior when they communicate using structured outputs — such as JSON schemas — rather than free-form natural language. This design pattern reduces parsing errors, improves interoperability, and makes agent systems easier to debug and monitor.
Key Points
- Structured outputs enforce a predictable contract between agents, eliminating ambiguity in multi-agent conversations.
- Healthier agent systems result from reduced hallucination, simpler error handling, and clearer traceability.
- This approach is especially valuable in autonomous systems where agents must coordinate without human intervention.
- The pattern mirrors best practices in software engineering (e.g., typed APIs) applied to agent communication.
Concepts
- RLM Agent: An agent built on reinforcement learning from models, often used in multi-agent frameworks where policies are learned through interaction with an environment or other agents.
- Structured Outputs: Data formatted according to a predefined schema (e.g., JSON, Protocol Buffers) that agents produce and consume, as opposed to unstructured text.
- Inter-agent Communication: The exchange of messages between multiple autonomous agents; when structured, it becomes machine-verifiable and schema-compliant.
Details
The original observation by @neural_avb notes that RLM Agents exhibit more robust and "healthy" behavior when their inter-agent messages follow a structured schema rather than being free-form text. The core reasoning is that structured outputs impose a formal contract: each agent knows exactly what fields to expect, what types they must be, and what values are valid.

In practice, this means:
- Reduced parsing failures – Agents no longer need to guess meaning from ambiguous natural language; they can directly deserialize validated data.
- Easier debugging – Logs and traces become machine-readable, enabling automated inspection and alerting.
- Better scaling – As more agents join a system, a shared schema prevents miscommunication and allows independent development of agent policies.
- Alignment with tool-use – Structured outputs naturally map to function calls and external API interactions, which many modern LLM-based agents already use.
The recommendation applies broadly to any multi-agent system where reliability and maintainability are priorities. While the original post focuses on RLM Agents, the principle extends to LLM-powered agents in general. Adopting structured outputs is a lightweight change that yields disproportionate improvements in agent system health.
See also: Open Knowledge Format, The Log Is the Agent, Agent Harness, Loop Engineering