The term "Agent Operating System" went from research paper to product category between January and June 2026. What started with one viral project has become a distinct layer of the AI stack — sitting above model providers and below agent applications, providing the runtime substrate that agents share.
This survey covers the landscape as of mid-2026: what an agent OS actually is, the architectural approaches in play, the players, and where the category is heading.
What is an Agent OS?
The clearest definition comes from Kevin Kim at yarnnn: "An agent framework is a library you import. An agent operating system is a substrate. Frameworks help you compose model calls into something useful for one task. Operating systems hold persistent state for many agents across many tasks, indefinitely."
The Knowlee team's analysis identifies six primitives that define the category:
- A process model — agents are processes with lifecycle management (start, stop, pause, resume, checkpoint)
- A memory hierarchy — shared state across agents, sessions, and time
- A scheduling / orchestration layer — what runs when, with what priority, under what constraints
- A security model — capability-based access control, identity, sandboxing
- An observability surface — the operator can see what every agent is doing
- A coordination layer — inter-agent communication without explicit message-passing
The Namzu team draws the boundary through two questions:
- Frameworks answer: "What does the agent do?" — composition, prompts, tool definitions, state graphs.
- Kernels answer: "How does the agent run?" — process lifecycle, scheduling, memory boundaries, IPC, sandboxing, checkpoint/resume, runtime observability.
The best production architectures use both. Build agents with a framework; run them inside an OS.
Why the category exists now
The AIOS paper (Mei et al., 2024) from Rutgers proposed the architecture that took the operating-system metaphor seriously — not as marketing language, but as an actual kernel: a scheduler for agent execution, a memory manager for agent context, an I/O layer for tool calls, and an isolation model for concurrent sessions.
The Vonng analysis maps the analogy to traditional OS subsystems:
| Subsystem | Traditional OS | Agent OS | Current State |
|---|
| Memory Management | Virtual memory, page swapping | Context Engineering, RAG | Most complex, highest value |
| File System | ext4/ZFS | State persistence, memory storage | Highly deterministic (databases) |
| Process Management | fork/exec/scheduler | Agent lifecycle, task orchestration | Red ocean (LangGraph et al.) |
| I/O Management | Device drivers | Tool calling, MCP/CLI | Currently hot (MCP, Skills) |
| Security | Permissions, audit, sandbox | Isolation, observability, decision audit | About to explode |
The market forces driving the OS layer: the Jacar production analysis found that on teams running 10+ production agents, the cost is no longer in building agents — it's in operating them. An agent OS with uniform observability, shared memory, and built-in governance saves weeks per production agent.
The OS landscape
CortexPrism — the open-source, Apache 2.0 agent OS
Built in TypeScript on Deno. Single binary. Apache 2.0 license. Ships as a complete stack: agent loop, 5-tier memory, Parallax security, plugin marketplace, web UI, REST API, CLI.
Architecture: Microkernel-inspired. 12 capability groups with RBAC. A syscall dispatcher mediates every agent action. The kernel is split into 6 packages (core ← gate ← ai ← server ← cli and core ← ai ← infra ← cli).
Key features:
- 5-tier memory: episodic, semantic, procedural, working, graph — with hybrid FTS5 + vector retrieval
- Parallax 3-stage security gate: capability check → LLM supervisor → encrypted vault
- 30 LLM providers through unified interface with CascadeRouter and self-learning Model Quartermaster
- Plugin system: ESM, MCP, and WASM. Marketplace for discovery.
- Single binary, no Docker required.
Positioning: The open-source agent OS for developers who want a complete platform with full stack integration — not a framework to build on, but an OS to run agents in.
OpenFang — the Rust-based security-first OS
Rust-based agent OS. Single ~32MB binary. 16 security systems. WASM dual-metered sandbox. Ed25519 manifest signing. Merkle audit trails.
Key features:
- Microkernel architecture with strict layering
- Four-layer memory stratification (L1 Raw → L2 Feature → L3 Structure → L4 Pattern)
- 40 messaging channels
- Pre-built autonomous capability packages: Clip, Lead, Collector, Predictor, Researcher
- Token efficiency savings of up to 500% vs traditional frameworks
Positioning: Security-first. Designed for environments where process isolation is a hard requirement. The "Rust agent OS with 40 channels and autonomous capability packages."
Pilox — the federated self-hosted OS
Self-hosted agent OS under BSL 1.1. Built-in A2A protocol federation, visual workflow engine, and marketplace.
Key features:
- Mesh federation with HMAC integrity and Ed25519 manifest signing
- Firecracker + WASM agent isolation
- Native MCP and A2A support
- Zero-trust identity (SPIFFE, roadmap)
- GPU fractional scheduling (MIG + HAMi)
Positioning: The federated option. "The only self-hosted agent OS with A2A federation, visual workflows, and a built-in marketplace."
XKernel — the AI-native OS experiment
XKernel is an operating system built from the ground up for AI agents — not on top of Linux, but as a complete 4-layer microkernel architecture with 21 Rust crates.
Key features:
- Real process supervisor daemon with captured I/O, restart policies, and health monitoring
- Capability-based security (seL4/OCap model)
- Cognitive task scheduling in 4 dimensions (criticality, deadline, efficiency, cost)
- Framework translation: LangChain, CrewAI, AutoGen concepts map to kernel primitives
- 22-syscall kernel interface specification
Positioning: Research-grade. Proves the OS concept at the kernel level, not the application level.
AgentRT / OpenFang — the microkernel
Part of the OpenFang/AgentOS ecosystem. Pure kernel with atomic mechanisms. Four-layer security built-in: sandbox isolation, permission arbitration, input sanitization, audit trail.
Positioning: The thin kernel approach. Provides only mechanisms; frameworks provide policy on top.
OSTack / Kronos / Flowork — the long tail
The ecosystem is expanding with specialized OSes:
Architectural approaches: a taxonomy
Approach 1: The microkernel (OpenFang, XKernel, AgentRT)
A thin kernel provides only atomic mechanisms — process lifecycle, IPC, scheduling, memory boundaries. Agent logic lives in user space. Security enforced at syscall boundaries. Inspired by seL4 and capability-based OS design.
Strengths: Minimal trusted computing base. Strong isolation. Formally verifiable security properties (in theory).
Weaknesses: Higher inter-component latency. More complex to build agents (more boundaries to cross).
Approach 2: The monolithic OS (CortexPrism, Pilox)
A complete stack in one process. Kernel, agent loop, memory, security, tools, UI — all in the same binary. Tight integration between subsystems.
Strengths: Lower latency. Simpler deployment (single binary). Coherent developer experience.
Weaknesses: Larger trusted computing base. Harder to formally verify.
Approach 3: The filesystem OS (OSTack)
Agents coordinate through the filesystem. No explicit message-passing. No orchestrator. The OS is invisible to agents — coordination happens at write time.
Strengths: Minimal agent awareness of the OS. Elegant for certain workloads. Natural fit for Unix-centric teams.
Weaknesses: Limited to filesystem-compatible coordination patterns. Latency depends on storage performance.
Approach 4: The bootable OS (Flowork)
A full operating system that boots from USB. Agents are WASM citizens in a hardened kernel. Sovereign computing — no host OS, no cloud dependency.
Strengths: Maximum isolation. True sovereignty.
Weaknesses: Niche use case. Significant deployment friction.
The Knowlee categorization is the clearest:
-
Agent Framework (LangChain, CrewAI, LangGraph): a library for building agents. Your application owns the runtime. Best for single-purpose agents, short-lived tasks, engineering teams that need graph-level control.
-
Agent Platform (Salesforce Agentforce, Microsoft Copilot): a vertical product for a specific domain. Excellent at one thing, opaque to everything else. Best for bounded verticals (sales, support, recruiting).
-
Agent Operating System: the runtime and governance layer across multiple verticals, multiple agents, multiple frameworks. Agents share memory, observability, security, and coordination. Best for fleets of 10+ agents, multi-operator environments, regulated industries.
The Yarnnn prediction: "The agent OS market will look more like macOS-vs-Windows-vs-Linux than like the current 'fifty agent startups' landscape — three or four serious systems, each with an installable application ecosystem."
When an Agent OS is the right call
From the Knowlee decision framework and the Jacar production analysis:
You need an Agent OS when:
- You're running 10+ production agents that need to coordinate
- Agents must persist across days or weeks with accumulating context
- Multiple teams build agents with different frameworks, but need shared memory and observability
- Governance and audit requirements are non-negotiable (EU AI Act, SOC 2)
- Non-engineers need to operate and oversee agents
You don't need an Agent OS when:
- You have fewer than 5 agents with no coordination needs
- Your agents are stateless, short-lived, single-purpose
- You're in research/experimentation where iteration speed matters more than production reliability
- Your problem is a single vertical with no ambition to add a second
- You're an engineering team that genuinely needs graph-level control
The OS primitives comparison
| Feature | CortexPrism | OpenFang | Pilox | XKernel |
|---|
| Language | TypeScript/Deno | Rust | Go | Rust |
| License | Apache 2.0 | MIT/Apache | BSL 1.1 | Open source |
| Architecture | Monolithic OS | Microkernel | Federated | 4-layer microkernel |
| Memory tiers | 5 (episodic, semantic, procedural, working, graph) | 4 (L1-L4 stratification) | N/A | N/A |
| Security model | Parallax (3-stage gate) | 16 security systems | Zero-trust SPIFFE | Capability (seL4 model) |
| Sandboxing | Docker + capability groups | WASM dual-metered | Firecracker + WASM | Process-level |
| LLM providers | 24 | Multiple | Multiple | N/A |
| Plugin system | ESM + MCP + WASM | Autonomous packages | MCP + A2A | Framework translation |
| Channels | CLI + Web UI + API | 40+ | Multiple | N/A |
| Marketplace | Yes | No | Yes (built-in) | No |
| Binary size | Single binary | ~32MB | N/A | 21 crates |
Note: This table reflects publicly documented features as of mid-2026. Some projects are earlier in their development cycles.
The road ahead
The Rankability analysis of 48 months of search demand shows agent-related search growing ~22× from 2022–23 to 2025–26, with the category now transitioning from speculative hype to uneven but real deployment.
The VoltusWave analysis makes the critical point: generic category searches ("what are AI agents") are declining, while specific product and comparison searches ("best AI agent for X," "[Product A] vs [Product B]") are rising. Buyers have moved past the category question and are evaluating specific vendors. This is how markets mature — and it's happening now.
Three trends to watch:
-
Agent-to-agent protocols. The Agent2Agent (A2A) protocol, MCP, and emerging standards for agent discovery and interop. The OS that supports these protocols natively will have an advantage as multi-agent systems become the norm.
-
Regulatory integration. The EU AI Act and similar frameworks will require per-agent, per-run, per-data-category audit trails. OSes that embed governance in the runtime — rather than bolting it on — will be the default for regulated deployments.
-
Specialization vs. consolidation. The ecosystem will likely consolidate around a small number of OSes, but specialized variants will persist for specific constraints: security-first (OpenFang), lightweight/edge, fully offline, enterprise-regulated.
Further reading
CortexPrism is an open-source, Apache 2.0 licensed AI Agent Operating System. Single binary. 5-tier memory. Parallax security. 30 LLM providers. Plugin marketplace. Install in one command.