ブログに戻る
Open Source AI Agent Operating Systems: The 2026 Landscape

Open Source AI Agent Operating Systems: The 2026 Landscape

A comprehensive survey of the emerging Agent OS category — what an agent operating system actually is, who's building them, how they compare architecturally, and why they're the next platform layer for AI.

CortexPrismJune 23, 202611 分で読めます3 ビュー

The term "Agent Operating System" went from research paper to product category between January and June 2026. What started with one viral project has become a distinct layer of the AI stack — sitting above model providers and below agent applications, providing the runtime substrate that agents share.

This survey covers the landscape as of mid-2026: what an agent OS actually is, the architectural approaches in play, the players, and where the category is heading.

What is an Agent OS?

The clearest definition comes from Kevin Kim at yarnnn: "An agent framework is a library you import. An agent operating system is a substrate. Frameworks help you compose model calls into something useful for one task. Operating systems hold persistent state for many agents across many tasks, indefinitely."

The Knowlee team's analysis identifies six primitives that define the category:

  1. A process model — agents are processes with lifecycle management (start, stop, pause, resume, checkpoint)
  2. A memory hierarchy — shared state across agents, sessions, and time
  3. A scheduling / orchestration layer — what runs when, with what priority, under what constraints
  4. A security model — capability-based access control, identity, sandboxing
  5. An observability surface — the operator can see what every agent is doing
  6. A coordination layer — inter-agent communication without explicit message-passing

The Namzu team draws the boundary through two questions:

  • Frameworks answer: "What does the agent do?" — composition, prompts, tool definitions, state graphs.
  • Kernels answer: "How does the agent run?" — process lifecycle, scheduling, memory boundaries, IPC, sandboxing, checkpoint/resume, runtime observability.

The best production architectures use both. Build agents with a framework; run them inside an OS.

Why the category exists now

The AIOS paper (Mei et al., 2024) from Rutgers proposed the architecture that took the operating-system metaphor seriously — not as marketing language, but as an actual kernel: a scheduler for agent execution, a memory manager for agent context, an I/O layer for tool calls, and an isolation model for concurrent sessions.

The Vonng analysis maps the analogy to traditional OS subsystems:

SubsystemTraditional OSAgent OSCurrent State
Memory ManagementVirtual memory, page swappingContext Engineering, RAGMost complex, highest value
File Systemext4/ZFSState persistence, memory storageHighly deterministic (databases)
Process Managementfork/exec/schedulerAgent lifecycle, task orchestrationRed ocean (LangGraph et al.)
I/O ManagementDevice driversTool calling, MCP/CLICurrently hot (MCP, Skills)
SecurityPermissions, audit, sandboxIsolation, observability, decision auditAbout to explode

The market forces driving the OS layer: the Jacar production analysis found that on teams running 10+ production agents, the cost is no longer in building agents — it's in operating them. An agent OS with uniform observability, shared memory, and built-in governance saves weeks per production agent.

The OS landscape

CortexPrism — the open-source, Apache 2.0 agent OS

Built in TypeScript on Deno. Single binary. Apache 2.0 license. Ships as a complete stack: agent loop, 5-tier memory, Parallax security, plugin marketplace, web UI, REST API, CLI.

Architecture: Microkernel-inspired. 12 capability groups with RBAC. A syscall dispatcher mediates every agent action. The kernel is split into 6 packages (core ← gate ← ai ← server ← cli and core ← ai ← infra ← cli).

Key features:

  • 5-tier memory: episodic, semantic, procedural, working, graph — with hybrid FTS5 + vector retrieval
  • Parallax 3-stage security gate: capability check → LLM supervisor → encrypted vault
  • 30 LLM providers through unified interface with CascadeRouter and self-learning Model Quartermaster
  • Plugin system: ESM, MCP, and WASM. Marketplace for discovery.
  • Single binary, no Docker required.

Positioning: The open-source agent OS for developers who want a complete platform with full stack integration — not a framework to build on, but an OS to run agents in.

OpenFang — the Rust-based security-first OS

Rust-based agent OS. Single ~32MB binary. 16 security systems. WASM dual-metered sandbox. Ed25519 manifest signing. Merkle audit trails.

Key features:

  • Microkernel architecture with strict layering
  • Four-layer memory stratification (L1 Raw → L2 Feature → L3 Structure → L4 Pattern)
  • 40 messaging channels
  • Pre-built autonomous capability packages: Clip, Lead, Collector, Predictor, Researcher
  • Token efficiency savings of up to 500% vs traditional frameworks

Positioning: Security-first. Designed for environments where process isolation is a hard requirement. The "Rust agent OS with 40 channels and autonomous capability packages."

Pilox — the federated self-hosted OS

Self-hosted agent OS under BSL 1.1. Built-in A2A protocol federation, visual workflow engine, and marketplace.

Key features:

  • Mesh federation with HMAC integrity and Ed25519 manifest signing
  • Firecracker + WASM agent isolation
  • Native MCP and A2A support
  • Zero-trust identity (SPIFFE, roadmap)
  • GPU fractional scheduling (MIG + HAMi)

Positioning: The federated option. "The only self-hosted agent OS with A2A federation, visual workflows, and a built-in marketplace."

XKernel — the AI-native OS experiment

XKernel is an operating system built from the ground up for AI agents — not on top of Linux, but as a complete 4-layer microkernel architecture with 21 Rust crates.

Key features:

  • Real process supervisor daemon with captured I/O, restart policies, and health monitoring
  • Capability-based security (seL4/OCap model)
  • Cognitive task scheduling in 4 dimensions (criticality, deadline, efficiency, cost)
  • Framework translation: LangChain, CrewAI, AutoGen concepts map to kernel primitives
  • 22-syscall kernel interface specification

Positioning: Research-grade. Proves the OS concept at the kernel level, not the application level.

AgentRT / OpenFang — the microkernel

Part of the OpenFang/AgentOS ecosystem. Pure kernel with atomic mechanisms. Four-layer security built-in: sandbox isolation, permission arbitration, input sanitization, audit trail.

Positioning: The thin kernel approach. Provides only mechanisms; frameworks provide policy on top.

OSTack / Kronos / Flowork — the long tail

The ecosystem is expanding with specialized OSes:

Architectural approaches: a taxonomy

Approach 1: The microkernel (OpenFang, XKernel, AgentRT)

A thin kernel provides only atomic mechanisms — process lifecycle, IPC, scheduling, memory boundaries. Agent logic lives in user space. Security enforced at syscall boundaries. Inspired by seL4 and capability-based OS design.

Strengths: Minimal trusted computing base. Strong isolation. Formally verifiable security properties (in theory).

Weaknesses: Higher inter-component latency. More complex to build agents (more boundaries to cross).

Approach 2: The monolithic OS (CortexPrism, Pilox)

A complete stack in one process. Kernel, agent loop, memory, security, tools, UI — all in the same binary. Tight integration between subsystems.

Strengths: Lower latency. Simpler deployment (single binary). Coherent developer experience.

Weaknesses: Larger trusted computing base. Harder to formally verify.

Approach 3: The filesystem OS (OSTack)

Agents coordinate through the filesystem. No explicit message-passing. No orchestrator. The OS is invisible to agents — coordination happens at write time.

Strengths: Minimal agent awareness of the OS. Elegant for certain workloads. Natural fit for Unix-centric teams.

Weaknesses: Limited to filesystem-compatible coordination patterns. Latency depends on storage performance.

Approach 4: The bootable OS (Flowork)

A full operating system that boots from USB. Agents are WASM citizens in a hardened kernel. Sovereign computing — no host OS, no cloud dependency.

Strengths: Maximum isolation. True sovereignty.

Weaknesses: Niche use case. Significant deployment friction.

The OS vs. framework vs. platform distinction

The Knowlee categorization is the clearest:

  • Agent Framework (LangChain, CrewAI, LangGraph): a library for building agents. Your application owns the runtime. Best for single-purpose agents, short-lived tasks, engineering teams that need graph-level control.

  • Agent Platform (Salesforce Agentforce, Microsoft Copilot): a vertical product for a specific domain. Excellent at one thing, opaque to everything else. Best for bounded verticals (sales, support, recruiting).

  • Agent Operating System: the runtime and governance layer across multiple verticals, multiple agents, multiple frameworks. Agents share memory, observability, security, and coordination. Best for fleets of 10+ agents, multi-operator environments, regulated industries.

The Yarnnn prediction: "The agent OS market will look more like macOS-vs-Windows-vs-Linux than like the current 'fifty agent startups' landscape — three or four serious systems, each with an installable application ecosystem."

When an Agent OS is the right call

From the Knowlee decision framework and the Jacar production analysis:

You need an Agent OS when:

  1. You're running 10+ production agents that need to coordinate
  2. Agents must persist across days or weeks with accumulating context
  3. Multiple teams build agents with different frameworks, but need shared memory and observability
  4. Governance and audit requirements are non-negotiable (EU AI Act, SOC 2)
  5. Non-engineers need to operate and oversee agents

You don't need an Agent OS when:

  1. You have fewer than 5 agents with no coordination needs
  2. Your agents are stateless, short-lived, single-purpose
  3. You're in research/experimentation where iteration speed matters more than production reliability
  4. Your problem is a single vertical with no ambition to add a second
  5. You're an engineering team that genuinely needs graph-level control

The OS primitives comparison

FeatureCortexPrismOpenFangPiloxXKernel
LanguageTypeScript/DenoRustGoRust
LicenseApache 2.0MIT/ApacheBSL 1.1Open source
ArchitectureMonolithic OSMicrokernelFederated4-layer microkernel
Memory tiers5 (episodic, semantic, procedural, working, graph)4 (L1-L4 stratification)N/AN/A
Security modelParallax (3-stage gate)16 security systemsZero-trust SPIFFECapability (seL4 model)
SandboxingDocker + capability groupsWASM dual-meteredFirecracker + WASMProcess-level
LLM providers24MultipleMultipleN/A
Plugin systemESM + MCP + WASMAutonomous packagesMCP + A2AFramework translation
ChannelsCLI + Web UI + API40+MultipleN/A
MarketplaceYesNoYes (built-in)No
Binary sizeSingle binary~32MBN/A21 crates

Note: This table reflects publicly documented features as of mid-2026. Some projects are earlier in their development cycles.

The road ahead

The Rankability analysis of 48 months of search demand shows agent-related search growing ~22× from 2022–23 to 2025–26, with the category now transitioning from speculative hype to uneven but real deployment.

The VoltusWave analysis makes the critical point: generic category searches ("what are AI agents") are declining, while specific product and comparison searches ("best AI agent for X," "[Product A] vs [Product B]") are rising. Buyers have moved past the category question and are evaluating specific vendors. This is how markets mature — and it's happening now.

Three trends to watch:

  1. Agent-to-agent protocols. The Agent2Agent (A2A) protocol, MCP, and emerging standards for agent discovery and interop. The OS that supports these protocols natively will have an advantage as multi-agent systems become the norm.

  2. Regulatory integration. The EU AI Act and similar frameworks will require per-agent, per-run, per-data-category audit trails. OSes that embed governance in the runtime — rather than bolting it on — will be the default for regulated deployments.

  3. Specialization vs. consolidation. The ecosystem will likely consolidate around a small number of OSes, but specialized variants will persist for specific constraints: security-first (OpenFang), lightweight/edge, fully offline, enterprise-regulated.

Further reading


CortexPrism is an open-source, Apache 2.0 licensed AI Agent Operating System. Single binary. 5-tier memory. Parallax security. 30 LLM providers. Plugin marketplace. Install in one command.

Related posts

Agent Operating Systems: When They Make Sense and When They Don't
+2

Agent Operating Systems: When They Make Sense and When They Don't

The industry is converging on the Agent OS as the next platform layer. But when is it the right choice — and when is it overengineering? A synthesis of the best thinking from across the agent ecosystem.

不明June 22, 202611 分で読めます4