⌁ Agent ecosystem intelligence

Signals, updates, and field notes for the AgentOS era.

AgentOS Watch tracks OpenClaw, Hermes, Mercury, Claude Code, skills, workflows, social validation, and risks — then turns noisy updates into practical decisions.

Last scan May 4 · 05:30 UTC Sources X · GitHub · V2EX · ClawHub Processed 27 → 5 published

Today’s radar feed

Five-minute scan: what changed, why it matters, and what to do next.

All signals →
2026-05-03 Product update Watch

OpenClaw 2026.5.2 ships provider, plugin, gateway, and channel fixes

The release claims sturdier plugin installs, leaner gateway hot paths, multi-channel fixes, and voice/TTS polish. Community feedback also flags context overflow and excessive tool-use regressions, so upgrade deliberately.

ImpactEmerging Sources4 ConfidenceB+

Next: Read the changelog, scan issue threads, and test your existing workflows before upgrading production agents.

2026-05-04 Skill Try

Agent skills are exploding; discovery is now the bottleneck

ClawHub, Agent Skills, and community awesome lists show a large and fast-growing skill ecosystem. The opportunity is not another raw directory; users need scenario-based curation, risk notes, and install guidance.

ImpactMedium Sources6 ConfidenceB+

Next: Start with scenario packs: research brief, browser automation, GitHub workflow, content studio, inbox ops, and security.

2026-05-04 New product Watch

Mercury positions itself as an always-on, permission-hardened personal agent

Mercury combines markdown-owned identity, Telegram/CLI channels, daemon mode, scheduled tasks, tool permissions, and token budgeting. The concept matches a real user need: persistent personal agents that do not silently overreach.

ImpactEmerging Sources3 ConfidenceB

Next: Track real usage reports and compare reliability, memory behavior, and permission boundaries against OpenClaw/Hermes.

2026-05-02 Community feedback Watch

Users are asking which personal agent is actually usable

A V2EX discussion captures the core market pain: many agents feel like LLM + tools + skills + IM, but users worry about instability, memory resets, and unreliable workflows.

ImpactEmerging Sources4 ConfidenceB+

Next: Create product scorecards focused on reliability, memory persistence, permission safety, and workflow repeatability.

Product tracker

Scorecards for momentum, reliability, skill ecosystem, memory, and risk.

Local personal agent OSWatch

OpenClaw

A locally running, multi-channel AI assistant with tools, skills, gateway routing, and broad provider support.

Momentum4/5
Reliability3/5
Skills5/5
Memory3/5
Risk3/5

Best for: Power users who want a self-hosted agent connected to real apps and communication channels.

Watch: Fast releases can introduce regressions; validate context handling, tool-use behavior, and channel reliability.

Learning / memory-oriented agentWatch

Hermes

Often discussed as an agent with stronger learning and memory loops.

Momentum5/5
Reliability3/5
Skills4/5
Memory5/5
Risk4/5

Best for: Users exploring self-improving or memory-heavy agent behavior.

Watch: Memory systems can become opaque; evaluate inspectability and recovery before relying on it.

Always-on Telegram/CLI agentWatch

Mercury

A soul-driven agent with permissioned tools, token budgets, scheduled tasks, and daemon mode.

Momentum4/5
Reliability2/5
Skills2/5
Memory3/5
Risk3/5

Best for: Users who want a persistent personal agent reachable from Telegram or CLI.

Watch: Needs real-world validation around reliability, security defaults, and ecosystem depth.

Coding agentAdopt

Claude Code

A mature coding agent increasingly connected to reusable Agent Skills workflows.

Momentum5/5
Reliability4/5
Skills5/5
Memory3/5
Risk2/5

Best for: Software development, code review, refactoring, debugging, and structured implementation tasks.

Watch: Skill sprawl becomes real when multiple agents share different folders and conventions.

Skills watch

Scenario-based skill packs instead of raw directories.

Skill Packresearch-brief

Research & daily brief

Collect sources, summarize changes, and produce daily/weekly intelligence.

Risk note: Watch for hallucinated citations and stale sources.

Skill Packbrowser-automation

Browser automation

Turn repeated browser workflows into reusable agent skills or scripts.

Risk note: Permission boundaries and account safety matter more than speed.

Skill Packgithub-workflow

GitHub issue workflow

Triage issues, draft PRs, summarize reviews, and monitor CI.

Risk note: Never auto-merge or push without explicit review.

No noise

5 signals max, every brief.

Daily Brief for fast scanning. Weekly Field Guide for deeper adoption notes, skill picks, and risk summaries.