The Agent Standard Wars: Comparing Developer Platforms for the LLM Era

From AGENTS.md to MCP and Skills.sh, the LLM dev ecosystem is fragmenting. We compare leading platforms, pricing, and protocols to help you choose your stack.

By Giwan Persaud

TL;DR: The “Chat with your Code” era is evolving into the “Agentic Workflow” era. While Skills.sh focuses on portable procedural memory and MCP provides a universal adapter for tools, AGENTS.md has emerged as the simplest way to give agents repository-level context. Your choice depends on whether you value tool interoperability (MCP), procedural automation (Skills), or low-friction documentation (AGENTS.md).


The landscape of LLM-powered development is shifting under our feet. A year ago, we were impressed by simple autocompletion. Six months ago, we were amazed by “Chat with Codebase.” Today, we are in the midst of the Agent Standard Wars.

As developers, we are no longer just choosing an IDE; we are choosing an ecosystem of instructions, protocols, and specialized “skills” that define how our AI partners interact with our work. This post breaks down the three major pillars of this new world: Context Standards, Protocol Layers, and Execution Platforms. We’ll also look at the rising influence of European tools in this space.

1. Context Standards: How Agents “Know” Your Project

Before an agent can help, it needs to understand the “rules of the house.” Two major standards have emerged to solve this:

AGENTS.md: The “README for Robots”

AGENTS.md is a simple, open format designed to give agents a predictable place to find setup commands, code styles, and project-specific quirks. It is intentionally low-tech: just a Markdown file.

  • Target Audience: Every developer who wants to “onboard” an agent to their repo in 30 seconds.
  • Strengths: Zero dependencies, human-readable, and supported by almost every major agent (Cursor, Windsurf, Claude Code, Aider).
  • Weaknesses: It is passive. It doesn’t “do” anything; it just “tells.”
  • Real-World Example: Check out the AGENTS.md in this very repository! It guides agents on how to run our Astro build, where the PWA config lives, and how to verify changes.

How to Structure Your AGENTS.md

The best AGENTS.md files follow a three-phase protocol to ensure high-quality output:

  1. Phase 1: Context & Planning: Instruct the agent to analyze requirements and draft a plan before writing any code.
  2. Phase 2: Implementation: Define your coding standards (e.g., “Use Tailwind first,” “Avoid any in TypeScript”).
  3. Phase 3: Verification: List the mandatory checks, such as npm run build and npm test, that must be performed before the agent marks a task as complete.

[!TIP] Hot Tip: Don’t bloat your main AGENTS.md. Use it as a directory. Use an @import style approach or link to specialized sub-documents (like agent-docs/protocol.md or agent-docs/guidelines.md) to keep the primary instructions lean but powerful.

Skills.sh / SKILL.md: The “Procedural Brain”

While AGENTS.md is a README, Skills.sh (utilizing the SKILL.md format) is a plugin. It allows you to package instructions with executable scripts.

  • Target Audience: Developers building specialized workflows (e.g., “Run this specific DB migration and then verify the SEO tags”).
  • Strengths: Portable, version-controlled, and actionable. It can activate specific logic only when needed, preventing “prompt bloat.”
  • Pricing: Open standard, with a community registry at Skills.sh.
  • More Detail: We’ve previously covered how to leverage agent skills for web development.

Why Use Skills?

Skills excel when your workflow requires a specific sequence of “thought” and “action.” Instead of hoping the agent remembers to check for broken links after a build, you create a skill that forces that check. It transforms the LLM from a generalist into a specialized artisan for your codebase.

[!TIP] Hot Tip: Use Skills.sh to codify “Tribal Knowledge.” If there’s a weird bug that only happens on M1 Macs when the Docker cache is full, don’t just put it in a README. Write a skill that detects the environment and provides the exact fix automatically.

2. The Protocol Layer: MCP (Model Context Protocol)

Introduced by Anthropic, MCP is the “Universal Serial Bus” (USB) of the AI world. Instead of writing a custom integration for every tool, you build an MCP server once, and any MCP-compatible client (like Claude Desktop or Cursor) can use it.

  • Features: Real-time data access (Google Drive, Slack, Postgres) and tool execution.
  • Case Study: A developer at a fintech startup used an MCP server to let Claude query their internal staging database directly. This turned a 20-minute manual data lookup into a 5-second chat query.
  • Beginner Tip: Use Smithery.ai to find and install pre-built MCP servers without writing code.

The Security Edge

MCP servers run locally. This means your sensitive API keys or database credentials never need to be sent to the LLM provider. The agent only gets the output of the tool, keeping your “secrets” within your own infrastructure.

[!TIP] Hot Tip: Use the MCP Inspector. It’s a dedicated debugger for MCP developers that lets you test your servers in isolation before connecting them to a complex agent. It saves hours of troubleshooting when your tool isn’t responding as expected.

3. Execution Platforms: Where the Work Happens

The choice of IDE or CLI is where most developers feel the “cost” and “friction” of these tools. Here is how the leaders stack up as of March 2026:

PlatformKey FeaturePricing (Pro/Max)Best For
AI-First IDEs
Cursor”Composer” (multi-file edits)$20/moThe “Gold Standard” for AI IDEs.
Windsurf”Cascade” engine & Codemaps$15/moDeep context in large codebases.
Agentic CLIs
Claude CodeTerminal-native agent$20-$200/moFast, autonomous CLI workflows.
Gemini CLIReAct loop & 1M contextFree / Usage-basedDeep research and massive analysis.
Mistral Vibe 🇪🇺Minimalist “vibe-coding”Usage-basedOpen-source, EU-based workflows.

User Sentiment & Reviews

  • Cursor: Remains the favorite for its polish and UI. However, some “power users” complain about the lack of a higher-tier plan to bypass usage limits on the newest models.
  • Windsurf: Praised for its speed and its “Codemaps” feature, which many users find superior to Cursor’s indexing for massive legacy repos.
  • Claude Code: Polarizing. Developers love the speed and the “unfiltered” access to Claude 3.7 Sonnet, but the usage-based pricing can lead to “bill shock” if not monitored.
  • Gemini CLI: The research powerhouse. It stands out for its massive 1-million-token context window, though some developers find its “agentic” capabilities slightly less aggressive than Claude’s.
  • Mistral Vibe 🇪🇺: France’s Mistral AI has entered the fray with a minimalist, open-source (Apache 2.0) CLI agent. For developers concerned about data sovereignty and European hosting, Vibe combined with Mistral’s EU-based infrastructure is becoming a strong alternative to US-based incumbents.

[!TIP] Hot Tip: Don’t just treat these as chat windows. The “killer feature” of Cursor (Composer) and Windsurf (Cascade) is their structural editing engine. They can refactor code across multiple files while maintaining import integrity—something a standard terminal-based agent often struggles with.

Strategic Advice: Which Should You Use?

For Beginners

Start with Cursor and add an AGENTS.md to your project. It’s the lowest barrier to entry. Copy a template from agents.md and customize it with your npm commands.

[!TIP] Hot Tip: Use the “Explain” feature religiously. When an agent changes code, ask it why it chose that specific pattern. It’s the fastest way to level up your own senior engineering skills.

For Experienced Developers

Build your own MCP servers for your internal tools and start codifying your complex deployment steps into Skills. This moves you away from “pair programming” and toward “engineering management” of a fleet of specialized subagents.

[!TIP] Hot Tip: Treat your AI instructions as Tier-1 code. Version control your AGENTS.md and SKILL.md files, and include them in your PR reviews. If the instructions are stale, the agent will hallucinate.

For Solo-Preneurs

If you are budget-conscious, Goose (open source) or Mistral Vibe paired with Mistral’s competitive pricing are excellent starting points. Pair them with Skills.sh to ensure you aren’t wasting tokens on repetitive explanations.

[!TIP] Hot Tip: Leverage “Context Pinning.” Most IDEs allow you to pin specific files or documentation. Pin your architectural ADRs (Architecture Decision Records) so the agent always stays within your established technical boundaries.


Learn More

Ready to dive deeper? Check out these resources:

The “standard wars” are far from over, but the winners will be the protocols that remain open and the tools that stay out of the developer’s way. Whether you are writing a simple AGENTS.md or a complex MCP server, you are ultimately doing one thing: teaching the machine how to be your best colleague.