Skills Are the New APIs

By Haim Ari · 2026-03-26

APIs connected software to software. Skills connect human intent to agent execution. With 349+ published skills, an open standard adopted by 20+ platforms, and a progressive disclosure architecture that solves the context window problem — the SKILL.md file is becoming the universal interface layer for agentic development.

There are 349 agent skills catalogued across 12 categories as of this week. The most-installed skill — Vercel's find-skills — has over 418,000 installs. The same skill file works in Claude Code, GitHub Copilot, Cursor, OpenAI Codex, Gemini CLI, and 15 other platforms.

Three months ago, that number was roughly 50.

The pattern here is the same one that played out when REST APIs replaced SOAP in the mid-2010s: a simpler interface format wins because it reduces the cost of integration to near zero.

Skills are doing that for agentic development. A SKILL.md file is becoming the universal interface between what a human wants and what an agent does.

What a Skill Actually Is (And Isn't)

A skill is a folder. That's it. A folder containing a SKILL.md file with YAML frontmatter and markdown instructions, plus optional scripts, reference docs, and templates.

Here's an example — a deployment skill for a staging environment:

``yaml

description: "Deploy the current branch to the staging environment.

Use when the user says 'deploy to staging', 'push to staging',

or 'ship it to staging'. Requires all tests to pass first."

`markdown

Deploy to Staging

Prerequisites

• All tests must pass before deploying

• Current branch must be up to date with main

Steps

Run the full test suite: bun run test

Build the production bundle: bun run build

Run the staging deploy script: bash scripts/deploy-staging.sh

Verify the deployment health check: curl -f https://staging.example.com/health

Report the deployment URL and health status

On Failure

• If tests fail: report which tests failed and stop

• If build fails: check for TypeScript errors first

• If deploy fails: run bash scripts/rollback-staging.sh and report

That's a complete skill. No SDK. No authentication flow. No client library. No versioning scheme. No rate limiting configuration. No webhook setup.

Compare this to what it takes to integrate a traditional API: read the docs, get API keys, install the client library, handle authentication, write the integration code, handle errors, manage rate limits, set up monitoring. That's a week of work for a senior engineer, minimum. And at the end, you have a point-to-point integration that works with exactly one service.

A skill takes about fifteen minutes to write and works with every agent that supports the standard.

!A skill replaces complex SDK integrations with a simple markdown file

The Architecture That Makes This Work

Skills aren't just convenient — they solve a fundamental technical problem that traditional APIs never had to think about: context windows.

An AI agent can't load every possible instruction into memory at startup. If you had fifty skills, each averaging 2,000 tokens, you'd burn 100,000 tokens of context before the agent processed a single user request. That's not a theoretical concern — it's the reason most early agent frameworks felt sluggish and expensive.

Anthropic solved this with a three-stage progressive disclosure architecture:

Stage 1 — Metadata (always loaded): Every skill's name and description sits in Claude's context at all times. This costs roughly 100 tokens per skill. With fifty installed skills, that's 5,000 tokens — negligible. Claude uses this metadata to decide which skills are relevant to the current task.

Stage 2 — Instructions (loaded on demand): When Claude determines a skill is relevant, it reads the full SKILL.md from the filesystem. The instructions enter context only when needed. This is the key architectural decision — skills exist as files on disk, not as API payloads or embedded configurations. Claude uses bash to read them, the same way it reads any other file.

Stage 3 — Resources (loaded during execution): If the skill's instructions reference scripts, templates, or documentation in subdirectories, Claude reads those files as it executes the skill. A deployment skill might reference scripts/deploy-staging.sh — that script never enters context unless the skill is actually running.

This is a fundamentally different architecture from a REST API call. An API is stateless and atomic — you send a request, you get a response. A skill is contextual and progressive — the agent loads exactly what it needs, when it needs it, and discards it when it's done.

The directory structure looks like this:

my-skill/

├── SKILL.md # Core instructions (required)

├── scripts/

│ ├── deploy.sh # Executable automation

│ └── healthcheck.py # Validation scripts

├── references/

│ └── runbook.md # Context docs loaded on demand

└── assets/

└── config.template # Templates and static files

Everything in scripts/ runs via bash. Everything in references/ gets read into context only when Claude needs it. Everything in assets/ gets used as raw material. The SKILL.md file orchestrates all of it.

!Progressive disclosure architecture loads only what the agent needs

Real Skills in Production

The theory is nice. Here's what this looks like in actual daily use.

Infrastructure Guardrails

Here's another example — a pre-deployment checklist for Kubernetes clusters:

`yaml

description: "Kubernetes pre-deployment validation. Use before any

kubectl apply, helm upgrade, or deployment to staging/production.

Triggers on 'deploy', 'rollout', 'ship to prod'."

`markdown

Kubernetes Pre-Deploy Validation

Before any deployment, run these checks IN ORDER:

Verify cluster context: kubectl config current-context

• STOP if context is a production cluster and user didn't explicitly say "production"

Check resource quotas: kubectl describe quota -n $NAMESPACE

Validate manifests: kubectl apply --dry-run=client -f $MANIFEST

Check for image pull secrets: kubectl get secrets -n $NAMESPACE | grep regcred

Run the canary health check: bash scripts/canary-check.sh

Hard Stops

• Never deploy to production namespace without explicit user confirmation

• Never apply manifests that remove PersistentVolumeClaims

• If any check fails, report the failure and STOP — do not proceed

This skill replaced a process that depended on human memory. The instructions are deterministic. The agent runs them every time, in order, without skipping steps because it's Friday afternoon and the engineer wants to go home.

Code Review Standards

Another example — a code review skill that encodes a team's review criteria:

`yaml

description: "Review a pull request against team engineering standards.

Use when the user asks to review a PR, check code quality,

or says 'review this'."

`markdown

Pull Request Review

Review Checklist

Read the PR description and linked issue

Check test coverage: bun run test --coverage

• Flag any file with <80% line coverage

Check for security issues:

• No hardcoded secrets, API keys, or tokens

• No SQL string concatenation

• All user input validated

Check for performance:

• No N+1 query patterns

• No unbounded loops over user-provided data

Verify error handling:

• All async operations have error handling

• No swallowed exceptions

Output Format

Rate the PR: APPROVE, REQUEST_CHANGES, or NEEDS_DISCUSSION

List specific findings with file:line references

This isn't revolutionary technology. It's a checklist in a markdown file. But the impact is significant because the agent executes it consistently, and the checklist is version-controlled alongside the code it reviews. When our standards change, we update the skill, and every engineer's agent gets the update immediately.

!Skills in production replace processes that depend on human memory

The Open Standard Changes Everything

On December 18, 2025, Anthropic published Agent Skills as an open standard at agentskills.io. This is the move that turns skills from a Claude Code feature into an industry-wide interface layer.

The standard is governed by the Agentic AI Foundation under the Linux Foundation, with Anthropic, OpenAI, and Block as co-founders, and AWS, Google, and Microsoft as platinum members. That's not a single-vendor initiative — that's the entire industry agreeing on a format.

What this means in practice: a skill you write for Claude Code works in GitHub Copilot. It works in Cursor. It works in OpenAI Codex CLI. It works in Gemini CLI. It works in VS Code. The SKILL.md format is the same everywhere.

This is the moment that made me think "this is the new API." Here's why:

Traditional APIs are point-to-point integrations. You integrate with Stripe's API to process payments. You integrate with Twilio's API to send SMS. Each integration is bespoke. Each requires its own authentication, its own client library, its own error handling.

Skills are one-to-many interfaces. You write a deployment skill once. Every agent on every platform that supports the standard can execute it. The skill doesn't care which agent is running it — it's just instructions and scripts. The agent doesn't care who wrote the skill — it's just a SKILL.md file with a standard format.

This is the same pattern that made REST APIs dominant over SOAP. Not because REST was technically superior in every dimension, but because it was simple enough that the cost of adoption dropped to near zero. Anyone who could write a URL and parse JSON could use a REST API. Anyone who can write markdown and bash scripts can write a skill.

!The open standard means one skill works across every platform

Building Your First Production Skill

If you haven't written a skill yet, start here. This is a 10-minute exercise that will change how you think about agent-assisted development.

Step 1: Identify a repetitive workflow. Something you do at least weekly that involves multiple steps. Setting up a new service. Running a specific type of analysis. Onboarding a new team member to a codebase.

Step 2: Create the directory.

`bash

mkdir -p skills/my-first-skill

Step 3: Write the SKILL.md.

`yaml

description: "What this skill does and when to use it.

Be specific about trigger phrases."

Then write the instructions in plain markdown. Be specific. Include actual commands. Specify what to do when things fail.

Step 4: Test it. Open Claude Code (or your agent of choice) and invoke it with /my-first-skill. Watch what Claude does. Notice where it deviates from what you expected. Refine the instructions.

Step 5: Add scripts for anything complex. If a step involves more than a one-liner, move it to scripts/ and reference it from the instructions. This keeps the main skill file focused on the workflow, not the implementation.

Two things to keep in mind when authoring:

Keep SKILL.md under 5,000 words. If it's longer, Claude is spending too much context loading it. Move detailed reference material to references/ and let Claude load it on demand.

The description field is the most important line in the file. Claude uses it to decide whether to load the skill. If the description is vague, the skill won't trigger when it should. If it's too broad, it'll trigger when it shouldn't. Include specific phrases and contexts, not just a generic summary.

Let Claude Build It for You

If writing a skill from scratch feels like too much work, there's a shortcut: the /skill-creator` command. It's a built-in skill that walks you through the entire process — from capturing intent to writing the SKILL.md to running test cases and iterating on the output. You describe what you want the skill to do, and it drafts it, tests it, and refines it based on your feedback. It even optimizes the description field for better triggering accuracy. For most workflows, this is the faster path.

!Building your first skill is a 10-minute exercise that changes how you work

Where This Goes Next

The skill ecosystem grew from 50 to 349 published skills in three months. Community collections like VoltAgent's awesome-agent-skills track over 1K+ skills across every major platform. Anthropic, Vercel, Stripe, Cloudflare, Sentry, Figma, and Hugging Face have all published official skills.

The trajectory is clear, and it mirrors what happened with APIs over the past decade: first the format stabilizes, then the ecosystem explodes, then the format becomes the assumed interface between systems.

We're in the ecosystem explosion phase right now. The format has stabilized. The governance is in place. Twenty platforms support it. The question isn't whether skills will become the standard interface between humans and agents — it's what you'll build with them while most developers are still copy-pasting prompts into chat windows.

The teams that are writing skills today are encoding their operational knowledge into portable, version-controlled, agent-executable formats. That knowledge compounds. Every skill you write makes your agents more capable and your team more consistent. Every skill you don't write is a workflow that depends on someone remembering the right steps in the right order.

APIs let software talk to software. Skills let humans talk to agents. The interface layer has shifted, and the developers who understand that shift first will build the most capable systems.