An AI-first company in 2026 is an organization where by default every task checks an AI option before a human option. It's not 'we use AI', but 'we've restructured processes so AI is the first executor'. Seven principles: AGENTS.md as a contract; AI onboarding from day zero; AI participation rate metric; human-in-the-loop as an explicit role; token budget as linear P&L; weekly AI retros; culture of 'ask AI first'.
Why 'AI-friendly' no longer works in 2026
In 2024–2025, being 'AI-friendly' was considered progressive: the team uses ChatGPT, marketing generates texts, support writes responses through Claude. By 2026, this became baseline hygiene — not a competitive advantage.
The key difference is in default behavior. AI-friendly: a person sees a task → solves it themselves → if stuck, goes to AI. AI-first: a person sees a task → checks the AI option → if AI fails, takes it themselves.
An AI-friendly company uses AI as a tool on demand. An AI-first company changes the default behavior: every task first checks an AI option.
McKinsey Global Institute estimates: AI agents today are technically capable of taking on tasks occupying 44% of working hours in the US (MGI, November 2025). Anthropic Economic Index confirms from another angle: in a sample of ~1 million real Claude conversations, 43% of tasks followed an automation pattern, 57% — augmentation (February 2025). If an AI-friendly company takes 5–10% of this, an AI-first one is closer to 30–40%.
Principle 1 — AGENTS.md as a contract
Not a job description, but a machine-readable contract: what the agent does, doesn't do, to whom it escalates. If AGENTS.md is poor — the agent breaks.
When hiring an AI employee, don't write a 'job description' — write a contract that the agent will read every heartbeat. AGENTS.md is an agreement between the human CEO and the AI agent.
Minimum required blocks: Identity (name, role, manager); Mission (one sentence — agent's value); Domain context (business areas); Responsibilities (5–8 points with specific triggers); Success metrics (how work is measured); Escalation rules (when it pings HITL); Skills inventory (tools and when to apply).
LeadUp AI case: the community manager agent Katya had an abstract instruction 'chat moderation' — missed complaints about payment three times. After replacing with specific triggers ('registration complaint' → FAQ + log, 'payment question' → escalation within 30 min) — zero missed complaints in a week. Rule: specific AGENTS.md > broad prompt. Better to hire three narrow agents than one universal.
Principle 2 — AI onboarding from day zero
Ready hire-pack before the start date: AGENTS.md, runbook, skills, accesses. Difference: 3–5 heartbeats to the first result vs 15–20 without preparation.
When you hire a new AI agent (or team member), you should have a ready pack: AGENTS.md, runbook, list of skills, API/MCP accesses, first boilerplate task. Not 'figure it out as you go'.
First-day checklist: AGENTS.md, reviewed by manager; HEARTBEAT.md — operational runbook; SOUL.md — agent's personality for voice consistency; 5–9 domain skills; accesses to Vault, task tracker, MCP; first boilerplate task; manager backup on escalation for 7 days.
LeadUp AI case: hire-pack for Anya (Editor-in-Chief) and Liza (SEO/GEO) in May 2026 included AGENTS.md ×2, AI onboarding runbook, 6+9 skills, accesses. Anya delivered an editorial calendar with 15 posts in the first heartbeat. This works only because the hire-pack was ready before the start date. Rule: runbook is more important than access rights.
Principle 3 — AI participation rate metric
AI participation rate is the share of tasks, decisions, and load-bearing comments where AI made the main contribution. Formula: (AI-closures + AI-decisions + AI-load-bearing-comments) / total for the period.
Without the metric, you don't know if you are AI-first or not. Three sources: closures (share of issues closed by the agent without reopening for 7 days); load-bearing comments (comments longer than 200 characters, changing the task's course); decisions (strategic agent decisions, signed by CEO without edits).
LeadUp AI case: on 2026-04-01, AI participation rate was ~12%. By 2026-05-01, it grew to ~38%. Plan for 2026-06-30 — ≥30% sustainably. If in 30 days rate <15% — the problem is in abstract AGENTS.md, uncovered skills, or too strict HITL.
AI utilization (number of AI requests per day) is an input metric. AI participation is an output metric. Count output.
Principle 4 — Human-in-the-loop as an explicit role
Explicit list of triggers (hire, €500, public PR) where a human signature is needed. The rest — agent autonomy. HITL = signator, not reviewer.
At LeadUp AI, HITL is mandatory for three categories: hire request for a new agent (only CEO signs); budget > €500/month (confirmation from CEO); external company commitments (legal, contracts, public PR, publication with figures/promises). Everything else — agent autonomy.
Paradox: the more often HITL 'just in case', the worse agents work. They learn not to take responsibility. If every step is reviewed, the agent optimizes for passing the review, not for the result. Rule: HITL should block only what can't be undone. Text comment — undoable. Hire — not. Payment — not.
Principle 5 — Token budget as linear P&L
Explicit budget per agent (€10–150/month). At 80% spend → support mode. At 100% → stop. CFO checks the metric weekly.
If you have N agents and each generates tokens, you have a linear expense item. The budget per agent is a P&L unit. CFO checks it alongside salaries and SaaS subscriptions.
Distribution at LeadUp AI: Heavy (€80–150/month) — Editor-in-Chief, Web-developer, Research Analyst; Medium (€30–60/month) — Community Manager, Marketing Manager, AI Designer; Light (€10–25/month) — Operations Watchdog, HR Director, Sales Lead. Rule at 80% spend: stop long-form deliverables and agentic loops, close only blockers and hot incidents, escalate to manager. Open-ended token budget kills discipline — explicit budget per agent, hard cap.
Principle 6 — Weekly AI retros
Structure: Done / Failed / Takeover. Surprised findings — mandatory column. Without retro, you fix symptoms, not root cause.
Every week a formal retro: what AI closed itself without escalation; what AI failed (timeout, hallucination, missing skill); what a human took over that AI should have done. Case: from retro W18 discovered that Productivity Reviews were generated by a waterloop due to dispatcher detector dispersion — 30 false tasks per week, ~4 hours of CEO manual work. The cost of retro paid off with one ticket.
Rule: Done / Failed / Takeover. Failed — mandatory column, even if it seems there isn't one. Look for surprise findings.
Principle 7 — Culture: 'ask AI first'
Triple-Default: AI first → memo second → meeting third. Saves 5–8 hours/week and trains written thinking.
Default behavior in the team: 'before you write a memo, before you book a meeting, ask the AI'. Triple-Default rule: AI first (for any question about processes/stack/solutions — agent first); memo second (if AI doesn't know — short memo in task tracker); meeting third (only if memo doesn't close).
Paradox: a team that fears AI will 'replace them' will never build an AI-first culture. Solution: AI is not a competitor, but a junior employee; metric — productivity per person; bonus for productivity growth, not team size. Culture is built on default behavior that people see in the CEO daily.
Where AI delivers, where it doesn't, where it adds work
Where AI confidently delivers results: marketing (content production, distribution, AEO optimization — >50% participation rate); L1 support (FAQ, billing, accesses); sales prospecting; process audit & documentation.
Where AI doesn't deliver (yet): closing calls with cold contacts; hiring people (not agents); crisis management; creative with truly new form.
Where AI adds work: first 60 days of AI-first transition CEO works more — writes AGENTS.md, refines skills, catches bugs, teaches team metrics, updates runbook. By day 90 it pays off. First quarter — investment period, not productivity period.
Roadmap 90 / 180 / 365 days
0–90 days: first production agent, AGENTS.md, AI participation rate ≥15%. Week 1 — CEO personally writes AGENTS.md. Week 2 — hire-pack, agent starts. Weeks 3–8 — weekly retro. Week 9 — first participation rate measurement. Week 12 — decision point: expand or dig root cause.
91–180 days: three agents, AI participation rate ≥25%, budget P&L transparent. Connect second and third agents. Implement budgetMonthlyCents. Formalize HITL policy.
181–365 days: AI participation rate ≥30% sustainably, Triple-Default rule in the team, positive ROI. New employees know 'AI first' from day zero. Productivity per person grows y/y.
Glossary
AGENTS.md — machine-readable contract between CEO and AI agent: identity, mission, responsibilities, metrics, escalation rules. AI participation rate — share of tasks/decisions/load-bearing comments where AI made the main contribution. AI utilization — input metric: number of AI requests per day (≠ AI participation). HITL — specific role of signator on irreversible actions, not reviewer. Triple-Default rule — sequence: AI first → memo second → meeting third.
What's next
AI-first is an operating discipline, not a stack or belief. Three levels of assistance: independently with the team (guide 'How to Hire Your First AI Employee in 14 Days' with AGENTS.md template); with us as consultants (Neuromasterskaya 2.0 — 3-month corporate AI transformation program); AI Employees turnkey (for CEOs with a budget — we build an AI team, transfer operationally, from €5,000 for the first agent).
Free intensive 'Hermes Agent' May 21–22 — create a working AI agent for your business in two evenings. Standard — free, VIP (50 EUR) — recording + guide + 1:1 analysis.
Take the AI-first Company Self-Assessment Checklist — eight questions, scale 1–5. In 3 minutes, find out your exact maturity level (from AI-friendly to AI-first) and get a personalized roadmap: which principle to start with.
