TechLevity
← Back to Insights
ai strategydigital transformationleadershipbusiness growth

How CIOs Close the Gap Between AI Investment and ROI

·9 min read

48% of AI leaders measure revenue impact vs 27% of others. Learn the token-level P&L framework that turns £186M AI budgets into measurable business value.

How CIOs Close the Gap Between AI Investment and ROI

TL;DR

Only 48% of AI leaders can measure revenue impact from their investments, compared to 27% of other organizations • UK enterprises invest £186M on average in AI initiatives over three years, but 95% can't scale AI with measurable P&L impact • Token-level P&L frameworks replace traditional CAPEX thinking with OPEX measurement for agentic AI systems • Four-layer ROI measurement tracks token costs, human displacement, revenue attribution, and compound value creation over time

How CIOs Close the Gap Between AI Investment and ROI

Excerpt: Most firms can measure their AWS bill but not their AI impact. Here's the token-level P&L framework that changes everything.

Your board approved £186 million for AI transformation. Six months later, they're asking where the money went. You can't answer because you're measuring the wrong things — a pattern we see repeatedly in why AI projects fail.

48% of AI leaders feel confident measuring revenue impact from their investments. Only 27% of everyone else can say the same. The gap isn't about tools or talent. It's about treating agentic AI like enterprise software when it behaves like a utility.

Most CIOs can't measure AI ROI because they're still thinking in CAPEX. Agentic AI is OPEX on steroids — you need a token-level P&L.

Why Do Most AI Investments Disappear Into a £186 Million Black Hole?

KPMG's latest research exposes the measurement crisis. UK enterprises plan to invest an average of £186 million in AI initiatives over the next three years. Yet only 5% have scaled AI with measurable P&L impact, according to EY's business functions study.

The maths doesn't work. Firms are pouring millions into AI whilst admitting they can't track where it lands. It's like buying petrol without a fuel gauge.

The problem isn't the technology. 95% of organisations have an AI strategy. 51% are exploring AI agents, 37% are piloting them. The infrastructure exists. The budget exists. The ROI visibility doesn't.

Traditional IT measurement breaks down because AI doesn't behave like traditional IT. Your ERP system costs £100K annually regardless of usage. Your AI system costs £100 for every million tokens processed. When your agents get better at their jobs, they consume more tokens. Success increases cost.

This is Jevon's Paradox in action: as token costs decrease, total consumption explodes. Deloitte found that Google processes 1.3 quadrillion tokens monthly. Unit economics improve whilst total spend accelerates — a dynamic explored in depth in our token economics analysis. Your finance team isn't equipped for this.

The Token Economics Reality

Most CIOs track AI spend like software licences. Per-seat costs. Annual contracts. Predictable monthly bills. Token-based AI pricing demolishes this model.

Here's what actually happens: You deploy an AI agent that processes customer support tickets. Week one, it handles 100 tickets using 500K tokens. Week four, it's handling 300 tickets using 2M tokens — because it's getting better at understanding context and providing detailed responses.

Your CFO sees AI costs tripling. Your customer success team sees resolution time dropping 60%. Traditional IT metrics show failure. Business metrics show success.

The 5% of companies measuring P&L impact track different numbers — numbers aligned with a mature AI SDLC maturity framework:

  • Token cost per business outcome (not token cost per month)
  • Human time saved per token consumed (productivity multipliers)
  • Revenue per token in customer-facing agents (direct value creation)

They've moved from cost accounting to value accounting. The shift isn't semantic — it's strategic.

The Four-Layer ROI Framework

The companies that crack AI measurement use a four-layer framework. Each layer tracks different economics.

Layer 1: Token-Level Cost Management

Track consumption patterns by agent, by task, by time of day. Not because you want to optimise tokens — because you want to understand which agents create value and which create noise.

Baseline metrics:

  • Cost per resolved customer ticket
  • Tokens per sales conversation
  • Processing cost per document analysis

Map token usage to business outcomes. An agent that uses 50K tokens to qualify a £100K sales lead has different unit economics than one using 50K tokens to answer FAQ questions.

Layer 2: Human Displacement Measurement

Calculate the FTE value of work that agents now handle autonomously. Don't count time saved — count complete handoffs.

Displacement calculation:

  • Document review: 40 hours/week → 2 hours/week = 0.95 FTE displaced
  • Initial customer screening: 15 hours/week → 1 hour/week = 0.35 FTE displaced
  • Contract analysis: 20 hours/week → 3 hours/week = 0.43 FTE displaced

Multiply displaced FTEs by fully-loaded salary costs. Include recruitment, training, and management overhead. This gives you the true economic value of automation.

Layer 3: Revenue Attribution

Connect AI agents directly to revenue outcomes. Customer-facing agents get attribution for deals they influence. Internal agents get attribution for deals they accelerate.

Attribution models:

  • First-touch: Agent initiated the conversation that led to a sale
  • Multi-touch: Agent contributed to qualification, proposal, or close
  • Acceleration: Agent reduced sales cycle length

Track these with the same rigour as marketing attribution. AI agents are not just cost centres — they're revenue drivers.

Layer 4: Compound Value Tracking

AI agents improve over time. Your measurement system must capture learning curves, not just point-in-time performance.

Month one: Customer support agent resolves 60% of tickets without escalation Month six: Same agent resolves 87% of tickets without escalation

The additional 27% improvement represents pure compound value. No additional training costs. No new hires. Just algorithmic learning creating ongoing ROI.

Why Most Companies Miss This

The 27% of companies that can't measure AI revenue impact make three consistent mistakes:

Mistake 1: Quarterly thinking for multi-year assets AI agents have learning curves measured in months, not quarters. Quarterly reviews miss the compound value story. Companies kill high-potential agents because they don't hit arbitrary 90-day benchmarks.

Mistake 2: Department-level attribution AI agents work across functions. A document processing agent might serve legal, sales, and operations simultaneously. Department budgets can't capture cross-functional value creation.

Mistake 3: Technology metrics instead of business metrics Model accuracy, latency, and uptime matter for engineering teams. They don't matter for ROI measurement. Business leaders care about cycle time reduction, error rate improvement, and customer satisfaction scores.

What Infrastructure Do You Need for Token-Level ROI Measurement?

Token-level ROI measurement needs different infrastructure than traditional IT measurement. You need:

Real-time consumption tracking: Monthly AWS bills don't work for token economics. You need daily visibility into which agents are consuming which tokens for which business outcomes.

Cross-system attribution: AI agents touch multiple systems. Your measurement infrastructure must connect tokens to CRM records, support tickets, and revenue outcomes.

Learning curve analytics: Static dashboards miss AI improvement over time. You need trending analysis that shows capability development alongside cost evolution.

Most companies try to bolt AI measurement onto existing IT systems. This fails because existing systems weren't designed for consumption-based, cross-functional, learning assets — which is why the CIOs who close the AI investment ROI gap build purpose-designed measurement stacks.

The 48% of AI leaders who measure revenue impact built purpose-designed measurement stacks. They treat AI measurement as a discrete capability requiring dedicated infrastructure investment.

What This Means for Your Organisation

If you're planning AI deployment without measurement infrastructure, you're planning to fail. 75% of enterprises want to work with service providers on priority use cases precisely because internal measurement capabilities don't exist.

The companies that crack this early gain competitive advantage. They can deploy AI agents faster because they understand unit economics. They can scale successful agents and kill unsuccessful ones based on data, not intuition.

More importantly, they can present ROI evidence to boards and investors. In PE-backed environments, demonstrable AI ROI becomes a value creation multiplier for exit planning.

Your CTO can build the agents. Your CFO can track the spending. But bridging agent capability to business value requires new measurement thinking — often guided by fractional CTO services that bring experience from multiple AI transformations.

The gap between AI investment and ROI isn't a technology problem. It's a measurement problem. The technology exists to solve it — with the right AI-native engineering support.

Key Takeaways

Token-level P&L frameworks replace traditional CAPEX measurement with consumption-based value tracking for agentic AI systems • Four-layer measurement captures token costs, human displacement, revenue attribution, and compound learning value over time • Real-time infrastructure connects AI consumption to business outcomes across CRM, support, and revenue systems • Cross-functional attribution measures AI agents that work across departments, not within traditional budget silos • Purpose-built measurement stacks separate successful AI deployments from failed pilots that can't demonstrate ROI

The Next Step

TechLevity has built token-level ROI measurement frameworks for 18 production AI systems. We've seen where the £186 million goes — and where it creates value.

Our AI ROI audit maps your current AI spending to business outcomes using the four-layer framework above. We identify which investments drive value, which create cost, and which measurement gaps prevent accurate attribution.

The audit takes three weeks. You'll know exactly where your AI budget creates ROI and where it doesn't.

**Book an AI ROI audit** — we'll show you where your £186M is going.

<!-- Internal Link Suggestions:

  • Link "why AI projects fail" to /insights/why-ai-projects-fail in the opening section about measurement failures
  • Link "AI maturity framework" to /insights/ai-sdlc-maturity-framework when discussing the 5% that measure P&L impact
  • Link "£200K AI pilot that never shipped" to /insights/200k-ai-pilot-never-shipped in the measurement crisis section
  • Link "fractional CTO" to /services/fractional-cto in the "Your CTO can build the agents" paragraph
  • Link "AI strategy" to /services/ai-strategy in the conclusion section about TechLevity services

-->

Want a second opinion on your AI initiative?

30-minute sanity check call. No pitch, no slides.

Book your call →

Newsletter

This is where I share what I can't post publicly.

AI strategy for UK scale-ups. Monthly. No fluff.

Subscribe to Beyond Growth →